Data Management
The future of computing lies in the hybrid cloud. We're creating a hybrid data fabric that provides secure, governed data access from anywhere, enables self-service discovery of the right data at the right time, and takes a holistic view at minimizing total cost of ownership for AI and analytics.
Tools + code
Fybrik
A cloud native platform to unify data access, governance and orchestration, enabling business agility while securing enterprise data.
View project ↗Project CodeFlare
A framework to simplify the integration, scaling and acceleration of complex multi-step analytics and machine learning pipelines on the cloud.
View project ↗Datashim Framework
A kubernetes-based framework for hassle free handling of datasets.
View project ↗Xskipper
A library for creating, managing and deploying data skipping indexes with Apache Spark
View project ↗
Publications
Shazia Afzal, Rajmohan C, et al.2021SMDS 2021
Diego Didona, Nikolas Ioannou, et al.2021VLDB 2021
Anjali Singh, Shamanth R Nayak K, et al.2021ICML 2021
Renan Souza, Vitor Silva, et al.2021PeerJ Computer Science
Antony Chazapis, Christian Pinto, et al.2021CHEOPS 2021
Kristina Spirovska, Diego Didona, et al.2021IEEE TPDS