Alluxio¶
Alluxio is data orchestration technology for analytics and AI for the cloud. In the MRS big data ecosystem, Alluxio lies between computing and storage. It provides a data abstraction layer for computing frameworks including Apache Spark, Presto, MapReduce, and Apache Hive, so that upper-layer computing applications can access persistent storage systems including HDFS and OBS through unified client APIs and a global namespace. In this way, computing and storage are separated.
Advantages:
Provides in-memory I/O throughput, and makes elastically scale data-driven applications cost effective.
Simplified cloud and object storage access
Simplified data management and a single point of access to multiple data sources
Easy application deployment