Alluxio

Alluxio is data orchestration technology for analytics and AI for the cloud. In the MRS big data ecosystem, Alluxio lies between computing and storage. It provides a data abstraction layer for computing frameworks including Apache Spark, Presto, MapReduce, and Apache Hive, so that upper-layer computing applications can access persistent storage systems including HDFS and OBS through unified client APIs and a global namespace. In this way, computing and storage are separated.

**Figure 1** Alluxio architecture

Figure 1 Alluxio architecture

Advantages:

  • Provides in-memory I/O throughput, and makes elastically scale data-driven applications cost effective.

  • Simplified cloud and object storage access

  • Simplified data management and a single point of access to multiple data sources

  • Easy application deployment