Big Data Architecture
作者:Thanachart Numnonda

1. Data Ingestion
1.1. Sqoop
1.2. Flume
1.3. KafKa
2. Hadoop Tools
2.1. Cluster Management
2.2. Search
3. Storage
3.1. Hadoop HDFS
3.2. NoSQL
3.2.1. HBase
3.3. RDBMS
3.4. Object Storage
4. Processing
4.1. Batch
4.1.1. MapReduce
4.1.2. Hive
4.1.3. Pig
4.2. SQL
4.3. RealTime
4.3.1. Impala
4.3.2. Spark
4.4. Machine Learning
4.4.1. Mahout
4.4.2. Spark