Big Data Architecture
par Thanachart Numnonda

1. Data Ingestion
1.1. Sqoop
1.2. Flume
1.3. KafKa
2. Hadoop Tools
2.1. Cluster Management
2.2. Search
3. Storage
3.1. Hadoop HDFS
3.2. NoSQL
3.2.1. HBase
3.3. RDBMS
3.4. Object Storage
4. Processing
4.1. Batch
4.1.1. MapReduce
4.1.2. Hive
4.1.3. Pig
4.2. SQL
4.3. RealTime
4.3.1. Impala
4.3.2. Spark
4.4. Machine Learning
4.4.1. Mahout
4.4.2. Spark