Big Data Architecture
by Thanachart Numnonda
1. Data Ingestion
1.1. Sqoop
1.2. Flume
1.3. KafKa
2. Hadoop Tools
2.1. Cluster Management
2.2. Search
3. Storage
3.1. Hadoop HDFS
3.2. NoSQL
3.2.1. HBase
3.3. RDBMS
3.4. Object Storage
4. Processing
4.1. Batch
4.1.1. MapReduce
4.1.2. Hive
4.1.3. Pig
4.2. SQL
4.3. RealTime
4.3.1. Impala
4.3.2. Spark
4.4. Machine Learning
4.4.1. Mahout
4.4.2. Spark