Big Data - RFX 9D methodology

Create a To-Do list for your upcoming tasks

Get Started. It's Free
or sign up with your email address
Rocket clouds
Big Data - RFX 9D methodology by Mind Map: Big Data -  RFX 9D methodology

1. Data Source

1.1. Social Media API

1.1.1. Facebook Graph API

1.1.2. Twitter API

1.1.3. Google+ API

1.1.4. LinkedIn API

1.2. Master Data

1.2.1. MySQL

1.2.2. Oracle

1.2.3. PostgresSQL

1.2.4. MongoDB

1.3. Server Logs

1.3.1. Access Logs

1.3.1.1. The number of visitors

1.3.1.2. The origin of the visitors

1.3.1.3. How many requests for each page at the site

1.3.1.4. Usage patterns in terms of time of day, day of week, and seasonally

1.4. User Activity Tracking

1.4.1. Contextual Data

1.4.1.1. Time

1.4.1.2. Pageview

1.4.1.3. Location

1.4.1.3.1. Store

1.4.1.4. Referer

1.4.1.4.1. Online Channel

1.4.1.4.2. Friend

1.4.1.5. User Agent

1.4.1.5.1. Device

1.4.2. Behavioral data

1.4.2.1. True Impression

1.4.2.2. True Attention

1.4.2.3. Like

1.4.2.4. Click

1.4.2.5. Comment

1.4.2.6. Purchase

1.4.2.7. CheckIn

1.5. Sensor devices

1.5.1. RFID Readers

1.6. Wearable medium

1.6.1. Smartwatch

2. Data Collector

2.1. Build trusted sources of data

2.1.1. Websites

2.1.1.1. JavaScript Tracking

2.1.2. Mobile

2.1.2.1. SDK for Mobile Analytics

2.1.3. Middle-ware

2.1.3.1. Access Logs

2.1.3.1.1. NginX

2.1.3.2. Debug/Error/Info Logs (e.g: log4j )

2.1.4. Email

2.1.4.1. Tracking

3. Data Storage

3.1. Distributed File System

3.1.1. Real-time processing

3.1.1.1. Apache Kafka

3.1.1.2. Redis

3.1.1.3. Apache Storm

3.1.1.4. Apache Spark Streaming

3.1.1.5. Aerospike

3.1.2. Batch processing

3.1.2.1. Apache Hadoop

3.1.2.1.1. HDFS

3.1.2.2. Apache Cassandra

3.1.2.3. memory-centric distributed storage system

3.1.2.3.1. http://tachyon-project.org/

3.1.2.3.2. https://github.com/google/jimfs

4. Data Query

4.1. Ask Bigger Questions

4.2. Distributed Query Engines

4.2.1. Apache Phoenix

4.2.2. Apache Hive

4.2.3. Apache Spark

4.2.4. Apache Tajo

5. Data Mining

5.1. Find the insights

5.1.1. Ad-hoc Query with SQL

5.1.2. Text Clustering

5.1.3. topic modeling

5.2. Predictive Analytics

5.2.1. Machine Learning

6. Data Visualization

6.1. Open Source Projects

6.1.1. http://www.chartjs.org/

6.1.2. http://nvd3.org/

6.1.3. http://dygraphs.com/

6.1.4. https://github.com/showcases/data-visualization

6.2. Display data in accessible format

6.2.1. Heatmap

6.2.2. D3.js

6.2.3. leafletjs

7. Data-driven Business

7.1. Marketing

7.1.1. Marketing Optimization

7.1.2. Reactive Marketing

7.2. Advertising

7.2.1. Advertising Analytics 2.0

7.3. Media intelligence

7.3.1. Analytics Platform

7.4. Strategy

7.4.1. Planning Automation

7.5. Organization

8. Data Model

8.1. Classical Data Structures

8.1.1. List

8.1.2. Tree

8.1.3. Hash Map

8.1.4. Graph

8.2. Algorithms

8.3. Probabilistic Data Structures

8.4. User Profile Classification

9. Data Actor

9.1. RFX-DB

9.1.1. Active Functor Theory

9.1.2. Reactive Function X (RFX)

9.2. Reactive System

9.3. Processing Actor - Akka.io

9.4. Reactive Streams