Data Lake
by RAM PRAVESH KUMAR
1. ?
1.1. Blob storage + Gen1(HDFS)
1.2. Big Data Container
1.2.1. Structured Data
1.2.1.1. Databases
1.2.1.2. Tables
1.2.2. Semi-structure Data
1.2.3. Un-Structured Data
1.3. Big Data Container
2. Exploration(Prep & Train)
2.1. Databricks
2.2. Hadoop Distributions
2.3. Synapse
2.4. HDInsight
3. Ingestions
3.1. From Local Computer
3.1.1. Azure Storage Explorer
3.1.2. AzCopy
3.1.3. Powershell
3.2. Blob Storage
3.2.1. ADF
3.2.2. AzCopy
3.2.3. DistCp
3.3. Relational Databases
3.3.1. ADF
3.4. Web server log
3.4.1. ADF
3.4.2. Powershell
3.5. On Premises Hadoop Cluster
3.5.1. ADF
3.5.2. DistCp
3.6. Streaming Data
3.6.1. Azure Stream Analytics
3.6.2. Strom