Machine Learning

Get Started. It's Free
or sign up with your email address
Machine Learning by Mind Map: Machine Learning

1. Technical skills

1.1. Computer Science's skills

1.1.1. SQL Database – able to write and execute complex queries in SQL

1.2. Anaylytics skill

1.2.1. R – analytics tool

1.3. Programming experience

1.3.1. Python, Java, R

1.4. Statistics and Probability

1.4.1. Prior knowledge in this would help in problem solving

1.5. Basic Linear Algebra

1.5.1. Regression analysis

2. Self-learning resources

2.1. Online courses

2.1.1. Coursera,Udacity, Khan Academy

2.2. Kaggle

2.2.1. Competitions - World most elite machine learning leaderboard Datasets - Analyze public datasets Kernels - Run code in cloud and receive community feedback

2.3. MOOC

2.3.1. Codecademy

2.4. Video Tutorial

2.4.1. Youtube

3. Soft skills

3.1. Intellectual curiosity

3.1.1. A data scientist should invest time and energy in getting into machine learning knowledge

3.2. Communication skills

3.2.1. Needed to translate technical finding to non-technical team such as Marketing departments

3.3. Business acumen

3.3.1. Understanding in industry and company on how to make profit and effective to strategy execution

4. Role Vs Task Vs Skill

4.1. Role

4.1.1. Data Analyst Analyzing data in order to tell a story and produce actionable insights

4.1.2. Machine Learning Enginnering Working software and consists of other software components that run autonomosly with minimal human supervision

4.2. Task

4.2.1. Regression Exp: Regression trees, Linear regression

4.2.2. Classification Exp: Decision trees, Support vector machine

4.2.3. Clustering Exp: Mean-shift(High accuracy), Hierarchical clustering

4.2.4. Multivariate Querying Exp: Nearest neighbors, Range Search

4.2.5. Density Estimation Exp: Kernel density estimation(High accuracy), Mixture of Gaussians

4.2.6. Dimension Reduction Exp: Manifold learning/KPCA(High accuracy), Principal component analysis

4.2.7. Testing and Matching Exp: Minimum spanning tree

4.3. Skill

4.3.1. Computer Science Fundamental and Programming Exp: Data Structures(stacks, queues),Python/C++/R/Java, algorithms(Searching, Sorting)

4.3.2. Probability and Statistics Exp: conditional probability, Bayer rule, distributions(Normal, Binomial, Poisson), analysis methods(ANOVA, hypothesis testing), measures(Mean, Median, Variance)

4.3.3. Data Modeling and Evaluation Exp: correlations, clusters, eigenvectors, classification, regression, anomaly detection

4.3.4. Applying Machine Learning Algorithms and Libraries Exp: libraries/packages/APIs(e.g. scikit-learn, Theano, Spark MLib, H2O, TensorFlow)

4.3.5. Software Engineering and System Design Exp: analysis, system design, modularity, version control, testig, documentaion