Selective Coding

Comienza Ya. Es Gratis
ó regístrate con tu dirección de correo electrónico
Selective Coding por Mind Map: Selective Coding

1. Circumstances

1.1. Limitation of current paper format

1.1.1. Action

1.1.1.1. Delete Exploration

1.1.1.1.1. Consequences

1.2. Peer Review System

1.3. Human Psychology

1.4. Delivering the document to the PI

1.4.1. Action

1.4.1.1. Don't explain technical details

1.4.1.1.1. Consequences

1.5. Delivering the document to the peers

1.5.1. Action

1.5.1.1. Explain technical details

1.5.1.1.1. Consequences

1.6. Scrolling make dizzying effects

1.6.1. Action

1.6.1.1. Make the document short and simple

1.6.1.1.1. Consequences

1.6.1.2. Don't use Jupyter notebook in the class

1.6.1.2.1. Consequences

2. Reproducibility

2.1. Problem

2.1.1. Cause

2.1.1.1. Non - programmer

2.1.1.1.1. Bad annotation

2.1.1.1.2. Bad version control

2.1.1.2. No exact same data set with exact same format

2.1.1.3. Random seed

2.1.1.4. Limitation of size of the paper

2.1.1.5. High variability

2.1.1.6. Poor documentation

2.1.1.6.1. Delete Exploration

2.1.1.6.2. Not enough annotations

2.1.2. Effect

2.1.2.1. Re write the code

2.1.2.2. Time consuming

2.2. Benefits

2.2.1. Advancement of Science

2.2.2. Citation

2.3. What it needs

2.3.1. easy explanation

3. Documentation

3.1. Code+Markdown

3.1.1. Psychological role

3.1.2. Incentive and motivation

3.2. Poorly documented report

3.2.1. affects reproducibility

4. Data Exploration

4.1. Necessary

4.1.1. Scientists who want to reproduce the research

4.1.2. Dead ends are important for future research

4.2. Unnecessary

4.2.1. Who wants to skim the work

4.3. Limitations

4.3.1. Current paper format

4.3.2. Peer reviewing system

4.3.3. Human nature to sell their work

5. Jupyter notebook

5.1. Improvements

5.1.1. Cell slide

5.1.2. Interactive data exploration

5.1.3. Functions in IDE

5.2. Cons

5.2.1. Scrolling cause dizzying effects

5.2.2. not suitable for complex projects

5.3. Pros

5.3.1. Dragging cells

5.3.2. perfect for class projects, report, simple analysis

5.3.3. School, learning purpose

6. Length

6.1. Short and simple

6.1.1. Tend to separate to other documents if it becomes to long

6.1.2. Can't include all the details or explorations that were made

7. New feature

7.1. Auto complete

7.2. Interactive data exploration

7.3. Drag and drop

8. Data comic

8.1. Engaging

8.2. More information in one page

8.3. Different level of importance using panels

8.4. Remember longer

8.5. Overcome limits of text

9. Who's the reader

9.1. Students(Novice)

9.1.1. More context

9.2. Colleague(Same field)

9.2.1. Freely use technical terms and explain technical details

9.2.2. Trust issue

9.3. Colleague(Different field)

9.3.1. Less technical terms and context

9.4. Business professional

9.4.1. Qualitative data only

9.4.2. Powerpoint/text report

9.4.3. Trust what data scientist does

9.4.4. Visualization

10. Data science work flow

10.1. Data cleaning 80%