BLASTER - Programming

Main Pipeline MindMap for SIDER_RepetitiveSearcher project

马上开始. 它是免费的哦
注册 使用您的电邮地址
BLASTER - Programming 作者: Mind Map: BLASTER - Programming

1. **genome_solap_location_filter**

1.1. For each DNA strand we get the **start** and **end** coordinates

2. **genome_solap_location_grouping**

2.1. For each DNA strand it will group coordinates depending on nearness

3. **genome_solap_by_pairs**

3.1. It will analyze two sequence which are overlaping and join them

4. **genome_solap_minmax**

4.1. Depending on each DNA strand it will get the *minimum* or *maximum* of each **group**

5. **genome_solap_main**

6. **chromosome_filter**

6.1. It reads a **FASTA** file and numbers all the sequences with a custom made **prefix**

6.2. The sequences in fasta file NEED to be in order

7. **genome_pre_duplicate_filter**

7.1. Filters the data in a **CSV** for duplicated sequences

8. **genome_duplicate_filter**

8.1. Calls *genome_pre_duplicate_filter* for "+" and "-" DNA strand and unites the results.

8.2. The Output is a **CSV** file with the data filtered.

9. **dash_filter**

9.1. Will filter **dashes**, "-", in a **CSV** file

10. **global_filters_main**

10.1. Mixes other filters

10.2. The Output is a **CSV** which is constantly overwritten

11. **csv_creator**

11.1. Creates a **CSV** file

12. **specific_sequence_corrected**

12.1. It reads the **CSV** file and finds the **correct coordinates** of the sequences.

13. **filter_by_column**

13.1. Filter **CSV** data depending on:

13.1.1. Minimun sequence **length**

13.1.2. Minimum identity **percent**

13.2. Then it Outputs a **CSV** file

14. **blastn_blaster**

14.1. It executes a **BLASTn** search

15. **repetitive_blaster**

16. **specific_sequence_extractor**

16.1. For each **chromosome ID** selected, it will take all the rows in the **CSV** file corresponding to that **chomosome ID**

17. **specific_sequence_1000nt**

17.1. For each **chromosome ID**, it will expand all the selected sequences in the **CSV** to **1000 nt**

18. **fasta_creator**

18.1. It will create a **FASTA** file with the input **CSV** file

19. **blastn_dic**

19.1. It creates a **BLAST database** with the **FASTA** file it reads

20. **genome_specific_chromosome_main**

20.1. Calls all blasters and filters

21. **boxymcboxface**

21.1. Just some aesthetic display

22. **csv_mixer**

22.1. Mixes 2 CSV files into one

23. **folder_creator**

23.1. Creates a folder in the current path