Toon volledige map

Kopieer en bewerk map Kopieer

BLASTER - Programming

Andere

Francisco Pacheco Hinojosa

Volg

Main Pipeline MindMap for SIDER_RepetitiveSearcher project

Laten we beginnen. Het is Gratis

of registreren met je e-mailadres

Vergelijkbare mind mappen Overzicht van map

BLASTER - Programming Door Francisco Pacheco Hinojosa Mind Map: BLASTER - Programming

Mind Map: BLASTER - Programming

1. genome_solap_location_filter

1.1. For each DNA strand we get the start and end coordinates

2. genome_solap_location_grouping

2.1. For each DNA strand it will group coordinates depending on nearness

3. genome_solap_by_pairs

3.1. It will analyze two sequence which are overlaping and join them

4. genome_solap_minmax

4.1. Depending on each DNA strand it will get the minimum or maximum of each group

5. genome_solap_main

6. chromosome_filter

6.1. It reads a FASTA file and numbers all the sequences with a custom made prefix

6.2. The sequences in fasta file NEED to be in order

7. genome_pre_duplicate_filter

7.1. Filters the data in a CSV for duplicated sequences

8. genome_duplicate_filter

8.1. Calls genome_pre_duplicate_filter for "+" and "-" DNA strand and unites the results.

8.2. The Output is a CSV file with the data filtered.

9. dash_filter

9.1. Will filter dashes, "-", in a CSV file

10. global_filters_main

10.1. Mixes other filters

10.2. The Output is a CSV which is constantly overwritten

11. csv_creator

11.1. Creates a CSV file

12. specific_sequence_corrected

12.1. It reads the CSV file and finds the correct coordinates of the sequences.

13. filter_by_column

13.1. Filter CSV data depending on:

13.1.1. Minimun sequence **length**

13.1.2. Minimum identity **percent**

13.2. Then it Outputs a CSV file

14. blastn_blaster

14.1. It executes a BLASTn search

15. repetitive_blaster

16. specific_sequence_extractor

16.1. For each chromosome ID selected, it will take all the rows in the CSV file corresponding to that chomosome ID

17. specific_sequence_1000nt

17.1. For each chromosome ID, it will expand all the selected sequences in the CSV to 1000 nt

18. fasta_creator

18.1. It will create a FASTA file with the input CSV file

19. blastn_dic

19.1. It creates a BLAST database with the FASTA file it reads

20. genome_specific_chromosome_main

20.1. Calls all blasters and filters

21. boxymcboxface

21.1. Just some aesthetic display

22. csv_mixer

22.1. Mixes 2 CSV files into one

23. folder_creator

23.1. Creates a folder in the current path

of Registreren