1.b |
BUSCO |
|
This creates short_summary.txt. »> Tip: Run generate_plot to create a graph of the summary. «< |
2.a |
cd-hit-est |
-c 0.95 |
This clusters 95% similar transcript sequences together in the transcriptome file and creates a non-redundant sequence file |
2.b |
cd-hit-est |
-c 0.90 |
This clusters 90% similar transcript sequences together in the transcriptome file and creates a non-redundant sequence file |
2.c |
BUSCO |
|
This creates short_summary.txt. »> Tip: Run generate_plot to create a graph of the summary. «< |
3.a |
TransDecoder.LongOrfs |
> 100bp |
This creates 4 files. A cds file, a pep file, a gff file and a bed file. |
3.b |
BUSCO |
|
This creates short_summary.txt. »> Tip: Run generate_plot to create a graph of the summary. «< |
4.a |
BLASTP |
-max_target_seqs 1, -outfmt 6 |
This would create 20 files. (5P2) |
4.b |
RBBH - python script |
pull out the reciprocal best hits only. |
This would create 10 files. |
4.c |
Orthologs - python script |
pull out the orthologs among 5 species |
This would create 1 file and let us call this the orthologs file.» Note: The order in which you process the Best Hits file would affect the final number of orthologs.« |
5.a |
Annotation |
use gff file |
Add annotation to the orthologs file. |