-
According to the manual: chromosome names in the annotations GTF file have to match chromosome names in the FASTA genome sequence files However, it seems that STAR ignores everything after the first space. So I cannot find this behaviour documented, though. Later, it's stated that The tabs are not allowed in chromosomes’ names, and spaces are not recommended. Why spaces are not recommended? Thank you. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
I just found it somehow documented here by @alexdobin: the "chromosome names" in the FASTA file (the string after ">" and before the first space) has to match the first field string in the GTF. Which is in agreement with commonly accepted FASTA format specifications (ref1, ref2). It's surprising that there doesn't seem to be an official definition of the format (ref3). |
Beta Was this translation helpful? Give feedback.
I just found it somehow documented here by @alexdobin: the "chromosome names" in the FASTA file (the string after ">" and before the first space) has to match the first field string in the GTF.
Which is in agreement with commonly accepted FASTA format specifications (ref1, ref2). It's surprising that there doesn't seem to be an official definition of the format (ref3).