Skip to content

chromosome names in the annotations GTF file have to match chromosome names in the FASTA genome sequence files #2156

Answered by biounix
biounix asked this question in Q&A
Discussion options

You must be logged in to vote

I just found it somehow documented here by @alexdobin: the "chromosome names" in the FASTA file (the string after ">" and before the first space) has to match the first field string in the GTF.

Which is in agreement with commonly accepted FASTA format specifications (ref1, ref2). It's surprising that there doesn't seem to be an official definition of the format (ref3).

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by biounix
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant