Mapping in Ensembl - there’s always a catch!
One of my recent projects has been to map short sequences of DNA (tags) to specific positions in transcripts (ie. exons, introns, UTRs). Everything was going along nicely, until I was looking through my output data and saw something weird.
I had one sequence map itself to an exon and a 5′UTR - on the same transcript. This caused me some concern (and an afternoon of frustration), because it seemed that according to the Ensembl database, this one position is both an exon and a 5′UTR. I thought I had done something wrong in my code, such as calculating the start and end position of the UTRs wrong or pulling out the wrong starting position for the start of translation.
Seems that wasn’t the problem. What it turned out to be was partially my fault, but also partially the fault of the Ensembl gene annotation.
Posted: July 15th, 2007 under bioinformatics, hmmmmm.
Comments: none