Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bio::FeatureIO gff3 fails to parse canonical GFF3 file from GFF3 spec #8

Open
cjfields opened this issue Oct 8, 2015 · 3 comments
Open

Comments

@cjfields
Copy link
Member

cjfields commented Oct 8, 2015


Author Name: Keith James (Keith James)
Original Redmine Issue: 2429, https://redmine.open-bio.org/issues/2429
Original Date: 2008-01-03
Original Assignee: Bioperl Guts


I was getting this error from all my GFF3 files:

——————- EXCEPTION: Bio::Root::Exception ——————-
MSG: Oops! ID cds00001 exists more than once in your file!
STACK: Error::throw
STACK: Bio::Root::Root::throw /software/perl-5.8.8/lib/site_perl/5.8.8/Bio/Root/Root.pm:359
STACK: Bio::FeatureIO::gff::next_feature_group /software/perl-5.8.8/lib/site_perl/5.8.8/Bio/FeatureIO/gff.pm:206

Several code comments within Bio/FeatureIO/gff.pm indicate that it assumes an unique ID for each line. The spec states that an ID must be unique within a file, but one assumes that this means unique /per feature/ where a feature may be split over several lines. The example GFF in the spec supports this interpretation.

To confirm the bug I created a copy of the canonical GFF3 example from the spec and reproduced this error (see above); the cds00001 feature is split over 4 lines, each with the same ID.

This is with bioperl release bioperl-1.5.2_102 on Linux i686, Perl v5.8.8

@cjfields
Copy link
Member Author

cjfields commented Oct 8, 2015


Original Redmine Comment
Author Name: Keith James
Original Date: 2008-01-03T10:48:29Z


Created an attachment (id=841)
Canonical gene testcase from GFF3 spec

@cjfields
Copy link
Member Author

cjfields commented Oct 8, 2015


Original Redmine Comment
Author Name: Chris Fields
Original Date: 2008-01-07T11:27:31Z


Bio::FeatureIO and other GFF-related functionality are undergoing revision to parse/write GFF3-compliant data and will likely not make the 1.6 release. More realistically it will be in the 1.7 dev release series (which will start up immediately after 1.6 is released).

@cjfields
Copy link
Member Author

cjfields commented Oct 8, 2015


Original Redmine Comment
Author Name: Chris Fields
Original Date: 2010-05-15T10:28:39Z


Moving to Bio::FeatureIO-specific queue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant