Skip to content

Stop codon included in CDS features of ORFanage GTF output #9

@nuoxuxu

Description

@nuoxuxu

Hi Ales,

Thank you so much for developing ORFanage! I have some suggestions for the output format. Currently the ORFanage output GTF file includes stop codons in the CDS features. This could potentially cause problems in downstream tools since it is assumed that the GTF file format does not include stop codons in the CDS feature (see https://agat.readthedocs.io/en/latest/gxf.html#main-points-and-differences-between-gtf-formats).

To adjust the CDS coordinate, I used agat_sp_add_start_and_stop to add the "stop_codon" feature, and then used agat_convert_sp_gff2gtf to convert the resulting gff3 file to a gtf file, from which the CDS coordinates are adjusted so that stop codons are not included.

It would be very helpful to state somewhere in the documentation that the ORFanage output gtf includes stop codons in the CDS features.

Another suggestion is that the current ORFanage gtf output does not included "gene_id" for features like "exon" and "CDS", this could potentially cause problems in downstream tools such as agat, since it is assumed that all features of a GTF file have a gene_id and a transcript_id attribute (see here).

Best,
Nuo

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions