Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Empty genbank produced by gff2gbSmallDNA.pl #419

Open
katharinedickson opened this issue Aug 1, 2024 · 1 comment
Open

Empty genbank produced by gff2gbSmallDNA.pl #419

katharinedickson opened this issue Aug 1, 2024 · 1 comment

Comments

@katharinedickson
Copy link

I would like to train Augustus on three eukaryotic genomes. I have the following for each:

  • a repeat-masked FASTA
  • a GFF file of coding regions, exons, start codons, and stop codons

When I feed the program a command for one of them,
/filepath/gff2gbSmallDNA.pl /filepath/Neolan1_GeneCatalog_genes_20200610.gff /filepath/Neolan1_AssemblyScaffolds_Repeatmasked.fasta 0 /filepath/N_lanati.genbank

I get blank genbank files as output. I have checked and rechecked the filepaths by copy-pasting them from my script into the head command; please assume these are correct.

What is the most likely problem?

@MarioStanke
Copy link
Contributor

That the GFF is not in a suitable format. GFF is not fully specified. It could also be a chromosome name mismatch.
I suggest you try the command with the files genome.fa and traingenes.gff from examples/autoAug/. This should work. And then you can look at the differences between your format and the one from the example and reformat if different.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants