-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add concat fastqs from SRA manifest #227
Conversation
Also I don't like this hard written column 6 buy I don't know how to do because it is a column parameter. |
I've incorporated a couple of suggestions into https://usegalaxy.eu/u/wolfgang-maier/w/sralisttoconcatenatedfastqs-imported-from-url, specifically:
I can confirm the remaining problem besides the |
Meanwhile, I was wondering if we should not start by using the 'cut' tool to keep only the first column with SRA and the column with IDs the user wants. Then we could set column 2. What do you think? |
Datamash is indeed a good improvement. |
@wm75 I have an issue with 'sra_manifest.tabular'. I do not manage to have this datatype out of 'split_file_to_collection': And it is the same with the tool |
The tests pass with the master branch of planemo, should we release planemo? |
@PierreOsteil for your information |
Yes, splitting to a collection will always produce |
|
The workflow is working. However, because one of the fasterq-dump output for an accession number or for a list of accession number is a list:paired, when we run this in parallel we get a list:list:paired and then we face galaxyproject/galaxy#16878. |
I'm pretty sure we can fix the Galaxy side before this ... and in either case I'm happy to merge the workflow now if you're happy with it. |
We still need to solve an issue @wm75 identified which is to be sure to use identifier which passes the 'relabel' step. |
I think I can include a fix into my awk step: |
4b20c63
to
d4c01b2
Compare
@wm75, tell me if it is OK for you like this (keeping the awk step) or if you prefer that I change it. |
Since https://github.com/galaxyproject/galaxy/pull/8757/files, the allowed chars for the Relabel tool also include |
Let me check how complicated it would be to replace awk with Cut and Replace ... |
Good point. The documentation had not been updated. I've just wrote a PR to fix it. Tell me if you manage with a cut replace. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've adjusted the README a bit and I think we can handle hypothetical sample names with triple underscores in them correctly with just a minimal change to the APPLY_RULES regex (untested though).
workflows/data-fetching/sra-manifest-to-concatenated-fastqs/CHANGELOG.md
Outdated
Show resolved
Hide resolved
workflows/data-fetching/sra-manifest-to-concatenated-fastqs/README.md
Outdated
Show resolved
Hide resolved
workflows/data-fetching/sra-manifest-to-concatenated-fastqs/README.md
Outdated
Show resolved
Hide resolved
workflows/data-fetching/sra-manifest-to-concatenated-fastqs/README.md
Outdated
Show resolved
Hide resolved
workflows/data-fetching/sra-manifest-to-concatenated-fastqs/README.md
Outdated
Show resolved
Hide resolved
workflows/data-fetching/sra-manifest-to-concatenated-fastqs/README.md
Outdated
Show resolved
Hide resolved
...ows/data-fetching/sra-manifest-to-concatenated-fastqs/sra-manifest-to-concatenated-fastqs.ga
Outdated
Show resolved
Hide resolved
...ows/data-fetching/sra-manifest-to-concatenated-fastqs/sra-manifest-to-concatenated-fastqs.ga
Outdated
Show resolved
Hide resolved
Accept sample names with '___' Co-authored-by: Wolfgang Maier <[email protected]>
Thanks. I updated the tests... If they pass, we are ready. |
Youhou! May I click on merge? |
Great work @lldelisle ! |
I would say. Great collaboration! Thanks @wm75 |
Hmm, the merge failed with a failing test now:
@mvdbeek any idea why the results would be different from the within-PR testing? |
I've relaunched the CI and then I will check on eu if I can reproduce the error... |
It seems fixed. |
Yes, I assume a temporary job error. We'll have to rework the planemo testing code a little so we always get a report ... |
Here is a new workflow when the samples has been sequenced in different run.
Points to discuss: