-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add LUAD dataset as BBBC043 #52
Comments
BBBC041 is already taken by the malaria dataset! https://data.broadinstitute.org/bbbc/BBBC041/ so is 042 by another I'm working on. Can you go with 043? |
@shntnu Did this dataset ever get finalized? It was never added to BBBC. |
Not yet AFAIK. This is at least a few months out (it's ok @jccaicedo's plate) |
@AnneCarpenter said:
We set aside a BBBC identifier for it back in 2018 but didn't proceed to create a page at the time. The good news is that we've never said anywhere that the data is available on BBBC, but I know it's odd that we have an id for it but not a page. Going forward, we will follow a different process for profiling datasets going forward (below) @ErinWeisbart and I are making steady progress here https://github.com/orgs/broadinstitute/projects/27/views/3 and once we reach this awslabs/open-data-registry#1003, we will have settled on a process that will lead to a BBBC entry getting created for LUAD ---------- Forwarded message --------- Here is now the adapted version of the 4 steps (from my email to AWS); Beth will be looped in at step 3 1 C-S lab will add a row to https://broad.io/profiling_dataset We will refer to the dataset on IDR if it exists, otherwise RODA
|
Our plan for managing Cell Painting Gallery has been settled! https://new.ipwiki.app/project_profiler_and_datasets Profiling datasets will be listed only Cell Painting Gallery, and not on BBBC. However, for a few datasets for which
we should indeed create an entry in BBBC |
Great, I added the following to the https://new.ipwiki.app/project_profiler_and_datasets wiki page: "Relationship to BBBC and then added your comment above to the page linked as "BBBC". Hope this was a good approach. |
I thought we decided it was fine for BBBC to continue to point to Gallery data sets going forward, such that BBBC still has an ongoing record of good benchmark datat sets |
That was the initial plan
Erin might recollect better, but I think we concluded it is wisest to avoid creating yet another identifier for a dataset and instead just point to RODA as a whole. I like that idea because it avoids redundancy. |
From email thread "Re: Question regarding potential BBBC contribution"
|
As long as we aren't giving it a new identifier, and refer to it on the BBBC page as "cpg-whatever" instead of "BBBC-whatever", I don't see why we WOULDN'T more broadly advertise that these projects exist :) |
You're right – I think I had implicitly assumed that such a plan would require creating a new BBBC identifier for each new CPG dataset, but (you're right) there is no need to So you're saying you'd not only point to https://registry.opendata.aws/cellpainting-gallery/ (similar to the way we point to BBBC here, screenshot below) but also (selectively) list CPG datasets in the Profiling section of the BBBC index page? Sounds good to me; worth getting Erin to sign off on it because she has thought through everything |
I see, using the cpg identifier makes that idea make more sense to me.
There’s still the danger that we update some detail in one location and not
the other but that’s minor and it sounds like Beth doesn’t mind the extra
step/work Of adding to bbbc, so that’s all good. --
Sent from my mobile phone
|
I like the new idea, where we just say at the bottom of current list on
BBBC “Large image-based profiling datasets from 2022 onwards are catalogued
at: URL”
--
Sent from my mobile phone
|
I like doing that because then we need not list everything in the gallery on BBBC, but for data sets we care a lot about I also like listing it as a "real" thing on BBBC, albeit fine to be with a CPG identifier rather than a BBBC one, because CPG is not super "browsable", esp for a biologist vs a CS person. |
We should add a new entry to "Image-based Profiling" table
|
I'm afraid I'm being dense in understanding what those columns are. What is the difference between |
https://bbbc.broadinstitute.org/image_sets
|
ha! right, number of channels. time for another cup of coffee... for this dataset |
Great I've updated my previous comment. Now it's over to @bethac07 who can tag someone to update BBBC |
@jccaicedo Creating this issue as a placeholder so that we have an id for the LUAD dataset
The text was updated successfully, but these errors were encountered: