-
Notifications
You must be signed in to change notification settings - Fork 161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] BEP031 - New entity: sample and samples.tsv file #812
[ENH] BEP031 - New entity: sample and samples.tsv file #812
Conversation
@mariehbourget - shouldn't this PR also describe a samples.tsv file providing additional details about a sample? |
Hi @satra, |
they would likely have to be merged at the same time, so perhaps doing it together instead of linking would work better for review. @bids-standard/maintainers thoughts? |
I suggest to add the |
Sounds good to me. |
Hi everyone! |
@mariehbourget - this looks good. just a few clarification comments. |
Couple of open questions that relate more to where those changes should go rather than about their content: Currently those changes read to me like the sample entity will be made "available" for ALL modalities. Describing it in the common principles puts it at the same level as the other entities that are described there and that are pretty much available to all modalities ( Should we possibly only have the entity defined only the in the entity table? Or do we feel that sample is a concept important enough that it deserves to be defined in the common principle? I think a good case can be made for the latter but I just want to make sure we are on the same page. Similarly putting the description of the So maybe it should go somewhere else? What has been done so far is to duplicate, into their respective page, a description of the file: for example with the As a user or reader of the spec this is usually fine because this concentrates all the info about a modality in a single page. As maintainer I am not a big fan of duplication. The other problem is that we don't even have a microscopy or ephys page so far to put this in. I would not be against the creation of a new page where we can put information about things that are shared by some but not all modalities: this could definitely help with "refactoring". That latter point would probably deserves its own issue and PR, so feel free to disregard this for this PR. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some quick thoughts. I'll try to have a more detailed read this week.
@Remi-Gau, thank you for the feedback about where those changes should go, those are important questions and I’m glad to discuss them here.
The rationale for this was twofold:
On the other hand, since no modality is using the
For the same reason as above, I put Finally, as you mentioned, without an official page for microscopy yet, that information has nowhere to go! I am not against creating a page for microscopy with only the So I’m open to make changes to facilitate the integration and maintenance of the specs but I'm not sure what is best, let me know your thoughts. Thanks! |
+1 on listing the samples.tsv file in the modality-agnostic files section ; on top of @mariehbourget 's arguments, I think any future modality that can be recorded in animal models will probably use this file... |
Co-authored-by: Chris Markiewicz <[email protected]>
Hi @bids-standard/maintainers, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks everyone who worked on this BEP -- looks great. My only concern is a bit of "special casing" samples
as to be promoted to the top level, although it is "per-subject" level description.
FWIW: Following my "generalization blurb" at #751 (comment) , I still think that it might have been more consistent if as long as we do not have sample-<sample_id>/
top level directories, to have sub-<label>/sub-<label>_samples.{tsv,json}
(and with inheritance principle do allow for samples.json
on top) analogous to _scans.{tsv,json}
we have.
But if, as proposed in this PR, subject_id
column is retained in such *samples.tsv
at any level -- we "kinda have it", just that this PR promotes it to the top level via inheritance principle, so no conflict for the future if we do decide to support sub-<label>/sub-<label>_samples.{tsv,json}
etc. Thus -- approve! ;)
@yarikoptic I initially (#779 (comment)) preferred using a per-subject table or the for a global table that applied to all via the inheritance principle, but the BEP working group argued that a global table would need to be recomputed by every tool, so dropped the objection (#779 (comment)). |
Thank you everyone for the reviews! Also, this is our first PR for BEP031, are there other things we have to do before the merge? Thanks! |
This looks great to me, and very much in line with the discussions of the BEP031 working group. Thank you @mariehbourget and all the BIDS maintainers who helped with it. Exciting to see the first PR from BEP031 is about to come to life! |
@mariehbourget @jcohenadad @SylvainTakerkart @JuliaSprenger @satra @Remi-Gau @sappelhoff is there any overlap with https://bids-specification.readthedocs.io/en/stable/04-modality-specific-files/08-genetic-descriptor.html SampleOrigin: Describes from which tissue the genetic information was extracted |
Hi @CPernet, For example, a participant The There are no suggested columns in Please let me know if that answer your question or if your have any other concerns. |
ok cool - I just wanted to make sure the names and possibly ontologies to pick information from are in agreement (and by that I mean we can amend genetic if that's easier, re cell type for instance) - thx |
Definitely a good thing to keep genetics concepts in mind as we add microscopy and electrophysiology. Thanks for the nudge, @CPernet! |
i think the sample stuff can be pulled over into the genetic side too, or the columns of samples.tsv augmented with some of the keys. spatial proteomics and transcriptomics across tissues/samples from humans and others are going to share a bunch of things. |
Dear BIDS community,
Context
As part of the development of the Microscopy BEP (BEP031 @mariehbourget, @jcohenadad) and Animal Ephys BEP (BEP032 @SylvainTakerkart, @JuliaSprenger), the “sample” entity was introduced in order to distinguish different tissue samples from the same subject.
In brief, the scope of the BEP031 proposal is to extend the Brain Imaging Data Structure (BIDS) specification to 2D and 3D microscopy images of histological samples, both ex vivo and in vivo. In this context, we were advised to split the BEP into smaller separate PRs when possible.
Contribution
The purpose of this PR is to add a new “sample” entity and
samples.tsv/json
files with required column (sample_type
) and recommended columns (pathology
,derived_from
).The "sample" entity is described as:
See issue #779 for related discussions on this topic.
Related PR #816 for new columns in
participants.tsv
file.