-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potential modifications to signature format for major releases #268
Comments
Relevant? https://github.com/marbl/Mash/blob/38d7abd3e545eafd067e627ef55fa171f1fc674c/src/mash/schema-1.0.0.json
(for specifying what we want in the signatures?)
…On Sat, Jun 3, 2017 at 10:40 AM, C. Titus Brown ***@***.***> wrote:
Meta-issue to group together all of the issues that propose to modify the signature format for a sourmash 2.0 release.
add accession/taxonomy info per #195
add actual k-mers strings per #211
add number of bp & number of sequences that went into signature #246
require a license in each signature file #242
provide at least some minimal provenance tracking e.g. #230 (source of data) and maybe command used to generate if e.g. k-mer trimming or preprocessing was done
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Other ideas:
|
Also: provide some sort of verification/signature tester so that before making a bunch of |
a great discussion on hierarchical vs tag-based file systems, which is very relevant for how to store metadata too: https://www.nayuki.io/page/designing-better-file-organization-around-tags-not-hierarchies |
also, remove email (done in #335). |
also add an md5sum for the input sample. |
@luizirber here's a proposal - any thoughts? https://hackmd.io/KYdgjATARiCcwFoBmIBsAGBAWYssIENhUBWBEAZhOAgqiQA4pYGg current text: Sourmash signatures - metadata thoughtstl;dr? Keep the core signature format lean and mean, with a few required fields; put other stuff in the Required fields in a signature:
Reserved block names.We should identify some reserved block names that have special meaning. Obvious ones include:
Content of these should be more completely described and then encoded in software & a software validator. Reserved metadata block proposal:
|
I am thinking this should be punted to 3.0, given all the other stuff we have in 2.0 already. |
punted to... well, 4.0? 5.0? whatever :) |
I'm starting to think that maybe a good goal would be to add a flexible selector framework (ref #1072) that would let us do keyword searches to select subsets of signatures based on tags, taxonomy, etc. This could integrate well with a folksonomy-style tagging approach. |
Meta-issue to group together all of the issues that propose to modify the signature format for a sourmash 2.0 release.
"mrnaseq"
? #58The text was updated successfully, but these errors were encountered: