Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

177 improve annotation mapping through readgmt #206

Merged
merged 2 commits into from
Oct 10, 2024

Conversation

axelklenk
Copy link
Collaborator

This PR finally closes #177

Gene ID type can be set and retrieved for all expression data types and containers and gene sets represented as lists of character vectors using the method gsvaAnnotation()

A new function guessGeneIdType() can be used to automatically derive the gene ID type of a list of gene sets.

A new function geneIdsToGeneSetCollection() can be used to construct a GeneSetCollection object from a list of character vectors.

Function readGMT() makes use of these new features and can automatically guess the gene ID type of a GMT file from MSigDb and add this metadata to the imported gene sets, regardless of their format (GeneSetCollection or list).

The metadata is used to perform gene ID mapping for gene sets with arbitrary representations of expression data and gene sets, provided that an appropriate annotation database, e.g. organism annotation, is available.

…sGeneIdType() and geneIdsToGeneSetCollection(); update function readGMT() to make use of these new features
@axelklenk axelklenk self-assigned this Oct 10, 2024
@axelklenk axelklenk merged commit 909787d into devel Oct 10, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

Successfully merging this pull request may close these issues.

Improve annotation mapping through readGMT()
1 participant