-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cross-fraction XIC normalization #344
Comments
Dear @antortjim - thank you for your issue. I will look at it next week (can't do right now), but I definitely like the idea to make |
Dear @lgatto, thanks for answering so fast! In the meantime I am trying out https://github.com/statOmics/MSqRob (which depends on MSnBase for the data structures), and I implemented a custom L-M minimisation with Python's scipy. I am not very sure it is correct though, but I can share it. Cheers |
Dear @antortjim - thank you for your issue. I meant to get back to you already a long time ago but than forgot. It would be great to hear more about your L-M minimisation script. |
Dear @lgatto - thanks for looking back into this! I guess it's summer ;) I don't have time this week as I am submitting my Master thesis on Monday, but next week I have plenty to try get it to work. I am excited to at least attempt to contribute. Imports the required modules including the
-Map every column in the moFF file (one per RAW file) to a sample i.e H1-L3 in the
Use the first
Define the equation to be minimised as a function of N, the normalizing factors.
Constrain the values that the normalizing factors can take to
Find the root (=minimise) using the lm method (Levenberg–Marquardt), constraining each root to the 0-20 interval. The starting value is Code not here: They can be used to multiply each column in the moFF file and then sum those representing the different fractions of the same sample. The final result is a new table with the number of columns equal to the number of conditions and replicates, 6 in this dataset (H1, H2, H3, L1, L2, L3,).
Attached a zip with the full Python script. |
I forgot to say that I don't think it is correct as I don't get the right results. See my second comment here statOmics/MSqRob#49 |
Hi there!!
First of all, thanks for developing an open source tool in R that can handle such complex datasets as those recurrent in proteomics and mass spec.
I am working with the MaxLFQ dataset (paper: http://www.mcponline.org/content/13/9/2513.long) where the developers of the MaxQuant suite mixed the HeLa and E.coli proteomes in 1:1 and 1:3 ratios. One of the crucial steps in their pipeline is the delayed normalization of XICs (extracted ion currents i.e MS1 intensities of the cluster feature detected in the apex intensity extraction step) across fractions of the same sample (for a total of 24 fractions) using a Levenberg-Marquandt minimization approach. The normalization factors used to sum up the peptide intensities across fractions are then set to the values that minimise the overall proteome variance, so called in the paper H(N). It's a nice approach to the problem they formulated in the paragraph below:
I've had a look at the MSnbase documentation, and the issue of sample fractions is mentioned in two sections:
https://bioconductor.org/packages/release/bioc/vignettes/MSnbase/inst/doc/MSnbase-demo.html#2_data_structure_and_content
regarding how to load multiple spectra files in either
mzData
,mzXML
andmzML
formats usingreadMSData()
https://bioconductor.org/packages/release/bioc/vignettes/MSnbase/inst/doc/MSnbase-demo.html#13_combining_msnset_instances
regarding how to combine multiple runs of the same sample i.e technical replicates if I understood correctly.
However, in none of them is this specific issue of cross-fraction XIC normalization addressed.
Is there any readily available function in the package that could read a table similar to the moFF output:
where rows represent peptides and columns represent
Enabling this feature would make it possible to connect moFF output to MSnbase in a straightforward way in datasets where sample fractionation across runs is present.
Thank you very much for your help beforehand
Cheers,
Antonio
The text was updated successfully, but these errors were encountered: