-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CV term for frames picked in IM dimension #361
Comments
It's a larger issue about the data model of an ion mobility frame.
These are missing from the ion mobility + DIA acquisition modes for mzML document located at https://docs.google.com/document/d/13TpfPOQGShp-RsDPB-FE6abLtipzopHXkLZ3jP3FAA0/edit?usp=sharing I haven't seen many tools that do IM feature detection that write out the results in mzML without going all the way through to collapsing the IM dimension and producing flat centroid peak lists. There isn't an ecosystem of producers/consumers to justify it in the same way we produce/consume profile and centroid spectra in mzML. We could add children of New terms:
|
I slapped together an example using a timsTOF data file I had lying around: ims_example.mzML.gz that was converted using ProteoWizard in 3D format. By index:
I don't have an example here for edit: fixed flipped term caught by Timo |
@timosachsenberg for the OpenMS use-case you were discussing, is the ion mobility peak picker attempting to centroid in both dimensions or do feature detection in the IM dimension over centroids in the m/z dimension? |
Hi @mobiusklein , I took a quick look, and for 0 you probably mean "ion mobility profile frame"? I think this would make for a good cv term! Regarding your question: Different components within OpenMS/OpenSWATH handle feature detection and IM differently. For DDA, one focus is adapting existing algorithms to be compatible with IM data. To make this easy, we want to centroid in the m/z and IM dimensions first (pure centroiding without looking at isotopic patterns) and then running/adapting existing algorithms. I hope I understood everything correctly, but I think this approach would give rise to a bit different CV terms because we do this data reduction step from raw to centroided m/z and IM in the beginning (e.g., during loading already). Regarding "I don't discard the profile information because I might want to know the width of the IM feature for grouping with MS2 features." - this is an interesting idea I also thought about. I just wasn't sure if this information is needed at this point - do you have some experience here? We will keep that in mind. |
Thank you for the feedback. You're right, I wrote I may be using different nomenclature, I suspect what I call a feature, you call a mass trace, and what you call a feature is what I treat as a set of features describing an isotopic pattern. If I understood correctly, you're looking to first port components to produce an What would be the representation you would want to write the output in, or would that be RE the last point, I wanted to retain the whole feature profile because I wanted to be able to decouple the "IM trace extraction", "IM isotopic pattern fitting/charge deconvolution" and "RT feature extraction" stages of a program. Later I wanted to build something DIAUmpire-esque for Waters MSe data where knowing if ion mobility peaks overlapped and by how much along with retention time overlap. Naturally, it was also handy for data exploration and understanding the data shape between each phase. I think this is less of an issue for Bruker DIA-PASEF since you still have isolation windows. By "needed at this point", do you mean that there's no ecosystem of tools consuming these intermediary representations because those available today just do the whole process end-to-end? |
Yeah, I think we just use different nomenclature.
Exactly.
Makes sense.
I was mainly referring to keeping the full IM profile for further processing. We currently don't do this in OpenMS but I see that it makes sense for some data/instruments. Thanks again for sharing your insights! |
Describe the question or discussion
Hi,
I may have missed something, but I could not find a suitable CV term for a spectrum obtained by centroiding/peak picking a frame – essentially a concatenated spectrum with a float data array – in the ion mobility (IM) dimension. The resulting, much sparser spectrum has a representation that is essentially the same as the original frame. Without an appropriate CV term, one would need to inspect the actual data to distinguish between raw data in concatenated format and IM picked data.
The text was updated successfully, but these errors were encountered: