Comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometry (GC × GC-TOFMS) is one the most powerful analytical platforms for chemical investigations of complex biological samples. It produces large datasets that are rich in information, but highly complex, and its consistency may be affected by random systemic fluctuations and/or changes in the experimental parameters. This study details the optimization of a data processing strategy that compensates for severe 2D pattern misalignments and detector response fluctuations for saliva samples analyzed across 2 years. The strategy was trained on two batches: one with samples from healthy subjects who had undergone dietary intervention with high/low-Maillard reaction products (dataset A), and the second from healthy/unhealthy obese individuals (dataset B). The combined untargeted and targeted pattern recognition algorithm (i.e., UT fingerprinting) was tuned for key process parameters, the signal-to-noise ratio (S/N), and MS spectrum similarity thresholds, and then tested for the best transform function (global or local, affine or low-degree polynomial) for pattern realignment in the temporal domain. Reliable peak detection achieved its best performance, computed as % of false negative/positive matches, with a S/N threshold of 50 and spectral similarity direct match factor (DMF) of 700. Cross-alignment of bi-dimensional (2D) peaks in the temporal domain was fully effective with a supervised operation including multiple centroids (reference peaks) and a match-and-transform strategy using affine functions. Regarding the performance-derived response fluctuations, the most promising strategy for cross-comparative analysis and data fusion included the mass spectral total useful signal (MSTUS) approach followed by Z-score normalization on the resulting matrix.
Extending the breadth of saliva metabolome fingerprinting by smart template strategies and effective pattern realignment on comprehensive two‑dimensional gas chromatographic data
Simone Squara;Andrea Caratti;Carlo Bicchi;Massimo Collino
;Chiara Cordero
2023-01-01
Abstract
Comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometry (GC × GC-TOFMS) is one the most powerful analytical platforms for chemical investigations of complex biological samples. It produces large datasets that are rich in information, but highly complex, and its consistency may be affected by random systemic fluctuations and/or changes in the experimental parameters. This study details the optimization of a data processing strategy that compensates for severe 2D pattern misalignments and detector response fluctuations for saliva samples analyzed across 2 years. The strategy was trained on two batches: one with samples from healthy subjects who had undergone dietary intervention with high/low-Maillard reaction products (dataset A), and the second from healthy/unhealthy obese individuals (dataset B). The combined untargeted and targeted pattern recognition algorithm (i.e., UT fingerprinting) was tuned for key process parameters, the signal-to-noise ratio (S/N), and MS spectrum similarity thresholds, and then tested for the best transform function (global or local, affine or low-degree polynomial) for pattern realignment in the temporal domain. Reliable peak detection achieved its best performance, computed as % of false negative/positive matches, with a S/N threshold of 50 and spectral similarity direct match factor (DMF) of 700. Cross-alignment of bi-dimensional (2D) peaks in the temporal domain was fully effective with a supervised operation including multiple centroids (reference peaks) and a match-and-transform strategy using affine functions. Regarding the performance-derived response fluctuations, the most promising strategy for cross-comparative analysis and data fusion included the mass spectral total useful signal (MSTUS) approach followed by Z-score normalization on the resulting matrix.File | Dimensione | Formato | |
---|---|---|---|
Squara_et_al-2023-Analytical_and_Bioanalytical_Chemistry.pdf
Accesso aperto
Descrizione: full text
Tipo di file:
PDF EDITORIALE
Dimensione
2.59 MB
Formato
Adobe PDF
|
2.59 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.