{rfName}
A

Indexed in

License and use

Altmetrics

Analysis of institutional authors

Tejedor-Calvo, EvaAuthorMarco-Montori, PedroAuthorGarcia-Barreda, SergiAuthor

Share

February 24, 2025
Publications
>
Article
No

A critical analysis of Adaptive Box-Cox transformation for skewed distributed data management: Metabolomics of Spanish and Argentinian truffles as a case study

Publicated to:Analytica Chimica Acta. 1345 343704- - 2025-04-01 1345(), DOI: 10.1016/j.aca.2025.343704

Authors: Sibono, Leonardo; Grosso, Massimiliano; Tejedor-Calvo, Eva; Casula, Mattia; Marco-Montori, Pedro; Garcia-Barreda, Sergi; Manis, Cristina; Caboni, Pierluigi

Affiliations

Dept Life & Environm Sci, Blocco A,Room 13, I-09042 Monserrato, Italy - Author
Univ Cagliari, Dept Mech Chem & Mat Engn, Via Marengo 2, Cagliari, Italy - Author
Zaragoza Univ, Agrifood Inst Aragon IA2, Agrifood Res & Technol Ctr Aragon CITA, Dept Plant Sci, Av Montanana 930, Zaragoza 50059, Spain - Author

Abstract

Background: Metabolic variations retrieved in metabolomic data are considered a benchmark for detecting biomatrix variability. Therefore, identifying target metabolites is crucial to keep track of any substrate modification and preserve it from any undesired alteration. Unfortunately, such a task can be negatively affected by detecting false positives, often triggered by complicated data distributions. In this work, we undertook an investigation of the metabolic profile of Spanish and Argentine truffles using a robust methodology. The issue of skewed data distributions has been effectively addressed through a normalization preprocessing, enhancing biomarker identification and samples classification. Results: A data normality-improved parametric test (ANOVA) was employed to define the target metabolites, which significantly vary between two regions of origin: Spain and Argentina. Specifically, Adaptive Box-Cox transformation was employed to improve the ANOVA test's performance so that data distributions were fitted to a Gaussian variable. Using the Bonferroni-Holm method for false discovery rate correction, we demonstrated the effectiveness of this transformation for the case under investigation. Results were compared with two non- parametric tests (Kruskall-Wallis and Permutation test), selected as a reference methodology, to provide a better understanding of non-normal distributions often encountered in metabolomic data analysis. 17 metabolites out of the 57 investigated metabolites exhibited notable variability across the two geographical regions. The validity of this methodology was supported through the discrimination of samples belonging to different groups. In this regard, both univariate and multivariate statistical models were tested through Monte Carlo simulations and yielded consistent results. Significance: Data analysis outcomes are sensitive to variables distributions. The present study shows an effective tool to increase data normality, thereby enhancing the statistical power for biomarker discovery and improving models' classification performances. These results find justification from the current knowledge within the field of food sciences, enabling their application in advancing research in the truffle analysis domain.

Keywords

AlimentosBiomarker discoverBiomarker discoveryData preprocessingEspectrometría de masasFoodGeographical originMarcadores genéticosMass spectrometryMetabolómicaMetabolomicsProcedenciaProcesamiento de datos

Quality index

Bibliometric impact. Analysis of the contribution and dissemination channel

The work has been published in the journal Analytica Chimica Acta due to its progression and the good impact it has achieved in recent years, according to the agency WoS (JCR), it has become a reference in its field. In the year of publication of the work, 2025, it was in position 16/111, thus managing to position itself as a Q1 (Primer Cuartil), in the category Chemistry, Analytical.

Independientemente del impacto esperado determinado por el canal de difusión, es importante destacar el impacto real observado de la propia aportación.

Según las diferentes agencias de indexación, el número de citas acumuladas por esta publicación hasta la fecha 2025-07-19:

  • WoS: 1
  • Scopus: 1

Impact and social visibility

From the perspective of influence or social adoption, and based on metrics associated with mentions and interactions provided by agencies specializing in calculating the so-called "Alternative or Social Metrics," we can highlight as of 2025-07-19:

  • The use, from an academic perspective evidenced by the Altmetric agency indicator referring to aggregations made by the personal bibliographic manager Mendeley, gives us a total of: 6.
  • The use of this contribution in bookmarks, code forks, additions to favorite lists for recurrent reading, as well as general views, indicates that someone is using the publication as a basis for their current work. This may be a notable indicator of future more formal and academic citations. This claim is supported by the result of the "Capture" indicator, which yields a total of: 8 (PlumX).

With a more dissemination-oriented intent and targeting more general audiences, we can observe other more global scores such as:

  • The Total Score from Altmetric: 0.5.
  • The number of mentions on the social network X (formerly Twitter): 1 (Altmetric).

It is essential to present evidence supporting full alignment with institutional principles and guidelines on Open Science and the Conservation and Dissemination of Intellectual Heritage. A clear example of this is:

Leadership analysis of institutional authors

This work has been carried out with international collaboration, specifically with researchers from: Italy.