The investigating functional molecular pathways of an organism are at the basis of the understanding of complex microbial ecosystems. In the case of unicellular organisms, the description of protein pathways provides information on the functionality of a microbial consortium. We recently developed a semi-automated pipeline for the analysis of metaproteomics data to depict the newborn mouse gut phylotypes.1 Here we present a refined bioinformatics tool to extend the analysis to a more interesting description of the metabolic functions correlated with the characterized microbial taxa, using a typical PDO Italian cheese as model. Metaproteomics raw data are analyzed to obtain a representation of the bacterial population at different taxonomic levels on the bases of the taxon-specificity of a tryptic peptide list, comparing it with that coming from the already available metaproteomics and metagenomics applications such as Megan2. As a further development we managed for the automated association of taxa to metabolic pathways (KEGG database) and of proteins to groups of cluster orthologs (COG). We optimized parameters to have the maximum number of protein and minimum FDR, and then developed a series of python scripts to integrate and improve, the output of available application, and manipulate raw data. This investigation aims at providing a functional insight in metagenomic analysis, and at offer a direct evaluation of protein functional pathways which are actually controlling the consortia homeostasis.
Bioinformatics pipeline for metaproteomics data analysis : investigation on microbial populations and their respective functional role in cheese
C. Piras;P. Roncada
2014-01-01
Abstract
The investigating functional molecular pathways of an organism are at the basis of the understanding of complex microbial ecosystems. In the case of unicellular organisms, the description of protein pathways provides information on the functionality of a microbial consortium. We recently developed a semi-automated pipeline for the analysis of metaproteomics data to depict the newborn mouse gut phylotypes.1 Here we present a refined bioinformatics tool to extend the analysis to a more interesting description of the metabolic functions correlated with the characterized microbial taxa, using a typical PDO Italian cheese as model. Metaproteomics raw data are analyzed to obtain a representation of the bacterial population at different taxonomic levels on the bases of the taxon-specificity of a tryptic peptide list, comparing it with that coming from the already available metaproteomics and metagenomics applications such as Megan2. As a further development we managed for the automated association of taxa to metabolic pathways (KEGG database) and of proteins to groups of cluster orthologs (COG). We optimized parameters to have the maximum number of protein and minimum FDR, and then developed a series of python scripts to integrate and improve, the output of available application, and manipulate raw data. This investigation aims at providing a functional insight in metagenomic analysis, and at offer a direct evaluation of protein functional pathways which are actually controlling the consortia homeostasis.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.