The Gene Ontology (GO) is used to achieve information about gene and protein functions by using a structured vocabulary of terms (GO Terms). GO Terms are related to biological concepts such as proteins or genes through the annotation process. There exist many different annotation processes identified by different evidence codes (EC). Annotated data are stored in public databases such as the Gene Ontology Annotation (GOA) database. Each term has a different specificity also referred to as Information Content (IC) of terms. Both the structure of GO and the corpora of annotation are continuously subject to change due to novel experimental findings. This process is often referred to as ontology evolution. This work focuses on how changes of annotations affect the IC of terms. The study confirms that statistically significant difference among many whole GOA versions exists on each species. Furthermore, there is also a statistically significant difference considering MF taxonomy for human, yeast, worm and fly. These results convey that annotation corpora changes have a high impact on IC.

Biases in information content measurement of gene ontology terms

Milano M;Agapito G;Guzzi PH;Cannataro M
2014-01-01

Abstract

The Gene Ontology (GO) is used to achieve information about gene and protein functions by using a structured vocabulary of terms (GO Terms). GO Terms are related to biological concepts such as proteins or genes through the annotation process. There exist many different annotation processes identified by different evidence codes (EC). Annotated data are stored in public databases such as the Gene Ontology Annotation (GOA) database. Each term has a different specificity also referred to as Information Content (IC) of terms. Both the structure of GO and the corpora of annotation are continuously subject to change due to novel experimental findings. This process is often referred to as ontology evolution. This work focuses on how changes of annotations affect the IC of terms. The study confirms that statistically significant difference among many whole GOA versions exists on each species. Furthermore, there is also a statistically significant difference considering MF taxonomy for human, yeast, worm and fly. These results convey that annotation corpora changes have a high impact on IC.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12317/19262
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 0
social impact