Ana gezinime atla Aramaya atla Ana içeriğe atla

Unsupervised identification of redundant domain entries in InterPro database using clustering techniques

  • Middle East Technical University
  • European Molecular Biology Laboratory

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

Özet

InterPro is a widely used database that integrates functional signatures provided by different protein sequence annotation databases with manual curation; in order to present a comprehensive database of functional sequence annotation. However, the integration of the signatures causes inconsistent and/or redundant annotations in some cases. In this study, we proposed an unsupervised method for the automatic detection of inconsistent and redundant entries in the InterPro database. Two clustering methods: Markov Cluster Algorithm (MCL) and hierarchical clustering are employed in order to investigate to what extent these signatures can be detected. Results show that a considerable amount of (~75%) redundant entries can be identified. The future goal is to develop a system that does the identification of redundant and inconsistent signatures with very high performance using machine learning techniques in a supervised fashion. The findings of the study may aid InterPro curators to fix the problematic entries. It may also be used by curators as a road map before the integration of new signatures.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıBCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
YayınlayanAssociation for Computing Machinery, Inc
Sayfalar505-506
Sayfa sayısı2
ISBN (Elektronik)9781450338530
DOI'lar
Yayın durumuYayınlandı - 9 Eyl 2015
Harici olarak yayınlandıEvet
Etkinlik6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2015 - Atlanta, !!United States
Süre: 9 Eyl 201512 Eyl 2015

Yayın serisi

AdıBCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2015
Ülke/Bölge!!United States
ŞehirAtlanta
Periyot9/09/1512/09/15

Parmak izi

Unsupervised identification of redundant domain entries in InterPro database using clustering techniques' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Bundan alıntı yap