Ana gezinime atla Aramaya atla Ana içeriğe atla

Joint visual-text modeling for automatic retrieval of multimedia documents

  • G. Iyengar
  • , P. Duygulu
  • , S. Feng
  • , P. Ircing
  • , S. P. Khudanpur
  • , D. Klakow
  • , M. R. Krause
  • , R. Manmatha
  • , H. J. Nock
  • , D. Petkova
  • , B. Pytlik
  • , P. Virga
  • IBM
  • Bilkent University
  • University of Massachusetts
  • University of West Bohemia
  • Johns Hopkins University
  • Saarland University
  • Georgetown University
  • Mt. Holyoke College

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

31 Alıntılar (Scopus)

Özet

In this paper we describe a novel approach for jointly modeling the text and the visual components of multimedia documents for the purpose of information retrieval(IR). We propose a novel framework where individual components are developed to model different relationships between documents and queries and then combined into a joint retrieval framework. In the state-of-the-art systems, a late combination between two independent systems, one analyzing just the text part of such documents, and the other analyzing the visual part without leveraging any knowledge acquired in the text processing, is the norm. Such systems rarely exceed the performance of any single modality (i.e. text or video) in information retrieval tasks. Our experiments indicate that allowing a rich interaction between the modalities results in significant improvement in performance over any single modality. We demonstrate these results using the TRECVID03 corpus, which comprises 120 hours of broadcast news videos. Our results demonstrate over 14% improvement in IR performance over the best reported text-only baseline and ranks amongst the best results reported on this corpus.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıProceedings of the 13th ACM International Conference on Multimedia, MM 2005
YayınlayanAssociation for Computing Machinery
Sayfalar21-30
Sayfa sayısı10
ISBN (Basılı)1595930442, 9781595930446
DOI'lar
Yayın durumuYayınlandı - 2005
Harici olarak yayınlandıEvet
Etkinlik13th ACM International Conference on Multimedia, MM 2005 - Singapore, !!Singapore
Süre: 6 Kas 200511 Kas 2005

Yayın serisi

AdıProceedings of the 13th ACM International Conference on Multimedia, MM 2005

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???13th ACM International Conference on Multimedia, MM 2005
Ülke/Bölge!!Singapore
ŞehirSingapore
Periyot6/11/0511/11/05

Parmak izi

Joint visual-text modeling for automatic retrieval of multimedia documents' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Bundan alıntı yap