Skip to main navigation Skip to search Skip to main content

Haber videolarinda nesne tanima ve otomatik etiketleme

Translated title of the contribution: Object recognition and auto-annotation in news videos
  • Bilkent University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We propose a new approach to object recognition problem motivated by the availability of large annotated image and video collections. Similar to translation from one language to another, this approach considers the object recognition problem as the translation of visual elements to words. The visual elements represented in feature space are first categorized into a finite set of blobs. Then, the correspondences between the blobs and the words are learned using a method adapted from Statistical Machine Translation. Finally, the correspondences, in the form of a probability table, are used to predict words for particular image regions (region naming), for entire images (auto-annotation), or to associate the automatically generated speech transcript text with the correct video frames (video alignment). Experimental results are presented on TRECVID 2004 data set, which consists of about 150 hours of news videos associated with manual annotations and speech transcript text.

Translated title of the contributionObject recognition and auto-annotation in news videos
Original languageTurkish
Title of host publication2006 IEEE 14th Signal Processing and Communications Applications Conference
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event2006 IEEE 14th Signal Processing and Communications Applications - Antalya, Turkey
Duration: 17 Apr 200619 Apr 2006

Publication series

Name2006 IEEE 14th Signal Processing and Communications Applications Conference
Volume2006

Conference

Conference2006 IEEE 14th Signal Processing and Communications Applications
Country/TerritoryTurkey
CityAntalya
Period17/04/0619/04/06

Fingerprint

Dive into the research topics of 'Object recognition and auto-annotation in news videos'. Together they form a unique fingerprint.

Cite this