Abstract
In MediaEval 2016, we focus on the image interestingness subtask which involves predicting interesting key frames of a video in the form of a movie trailer. We specifically propose three different deep models for this subtask. The first two models are based on fine-tuning two pretrained models, namely AlexNet and MemNet, where we cast the interestingness prediction as a regression problem. Our third deep model, on the other hand, depends on a triplet network which is comprised of three instances of the same feedforward network with shared weights, and trained according to a triplet ranking loss. Our experiments demonstrate that all these models provide relatively similar and promising results on the image interestingness subtask.
| Original language | English |
|---|---|
| Journal | CEUR Workshop Proceedings |
| Volume | 1739 |
| Publication status | Published - 2016 |
| Event | 2016 Multimedia Benchmark Workshop, MediaEval 2016 - Hilversum, Netherlands Duration: 20 Oct 2016 → 21 Oct 2016 |
Fingerprint
Dive into the research topics of 'HUCVL at MediaEval 2016: Predicting interesting key frames with deep models'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver