Ana gezinime atla Aramaya atla Ana içeriğe atla

Cross-lingual visual pre-training for multimodal machine translation

  • Ozan Caglayan
  • , Menekse Kuyu
  • , Mustafa Sercan Amac
  • , Pranava Madhyastha
  • , Erkut Erdem
  • , Aykut Erdem
  • , Lucia Specia
  • Imperial College London
  • Hacettepe University
  • Koc University
  • University of Sheffield
  • Dublin City University

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

44 Alıntılar (Scopus)

Özet

Pre-trained language models have been shown to improve performance in many natural language tasks substantially. Although the early focus of such models was single language pre-training, recent advances have resulted in cross-lingual and visual pre-training methods. In this paper, we combine these two approaches to learn visually-grounded cross-lingual representations. Specifically, we extend the translation language modelling (Lample and Conneau, 2019) with masked region classification and perform pre-training with three-way parallel vision & language corpora. We show that when fine-tuned for multimodal machine translation, these models obtain state-of-the-art performance. We also provide qualitative insights into the usefulness of the learned grounded representations.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıEACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference
YayınlayanAssociation for Computational Linguistics (ACL)
Sayfalar1317-1324
Sayfa sayısı8
ISBN (Elektronik)9781954085022
DOI'lar
Yayın durumuYayınlandı - 2021
Etkinlik16th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2021 - Virtual, Online
Süre: 19 Nis 202123 Nis 2021

Yayın serisi

AdıEACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???16th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2021
ŞehirVirtual, Online
Periyot19/04/2123/04/21

Parmak izi

Cross-lingual visual pre-training for multimodal machine translation' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Bundan alıntı yap