Arşiv ve Dokümantasyon Merkezi
Dijital Arşivi

Neural Sign Language Translation by learning tokenization

Basit öğe kaydını göster

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Akarun, Lale.
dc.contributor.author Orbay, Alptekin.
dc.date.accessioned 2023-03-16T10:04:40Z
dc.date.available 2023-03-16T10:04:40Z
dc.date.issued 2020.
dc.identifier.other CMPE 2020 O73
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12428
dc.description.abstract In this thesis, we propose a multitask learning based method to improve Neural Sign Language Translation (NSLT) consisting of two parts, a tokenization layer and Neural Machine Translation (NMT). The tokenization part focuses on how Sign Language (SL) videos should be represented to be fed into the other part. It has not been studied elaborately whereas NMT research has attracted several researchers contributing enormous advancements. Up to now, there are two main input tokenization levels, namely frame-level and gloss-level tokenization. Glosses are world-like intermediate presentation and unique to SLs. Therefore, we aim to develop a generic sign-level tokenization layer so that it is applicable to other domains without further e ort. We begin with investigating current tokenization approaches and explain their weaknesses with several experiments. To provide a solution, we adapt Transfer Learning, Multitask Learning and Unsupervised Domain Adaptation into this research to leverage additional supervision. We succeed in enabling knowledge transfer between SLs and improve translation quality by 5 points in BLEU-4 and 8 points in ROUGE scores. Secondly, we show the e ects of body parts by extensive experiments in all the tokenization approaches. Apart from these, we adopt 3D-CNNs to improve e ciency in terms of time and space. Lastly, we discuss the advantages of sign-level tokenization over gloss-level tokenization. To sum up, our proposed method eliminates the need for gloss level annotation to obtain higher scores by providing additional supervision by utilizing weak supervision sources.
dc.format.extent 30 cm.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2020.
dc.subject.lcsh Sign language -- Study and teaching.
dc.subject.lcsh Learning strategies.
dc.title Neural Sign Language Translation by learning tokenization
dc.format.pages xiii, 71 leaves ;


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım