File
Improving neural machine translation for morphologically rich languages
Digital Document
Abstract |
Abstract
Machine Translation aims to provide a seamless communication and interaction, thereby overcoming human language barriers. Recently, Neural Machine Translation (NMT) approaches have been very successful and achieve state-of-the-art performance in many language pairs. NMT systems consist of millions of neurons that are optimised to learn the input-output mapping between the source and the target languages. However, these systems produce poor translation quality under low-resource conditions and are unable to handle a large vocabulary particularly for languages with rich morphology such as Turkish, Tamil and German. In this project, we present a source vocabulary expansion technique to handle the problem of translating rare and unknown words by incorporating morphological information in the words. The effectiveness of the proposed technique is demonstrated by translating from two morphologically rich languages to English. Using this technique, we achieve a performance gain of approximately 2 BLEU points for both German → English and Turkish → English. |
---|---|
Persons |
Persons
Author (aut): Gunasekaran, Raja
Thesis advisor (ths): Hartley, Ian
Degree committee member (dgc): Casperson, David
|
Degree Name |
Degree Name
|
Department |
Department
|
DOI |
DOI
https://doi.org/10.24124/2018/58807
|
Collection(s) |
Collection(s)
|
Origin Information |
|
||||||
---|---|---|---|---|---|---|---|
Organizations |
Degree granting institution (dgg): University of Northern British Columbia
|
||||||
Degree Level |
Subject Topic |
Subject Topic
|
---|---|
Keywords |
Keywords
Neural Machine Translation (NMT)
language pairs
neurons
input-output mapping
morphology
2 BLEU
|
Extent |
Extent
1 online resource (vii, 67 pages)
|
---|---|
Physical Form |
Physical Form
|
Content type |
Content type
|
Resource Type |
Resource Type
|
Genre |
Genre
|
Language |
Language
|
Handle |
Handle
Handle placeholder
|
---|
Use and Reproduction |
Use and Reproduction
author
|
---|---|
Rights Statement |
Rights Statement
|
unbc_58807.pdf543.8 KB
20603-Extracted Text.txt79.29 KB
Download
Language |
English
|
---|---|
Name |
Improving neural machine translation for morphologically rich languages
|
Authored on |
|
MIME type |
application/pdf
|
File size |
556848
|
Media Use |