Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorFu, Yingxue
dc.contributor.authorNederhof, Mark Jan
dc.contributor.editorBizzoni, Yuri
dc.contributor.editorTeich, Elke
dc.contributor.editorEspaña-Bonet, Cristina
dc.contributor.editorvan Genabith, Josef
dc.date.accessioned2021-06-03T10:30:14Z
dc.date.available2021-06-03T10:30:14Z
dc.date.issued2021-05-31
dc.identifier273993683
dc.identifier777591c5-314d-4ecb-be42-2d2ae1d834bc
dc.identifier.citationFu , Y & Nederhof , M J 2021 , Automatic classification of human translation and machine translation : a study from the perspective of lexical diversity . in Y Bizzoni , E Teich , C España-Bonet & J van Genabith (eds) , Proceedings for the First Workshop on Modelling Translation : Translatology in the Digital Age . NEALT Proceedings Series , Linkoping University Electronic Press , pp. 91–99 , Workshop on Modelling Translation , Online City , Iceland , 31/05/21 . < https://aclanthology.org/previews/ingest-nodalida/2021.motra-1.10/ >en
dc.identifier.citationworkshopen
dc.identifier.issn1650-3686
dc.identifier.otherORCID: /0000-0002-1845-6829/work/95041670
dc.identifier.urihttps://hdl.handle.net/10023/23304
dc.description.abstractBy using a trigram model and fine-tuning a pretrained BERT model for sequence classification, we show that machine translation and human translation can be classified with an accuracy above chance level, which suggests that machine translation and human translation are different in a systematic way. The classification accuracy of machine translation is much higher than of human translation. We show that this may be explained by the difference in lexical diversity between machine translation and human translation. If machine translation has independent patterns from human translation, automatic metrics which measure the deviation of machine translation from human translation may conflate difference with quality. Our experiment with two different types of automatic metrics shows correlation with the result of the classification task. Therefore, we suggest the difference in lexical diversity between machine translation and human translation be given more attention in machine translation evaluation.
dc.format.extent139015
dc.language.isoeng
dc.publisherLinkoping University Electronic Press
dc.relation.ispartofProceedings for the First Workshop on Modelling Translationen
dc.relation.ispartofseriesNEALT Proceedings Seriesen
dc.subjectQ Science (General)en
dc.subjectArtificial Intelligenceen
dc.subject3rd-DASen
dc.subject.lccQ1en
dc.titleAutomatic classification of human translation and machine translation : a study from the perspective of lexical diversityen
dc.typeConference itemen
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen
dc.identifier.urlhttps://aclanthology.org/previews/ingest-nodalida/2021.motra-1.10/en


This item appears in the following Collection(s)

Show simple item record