
{"id":16740,"date":"2020-09-03T14:19:38","date_gmt":"2020-09-03T12:19:38","guid":{"rendered":"https:\/\/www.textform.com\/?p=16740"},"modified":"2020-12-03T13:09:32","modified_gmt":"2020-12-03T12:09:32","slug":"machine-translation-faux-pas","status":"publish","type":"post","link":"https:\/\/www.textform.com\/en\/blog-en\/machine-translation-faux-pas\/","title":{"rendered":"When machine translation mistakes are no laughing matter"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16741 size-full\" src=\"https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/09\/machine_translation_mistakes_blogpost.jpg\" alt=\"When machine translation mistakes are no laughing matter\" width=\"2000\" height=\"1021\" srcset=\"https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/09\/machine_translation_mistakes_blogpost.jpg 2000w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/09\/machine_translation_mistakes_blogpost-300x153.jpg 300w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/09\/machine_translation_mistakes_blogpost-1024x523.jpg 1024w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/09\/machine_translation_mistakes_blogpost-768x392.jpg 768w\" sizes=\"(max-width: 2000px) 100vw, 2000px\" \/><\/p>\n<p><span data-contrast=\"auto\">During our recent trawling of localization news and events<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">here<\/span><span data-contrast=\"auto\">\u00a0in the marketing department<\/span><span data-contrast=\"auto\">\u00a0at\u00a0<\/span><span data-contrast=\"auto\">text<\/span><span data-contrast=\"auto\">&amp;form<\/span><span data-contrast=\"auto\">\u00a0some<\/span><span data-contrast=\"auto\">\u00a0machine translation<\/span><span data-contrast=\"auto\">\u00a0(MT)<\/span><span data-contrast=\"auto\">\u00a0faux pas have\u00a0<\/span><span data-contrast=\"auto\">really\u00a0<\/span><span data-contrast=\"auto\">caught our eye<\/span><span data-contrast=\"auto\">.\u00a0<\/span><span data-contrast=\"auto\">With the use\u00a0<\/span><span data-contrast=\"auto\">of automatic MT<\/span><span data-contrast=\"auto\">\u00a0becoming more popular, easy to use and ultimately more acceptable, we wanted to take<\/span><span data-contrast=\"auto\">\u00a0a look at these<\/span><span data-contrast=\"auto\">\u00a0prominent<\/span><span data-contrast=\"auto\">\u00a0machine translation\u00a0<\/span><span data-contrast=\"auto\">errors<\/span><span data-contrast=\"auto\">\u00a0and\u00a0<\/span><span data-contrast=\"auto\">gauge<\/span><span data-contrast=\"auto\">\u00a0just how damaging these\u00a0<\/span><span data-contrast=\"auto\">mistakes can\u00a0<\/span><span data-contrast=\"auto\">be<\/span><span data-contrast=\"auto\">.<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">But first the funny side.\u00a0<\/span><span data-contrast=\"auto\">If you have\u00a0<\/span><span data-contrast=\"auto\">come across the comedy chat show from the US,\u00a0<\/span><span data-contrast=\"auto\">\u2018<\/span><span data-contrast=\"auto\">The Tonight Show Starring Jimmy Fallon<\/span><span data-contrast=\"auto\">\u2019<\/span><span data-contrast=\"auto\">, you will probably have seen<\/span><span data-contrast=\"auto\">\u00a0the regular<\/span><span data-contrast=\"auto\">\u00a0segment called \u2018<\/span><a href=\"https:\/\/www.youtube.com\/watch?v=7m-NPuXPBqM&amp;t=153s\"><span data-contrast=\"none\">Google Translates Songs<\/span><\/a><span data-contrast=\"auto\">\u2019. They invite a famous singer<\/span><span data-contrast=\"auto\">\u00a0or actor (and we are talking big names here lik<\/span><span data-contrast=\"auto\">e Camila Cabello and Idris Elba<\/span><span data-contrast=\"auto\">)<\/span><span data-contrast=\"auto\">\u00a0to\u00a0<\/span><span data-contrast=\"auto\">sing a well-known\u00a0<\/span><span data-contrast=\"auto\">pop song whose lyrics have\u00a0<\/span><span data-contrast=\"auto\">been fed<\/span><span data-contrast=\"auto\">\u00a0into Google Translate, translated to Russian and then back into English. The results are predictably hilarious<\/span><span data-contrast=\"auto\">, if somewhat contrived<\/span><span data-contrast=\"auto\">\u00a0and the long-running nature of this\u00a0<\/span><span data-contrast=\"auto\">feature suggests that translation mistakes are endlessly funny<\/span><span data-contrast=\"auto\">.\u00a0<\/span><span data-contrast=\"auto\">Great then<\/span><span data-contrast=\"auto\">,<\/span><span data-contrast=\"auto\">\u00a0when laughs are the goal, b<\/span><span data-contrast=\"auto\">ut what happens when machine translation<\/span><span data-contrast=\"auto\">\u00a0produces inaccurate results which are not destined for a comedy sketch?<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">In<\/span><span data-contrast=\"auto\">\u00a0July<\/span><span data-contrast=\"auto\">,<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">Facebook<\/span><span data-contrast=\"auto\">\u00a0Thailand<\/span><span data-contrast=\"auto\">\u00a0got itself into\u00a0<\/span><a href=\"https:\/\/slator.com\/industry-news\/thai-mistranslation-shows-risk-of-auto-translating-social-media-content\/\"><span data-contrast=\"none\">serious trouble<\/span><\/a><span data-contrast=\"auto\">\u00a0when it<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">published a post<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">about the Tha<\/span><span data-contrast=\"auto\">i King\u2019s birthday which the US tech giant\u2019s machine translation<\/span><span data-contrast=\"auto\">\u00a0had<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">automatically<\/span><span data-contrast=\"auto\">\u00a0transformed<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">into Thai from English<\/span><span data-contrast=\"auto\">. The\u00a0<\/span><span data-contrast=\"auto\">post\u00a0<\/span><span data-contrast=\"auto\">caused such offense\u00a0<\/span><span data-contrast=\"auto\">to the Thai monarchy that Facebook\u00a0<\/span><span data-contrast=\"auto\">immediately\u00a0<\/span><span data-contrast=\"auto\">suspended its auto-translate function<\/span><span data-contrast=\"auto\">\u00a0in Thailand<\/span><span data-contrast=\"auto\">, promised a review of its machine translation model and issued a sincere apology.\u00a0<\/span><span data-contrast=\"auto\">The Crime Technology D<\/span><span data-contrast=\"auto\">ivision<\/span><span data-contrast=\"auto\">\u00a0of the Thai police force<\/span><span data-contrast=\"auto\">\u00a0was<\/span><span data-contrast=\"auto\">\u00a0called in to investigate and the public broadcaster<\/span><span data-contrast=\"auto\">,<\/span><span data-contrast=\"auto\">\u00a0Thai PBS also filed a complaint against the American tech company after its viewers complained\u00a0<\/span><span data-contrast=\"auto\">about\u00a0<\/span><span data-contrast=\"auto\">the offensive<\/span><span data-contrast=\"auto\">\u00a0subtitles on its social media pages. Qu<\/span><span data-contrast=\"auto\">ite a stir then<\/span><span data-contrast=\"auto\">\u00a0and one that Facebook would undoubtedly have preferred to avoid<\/span><span data-contrast=\"auto\">.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">Previously in\u00a0<\/span><span data-contrast=\"auto\">January this year, Facebook\u00a0<\/span><span data-contrast=\"auto\">als<\/span><span data-contrast=\"auto\">o blamed\u00a0<\/span><a href=\"https:\/\/www.bbc.co.uk\/news\/world-asia-51166339\"><span data-contrast=\"none\">technical issues<\/span><\/a><span data-contrast=\"auto\">\u00a0when it<\/span><span data-contrast=\"auto\">\u00a0automatic<\/span><span data-contrast=\"auto\">ally<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">translated the name<\/span><span data-contrast=\"auto\">\u00a0of the Chinese\u00a0<\/span><span data-contrast=\"auto\">President &#8211; on an official visit to Myanmar &#8211; into an insulting and\u00a0<\/span><span data-contrast=\"auto\">disrespectful English word, this time from Burmese.<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">Although perhaps amusing on the face of it, we won\u2019<\/span><span data-contrast=\"auto\">t be repeating it here<\/span><span data-contrast=\"auto\">\u00a0and it seems pretty certain that the Chinese weren\u2019t laughing along.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">Even more worryingly,\u00a0<\/span><span data-contrast=\"auto\">US immigration officials<\/span><span data-contrast=\"auto\">\u00a0were last year\u00a0<\/span><a href=\"https:\/\/www.propublica.org\/article\/google-says-google-translate-cant-replace-human-translators-immigration-officials-have-used-it-to-vet-refugees\"><span data-contrast=\"none\">advised<\/span><\/a><span data-contrast=\"auto\">\u00a0to use Google Translate or other similar free, online machine translation<\/span><span data-contrast=\"auto\">\u00a0tools<\/span><span data-contrast=\"auto\">\u00a0to convert social\u00a0<\/span><span data-contrast=\"auto\">media posts written by refugees<\/span><span data-contrast=\"auto\">\u00a0into American English and use these to assess individuals\u2019 suitability for\u00a0<\/span><span data-contrast=\"auto\">entry in to the US<\/span><span data-contrast=\"auto\">.\u00a0<\/span><span data-contrast=\"auto\">Even Google itself\u00a0<\/span><a href=\"https:\/\/cloud.google.com\/translate\/attribution\"><span data-contrast=\"none\">advises against<\/span><\/a><span data-contrast=\"auto\">\u00a0using its MT<\/span><span data-contrast=\"auto\">\u00a0for complex tasks and it would appear to be a gross misjudgement on the part of the US government to assume that this was an acceptable\u00a0<\/span><span data-contrast=\"auto\">element in<\/span><span data-contrast=\"auto\">\u00a0the\u00a0<\/span><span data-contrast=\"auto\">vetting process.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">Social media\u2019s suitability<\/span><span data-contrast=\"auto\">\u00a0for machine translation is\u00a0<\/span><span data-contrast=\"auto\">to say the least, debatable. The very nature of Faceb<\/span><span data-contrast=\"auto\">ook, Twitter and the like is<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">one of\u00a0<\/span><span data-contrast=\"auto\">relaxe<\/span><span data-contrast=\"auto\">d,<\/span><span data-contrast=\"auto\">\u00a0informal\u00a0<\/span><span data-contrast=\"auto\">and\u00a0<\/span><span data-contrast=\"auto\">fashionable language<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">which<\/span><span data-contrast=\"auto\">\u00a0is often<\/span><span data-contrast=\"auto\">\u00a0colloquial and<\/span><span data-contrast=\"auto\">\u00a0changes<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">constantly<\/span><span data-contrast=\"auto\">. Everything that MT finds challenging.<\/span><span data-contrast=\"auto\">\u00a0\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">Just how quickly inaccurate translation can spread in this age of social media has been seen recently in\u00a0<\/span><span data-contrast=\"none\">Mexico<\/span><span data-contrast=\"auto\">. Alt<\/span><span data-contrast=\"auto\">hough whether or not this was machine translation<\/span><span data-contrast=\"auto\">\u00a0remains unconfirmed, the mistranslations on the Visitmexico.com website this August were trending\u00a0<\/span><span data-contrast=\"auto\">on Twitter and caused the English version of the website to be shut down completely. Not ideal during a pandemic when your country is trying to restore confidence\u00a0<\/span><span data-contrast=\"auto\">and attract\u00a0<\/span><span data-contrast=\"auto\">visitors<\/span><span data-contrast=\"auto\">.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">But away from social media MT errors can be equally as damaging.\u00a0<\/span><span data-contrast=\"auto\">I<\/span><span data-contrast=\"auto\">n July this year a\u00a0<\/span><span data-contrast=\"auto\">Welsh member of the UK Parliament<\/span><span data-contrast=\"auto\">\u00a0was also obliged to\u00a0<\/span><a href=\"https:\/\/www.bbc.co.uk\/news\/uk-wales-politics-53282713\"><span data-contrast=\"none\">apologize for\u00a0<\/span><span data-contrast=\"none\">inaccuracies<\/span><\/a><span data-contrast=\"auto\">\u00a0on her website after machine translation<\/span><span data-contrast=\"auto\">\u00a0(Microsoft this time)<\/span><span data-contrast=\"auto\">\u00a0was\u00a0<\/span><span data-contrast=\"auto\">used<\/span><span data-contrast=\"auto\">\u00a0to transform English into Welsh.\u00a0<\/span><span data-contrast=\"auto\">Her constituents were offered medical surgery rather than the more usual MP\u2019s surgery, which is a meeting to discuss any issues, and the site also referred to an industrial plant as a plant of the vegetable variety.<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">Hardly reassuring for the MP\u2019s Welsh speaking voters seeking reassurance in their language of choice.\u00a0<\/span><span data-contrast=\"auto\">Thanks, a cabbage for us please.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">Away<\/span><span data-contrast=\"auto\">\u00a0from\u00a0<\/span><span data-contrast=\"auto\">the pandemic where MT has been shown to p<\/span><span data-contrast=\"auto\">rovide\u00a0<\/span><a href=\"https:\/\/translatorswithoutborders.org\/TICO-19-announcement\"><span data-contrast=\"none\">vital health information<\/span><\/a><span data-contrast=\"auto\">\u00a0in minority languages, Doctors have<\/span><span data-contrast=\"auto\">\u00a0warned of the over-reliance on automatic translation in a medical context.<\/span><span data-contrast=\"auto\">\u00a0<\/span><a href=\"https:\/\/www.physiciansweekly.com\/machine-translation-emergency-department-discharge-instructions\/\"><span data-contrast=\"none\">Research carried out in 2017<\/span><\/a><span data-contrast=\"auto\">\u00a0assessed the use of Google Translate for Chinese and Spanish discharge instructions<\/span><span data-contrast=\"auto\">\u00a0from emergency departments<\/span><span data-contrast=\"auto\">\u00a0in the US and concluded that \u2018<\/span><span data-contrast=\"auto\">2% of Spanish and 8% of Chinese sentence translations\u2014had the potential to cause clinically significant harm\u2019<\/span><span data-contrast=\"auto\">. No laughing\u00a0<\/span><span data-contrast=\"auto\">matter then, a<\/span><span data-contrast=\"auto\">gain underlining the constraints<\/span><span data-contrast=\"auto\">\u00a0of these free forms of MT.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">All these examples illustrate an over confidence in the capacity of machine translation\u00a0<\/span><span data-contrast=\"auto\">to understand the context and complexity of human language\u00a0<\/span><span data-contrast=\"auto\">and often point to\u00a0<\/span><span data-contrast=\"auto\">a misreading<\/span><span data-contrast=\"auto\">\u00a0of its limitations.\u00a0<\/span><span data-contrast=\"auto\">The improvements and\u00a0<\/span><a href=\"https:\/\/slator.com\/machine-translation\/\"><span data-contrast=\"none\">advances in mac<\/span><span data-contrast=\"none\">h<\/span><span data-contrast=\"none\">ine t<\/span><span data-contrast=\"none\">ranslation<\/span><\/a><span data-contrast=\"auto\">\u00a0over the last five years have been wel<\/span><span data-contrast=\"auto\">l documented and the rise of\u00a0<\/span><span data-contrast=\"none\">NLP<\/span><span data-contrast=\"auto\">\u00a0has been nothing short of revolutionary, but there is a long way to go before machines can translate sensitive or idiomatic language like humans can. Certainly blaming the technology is just too easy. Blame the humans behind the technology and the often misguided refusal to include humanity in the translation process.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\n<hr \/>\n<p><em>by Caroline Strappazzon<\/em>, Marketing Specialist at text&amp;form<\/p>\n<div>Caroline joined <a href=\"https:\/\/www.textform.com\/en\/about-textform\/\">text&amp;form<\/a> in the spring of 2018 and works as a copy writer and content marketing specialist. She has a background in translation (French to English) and an interest in all things linguistic. She is based between <a href=\"https:\/\/en.wikipedia.org\/wiki\/Sheffield\">Sheffield<\/a> in the UK and the French Alps.<\/div>\n","protected":false},"excerpt":{"rendered":"<p>During our recent trawling of localization news and events\u00a0here\u00a0in the marketing department\u00a0at\u00a0text&amp;form\u00a0some\u00a0machine translation\u00a0(MT)\u00a0faux pas have\u00a0really\u00a0caught our eye.\u00a0With the use\u00a0of automatic MT\u00a0becoming more popular, easy to use and ultimately more acceptable, we wanted to take\u00a0a look at these\u00a0prominent\u00a0machine translation\u00a0errors\u00a0and\u00a0gauge\u00a0just how damaging these\u00a0mistakes can\u00a0be.\u00a0\u00a0 But first the funny side.\u00a0If you have\u00a0come across the comedy chat show from [&hellip;]<\/p>\n","protected":false},"author":22,"featured_media":13388,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[339,797,737],"tags":[382,915,388,486],"_links":{"self":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/posts\/16740"}],"collection":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/users\/22"}],"replies":[{"embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/comments?post=16740"}],"version-history":[{"count":0,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/posts\/16740\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/media\/13388"}],"wp:attachment":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/media?parent=16740"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/categories?post=16740"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/tags?post=16740"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}