
{"id":16096,"date":"2020-02-24T10:45:13","date_gmt":"2020-02-24T09:45:13","guid":{"rendered":"https:\/\/www.textform.com\/?p=16096"},"modified":"2022-06-23T15:26:20","modified_gmt":"2022-06-23T13:26:20","slug":"bridging-nmt-part-2","status":"publish","type":"post","link":"https:\/\/www.textform.com\/en\/blog-en\/bridging-nmt-part-2\/","title":{"rendered":"Bridging &#038; NMT \u2013 Part 2"},"content":{"rendered":"<h1><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16102 size-large\" src=\"https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/02\/flawed_perfection_part2-1-1024x523.png\" alt=\"\" width=\"1024\" height=\"523\" srcset=\"https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/02\/flawed_perfection_part2-1-1024x523.png 1024w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/02\/flawed_perfection_part2-1-300x153.png 300w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/02\/flawed_perfection_part2-1-768x392.png 768w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/02\/flawed_perfection_part2-1.png 2000w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><br \/>\nBridging<\/h1>\n<p>In the <a href=\"https:\/\/www.textform.com\/en\/from-the-language-lab\/bridging-nmt-part-1\/\">first chapter of this article<\/a>\u00a0 the construction <em>to suffice with<\/em> was used as an example to discuss the incompatibility of <a href=\"https:\/\/en.wikipedia.org\/wiki\/Neural_machine_translation\">neural machine translation<\/a> and informal language. Although the translations provided were not completely wrong, they could not convey the meaning of the original sentence. And when it comes to technical translation, nit-picking is the standard procedure. But I should probably get my own house in order before I dish out further criticism\u00a0: neither is English my mother tongue nor was I aware of this specific use of <em>to suffice with<\/em> before I read the text in question. So I am no better than our test subjects, my personal data set also seems to be incomplete or contaminated. Nevertheless, I was able to understand the sentence as the author intended. Apparently, I (representing humans) possess an ability that the machine lacks at this point. How else would I be able to compensate for my own linguistic incapacity?<\/p>\n<p>A simple answer would be something like: the context doesn\u2019t allow any other interpretation! Or: humans have a brain, machines don\u2019t. But since machine translation is a complex and fascinating subject, it merits an appropriate answer. So here is an attempt to explain this phenomenon on a linguistic-cognitive level\u00a0.<\/p>\n<p>It is true that the meaning of the sentence is made clear by its context. In this case by the second main clause of the sentence, because unlike its predecessor, there is only one way to interpret the aforementioned dependency. Even our test subjects agree on that. This transfer of information can be explained in theory by an indirect anaphoric connection, also called <em>bridging<\/em>. In short, an anaphora is the reference in a clause to something or someone previously mentioned.<\/p>\n<p>A good example of this is a personal pronoun, which establishes an <a href=\"https:\/\/en.wikipedia.org\/wiki\/Anaphora_(linguistics)\">anaphoric connection<\/a> to a person or a thing introduced earlier in a text. An indirect anaphora is a bit more abstract, because it is not based on grammatical rules but rather on world knowledge. A simple example shows how a string defines a later one through bridging:<\/p>\n<p><em>Brexit vote in the House of Commons: Serenity in Brussels<\/em><\/p>\n<p>The word \u201cBrexit\u201d activates the part of our world knowledge around Britain&#8217;s withdrawal from the EU. Based on that, we later interpret the word \u201cBrussels\u201d within that familiar framework. The exact same framework-based interpretation will most likely affect your perception of the two following, legitimate paraphrases of the same sentence:<\/p>\n<p><em>Brexit vote in the House of Commons: EU Parliament does not panic<\/em><\/p>\n<p><em>Brexit vote in the House of Commons: The relaxed inhabitants of Belgium\u2019s capital<\/em><\/p>\n<p>Bridging helps us to choose the correct translation or paraphrase. For the same reason\u00a0 we are able to understand our original example: due to the use of a construction unknown to me, I\u2019m unable to determine the relation between machine-to-machine application and SIM cards. However, since the same relation is unmistakably defined in the second part of the sentence, my world knowledge links the two examples, thus making sense of the first part retrospectively. And as if that wasn\u2019t enough, I also improved my own algorithm by learning the alternative use of <em>to suffice with<\/em>, independently and automatically.<\/p>\n<p>NMT models can certainly handle context, e.g. by including specific TMs or high-quality <a href=\"https:\/\/www.tei-c.org\/release\/doc\/tei-p5-doc\/en\/html\/CC.html\">language corpora<\/a>. That said, a complex contextualization is still a dream of the future\u00a0 because it requires artificial intelligence that is on par with that of a human when it comes to cognition and recursion.<\/p>\n<p>&nbsp;<\/p>\n<h2>An outlook<\/h2>\n<p>Informal language and the inability to properly process context add up to a long list of unaddressed tasks in the development of human-like NMT engines. However, you would have to be very naive to deny the capabilities of neural machine translation. No matter how abstract and complex these examples might be &#8211; I personally have no doubt that one day, these problems will be overcome by the machine. Despite my confidence in NMT, I wouldn\u2019t dare to guess when that will be. Until then, only a small number of translations will be fully automated. That prediction also takes into account revision and maintenance of NMT engines. For the time being, humans still have the final say when it comes to translation, not to mention localization. If you are still looking for the right humans for your <a href=\"https:\/\/www.textform.com\/en\/language-services\/\">translations or localization projects<\/a>, we at <a href=\"https:\/\/www.textform.com\/en\/\">text&amp;form<\/a> will be happy to help you find them.<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p><em>About the Author<\/em><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft wp-image-15706\" src=\"https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/01\/daniel_nad_project_manager_textform-300x300.png\" alt=\"Daniel Nad, Project Manager at text&amp;form\" width=\"250\" height=\"250\" srcset=\"https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/01\/daniel_nad_project_manager_textform-300x300.png 300w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/01\/daniel_nad_project_manager_textform-150x150.png 150w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/01\/daniel_nad_project_manager_textform-768x768.png 768w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/01\/daniel_nad_project_manager_textform-266x266.png 266w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/01\/daniel_nad_project_manager_textform-500x500.png 500w, https:\/\/www.textform.com\/tf2019\/wp-content\/uploads\/2020\/01\/daniel_nad_project_manager_textform.png 1000w\" sizes=\"(max-width: 250px) 100vw, 250px\" \/><\/p>\n<div>Organized sounds, manifested in abstract character strings with thousands of mutually unintelligible variants: human languages are fascinating. At least according to our author Daniel Nad. As a passionate linguist and PM at <a href=\"https:\/\/www.textform.com\/en\/about-us\/our-team\/\">text&amp;form<\/a>, he experiences language on a daily basis &#8211;\u00a0and enjoys sharing his passion with others.<\/div>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><!--HubSpot Call-to-Action Code --><span id=\"hs-cta-wrapper-087b4f07-6a28-438f-a182-3ad47941a092\" class=\"hs-cta-wrapper\"><span id=\"hs-cta-087b4f07-6a28-438f-a182-3ad47941a092\" class=\"hs-cta-node hs-cta-087b4f07-6a28-438f-a182-3ad47941a092\"><!-- [if lte IE 8]>\n\n\n<div id=\"hs-cta-ie-element\"><\/div>\n\n\n<![endif]--><a href=\"https:\/\/cta-redirect.hubspot.com\/cta\/redirect\/5306090\/087b4f07-6a28-438f-a182-3ad47941a092\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" id=\"hs-cta-img-087b4f07-6a28-438f-a182-3ad47941a092\" class=\"hs-cta-img alignnone\" style=\"border-width: 0px;\" src=\"https:\/\/no-cache.hubspot.com\/cta\/default\/5306090\/087b4f07-6a28-438f-a182-3ad47941a092.png\" alt=\"Join the discussion\" \/><\/a><\/span><script charset=\"utf-8\" src=\"https:\/\/js.hscta.net\/cta\/current.js\"><\/script><script type=\"text\/javascript\"> hbspt.cta.load(5306090, '087b4f07-6a28-438f-a182-3ad47941a092', {}); <\/script><\/span><!-- end HubSpot Call-to-Action Code --><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Bridging In the first chapter of this article\u00a0 the construction to suffice with was used as an example to discuss the incompatibility of neural machine translation and informal language. Although the translations provided were not completely wrong, they could not convey the meaning of the original sentence. And when it comes to technical translation, nit-picking [&hellip;]<\/p>\n","protected":false},"author":22,"featured_media":13390,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[339,797,737],"tags":[826,798,653,382,388,488,486,589],"_links":{"self":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/posts\/16096"}],"collection":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/users\/22"}],"replies":[{"embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/comments?post=16096"}],"version-history":[{"count":0,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/posts\/16096\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/media\/13390"}],"wp:attachment":[{"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/media?parent=16096"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/categories?post=16096"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.textform.com\/en\/wp-json\/wp\/v2\/tags?post=16096"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}