Hybrid Translation Workflows with Neural Machine Translation and Translation Memories

神经机器翻译和翻译记忆的混合翻译工作流

2020-03-05 14:46 memoq

本文共829个字,阅读需9分钟

阅读模式 切换至中文

The localization industry has been watching machine translation (MT) for years. Despite some concerns about the practical aspects of using the most radical change the language industry has ever seen, most professionals agree that new technologies are here to stay, certainly when neural machine translation (NMT) is part of the picture. NMT is the biggest megatrend to hit the language industry, the blockbuster of localization. We call it a megatrend because it is particularly fundamental, persistent, and not limited to singular aspects of our work. It will affect everything: business, process, technology, people, quality management, decisions on rates, and the kinds of professional roles needed to do this work well. But NMT can’t stand alone. It is the only language processed by a machine, after all. Humans train and operate the NMT systems and make choices about style, tone, and feeling. The human role will be more important than ever, with new skills required for working with NMT. In order to facilitate the interaction of technology and humans, workflows and functionalities are used that make it easy to interact– (not to type new translations from scratch, but to edit, in most cases) and to rely on previous good translations that are still valid. memoQ, a state-of-the-art translation environment, integrates NMT with translation memory, myriad editing functionalities, QA tools, and the most comfortable working environment available. This all goes toward supporting the humans behind the machines, to get the very best out of the human-technology interaction. We see smart, hybrid workflows coming in which NMT and translation memories go hand-in-hand to enable humans to translate ever increasing volumes. These workflows can be extremely flexible, facilitating translations for many purposes and in many language combinations. We believe MT should not stand alone. MT raw translation for many languages (as long as large corpora are available and collected in advance), but it is the many other functionalities of a translation environment that allow editing and QA of the MT output so that it can be leveraged as validated material for future use. Not only can NMT systems learn and improve their own output, they can also profit from the extended options of a smart, time-tested translation technology when used in a hybrid process, improving results from both systems. How can you exploit NMT and translation memories to translate faster and more efficiently than ever before? Let’s look at two challenges related to NMT: terminology and context. One solution that addresses both is Microsoft’s machine translation plugin, Translator Text API v3.0. This version is based entirely on NMT and supports over three dozen languages. One of the features, Custom Translator, allows you to build and train custom NMT models by uploading term bases, translation memories, and other documents. Custom Translator is a paid service that requires some of your time and expertise. With it you can create a private NMT engine. You train the system with your resources for your own use, and it’s never accessed by third parties. Challenge of Terminology Whether a human or machine translates, terminology needs to be appropriate for the subject matter at hand. NMT can learn from subject-matter specific glossaries to improve accuracy. For example, with Amazon’s NMT service, which is accessible in memoQ through the Intento MT gateway plugin, you can upload your glossaries in various topics (such as law, economy, audiovisual, etc.) for later use in memoQ projects. You select the appropriate glossaries prior to beginning work on a translation. If the system detects a match for a word in a sentence to translate and term in the glossary, it will use the preferred term during translation. Challenge of Context To assure that the context for your translations is correct when using NMT, you can supplement NMT with matches from your translation memories. Let’s say you have a general NMT model (not customized for a specific subject matter) to which you add matching source and target segments from a subject-specific translation memory. In such a case, the MT service translates the given sentence and also modifies the translation according to the uploaded translation memory result, thereby providing the user with a contextually accurate translation. The Human Element in a Hybrid Solution The best solution is undoubtedly one that involves humans at critical points. Even the best possible NMT engine using a custom model and incorporating appropriate terminology and relevant segments from translation memory needs a skilled language professional to edit the NMT result. Armed with memoQ’s suggestions from a wide array of resources like translation memories, term bases, and LiveDocs corpora, the human element in hybrid workflows can increase your confidence in future reuse of translated content. Simply put, when NMT engines are joined with productivity and collaboration tools like memoQ in one workflow, you can handle more localization volume. We are proud to help our industry advance with tools to bring more content to ever-growing international audiences.
多年来,本地化行业一直关注着机器翻译(MT)的发展。尽管有人担忧MT(语言行业有史以来的巨变)的实际运用,但大多数专业人士都认为新技术将逐步推广,特别是MT中的神经机器翻译(NMT)。 NMT是语言产业的大势所趋,本地化对此趋之若鹜。我们称其为大势所趋,因为它是根本性的变化,影响持久,且不限于语言行业。它将影响一切:商业、流程、技术、人员、质量管理、定价,以及以上各行的工作人员。 但是NMT不能单独工作。毕竟,这只是机器处理的语言。人类训练并操作NMT系统,选出恰当的风格、语气和情感。与NMT合作需要新的技能,人的角色将比以往任何时候都要重要。 为了促进技术和人之间的交互,人们启用了工作流,发展了多种功能,使交互变得容易(人们不用再从头翻译,在大多数情况下,可以直接编辑译文),并可以利用过去有效的好译文。 memoQ是最先进的翻译环境,它提供了NMT、翻译记忆、大量编辑功能、QA工具和最舒适的工作环境。为机器背后的人类提供支持,创造最佳的人机交互。 我们看到了智能混合工作流的到来,NMT和翻译记忆协同工作,使人类能够翻译越来越多的内容。这些工作流非常灵活,促进了多种用途和语言组合的翻译。 我们认为MT不应该单独工作。MT可以进行多种语言的原始翻译(只要能提供并预先收集大量的语料库),但是只有翻译环境的许多其他功能才能对MT的译文进行编辑和质量检查,并将其用作有效材料。 NMT系统可以学习和改善自己的输出,加入经过时间考验的智能翻译技术扩展选项后,还能从中受益,从而改善两个系统。 如何利用NMT和翻译记忆,译得更快更有效?让我们看看与NMT相关的两个挑战:术语和语境。 解决这两个问题的一个方案是Microsoft的机器翻译插件Translator Text API v3.0。这个版本完全基于NMT,支持30多种语言。其中一个特色是私人翻译器,它允许用户上传术语库、翻译记忆和其他文档,构建和训练私人NMT模型。私人翻译器是一项收费服务,需要你有一定的时间和专业知识。有了它,你可以创建私人NMT引擎,使用自己的资源对系统进行培训,不予第三方访问。 术语的挑战 无论是人工翻译还是机器翻译,术语都需要符合文章的主题。NMT可以学习特定主题的词汇,提高准确性。例如,亚马逊的NMT服务可以通过安装Intento MT插件在memoQ中使用,你可以上传不同主题的术语表(如法律、经济、视听等),供memoQ项目日后使用。在开始翻译之前,你要先选择适当的术语表。如果系统检测到要翻译的句子中有单词与术语表中的术语相匹配,在翻译时,它将选择最佳术语。 环境的挑战 为了确保使用NMT翻译时语境的正确,你可以搭配翻译记忆中的匹配句段。假设你有个一般的NMT模型(非特定主题定制款),你可以从特定主题的翻译记忆中添加匹配的源语和目标语句段。如此一来,MT能够翻译给定的句子,并根据上传的翻译记忆结果修改译文,为用户提供语境准确的翻译。 混合工作流中的人类角色 最好的解决方案无疑是让人类在关键时刻出马。即使是最好的NMT引擎,加上定制模型,结合准确的术语和翻译记忆的相关句段,最终的译文也需要熟练的语言工作者来编辑。有了memoQ结合翻译记忆、术语库和LiveDocs语料库等海量资源获得的建议,混合工作流中,人类更有信心重复利用翻译过的文本。 简单地说,当NMT引擎与提高生产效率、加强协作的工具(如memoQ)结合在一个工作流中时,您可以处理更多本地化工作。我们很自豪能帮助行业进步,为不断增长的国际受众带来更多内容。 译后编辑:孔越怡 (中山大学)

以上中文文本为机器翻译,存在不同程度偏差和错误,请理解并参考英文原文阅读。

阅读原文