Terminologists have an arduous task. They need to translate terminology into money. How? By presenting convincing examples of terminological mistakes and their consequences in a corporate setting. And there’s nothing that terminologists all over the world would like more than to know that CEOs, CIOs, or CTOs are interested and ready to invest in a well-structured and fully developed terminology database.
In Support of Terminology
In reality, thanks to the explosion of the new economy, terminology has already spread without any CEO, CIO, or CTO noticing. In the ‘90s newly-born web portals needed taxonomies to categorize and make information researchable and relevant, as well as to prevent users from looking through every single resource. Today, in content management systems, taxonomies also help content creators find and re-use content.
Terminology, taxonomy, and related disciplines like library science are the foundation of modern fields of activity such as knowledge architecture, SEO, search engine development, and knowledge management. It’s therefore not an exaggeration to say that terminology is an essential tool for knowledge transfer, be it on paper or digital format.
The Misconception About Terminology
The general belief is that terminology is related only to language and translation. This is probably the reason that almost no one, be it a corporation or a language service provider, is interested in investing in it. Terminology is seen as a part of the translation workflow and a necessary evil that no one wants to turn into an element of their own core business. However, this belief is only true to a certain extent. Here are a few facts and figures to prove this statement.
A white paper published by IDC (International Data Corporation) in 2001 reported that:
Knowledge workers spent 15% to 25% of their time searching for information.
Searches were successfully completed only 50% of the time or less.
Only 21% of workers found the information they needed 85% to 100% of the time.
A 2010 Tekom survey — as presented by TermNet in their Terminology Management course — showed that:
In 85% of the cases, different departments within an organization use different terms for the same product.
In 70% of the cases, employees use different terms for the same concept.
In 50% of the cases, employees don’t understand terms in the product they are documenting.
Knowledge is a commodity that can be accessed anywhere and anytime we want. But this knowledge needs to be organized, and terminology is essential to this end because it is at the center of every single task we perform today.
Terminology from Content Creation to Translation
At what point should terminology enter the content life cycle?
Terminology should come before the creation of any kind of content.
A well-structured termbase will help improve accuracy in technical documentation and reuse content units.
As well as helping writers select the right terms for any concept, with a terminological database at their disposal they will make your content easy to find. At the same time, using of the correct terminology will improve the accuracy and clarity of a corporate message, thus reinforcing branding and protecting an organization from negative consequences.
Also, higher consistency in source content will maximize the effects of any translation technologies, from translation memories to machine translation.
Terminology is central to the development of controlled languages, to build termbases from which they can extract a limited vocabulary of unique terms and concepts.
A limited vocabulary may help reduce ambiguity and complexity. When combined with simplified grammar, a controlled language is obtained that can help technical writers avoid long, convoluted sentences, thus improving the overall readability, comprehension, and usability of technical documentation. Furthermore, a controlled language will also help cut translation times and costs by reducing the number of words in source content and improving content consistency.
The task of creating a termbase for your company might sound daunting, but it’s not really. Keep in mind that, like in the case of translation quality, the “good enough” principle is valid for terminological work as well. It’s better to begin by collecting the minimal data necessary and then build upon it over time.
Manual terminology extraction (also called text mining) is fine for small texts. For larger texts, there are various open-source terminology extraction tools, either in desktop or (even better) online versions.
Once you have your first bundle of data, collect it in a file. Avoid using Word, because it doesn’t allow for much flexibility. Some terminologists are also dead-set against spreadsheets (like Excel), although personally, I don’t mind. If you’re just starting out and lack the skills, the time, or the inclination to create a complex termbase — for example, using MS Access or a specific tool — a spreadsheet is a viable, practical option. Among other things, it offers the Advanced filter dialog box and boolean logic to help you search your file. Most importantly, a spreadsheet can also be saved in .csv format, which is the most accessible and compatible of formats when it comes to importing a termbase in your translation management system (TMS). I know there’s TBX too, but we’ll deal with exchange formats and standards another time.
Don’t restrict yourself to a two-column spreadsheet. In addition to source and target term, enter also other data elements. The most commonly used data categories are: definition, status, department, client, project, product, part of speech, domain, context, and illustrations.
Finally, choose a TMS that fits your workflow and import your termbase. You’ll make your translators happy, while saving time and money.
术语学家的任务艰巨。他们需要把术语转化为金钱。怎么做?通过举出令人信服的例子说明在公司环境中所发生的术语错误及其后果。世界各地的术语学家最想知道的莫过于CEO、CIO或CTO对结构良好、开发完善的术语数据库感兴趣并准备投资。
支持术语
事实上,由于新经济的爆发,术语已经在没有任何CEO、CIO或CTO注意到的情况下传播开来。在90年代,新生的门户网站需要分类法来进行分类,使信息具有可研究性和相关性,同时防止用户浏览每一个单一的资源。今天,在内容管理系统中,分类法还帮助内容创建者查找和重用内容。
术语、分类学和诸如图书馆学等相关学科是知识架构、SEO、搜索引擎开发和知识管理等现代活动领域的基础。因此,可以毫不夸张地说,术语是知识转移的重要工具,无论是纸面形式还是数字形式。
对术语的误解
人们普遍认为术语只与语言和翻译有关。这可能是几乎没有公司或语言服务提供商想要投资它的原因。术语被看作是翻译工作流程的一部分,虽无法避免,却没有人想把它变成自己核心业务的要素。然而,这种想法只在一定程度上是正确的。这里有几个事实和数字来证明这个说法。
2001年IDC(国际数据公司)发表的一份白皮书报告说:
知识型员工花费15%到25%的时间搜索信息。
成功完成搜索的时间只占50%或更少。
只有21%的员工在85%到100%的时间里找到了他们需要的信息。
TermNet在其术语管理课程中介绍的2010年Tekom调查显示:
在85%的情况下,一个组织内的不同部门对同一产品使用不同的术语。
在70%的情况下,员工对同一个概念使用不同的术语。
在50%的情况下,员工不理解他们所记录的产品中的术语。
知识是一种可以随时随地获取的商品。但是这些知识需要被组织起来,而术语对此至关重要,因为它是我们执行每一项单一任务的中心。
从内容创作到翻译的术语
术语应该在什么时候进入内容生命周期?
术语应该出现在任何一种内容的创建之前。
结构良好的术语库将有助于提高技术文档的准确性和重用内容单元。
除了帮助作者为所有概念选择正确的术语外,术语数据库还能使你的内容更容易被找到。同时,使用正确的术语将提高整体信息的准确性和清晰度,从而加强品牌化,保护组织免受负面影响。
此外,源内容的高度一致性将使翻译记忆、机器翻译等翻译技术的使用效果最大化。
术语是受控语言开发的核心,可以构建术语库,从术语库中提取有限的独特术语和概念词汇。
有限的词汇可能有助于减少歧义和复杂性。当与简化语法相结合时,可以获得一种受控的语言,它可以帮助技术作者避免冗长、复杂的句子,从而提高技术文档的整体可读性、理解性和可用性。此外,受控语言还将通过减少源内容字数和提高内容一致性来帮助削减翻译时间和成本。
为您的公司创建术语库的任务听起来可能很艰巨,但实际上并不是这样。请记住,就像翻译质量一样,“足够好”的原则也适用于术语工作。最好先收集所需的最小数据,然后在此基础上逐步建立术语库。
手动提取术语(也称为文本挖掘)对于小文本是很合适的。对于较大的文本,有各种开源的术语提取工具可以使用,这些工具既有桌面版本,也有(更好的)在线版本。
一旦您有了第一组数据,请将其收集到一个文件中。要避免使用Word,因为它没有太大的灵活性。一些术语学家也坚决反对电子表格(比如Excel),尽管我个人并不介意。如果您刚刚起步,缺乏(使用MS Access或特定工具)创建复杂术语库的技能、时间或意愿,那么电子表格是一个可行且实用的选择。它提供高级筛选对话框和布尔逻辑来帮助您搜索文件。最重要的是,电子表格还可以保存为csv格式,在翻译管理系统(TMS)中导入术语库时,这是最易访问和兼容的格式。我知道TBX也有,但我们下次再讨论格式和标准转换。
不要把自己限制在一个两列的电子表格中。除了源术语和目标术语外,还要输入其他数据元素。最常用的数据类别有:定义、状态、部门、客户、项目、产品、词性、领域、上下文和插图。
最后,选择一个适合您工作流程的TMS并导入术语库。这样不仅节省时间和金钱,还能让译者感到轻松愉悦。
译后编辑:王思晴(中山大学)
以上中文文本为机器翻译,存在不同程度偏差和错误,请理解并参考英文原文阅读。
阅读原文