The Evolution of the Internet and Translation Technologies

互联网的发展与翻译技术

2020-08-26 12:00 Andovar

本文共1463个字,阅读需15分钟

阅读模式 切换至中文

Technology has come a long way since the days of box Apple computers and floppy discs. With each step of development (called tech disruptions), we achieve better skills at shaping technologies into something increasingly more accessible and useful. Now in modern times, the internet stands tall as the one of the indisputable achievements of human innovation. Initially a tech disruption itself, the internet has become so widely adopted that it now also experiences its own periodic evolutionary changes, such as Web 0.0 - Web 5.0. In each upgrade, users gain greater control of the many systems that make it up and then shape and use it as desired. Along with the rapid growth of the internet and its many components came the development of translation technologies. Now with the internet becoming a means to easily connect people and translation becoming easier to automate, everyone from large organizations to individuals at home can reach out and communicate with others across the globe.   In light of this, we're taking a quick look back at the growth of the internet and where technology is headed, both on the web and in language translation technologies.   Defining Web 0.0 - 5.0 Web 0.0: Invented by Tim Berners-Lee, this was the birth of the World Wide Web. Though the internet was initially developed as a closed system, existing between only a select few computer systems as early as the 1960s, this period saw the development of the first web servers and browsers in 1990. Web 1.0: Introduced to the public in 1991, this started as a ‘Read-Only’ static version of the web, emerging initially as a way for users to consume information. By 1995, the first online shopping carts were created and companies like Yahoo, Google, Amzon and eBay began to appear as successful web-based services. Web 2.0: This was the 'Read-Write' period of 1999 where web users became publishers themselves, bringing their own creativity and content onto the platform and leading to the launch of sites like LiveJournal and Blogger. The crowd-sourced encyclopedia Wikipedia launched in 2001, social media sites like MySpace and Facebook followed suit in 2003 and 2004, respectively, and YouTube in 2005. The list of innovations and platforms that emerged during this time are numerous, and connectivity and expression between individuals was at an all-time high. Web 3.0: The 'Read-Write-Execute' time of the web in 2006 and the one most people probably know the best. Personalization sees an impressive leap forward with the ability for users to code and analyze data. In addition, not only are people connecting and consuming information at higher rates, but so are web applications. This is now the semantic executing stage of the web, where computer-to-computer systems have become automated. Knowledge stored on the internet is now able to be interpreted by backend processes, allowing for analyses and increased efficiencies. Though the data collected is not necessarily understood or properly contextualized by these systems, the wealth of knowledge that can be gleaned leads to great jumps in development. Artificial intelligence, machine learning, augmented reality and virtual reality all start to become a reality. Web 4.0: The term 'Mobile Web' is coined around the same time as Web 3.0 and runs almost concurrently. While people used to browse a static version of the web prior to this, it is now constantly in motion. This period sees the rapid adoption of phones, bluetooth devices, smartwatches, and various personal items and home systems connected to the internet. We enter an age where mobile devices allow us to stay connected 24-7. Whether at home or on the go, society is steadily becoming more and more digital in every aspect of our lives. Web 5.0: This is an intriguing phase of the web because it’s still in its developing stages today. This is essentially the introduction of the 'Intelligent Web', or the 'Emotional Web'. Systems can now begin to measure feelings, wants and desires based on detection and predictive algorithms, thereby helping to expedite our personal life processes. As a result, websites can begin to convey different and more personal experiences for each user, enhancing the effectiveness of such interactions.    The S-Curve The S-curve (the 'Sigmoid function' in mathematical terms) is a data scale expressing how there is an expansion of scalability. Essentially, what was once accessible to a few becomes, over time, accessible to many. The S-curve in this case represents proprietary services, tools, utilities and features. Just as when building a house, once the foundations are laid, you build upon them, adding beneficial features and functions while continuing to further improve upon them. Just as how the web developed and became simultaneously more intelligent and accessible, the S-curve is important to consider in the development of technologies and software, as is it a requirement of evolution for systems to expand and grow. By following this model, the transition of machine translation (MT) to computer-aided translation (CAT), to artificial intelligence (AI), then to quantum computing (QC), is seemingly inevitable.   Machine translation: Pros and cons Machine translation, or MT, still has a lot to offer. A great deal of MT is the execution of backend coding. This is particularly accurate when it comes to finances, like exchanging currencies. Paired with AI, MT isn’t going anywhere with our current coding system, as it’s already been fully integrated. With Web 5.0 we will see the introduction of a new system format that will begin to integrate the new AI QC systems, which will not only eliminate MT but the way we fundamentally interact with codes and processes. It will be future tech disruptions (like the far-flung Web 6.0-7.0) where we will really see a shift in technology, where hardware and software will be phased out and replaced entirely with AI/QC-compatible software. In our current web rendition, MT on the front end has one major drawback in translation - it lacks the ability to discern culture, locale and social differences, and therefore often requires substantial human intervention in the form of post-editing. Trying to get a correct raw translation from open-source MT engines, like Google Translate, can be problematic in certain languages and content types, as the system is set to translate via dictionary and not social dialect. This means mis-translations can be a common problem. At its base, MT is not intuitive, and that is quickly becoming an issue as intuitive AI begins to show up in more systems.   End of the computer-aided translation era? Computer-aided translation, or CAT, is the process in which human translators use computer software and hardware to support and facilitate translation. At this time, it is the best guarantee to clear and concise translation efforts. CAT processes can include writing systems specific to regional (locale) translation with document editors, terminology management and translation memory. CAT is probably the one system in computer automation that keeps humans in the daily process. With CAT, the need for human eyes to review work is still an integral part, making it a fundamentally human effort and one that offers the highest potential for context success or flaws. Where MT is not intuitive and cannot translate with precision, human review can sense the nature of the work and change it appropriately, ideally to yield correct and natural-sounding lingo. We will likely see an end to CAT systems in the next decade with the advancement of AI as humans have the potential to suffer interest of misguidance, misinterpretation or bias towards content. Theoretically, given AI’s complete lack of social interest or emotional bias, it will be able to translate within the correct locale without any inserted  or unintentional bias. Furthermore, with QC, AI will be able to measure the tone and emotional reception of words and apply translations within the locale context without misinterpretation.   Andovar: Keeping up with the times At Andovar, we are as interested as you are in the evolutionary steps of the internet and the world at large. We want to be right there at the forefront of technology disruptions, integrating new system updates as they become available. With that in mind, our staff has practiced long and hard to become experts in the best systems presently available, and right at the front in this age is CAT. Our team is prepared to help you access the best translation hardware and software available with the promise that as soon as new tech is available, we will be there to help you smoothly transition to them in the future. Andovar’s Language Technology Tools are here to make your life a little easier. Please feel free to get in touch if you have any questions about translation technologies or to see how we can help you with your next localization project!   hbspt.cta._relativeUrls=true;hbspt.cta.load(6085627, '377ab199-f6a5-4365-8bc7-273fe5898506', {});
自从苹果电脑和软盘时代以来,科技已经取得了长足的进步。随着技术的每一次更新(称为技术颠覆),我们在将技术塑造成得越来越容易接触和使用方面获得了更好的技能。当今时代,互联网作为人类创新的无可争辩的成果之一屹立不倒。最初,互联网本身是一种技术上的颠覆,现在它已经被广泛使用,也在经历自己的周期性变化,比如从Web0.0到Web5.0。在每一次升级中,用户获得了对组成它的许多系统的更大控制权,然后根据需要塑造和使用它。 随着互联网及其众多组成部分的迅速发展,翻译技术也随之发展起来。现在,互联网成为人们之间轻松联系的一种手段,翻译也变得更加自动化,从大型组织到家庭中的个人,每个人都可以与全球各地的其他人进行联系和交流。 有鉴于此,我们将快速回顾互联网的发展历程以及技术的发展方向,包括网络和语言翻译技术。 定义从Web 0.0到5.0的发展 Web0.0:由蒂姆·伯纳斯·李发明,这标志着万维网的诞生。虽然因特网最初是作为一个封闭系统发展起来的,早在20世纪60年代仅仅存在于少数几个计算机系统之间,但在1990年出现了第一批网络服务器和浏览器。 Web1.0:1991年向公众推出,它最初是作为一个web的“只读”静态版本,最初是作为用户使用信息的一种方式出现的。到1995年,第一批网上购物车诞生了,雅虎,谷歌,亚马逊和易趣等公司开始作为成功的网络服务商出现。 Web2.0:这是1999年的“读写”时期,网络用户自己成为出版商,将他们自己的创意和内容发布到平台上,并促进了交友网和博客使用者等网站的推出。2001年推出了众包的百科全书维基百科,2003年和2004年推出了我的空间和脸书等社交媒体网站,2005年推出了YouTube。在此期间出现的创新和平台不胜枚举,个体之间的连接和表达也创下了历史新高。 Web3.0:2006年Web的“读-写-执行”时代,也是大多数人最熟悉的时代。随着用户编码和分析数据的能力提高,个性化取得了一个令人印象深刻的飞跃。此外,不仅人们以更高的速率连接和使用信息,网络应用程序也是如此。现在是web的语义执行阶段,计算机到计算机系统已经自动化。储存在因特网上的知识现在可以通过后端程序进行表达,从而可以以较高的效率分析信息。虽然收集的数据不一定被这些系统所理解或适当地情景化,但可以收集到的丰富知识促进了发展的巨大飞跃。人工智能,机器学习,增强现实和虚拟现实都开始成为现实。 Web4.0:术语“移动网络”是与Web3.0同时出现的,并且几乎是同时运行的。在此之前,人们浏览的是web的静态版本,而现在则是动态的。这一时期出现了电话,蓝牙设备,智能手表以及各种个人物品和家庭系统连接到互联网上进行快速使用的现象。我们进入了一个移动设备可以让我们每时每刻保持联系的时代。无论是在家里还是在旅途中,社会在我们生活的各个方面都在稳步地变得越来越数字化。 Web5.0:这是网络发展的一个有趣的阶段,因为它至今仍处于发展阶段。这一时期实质上是引入了‘智能网络’,或‘情感网’。系统现在可以开始基于检测和预测算法来衡量感情,需求和欲望,从而帮助加快我们的个人生活进程。因此,网站开始为每个用户传达不同的,更个人化的体验来增强这种互动的有效性。 S型曲线 S型曲线(数学术语中的“Sigmoid函数”)是一种表示可伸缩性如何扩展的数据尺度。基本上,随着时间的推移,过去少数人可以获得的东西,到现在许多人都有机会接触。本例中的S型曲线表示专有服务,工具,公用程式和特性。就像建房子一样,一旦地基打好了,你就可以在上面继续建造,增加有益的特性和功能,同时继续进一步改进。 正如网络是如何发展并同时变得更智能和更易访问一样,S型曲线在技术和软件的发展中也是重要的考虑因素,因为它是系统扩展和增长的必要条件。按照这种模式,机器翻译(MT)到计算机辅助翻译(CAT),到人工智能(AI),再到量子计算(QC),S型曲线在每次变化的过渡中似乎是不可避免的。 机器翻译的利弊 机器翻译,或者说MT,仍然有很多东西可以提供。大量的机器翻译是后端编码的执行。这在金融方面尤其准确,比如兑换货币。 由于机器翻译已经集成度非常高了,它与人工智能相结合在我们目前的编码系统中不会有任何发展。随着Web5.0的出现,我们将看到一个新的系统格式引入,它将开始整合新的人工智能量子计算系统,这不仅将淘汰机器翻译,而且还将消除我们与代码和流程根本上进行交互的方式。它将是未来的技术颠覆(就像未来的Web6.0和7.0版本),在那里我们将真正看到技术的转变,硬件和软件将逐步淘汰并且完全被人工智能/量子计算兼容的软件所取代。 在我们目前的网络翻译中,前端的机器翻译在翻译中有一个主要的缺点--它缺乏辨别文化,地点和社会差异的能力,因此常常需要大量以后期编辑的形式出现的人工干预。试图从开源机器翻译引擎(如谷歌翻译)获得正确的原始翻译,在某些语言和内容类型中可能会有问题,因为系统设置为通过词典而不是社会方言进行翻译。这意味着错误翻译可能是一个常见的问题。从根本上说,机器翻译并不是直观的,同时直观的人工智能开始在更多的系统中出现使得这正迅速成为一个问题。 计算机辅助翻译时代的终结? 计算机辅助翻译(Computer-aided translation,简称CAT)是人类译者利用计算机软件和硬件来支持和促进翻译的过程。此时,力求翻译清晰,简洁是最好的保证。计算机辅助翻译过程可以包括特定于区域(现场)翻译的写作系统,它带有文档编辑器,术语管理和翻译记忆。 计算机辅助翻译可能是计算机自动化系统中唯一一个使人类保持在日常过程中的系统。即使有了计算机辅助翻译系统,人眼审查工作的需求仍然是不可或缺的一部分,这是人类所需的基本的努力,它也提供了检测机器翻译取得成功或出现缺陷最大的可能性。当机器翻译不直观、不精准时,人工反馈可以反映出工作的性质,并适当地纠正它,最好最终能够产生正确的、地道的行话。 随着人工智能的发展,我们很可能在未来十年见证计算机辅助翻译系统的终结,因为人类有可能遭受机器翻译误读、误译内容。理论上,鉴于人工智能完全缺乏社会兴趣或情感偏见,它将能够在正确的语言环境内进行翻译,而不会有任何插入或无意的偏见。此外,有了量子计算,人工智能将能够感知单词表达的语气和情感,并在现场语境中应用翻译而不会产生曲解。 安多瓦尔:与时俱进 在安多瓦尔,我们和你一样对互联网和整个世界的发展历程感兴趣。我们希望站在技术革新的最前沿,在新系统更新可用时对其进行整合。考虑到这一点,我们的员工经过长期艰苦的实践,成为目前可用的最佳系统的专家,而在这个时代走在最前面的就是计算机辅助翻译系统。 我们的团队准备帮助您获得最好的翻译硬件和软件,并承诺一旦新技术可用,我们未来将帮助您顺利过渡到新技术。安多瓦尔的语言技术工具帮助你的生活变得更顺畅。 如果您对翻译技术有任何疑问,请随时联系我们,或者了解我们如何帮助您完成下一个本地化项目! hbspt.cta._relativeUrls=true;hbspt.cta.load(6085627, '377ab199-f6a5-4365-8bc7-273fe5898506', {});

以上中文文本为机器翻译,存在不同程度偏差和错误,请理解并参考英文原文阅读。

阅读原文