语言数据空间网络研讨会系列--翻译技术速递

In line with the European Data Strategy and the launch of the DIGITAL Programme, the European Commission (CNECT.G3’s Multilingualism Sector) organised a series of seven workshops in May. They targeted several business sectors, such as News, Broadcasting, Advertising, Publishing, Language Technology, Telecommunication Industries as well as Libraries, Archives and Public Administrations. The goal was not only to present the Language Data Space concept, but also to gather insights from the different stakeholders’ groups. The Language Data Space aims to give stakeholders the opportunity to monetise their efforts in terms of language resources (data, tools, services, models, etc.), while also supporting the deployment of language models and AI-based language technology services for their businesses − in one single marketplace. Its objective is to create an interconnected and competitive European data economy for the valorisation and re-use of language resources. The Language Data Space will be financed as follows: • Framework Programme: Digital Europe Work Programme 2021-2022 • Type of Action: PROCUREMENT • Indicative Budget: EUR 6 million • Indicative Time of the Call Opening: June – September 2022 • Indicative Starting Date: Early 2023 More than 100 attendees and their representative organisations (e.g., FEP, FEDMA, ETNO, EBU, etc.) participated in these workshops. All of them showed interest and identified several opportunities, for instance, monetising language data, counteracting the European Language Technologies landscape fragmentation and enriching it with high-quality data, covering different modalities, business domains and use cases. In addition, stakeholders maintained that indispensable ‘enabling conditions’ must be implemented and certain challenges have yet to be overcome, on a technical (e.g., promoting standards, normalising metadata, designing and developing the architecture), legal (e.g., complying with GDPR, implementing IPR clearance and correct licensing) or operational (e.g., defining governance, fostering sustainability and interoperability) level. More to come soon – stay tuned!

根据欧洲数据战略和数字计划的启动，欧盟委员会（CNECT.G3的多语言部门）在5月组织了一系列七个研讨会。它们针对若干商业部门，如新闻、广播、广告、出版、语言技术、电信业以及图书馆、档案馆和公共行政部门。其目的不仅是提出语言数据空间的概念，而且还收集来自不同利益相关方团体的见解。语言数据空间旨在为利益相关方提供机会，使其在语言资源（数据、工具、服务、模型等）方面的努力实现货币化。同时还支持在一个市场中部署语言模型和基于人工智能的语言技术服务。它的目标是建立一个相互联系和竞争的欧洲数据经济，以实现语言资源的价值评估和再利用。语言数据空间的资金来源如下： · 框架计划：2021-2022年数字欧洲工作计划 · 操作类型：采购 · 指示性预算：6百万欧元 · 电话会议开始的指示时间：2022年6月至9月 · 指示性起始日期：2023年初 100多名与会者及其代表机构（如：FEP、FEDMA、ETNO、EBU等）参加了这些讲习班。他们都表现出了兴趣，并发现了一些机会，例如，将语言数据货币化，抵消欧洲语言技术领域的碎片化，并通过高质量的数据丰富它，涵盖不同的模式，业务领域和用例。此外，利益攸关方认为，必须落实不可或缺的“扶持条件”，在技术（例如，促进标准、规范元数据、设计和开发体系结构）、法律（例如，遵守GDPR、实施知识产权许可和正确的许可）或可操作（例如，定义治理、促进可持续性和互操作性）级别。更多内容即将推出—敬请期待!

以上中文文本为机器翻译，存在不同程度偏差和错误，请理解并参考英文原文阅读。

阅读原文

机器翻译

工具

翻译管理

本地化