Language Technology Made in India

印度制造的语言技术

2020-08-17 17:00 Nimdzi Insights

本文共239个字,阅读需3分钟

阅读模式 切换至中文

Report by Rucha L. Sheth. After a slow entry into the language technology space, India promises an interesting journey moving forward, as user preferences increasingly lean towards native language content. The Indian language demographics are as diverse as it gets, with 22 official languages. In all, there are 122 major and 1,599 minor languages. Most of the 10 most widely spoken languages have distinct writing systems. The Digital 2020: India report by Datareportal on India’s digital trends, released in February 2020, reports 687.6 million internet users. What’s most interesting is that, of these 687.6 million users, only 175 million speak English, English being their second or even third language. Rural India is the main contributor to this growth, registering a 35 percent increase in new users as compared to a mere 7 percent in urban India. It is important to add here that rural India prefers content in native languages while urban India primarily consumes English content. Clearly the demand for content in native languages is on a rise. Source: Times Internet for Marketers What’s more, a big chunk of the native language content is being consumed via smartphones. Looking at usage patterns for content in India’s eight most widely spoken languages shows that mobile phones are the clear winner, with Marathi being an exception. Source: Times Internet for Marketers The increase of Internet penetration among non-English speaking users has been the principal factor spurring the development of language technology and language digitization in India.
Rucha L.Sheth报道。 在缓慢进入语言技术领域之后,印度有望迎来一段有趣的前进之旅,因为用户的偏好越来越倾向于母语内容。 使用印度语的人口结构非常多样化,它们共有22种官方语言。主要语言有122种,次要语言有1599种。在这10种使用最广泛的语言中,大多数都有不同的书写系统。 2020年2月,数据门户网站发布了《数字2020:印度数字趋势报告》,报告称有6.876亿互联网用户。最有趣的是,在这6.876亿用户中,只有1.75亿人说英语,英语是他们的第二甚至第三语言。 印度农村地区是这一增长的主要贡献者,新用户增长了35%,而城市地区仅增长了7%。在此需要补充的是,印度农村地区更喜欢母语内容,而城市地区则主要使用英语内容。显然,对母语内容的需求正在上升。 来源:时代营销互联网 更重要的是,很大一部分母语内容是通过智能手机使用的。查看印度使用最广泛的八种语言的内容的使用模式可以发现,手机是明显的赢家,马拉地语则是个例外。 来源:时代营销互联网 在印度,非英语使用者中互联网普及率的提高是推动语言技术和语言数字化发展的主要因素。

以上中文文本为机器翻译,存在不同程度偏差和错误,请理解并参考英文原文阅读。

阅读原文