The Winning Papers from the World’s Largest Natural Language Processing Conference

世界最大自然语言处理大会获奖论文

2019-11-18 10:58 slator

本文共567个字,阅读需6分钟

阅读模式 切换至中文

While many major conferences in Hong Kong are being cancelled due to the ongoing protests, academia’s natural language processing (NLP) community gathered in the city for the latest instalment of the world’s largest NLP conference, EMNLP. EMNLP, short for Empirical Methods in Natural Language Processing, is an annual conference that hosts researchers from across the globe. Attendees and presenters gather to learn about the latest developments spanning the breadth of NLP, including machine translation. Along with the regular cohort of researchers from top academic institutions, the conference also attracts significant attention from big tech — Google, Facebook, Baidu, Apple, Salesforce, eBay, Cisco and Amazon were all among the conference’s sponsors. For them, EMNLP is not only an opportunity to share their own latest NLP research findings, but also to actively scout talent from among the conference attendees. Each year, in the run up to the conference, researchers are asked to submit papers for the conference reviewers’ consideration. Typically, around a quarter or so of all papers submitted are accepted into the conference in some form. EMNLP is a high-stakes, high-brow gathering that gives researchers the opportunity to showcase their latest findings to a roomful of their equally NLP-minded peers. The Hong Kong edition of EMNLP — called EMNLP-IJCNLP 2019 (IJC standing for International Joint Conference) — was held on November 5–7, 2019 and featured 465 long papers, 218 short papers, and 44 demo papers. There were just shy of 3,000 submissions, 37% more than in 2018. At the closing ceremony held on November 9, 2019, EMNLP announced the winners of the four awards up for grabs: Best Paper, Best Paper Runner-Up, Best Demo Paper, and Best Resource Paper. The Best Paper Award went to Xiang Lisa Li and Jason Eisner from John Hopkins University for their paper on Specializing Word Embeddings (for Parsing) by Information Bottleneck. The Runners-Up for Best Paper Award were John Hewitt and Percy Liang from Stanford University for Designing and Interpreting Probes with Control Tasks. The Best Demo Paper Award was won by a group of researchers from the Allen Institute for Artificial Intelligence and the University of California, Irvine for their paper on AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models. The Best Resource Paper Award went to a group of machine translation researchers for their work exploring low-resource languages. The paper, entitled “The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali–English and Sinhala–English,” was co-authored by a group of eight researchers from Facebook Applied Machine Learning, Facebook AI Research (FAIR), Sorbonne Universités, and Johns Hopkins University. The paper’s co-authors are Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, and Marc’Aurelio Ranzato. Low-resource languages are a hot topic in machine translation research, and are of a particular preoccupation for big tech companies such as Facebook, Microsoft, and Alibaba. These companies have their own motivations for seeking to understand (and generate) content in languages that are typically difficult for US- or China-centric corporates to penetrate in meaningful ways. The awards were decided by three separate committees: one for Best Paper (and Runner-Up), another for Best Resource Paper, and one for Best Demo Paper. Initial nominations for the awards came from the 1,700 reviewers, 152 area chairs, and 18 senior area chairs, who put forward a shortlist of five candidates for Best Paper and another five for Best Resource paper.
尽管由于持续的抗议活动,香港的许多主要会议被取消,但学术界的自然语言处理 (NLP) 社区聚集在这座城市,参加了世界上最大的NLP会议EMNLP的最新一期。 EMNLP是自然语言处理中经验方法的缩写,是一年一度的会议,接待了来自全球的研究人员。与会者和主持人聚集在一起,了解NLP的最新发展,包括机器翻译。 除了来自顶级学术机构的定期研究人员外,这次会议还吸引了大型科技公司的极大关注-Google,Facebook,百度,苹果,Salesforce,eBay,Cisco和Amazon都是会议的赞助商。对他们来说,EMNLP不仅是分享他们自己最新的NLP研究成果的机会,也是从会议与会者中积极寻找人才的机会。 每年,在会议召开之前,研究人员都被要求提交论文供会议评审员审议。通常,提交的所有论文中约有四分之一左右以某种形式被会议接受。EMNLP是一个高风险,高眉毛的聚会,使研究人员有机会向一群同样具有NLP思想的同行展示他们的最新发现。 香港EMNLP-名为EMNLP-IJCNLP 2019 (代表国际联席会议的IJC)-在2019年11月5至7日举行,收录了465篇长篇论文,218篇短篇论文和44篇演示论文。只有3,000份提交,比2018年多37% 份。 在2019年11月9日举行的闭幕式上,EMNLP宣布了四个奖项的获奖者: 最佳论文,最佳论文亚军,最佳演示论文和最佳资源论文。 最佳论文奖授予了约翰·霍普金斯大学的向丽莎·李 (Xiang Lisa Li) 和杰森·埃斯纳 (Jason Eisner),以表彰他们关于通过信息瓶颈对单词嵌入 (用于解析) 进行专业研究的论文。最佳论文奖的亚军是斯坦福大学的约翰·休伊特 (John Hewitt) 和珀西·梁 (Percy Liang),他们设计和解释了具有控制任务的探针。 艾伦人工智能研究所和加州大学欧文分校的一组研究人员因其关于AllenNLP解释的论文而获得了最佳演示论文奖: 解释NLP模型预测的框架。 最佳资源论文奖授予了一群机器翻译研究人员,他们致力于探索低资源语言。该论文的标题为 “用于低资源机器翻译的FLORES评估数据集: 尼泊尔语-英语和僧伽罗语-英语”,由来自Facebook应用机器学习,Facebook AI研究 (FAIR),索邦大学和约翰·霍普金斯大学的八名研究人员共同撰写。该论文的合著者是弗朗西斯科·古兹曼,彭珍·陈,迈尔·奥特,胡安·皮诺,纪尧姆·兰普尔,菲利普·科恩,维斯拉夫·乔杜里和马克·奥雷里奥·兰扎托。 低资源语言是机器翻译研究中的热门话题,对于Facebook、微软和阿里巴巴等大型科技公司来说,这是一个特别关注的问题。这些公司有自己的动机来寻求理解 (和生成) 语言的内容,而这些语言通常对于以美国或中国为中心的公司来说很难以有意义的方式渗透。 该奖项由三个独立的委员会决定: 一个是最佳论文 (和亚军),另一个是最佳资源论文,另一个是最佳演示论文。该奖项的初步提名来自1,700位审稿人,152位地区主席和18位高级地区主席,他们提出了五名最佳论文候选人和另外五名最佳资源论文候选人的入围名单。

以上中文文本为机器翻译,存在不同程度偏差和错误,请理解并参考英文原文阅读。

阅读原文