With so many new technologies and AI tools emerging in the market, it can be difficult for enterprise translation and/or localization customers to understand the differences between custom engine development and LLM fine-tuning for specific content types (e.g., marketing copy, legal contracts, e-commerce product descriptions). This blog post will provide a better understanding of these two approaches and explain how In-Context Learning (ICL) can be used to improve the accuracy and efficiency of translation.
Custom Engine Development vs. Adaptive Engine Training
When it comes to machine translation, two of the most popular approaches are custom engine development and adaptive engine training. While both approaches have their own unique advantages and challenges, the difference between custom engine training and LLM fine-tuning lies in how the model's parameters are adjusted to better fit specific content types. Custom engine development involves creating a new system from scratch, while LLM fine-tuning focuses on training existing models to recognize and respond to specific types of content.
Custom Engine Development
Custom engine development is the process of training a model's parameters once using a content-specific dataset and deploying those parameters. In the past, the standard approach to translation was to develop a custom engine for each content type. This involved collecting a large dataset of source and target text pairs for the specific content type, and then training a machine translation model on this data. Once the model was trained, it could be deployed to translate new text.
This approach has been used for many decades and can produce high-accuracy models. However, it is also a very time-consuming and expensive process. It can take months or even years to collect enough data and train a high-quality model. Additionally, custom engines are not always able to keep up with the latest changes in language. This is because they are trained on a static dataset of data, which can become outdated over time. If a company wants to train the model on a new example, it must retrain the entire model and deploy it again, which can be a time-consuming and tedious process and lead to stale models.
Adaptive Engine Training
Adaptive engine training, on the other hand, is a newer approach to translation that addresses some of the limitations of custom engine development and involves continuously updating the deployed model's parameters with a constant stream of new training examples. This approach eliminates the need to retrain and redeploy the entire model, ensuring the model is always trained on the most recent data and improves its accuracy over time. Lilt pioneered and began developing this technology in 2015, revolutionizing the way enterprise companies approach enterprise translation. Adaptive engine training is more efficient than custom engine development, but it is still not as efficient as In-Context Learning. This is because adaptive engine training requires a large amount of data to train the model.
Where We Are Now with In-Context Learning/LLM Fine-Tuning
In-Context Learning or LLM Fine-Tuning takes the adaptive engine training approach further by allowing for rapid customization of a single model to a specific content type. With In-Context Learning, a machine translation model is trained on a single example of the content type to be translated. This allows the model to be customized to the specific content type without the need for a large dataset of training data.
In recent years, In-Context Learning or LLM Fine-Tuning has become the preferred approach for achieving accuracy and efficiency in translation. This approach allows for continuous training of a deployed model and comprises two techniques: fine-tuning and few-shot prompting.
Fine-Tuning
Fine-tuning is the process of adjusting the model's parameters for each new example. It is a technique specific to neural networks that adjusts the model's parameters for each new example, leading to even higher accuracy.
Few-Shot Prompting
Few-shot prompting involves adding training examples to the input of the deployed model to influence the model's output without adjusting its parameters.
One of the biggest advantages of ICL and LLM Fine-Tuning is that it eliminates the need for multiple model evaluations. With a single model, ICL allows for the rapid customization of models for specific content types, resulting in even higher accuracy and efficiency.
That said, In-Context Learning is the most efficient and accurate approach to translation. It is a newer approach, but it is quickly becoming the standard for translation.
Final Considerations: Choosing the Right Approach for Your Business
As with any technology, there is no one-size-fits-all solution when it comes to custom engine development versus LLM fine-tuning. The approach you choose should be tailored to your specific needs, budget, and timeline. By carefully considering these factors, you can choose the approach that will deliver the best results for your company.
Consider the resources you have available to you. Custom engine development requires a significant investment of time, money, and expertise. If you don't have these resources, or if your content is constantly evolving, LLM fine-tuning may be a more practical solution for your business.
Finally, consider your long-term goals. If you plan on scaling your translation efforts and adding new content types in the future, a custom engine may be a better investment. However, if you're looking for a more flexible solution that can adapt to changing content needs, LLM fine-tuning may be the way to go.
While both custom engine training and adaptive engine training can produce high-accuracy models, ICL allows for customization and continuous training, leading to better results for specific content types and making it the preferred approach for many businesses. Ultimately, the right approach for your business will depend on your specific needs and requirements.
随着市场上涌现出如此多的新技术和人工智能工具,企业翻译和/或本地化客户可能很难理解定制引擎开发和LLM微调之间的差异。营销文案、法律合同、电子商务产品说明)。这篇博客文章将提供对这两种方法的更好理解,并解释如何使用上下文学习(ICL)来提高翻译的准确性和效率。
自定义引擎开发与自适应发动机训练
在机器翻译方面,两种最流行的方法是自定义引擎开发和自适应引擎训练。虽然这两种方法都有自己独特的优势和挑战,但自定义引擎训练和LLM微调之间的区别在于如何调整模型的参数以更好地适应特定的内容类型。自定义引擎开发涉及从头开始创建新系统,而LLM微调则侧重于训练现有模型来识别和响应特定类型的内容。
自定义引擎开发
自定义引擎开发是使用特定于内容的数据集训练模型参数并部署这些参数的过程。过去,标准的翻译方法是为每种内容类型开发一个自定义引擎。这涉及收集特定内容类型的源和目标文本对的大型数据集,然后在此数据上训练机器翻译模型。一旦模型经过训练,就可以部署它来翻译新的文本。
这种方法已经使用了几十年,可以产生高精度的模型。然而,这也是一个非常耗时和昂贵的过程。收集足够的数据并训练高质量的模型可能需要数月甚至数年的时间。此外,自定义引擎并不总是能够跟上语言的最新变化。这是因为它们是在静态数据集上训练的,随着时间的推移,这些数据集可能会过时。如果一家公司想要在一个新的示例上训练模型,它必须重新训练整个模型并再次部署它,这可能是一个耗时而繁琐的过程,并导致模型过时。
自适应发动机训练
另一方面,自适应引擎训练是一种较新的翻译方法,它解决了自定义引擎开发的一些限制,并涉及使用恒定的新训练示例流不断更新部署模型的参数。这种方法消除了重新训练和重新部署整个模型的需要,确保模型始终基于最新数据进行训练,并随着时间的推移提高其准确性。Lilt于2015年率先开始开发这项技术,彻底改变了企业公司处理企业翻译的方式。自适应引擎训练比自定义引擎开发更有效,但仍然不如上下文学习有效。这是因为自适应引擎训练需要大量数据来训练模型。
我们现在在哪里与上下文学习/LLM微调
上下文学习或LLM微调通过允许将单个模型快速定制为特定内容类型,进一步采用自适应引擎训练方法。通过上下文学习,机器翻译模型在要翻译的内容类型的单个示例上进行训练。这允许模型被定制为特定的内容类型,而不需要训练数据的大数据集。
近年来,语境学习或LLM微调已成为实现翻译准确性和效率的首选方法。该方法允许对部署的模型进行连续训练,并且包括两种技术:微调和少量提示。
微调
微调是针对每个新示例调整模型参数的过程。这是一种特定于神经网络的技术,它可以为每个新的示例调整模型的参数,从而获得更高的准确性。
少镜头提示
少镜头提示涉及将训练示例添加到已部署模型的输入以影响模型的输出而不调整其参数。
ICL和LLM微调的最大优点之一是它消除了对多个模型评估的需要。通过单一模型,ICL允许针对特定内容类型快速定制模型,从而实现更高的准确性和效率。
也就是说,语境学习是最有效和准确的翻译方法。这是一种较新的方法,但它正在迅速成为翻译的标准。
最后考虑事项:为您的业务选择正确的方法
与任何技术一样,当涉及到自定义引擎开发与LLM微调时,没有一种通用的解决方案。您选择的方法应根据您的特定需求、预算和时间轴进行定制。通过仔细考虑这些因素,您可以选择将为您的公司提供最佳结果的方法。
考虑一下你所拥有的资源。定制引擎开发需要大量的时间、金钱和专业知识的投入。如果您没有这些资源,或者您的内容不断发展,LLM微调可能是您业务的更实用的解决方案。
最后,考虑一下你的长期目标。如果您计划在未来扩展翻译工作并添加新的内容类型,自定义引擎可能是更好的投资。但是,如果您正在寻找一个更灵活的解决方案,可以适应不断变化的内容需求,LLM微调可能是要走的路。
虽然自定义引擎训练和自适应引擎训练都可以生成高精度模型,但ICL允许自定义和持续训练,从而为特定内容类型带来更好的结果,并使其成为许多企业的首选方法。最终,适合您业务的正确方法将取决于您的特定需求和要求。
以上中文文本为机器翻译,存在不同程度偏差和错误,请理解并参考英文原文阅读。
阅读原文