Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Azure AI Foundry custom translation, formerly known as Custom Translator, empowers businesses, app developers, and language service providers to craft tailored neural machine translation (NMT) systems. Integrated within the Azure AI Translator service, it allows users to adapt translations to suit specific requirements and contexts. The custom translation feature provides translation solutions that effortlessly integrate with your current applications, workflows, and websites.
Azure AI Foundry custom translation is built on the robust, cloud-based Translator Text API—the same secure, high-performance platform that powers billions of translations daily—Custom translation enables you to fine-tune and deploy translation systems to and from English. It supports over 100 languages, directly mapping to the extensive NMT capabilities available in the service. For a complete list of supported languages, see to Translator language support.
Custom translation makes building and deploying a custom translation system effortless—even if you have no programming experience. Its intuitive user interface lets you seamlessly upload data, train and test your models, and securely deploy them to a production environment. Depending on your training data size, your translation system can be up and running at scale within a few hours.
If you prefer programmatic control, a dedicated API is available. This API enables you to manage model creation and updates directly from your own applications or web services, integrating custom translation capabilities into your existing workflows.
Features
Custom translation provides different features to build custom translation system.
Feature | Description |
---|---|
Apply neural machine translation technology | Improve your translation by applying neural machine translation (NMT) provided by Custom translator. |
Build systems that knows your business terminology | Customize and build translation systems using parallel documents that understand the terminologies used in your own business and industry. |
Use a dictionary to build your models | If you don't have training data set, you can train a model with only dictionary data. |
Access your custom translation model | You can access your custom translation model anytime using your existing applications/ programs via Azure AI Translator Text API V3. |
Get better translations
Azure AI Translator released Neural Machine Translation (NMT) in 2016. NMT provided major advances in translation quality over the industry-standard Statistical Machine Translation (SMT) technology. Because NMT better captures the context of full sentences before translating them, it provides higher quality, more human-sounding, and more fluent translations. Custom translation provides NMT for your custom models resulting in better translation quality.
Custom translation also accepts data that's parallel at the document level to make data collection and preparation more effective. If users have access to versions of the same content in multiple languages but in separate documents, custom translation is able to automatically match sentences across documents. For a list of supported document format, see Custom translation document formats and naming convention.
If the appropriate type and amount of training data is supplied, it's not uncommon to see BLEU
score gains between 5 and 10 points by using custom translation.
Build and translate with custom models
You create a custom translation model by fine tuning an Azure AI Translator base model with your own data and terminology.
This article is an overview of how to use fine-tuning to create a custom translation model for your applications across all supported languages.
The steps for creating a custom model are as follows:
Create a project. A project is a work area for composing and building your custom translation system. A project can contain multiple language pairs, models, and documents. All the work you do in custom translation is done inside a specific project.
Create a language pair. A language pair is a wrapper for models, documents, and tests. Each language pair includes all documents that are uploaded into that project with the correct language pair. For example, if you have both an English-to-Spanish language pair and a Spanish-to-English language pair, the same documents are included in both language pairs.
Upload parallel documents. Parallel documents are pairs of texts where one document is the original language and the other is its human translation. Each sentence in the original document corresponds to a translated sentence in the paired document. Importantly, the designations "source" and "target" are interchangeable—a single pair can be used to train a translation system in either direction.
Train a model. A model is the translation system built for a specific language pair, and a successful training session results in a fully functional model. To train a model, you must provide three distinct and mutually exclusive sets of documents: training, tuning, and testing data. If you supply only training data when queuing a training session, custom translation automatically extracts a random subset of sentences from your training documents to serve as the tuning and testing data—ensuring that these sentences aren't used during the actual training.
Note
A minimum of 10,000 parallel sentences is required to train a model effectively.
Test a model. The testing set is used to compute the BLEU score. This score indicates the quality of your translation system.
Deploy a model. Your custom model is made available for runtime translation requests in the regions you select.
Translate text. Use the cloud-based, secure, high performance, highly scalable Azure AI Translator Text API to make translation requests.