Revert word breakers used by Search to previous version (SQL Server Search)
Applies to: SQL Server Azure SQL Database
SQL Server installs and enables a version of the word breakers and stemmers for all languages supported by Full-Text Search with the exception of Korean. This article describes how to switch from this version of these components to the previous version, or to switch back from the previous version to the new version.
This article does not discuss the following languages:
English. To revert or restore the English components, see Change the Word Breaker Used for US English and UK English.
Danish, Polish, and Turkish. The third-party word breakers for Danish, Polish, and Turkish that were included with previous releases of SQL Server have been replaced with Microsoft components.
Czech and Greek. There are new word breakers for Czech and Greek. Previous releases of SQL Server Full-Text Search did not include support for these two languages.
Korean. The word breaker and stemmer for the Korean language are not upgraded in this release.
For general information about word breakers and stemmers, see Configure and Manage Word Breakers and Stemmers for Search.
Overview of reverting and restoring word breakers and stemmers
The instructions for reverting and restoring word breakers and stemmers depend on the language. The following table summarizes the three sets of actions that may be required to revert to the previous version of the components.
Current file | Previous file | Number of affected languages | Action for files | Action for registry entries |
---|---|---|---|---|
NaturalLanguage6.dll | NaturalLanguage6.dll | 34 | Obtain and install a previous version of NaturalLanguage6.dll, overwriting the current version of the file. | No action required. The registry keys and values have not changed for this release. |
(Other file name) | NaturalLanguage6.dll | 5 | Obtain and install a previous version of NaturalLanguage6.dll, overwriting the current version of the file. | Change a set of registry entries to specify the previous version of the components. |
(Other file name) | (Other file name) | 6 | No action required. SQL Server setup copies both the current and the previous versions of the components to the Binn folder. |
Change a set of registry entries to specify the previous version of the components. |
Warning
If you replace the current version of the file NaturalLanguage6.dll with a different version, then the behavior of all the languages that use this file is affected.
The files described in this article are DLL files that are installed in the MSSQL\Binn
folder for the SQL Server instance. The full path is typically the following path:
C:\Program Files\Microsoft SQL Server\<instance>\MSSQL\Binn
Languages for which the file name of both the current and previous word breaker is NaturalLanguage6.dll
For the languages in the following table, the file name of both the current and previous word breaker is NaturalLanguage6.dll. To revert or restore these components, you have to overwrite NaturalLanguage6.dll with a different version of the same file. You do not have to change any registry entries, because the registry entries have not changed for this release.
Warning
If you replace the current version of the file NaturalLanguage6.dll with a different version, then the behavior of all the languages that use this file is affected.
List of affected languages
Language | Abbreviation used in the registry |
LCID |
---|---|---|
Bengali | ben |
1093 |
Bulgarian | bgr |
1026 |
Catalan | cat |
1027 |
Spanish | esn |
3082 |
French | fra |
1036 |
Gujarati | guj |
1095 |
Hebrew | heb |
1037 |
Hindi | hin |
1081 |
Croatian | hrv |
1050 |
Indonesian | ind |
1057 |
Icelandic | isl |
1039 |
Italian | ita |
1040 |
Kannada | kan |
1099 |
Lithuanian | lth |
1063 |
Latvian | lvi |
1062 |
Malayalam | mal |
1100 |
Marathi | mar |
1102 |
Malay | msl |
1086 |
Neutral | Neutral |
0000 |
Norwegial Bokmaal | nor |
1044 |
Punjabi | pan |
1094 |
Portuguese (Brazil) | ptb |
1046 |
Portuguese | ptg |
2070 |
Romanian | rom |
1048 |
Slovak | sky |
1051 |
Slovenian | slv |
1060 |
Serbian - Cyrillic | srb |
3098 |
Serbian - Latin | srl |
2074 |
Swedish | sve |
1053 |
Tamil | tam |
1097 |
Telugu | tel |
1098 |
Ukrainian | ukr |
1058 |
Urdu | urd |
1056 |
Vietnamese | vit |
1066 |
The preceding table is sorted alphabetically on the Abbreviation column.
To revert to the previous components
Navigate to the Binn folder described above.
Back up the SQL Server version of NaturalLanguage6.dll to another location.
Copy the previous version of NaturalLanguage6.dll from the Binn folder of an instance of SQL Server 2008 R2 (10.50.x) or SQL Server 2008 (10.0.x) into the Binn folder of the SQL Server instance.
Warning
This change affects all the languages that use NaturalLanguage6.dll in both the current and previous version.
Restart SQL Server.
To restore the current components
Navigate to the location where you backed up the SQL Server version of NaturalLanguage6.dll.
Copy the current version of NaturalLanguage6.dll from the backup location into the Binn folder of the SQL Server instance.
Warning
This change affects all the languages that use NaturalLanguage6.dll in both the current and previous version.
Restart SQL Server.
Languages for which the file name of the previous word breaker only is NaturalLanguage6.dll
For the languages in the following table, the file name of the previous word breaker is different from the file name of the new version. The previous file name is NaturalLanguage6.dll. To revert to the previous version, you have to overwrite the current version of NaturalLanguage6.dll with an earlier version of the same file. You also have to change a set of registry entries to specify the previous or current version of the components.
Warning
If you replace the current version of the file NaturalLanguage6.dll with a different version, then the behavior of all the languages that use this file is affected.
List of affected languages
Language | Abbreviation used in the registry |
LCID |
---|---|---|
Arabic | ara | 1025 |
German | deu | 1031 |
Japanese | jpn | 1041 |
Dutch | nld | 1043 |
Russian | rus | 1049 |
The preceding table is sorted alphabetically on the Abbreviation column.
Use the following instructions together with the list of values in the section File names and registry values for reverting and restoring word breakers and stemmers.
To revert to the previous components
Navigate to the Binn folder described above.
Do not remove the files for the current version of the components from the Binn folder.
Back up the SQL Server version of NaturalLanguage6.dll to another location.
Copy the previous version of NaturalLanguage6.dll from the Binn folder of an instance of SQL Server 2008 R2 (10.50.x) or SQL Server 2008 (10.0.x) into the Binn folder of the new SQL Server instance.
Warning
This change affects all the languages that use NaturalLanguage6.dll in both the current and previous version.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\CLSID.
Use the following steps to add new keys for the COM ClassIDs for the previous word breaker and stemmer interfaces for the selected language:
Add a new key with the value from the table for the previous word breaker.
Update the (Default) data of that key value to the file name of the previous word breaker from the table.
If the selected language uses a stemmer, then add a new key with the value from the table for the previous stemmer.
If the selected language uses a stemmer, then update the (Default) data of that key value to the file name of the previous stemmer from the table.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\Language<language_key>. <language_key> represents the abbreviation for the language that is used in the registry; for example, "fra" for French and "esn" for Spanish.
Update the WBreakerClass key value to the value from the table for the current word breaker.
If the selected language uses a stemmer, then update the StemmerClass key value to the value from the table for the current stemmer.
Restart SQL Server.
To restore the current components
Navigate to the location where you backed up the SQL Server version of NaturalLanguage6.dll.
Copy the current version of NaturalLanguage6.dll from the backup location into the Binn folder of the SQL Server instance.
Warning
This change affects all the languages that use NaturalLanguage6.dll in both the current and previous version.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\CLSID.
If the following keys do not exist, then use the following steps to add new keys for the COM ClassIDs for the current word breaker and stemmer interfaces for the selected language:
Add a new key with the value from the table for the current word breaker.
Update the (Default) data of that key value to the file name of the current word breaker from the table.
If the selected language uses a stemmer, then add a new key with the value from the table for the current stemmer.
If the selected language uses a stemmer, then update the (Default) data of that key value to the file name of the current stemmer from the table.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\Language<language_key>. <language_key> represents the abbreviation for the language that is used in the registry; for example, "fra" for French and "esn" for Spanish.
Update the WBreakerClass key value to the value from the table for the previous word breaker.
If the selected language uses a stemmer, then update the StemmerClass key value to the value from the table for the previous stemmer.
Restart SQL Server.
File names and registry values for reverting and restoring word breakers and stemmers
Use the following list of file names and registry entries together with the instructions in the preceding section. Use the previous values to revert to the previous version, or use the current values to restore the current version of the components.
The following listed is sorted alphabetically on the abbreviation used for each language.
Arabic (ara), LCID 1025
Component | Word breaker | Stemmer |
---|---|---|
Previous CLSID | 7EFD3C7E-9E4B-4a93-9503-DECD74C0AC6D | 483B0283-25DB-4c92-9C15-A65925CB95CE |
Previous file name | NaturalLanguage6.dll | NaturalLanguage6.dll |
Current CLSID | 04b37e30-c9a9-4a7d-8f20-792fc87ddf71 | None |
Current file name | MSWB7.dll | None |
German (deu), LCID 1031
Component | Word breaker | Stemmer |
---|---|---|
Previous CLSID | 45EACA36-DBE9-4e4a-A26D-5C201902346D | 65170AE4-0AD2-4fa5-B3BA-7CD73E2DA825 |
Previous file name | NaturalLanguage6.dll | NaturalLanguage6.dll |
Current CLSID | dfa00c33-bf19-482e-a791-3c785b0149b4 | 8a474d89-6e2f-419c-8dd5-9b50edc8c787 |
Current file name | MsWb7.dll | MsWb7.dll |
Japanese (jpn), LCID 1041
Component | Word breaker | Stemmer |
---|---|---|
Previous CLSID | E1E8F15E-8BEC-45df-83BF-50FF84D0CAB5 | 3D5DF14F-649F-4cbc-853D-F18FEDE9CF5D |
Previous file name | NaturalLanguage6.dll | NaturalLanguage6.dll |
Current CLSID | 04096682-6ece-4e9e-90c1-52d81f0422ed | None |
Current file name | MsWb70011.dll | None |
Dutch (nld), LCID 1043
Component | Word breaker | Stemmer |
---|---|---|
Previous CLSID | 2C9F6BEB-C5B0-42b6-A5EE-84C24DC0D8EF | F7A465EE-13FB-409a-B878-195B420433AF |
Previous file name | NaturalLanguage6.dll | NaturalLanguage6.dll |
Current CLSID | 69483c30-a9af-4552-8f84-a0796ad5285b | CF923CB5-1187-43ab-B053-3E44BED65FFA |
Current file name | MsWb7.dll | MsWb7.dll |
Russian (rus), LCID 1049
Component | Word breaker | Stemmer |
---|---|---|
Previous CLSID | 2CB6CDA4-1C14-4392-A8EC-81EEF1F2E079 | E06A0DDD-E81A-4e93-8A8D-F386C3A1B670 |
Previous file name | NaturalLanguage6.dll | NaturalLanguage6.dll |
Current CLSID | aaa3d3bd-6de7-4317-91a0-d25e7d3babc3 | d42c8b70-adeb-4b81-a52f-c09f24f77dfa |
Current file name | MsWb7.dll | MsWb7.dll |
Languages for which neither the previous nor the current file name is NaturalLanguage6.dll
For the languages in the following table, the file names of the previous word breakers and stemmers are different from the file names of the new versions. Neither the previous nor the current file name is NaturalLanguage6.dll. You do not have to replace any files, because SQL Server setup copies both the current and the previous versions of the components to the Binn folder. However you have to change a set of registry entries to specify the previous or current version of the components.
List of affected languages
Language | Abbreviation used in the registry |
LCID |
---|---|---|
Simplified Chinese | chs | 2052 |
Traditional Chinese | cht | 1028 |
Thai | tha | 1054 |
Chinese Traditional | zh-hk | 3076 |
Chinese Traditional | zh-mo | 5124 |
Chinese Simplified | zh-sg | 4100 |
The preceding table is sorted alphabetically on the Abbreviation column.
Use the following instructions together with the list of values in the section File names and registry values for reverting and restoring word breakers and stemmers.
To revert to the previous components
Do not remove the files for the current version of the components from the Binn folder.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\CLSID.
Use the following steps to add new keys for the COM ClassIDs for the previous word breaker and stemmer interfaces for the selected language:
Add a new key with the value from the table for the previous word breaker.
Update the (Default) data of that key value to the file name of the previous word breaker from the table.
If the selected language uses a stemmer, then add a new key with the value from the table for the previous stemmer.
If the selected language uses a stemmer, then update the (Default) data of that key value to the file name of the previous stemmer from the table.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\Language<language_key>. <language_key> represents the abbreviation for the language that is used in the registry; for example, "fra" for French and "esn" for Spanish.
Update the WBreakerClass key value to the value from the table for the current word breaker.
If the selected language uses a stemmer, then update the StemmerClass key value to the value from the table for the current stemmer.
Restart SQL Server.
To restore the previous components
Do not remove the files for the previous version of the components from the Binn folder.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\CLSID.
If the following keys do not exist, then use the following steps to add new keys for the COM ClassIDs for the current word breaker and stemmer interfaces for the selected language:
Add a new key with the value from the table for the current word breaker.
Update the (Default) data of that key value to the file name of the current word breaker from the table.
If the selected language uses a stemmer, then add a new key with the value from the table for the current stemmer.
If the selected language uses a stemmer, then update the (Default) data of that key value to the file name of the current stemmer from the table.
In the registry, navigate to the following node: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server<InstanceRoot>\MSSearch\Language<language_key>. <language_key> represents the abbreviation for the language that is used in the registry; for example, "fra" for French and "esn" for Spanish.
Update the WBreakerClass key value to the value from the table for the previous word breaker.
If the selected language uses a stemmer, then update the StemmerClass key value to the value from the table for the previous stemmer.
Restart SQL Server.
File names and registry values for reverting and restoring word breakers and stemmers
Use the following list of file names and registry entries together with the instructions in the preceding section. Use the previous values to revert to the previous version, or use the current values to restore the current version of the components.
The following listed is sorted alphabetically on the abbreviation used for each language.
Simplified Chinese (chs), LCID 2052
Component | Word breaker |
---|---|
Previous CLSID | 12CE94A0-DEFB-11D2-B31D-00600893A857 |
Previous file name | chsbrkr.dll |
Current CLSID | E0831C90-BAB0-4ca5-B9BD-EA254B538DAC |
Current file name | MsWb70804.dll |
Traditional Chinese (cht), LCID 1028
Component | Word breaker |
---|---|
Previous CLSID | 1680E7C3-9430-4A51-9B82-1E7E7AEE5258 |
Previous file name | chtbrkr.dll |
Current CLSID | E9B1DF65-08F1-438b-8277-EF462B23A792 |
Current file name | MsWb70404.dll |
Thai (tha), LCID 1054
Component | Word breaker | Stemmer |
---|---|---|
Previous CLSID | CCA22CF4-59FE-11D1-BBFF-00C04FB97FDA | CEDC01C7-59FE-11D1-BBFF-00C04FB97FDA |
Previous file name | Thawbrkr.dll | Thawbrkr.dll |
Current CLSID | F70C0935-6E9F-4ef1-9F06-7876536DB900 | None |
Current file name | MsWb7001e.dll | None |
Chinese Traditional (zh-hk), LCID 3076
Component | Word breaker |
---|---|
Previous CLSID | 1680E7C3-9430-4A51-9B82-1E7E7AEE5258 |
Previous file name | chtbrkr.dll |
Current CLSID | E9B1DF65-08F1-438b-8277-EF462B23A792 |
Current file name | MsWb70404.dll |
Chinese Traditional (zh-mo), LCID 5124
Component | Word breaker |
---|---|
Previous CLSID | 1680E7C3-9430-4A51-9B82-1E7E7AEE5258 |
Previous file name | chtbrkr.dll |
Current CLSID | E9B1DF65-08F1-438b-8277-EF462B23A792 |
Current file name | MsWb70404.dll |
Chinese Simplified (zh-sg), LCID 4100
Component | Word breaker |
---|---|
Previous CLSID | 12CE94A0-DEFB-11D2-B31D-00600893A857 |
Previous file name | chsbrkr.dll |
Current CLSID | E0831C90-BAB0-4ca5-B9BD-EA254B538DAC |
Current file name | MsWb70804.dll |
See Also
Change the Word Breaker Used for US English and UK English
Behavior Changes to Full-Text Search