I work with a number of people based in China, so I'm not sure if that is why, but I have sometimes found when editing their Word documents that the font automatically switches to 'Chinese' (e.g. SimSun or MS Mincho) when I insert advanced punctuation characters. For example, I might insert an en-dash (–) to amend some hyphens, or I might insert a prime (′).
("Sometimes" = circa 25% of the documents, and then it might be say 20–80% of the text within an affected document.)
(By the way, it is not only me: the latest document exhibits the same issue from primes inserted by the original authors in China.)
Note that I am careful to insert these characters as plain text. For example:
- use the numeric keypad (e.g. type 8210 on numeric keypad while holding Alt to get en-dash);
- use Word shortcut/hotkey (e.g. type - on numeric keypad while holding Ctrl); or
- get desired character from Character Map (in an arbitrary font that contains the character), copy, paste into Notepad (to strip font), copy from Notepad, paste into Word.
To be clear, it is only the font of the inserted (i.e. newly typed) character that is affected. Surrounding (unedited) text is unaffected.
In one case, consecutive sentences within the same paragraph were treated differently. I have anonymised the content in what follows.
In one sentence (Maecenas 18-21 urna.), replacing the hyphen with an en-dash never affected the font (Times New Roman).
In the other sentence (Fusce 20-25 imperdiet 60-80.) replacing the hyphen with an en-dash sometimes didn't affect the font, and sometimes it did. Specifically:
- No effect if the hyphen was deleted first, and then the en-dash was typed in.
- No effect if the en-dash was inserted in front of the hyphen.
- Font switched to MS Mincho (method 1) or SimSun (method 2) if EITHER the en-dash was inserted after the hyphen OR the hyphen was selected and then overwritten. [But method 1 (Alt+8242) causes font-switching to SimSun instead when inserting a prime (′) in those two ways.]
- No effect if method 3 is used for insertion of an en-dash after the hyphen or overwriting it. But method 3 still causes font-switching (to SimSun) when inserting a prime (′) anywhere in that same sample text.
I have examined the XML of the Word document, and the only difference I can notice is that in the first sentence "18-21" is treated as a single XML element, whilst in the second sentence each of the ranges ("20-25" and "60-80") are split into multiple XML elements.
(I shall paste the full content of document.xml in a follow-up post, to keep this initial post more concise.)
Earlier I had speculated that it might be due to a Style definition, but the above is not really pointing clearly to a Style problem.
I'm interested to know whether others have encountered this. And whether there are some other tips or tricks to avoid the font switching!