Some unicode character combinations are invalid in paths when used together, but not seperately

Emilia 11 Reputation points
2021-07-16T00:18:43.403+00:00

Problem: some unicode character combinations are invalid in paths when used together but not seperately

We tried to upload blobs with some character combination in the blob name. Some combinations got rejected both in the portal and using REST API

I tested with combinations of the following unicode characters
U+103A3 [OLD PERSIAN SIGN KA] 𐎣
U+FFFC [OBJECT REPLACEMENT CHARACTER] 
U+FE69 [SMALL DOLLAR SIGN] ﹩

Working blobnames:
unicode/_2weirdordering.txt
u/_2weirdordering.txt
u/_2weirdordering_2weirdordering.txt
u/_2weirdordering.txt
u𐎣_3/﹩_smalldollar.txt

Not working:
u𐎣_3/_2weirdordering.txt
u𐎣_345/_2weirdordering.txt
u𐎣_3_2weirdordering.txt
u𐎣_3/t_2weirdordering.txt
u𐎣_3/u𐎣_4/_2weirdordering.txt

When trying to upload these files through the portal you get "Failed to validate file names."
Upload through the REST API also fail for the same set of files.

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,415 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. deherman-MSFT 33,141 Reputation points Microsoft Employee
    2021-07-19T20:58:15.703+00:00

    @Emilia
    Apologies for the delayed response. For information on what Unicode characters are allowed please this section here. If you feel that the characters should be allowed but are still not please let me know and I can forward your request to the service team.

    -------------------------------

    Please don’t forget to "Accept the answer" and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.