SQL Server and custom R: question marks added to output

Question

SQL Server and custom R: question marks added to output

Olga Larina 26

After R update according to tutorial https://learn.microsoft.com/en-us/sql/machine-learning/install/custom-runtime-r?view=sql-server-ver15&pivots=platform-windows
Query:

EXEC sp_execute_external_script
@language =N'myR',
@script=N'
print("Hello RExtension!");'

Output:
STDOUT message(s) from external script:
[1] " ��Hello RExtension! ��"

Completion time: 2021-09-14T09:05:00.2416329+02:00

It seems to be encoding problem. Does anyone have tips on solving? Thank you!

Seeya Xi-MSFT 16,586 Reputation points

2021-09-16T01:32:18.753+00:00

Hi @Olga Larina ,

We have not received a response from you. Did the reply could help you? If the response helped, do "Accept Answer". If it doesn't work, please let us know the progress. By doing so, it will benefit all community members who are having this similar issue. Your contribution is highly appreciated.

Best regards,
Seeya
Olga Larina 26 Reputation points

2021-09-16T14:53:45.687+00:00
Hi Seeya,

Thank you for your response. I tried setting and should point some things:

The file was located in Program Files (x86)\Microsoft SQL Server Management Studio 18\Common7\IDE\SqlWorkbenchProjectItems\Sql

The file was empty

I did recommend actions and it didn't help. Actually, when I run my default R - everything works fine. The problem appears just for custom R.
Erland Sommarskog 121.8K Reputation points MVP Volunteer Moderator

2021-09-16T21:19:07.263+00:00

No, I don't think has anything to do with settings in SSMS. I don't think Seeya really has understood what this question is about. :-)

Anyway, I did some very quick testing, and it appears that the problem is only with print, but with a OutputDataSet which would be the normal way to get back data.
Olga Larina 26 Reputation points

2021-09-17T06:37:47.837+00:00

Yes, indeed, thank you for the tip! It can be useful.

I'm working with already existing code, so a lot should be rewritten... So if you get ideas on how to improve the situation with print - please share.

2 answers

Your answer

Seeya Xi-MSFT 16,586 Reputation points

2021-09-16T01:32:18.753+00:00

Hi @Olga Larina ,

We have not received a response from you. Did the reply could help you? If the response helped, do "Accept Answer". If it doesn't work, please let us know the progress. By doing so, it will benefit all community members who are having this similar issue. Your contribution is highly appreciated.

Best regards,
Seeya
Olga Larina 26 Reputation points

2021-09-16T14:53:45.687+00:00

Hi Seeya,

Thank you for your response. I tried setting and should point some things:

The file was located in Program Files (x86)\Microsoft SQL Server Management Studio 18\Common7\IDE\SqlWorkbenchProjectItems\Sql

The file was empty

I did recommend actions and it didn't help. Actually, when I run my default R - everything works fine. The problem appears just for custom R.
Erland Sommarskog 121.8K Reputation points MVP Volunteer Moderator

2021-09-16T21:19:07.263+00:00

No, I don't think has anything to do with settings in SSMS. I don't think Seeya really has understood what this question is about. :-)

Anyway, I did some very quick testing, and it appears that the problem is only with print, but with a OutputDataSet which would be the normal way to get back data.
Olga Larina 26 Reputation points

2021-09-17T06:37:47.837+00:00

Yes, indeed, thank you for the tip! It can be useful.

I'm working with already existing code, so a lot should be rewritten... So if you get ideas on how to improve the situation with print - please share.

Answer 1

Seeya Xi-MSFT 16,586

Hi @Olga Larina ,

You can try this setting:
https://joehanna.com/sql-server/changing-the-default-encoding-of-sql-files-in-ssms/

Best regards,
Seeya

If the response is helpful, please click "Accept Answer" and upvote it, as this could help other community members looking for similar queries.
Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

Answer 2

Here is a small update. Together with a friend I dug into this a little more and I was able to analyse what is being sent. Consider this script:

   EXEC sp_execute_external_script  
          @language =N'myR',  
          @script=N'  
          x <- "räksmörgås"  
          print(x)  
          print(Encoding(x))'

("räksmörgås" is Swedish for "shrimp sandwich".)

SSMS prints this:

   STDOUT message(s) from external script:   
   [1] " ��rÃ¤ksmÃ¶rgÃ¥s ��"  
   [1] " ��latin1 ��"

The Swedish word has been encoded as UTF-8, but the string is then interpreted as Latin-1.

I wrote a small Perl script to run the batch above. This permitted me to capture the actual output and then analyse the bytes. For the second line, I got these bytes for the first couple of characters:

   5b 31 5d 20 22 02 fd fd 72 c3 92 c2 a4 6b   
    [  1  ] SP  "     ý  ý  r  Ã  ƒ  Â  ¤  k

So it seems that this is UTF-8 encoded string that has been interpreted as Latin-1 and then been re-encoded into UTF-8 a second time. SSMS expects the string to be UTF-8, so it displays only one layer of UTF-8 conversion. However, the sequence fd-fd is not legal UTF-8, and therefore SSMS displays the rhombus with the question mark. The character is known as REPLACEMENT CHARACTER, and tells is that there is an encoding error.

The full sequence of mysterious characters is 02-fd-fd, but why it appears here, I don't know.

So this does not solve the problem, but at least it gives an idea of what is going on.

Olga Larina 26 Reputation points

2021-09-22T12:30:22.72+00:00

Thank you, Erland! This is really interesting.
Seeya Xi-MSFT 16,586 Reputation points

2021-09-27T08:57:53.117+00:00

Hi @Olga Larina ,

Did the reply could help you? If the response helped, do "Accept Answer". By doing so, it will benefit all community members who are having this similar issue.
Have a nice day!

Best regards,
Seeya
Olga Larina 26 Reputation points

2021-09-27T09:00:35.053+00:00

Hi Seeya,

As Erland mentioned, 'this does not solve the problem, but at least it gives an idea of what is going on'.
So, the problem is not solved yet.
Have a nice day!

Best regards,
Olga

Share via

SQL Server and custom R: question marks added to output

2 answers

Your answer