An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
Hello @Calabria Montero, Salvador (SGRE SE D FP&DC WEF)
In Azure Synapse Analytics, you can install additional Python modules or R packages in Apache Spark pool nodes. However, the installation of Operating System libraries like libgdal-dev and libudunits2-dev is not directly supported.
Azure Synapse Analytics provides built-in support for many popular open-source R packages, including TidyVerse3. You can install or remove these packages into a Spark pool1. Pool-level libraries are available to all notebooks and jobs running on the pool.
To install R packages, you can manage workspace packages. In Synapse, workspace packages can be custom or private R tar.gz files. You can upload these packages to your workspace and later assign them to a specific serverless Apache Spark pool. Once assigned, these workspace packages are installed automatically on all Spark pool sessions started on the corresponding pool.
However, the installation of OS libraries is not directly supported in Azure Synapse Analytics. This is because the environment is managed and does not provide sudo access required to install OS libraries. Therefore, you might need to find a workaround. One possible solution could be to find an R package that includes the necessary binaries or another package that does not have these dependencies.
** Important: Any changes to the OS level would affect the stability and security of the Spark pool nodes, and it’s one of the reasons why such operations are not permitted. If you have a specific requirement, I recommend reaching out to Azure support for more tailored assistance. They might be able to provide a solution or workaround for your specific use case.
More info:
https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-manage-pool-packages https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-azure-portal-add-libraries https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-r-language Wishing you well,