Content
Where is hue service and url in Hdinsight?
Hi, I created a hdinsight (hadoop) cluster and I used the script action to install hue. The installation was Succeeded: But I was unable to find hue service or url in ambari Could you please help me? Regards Fede


how to access hadoop and Apache spark in azure
Hi, I've created my azure student account, but can't access HDinsight to use hadoop and Apachec spark for my master. How to i do? cheers joao acount (sba22203@student.cct.ie)


can i Connect to Azure Hdinsight Hive DB using Python script???
Hello everyone!!! I am having a task to perform on Azure Hdinsight Hive DB and Azure databrick. I have to connect to Azure Hdinsight Hive DB and get all data in excel or csv formate on daily bases automatical that will store in my Storage account…


Migration from AWS EMR to Azure
We are trying to move our spark steps code from AWS EMR cluster to AZURE. we are using the add-steps option with command-runner.jar in EMR. Each step inits a python script which uses large text file in S3 storage and manipulating it with Spark. Example…


Can I use a Student subscription in Azure to create an HDInsight Spark cluster?
Hi all, I am trying to create a Spark cluster in HDInsight (the name of the resource is Azure HDInsight) with my Student subscription. I have tried googling but couldn't find clues in the Microsoft documentation. I have my $100 unused, but when I go…


N How to create HDInsight Interactive query cluster with aditional storage account?
Hi community, I am new with HDInsight, I am asking for help regarding this situation: Pre conditions: I have a data lake gen 2 (hierarchical namespace enable) with my business data. ( csv and parquet files) I need to create 2 clusters. Interactive…


Configuration related exception while trying to run a spark app in HDInsight 5.0 cluster
I am migrating from HDInsight 4.0 to 5.0. Locally, it works. However, when I ran spark jobs in HDInsight cluster, I got the below error. Any idea why "spark.nonjvm.error.forwarding.enabled" is registered multiple times? Command to run spark…


How to execute Hive queries in Synapse spark
Hello! I am replacing a HDI cluster with Azure Synapse. My current HDI spark cluster executes some HIVE queries for data transformation. Is it possible to execute the same HIVE queries into Azure Synapse spark pool? Thanks, DR


Differences between HD Insight and Azure Data bricks?
I know that HDInsight has several types of clusters whereas Databricks is only for Spark type of cluster. I believe there must be some significant differences which will influence what to be chosen for implementation. [Note: As we migrate from MSDN,…


Does Azure has service to migrate data from AWS MSK to Azure Kafka HDInsight
I am looking for way to migrate data from AWS MSK to Azure Kafka. Is there any service available to do that are what are its pre-migration Prerequisites?


Azure HDinsight
What is Resource Provider connection in Azure HDinsight? (In portal, when deploying HDinsight Cluster it gives 2 option first Inbound that has no privatelink tickbox and other is Outbound that has privatelink tickbox). I want to know about both with…


How to fix error in a pipeline with hdi activity?
I try to run a pipeline with a hive activity, I get the Error: Response status code indicates server error: 500 (InternalServerError), with the code 2300. I couldn't find that error in the solution guide, so I don't really know how to go from here. …


convert a result of collect_list into json using spark with scala
please find the sample below - after using below code-- val df4 = df3.groupBy("shop_id").agg(collect_list(map($"variant_id",$"variants1")) as ("variants")) and got data like -- …


HDInsight startup yields linked service error: The storage connection string is invalid.
I'm getting an error when trying to run the demo spark word count in the data factory using HDInsight and a spark activity. All services were successfully created and tested. But when the spark pipeline is triggered, the following error is displayed: …
Json SerDe hive query failing with HDInsight version 4.0
Hi All, I have a hive query which is as follows: ADD JAR ${hiveconf:JsonSerde}; set hive.execution.engine=tez; DROP TABLE IF EXISTS Test; CREATE EXTERNAL TABLE Test( Results array< struct<requestid:string,result:array< …


How can I make user defined parameters required inside a pipeline
If I have a parameter that I defined, how can I make it required like this
HDInsight cluster creation- error on configuration+pricing
Hi, I have been trying to create an HDInsight Cluster from Canada, but it fails at the configuration step. I subscribed for PAYG, I tried by selecting different nodes but none of them is working it gives me the error " You have reached your…


How do I find Hive server information
I'm trying to create a pipeline to copy data from csv to a DataBricks table. To do so, I believe I need to set up a HIVE linked service. However, I'm not sure where I can find the necessary information to fill out the LS form - we had a…


Azure HDInsight Spark job is failing with Logger Error
Hello Team, Our jobs are recently failing with this error- ERROR RawSocketSender [MdsLoggerSenderThread]: org.fluentd.logger.sender.RawSocketSender java.net.SocketException: Broken pipe (Write failed) All these pyspark jobs were running fine…


Real Case Scenarios
Hello, Where can I find case scenarios or real life use cases of for example cloud models or high availability and scalability. What I mean is, for example, hybrid cloud is used by banks because they want to control the database and security. …

