Spark Applications are usually submitting to HDInsight clusters from Azure Data Factory, Jupyter, Zeppelin, JDBC, SSH or Livy directly using curl command.
Front End | Servied Used on HDInsight |
---|---|
Jupyter | Livy |
ADF | Livy |
Zeppelin | Interpreter(Livy, Spark) |
Curl | Livy |
* Slow Performance : Spark application takes more time compared to another HDInsight cluster, still complete successfully.
* Unexpected Failure: Spark Application starts processing data but fails to complete with some exception
* Application fails with Exception :Spark Application starts processing data but fails to complete with some exception
* Application hangs-Never gets into finished state
* Spark application fails to start when submitted from Spark-CLI
java.lang.OutOfMemoryError: unable to create new native thread
in livy logs /var/log/livy/livy-livy-server.out
then follow steps detailed in livy-nativethread-exhaustion./var/log/livy/livy-livy-server.out
),/var/log/jupyter/
) when troubleshooting Spark Application issues that were submitted using jupter notebook.
If application is submitted using JDBC that uses Spark Thrift Service then get Spark Thrift Driver logs from /var/log/spark/sparkthriftdriver.log
In case the Spark job is submitted from spark-shell then get the complete spark-submit command.
For any spark application performance issues (including the three scenarios list above) first note the Application ID, next capture YARN logs for the application that is experiencing performance issue (Slow/Hang) or failures. a. How do I download Yarn logs from HDInsight cluster?, this article details different options to capture YARN Logs. b. Download all Application Master logs. c. Get logs for all containers (Driver and Executor).
Get screenshot of YARN UI showing the start datetime, end datetime and the status for the failed application.
If this application had completed successful early then capture start, end datetime, application status and also the YARN logs for this successfully completed Spark Application How do I download Yarn logs from HDInsight cluster?.
## Spark Streaming
For General Spark Tuning Refer