How to fix 'OpenMPI error on compute when running Azure Machine Learning jobs '

Isha Chopra 0 Reputation points
2023-01-30T23:20:30.2433333+00:00
  1. Seeing below error in log file: user_logs/mpi_log.txt
Error: "Orted process returned unsuccessful exit code: 2. If you are using a version of OpenMPI < 4.0.1, please update to a newer version and try again. If the issue persists, please contact support. Error message: Execution failed. User process '-x' exited with status code 2. Please check log file '/mnt/azureml/cr/j/464bfc758a684b42b512385a391314f3/cap/lifecycler/wd/.azureml_cr_log/mpi_orted_1.txt' for error details. Error: \t/bin/bash [GNU long option] [option] script-file ...\nGNU long options:\n\t--debug\n\t--debugger\n\t--dump-po-strings\n\t--dump-strings\n\t--help\n\t--init-file\n\t--login\n\t--noediting\n\t--noprofile\n\t--norc\n\t--posix\n\t--rcfile\n\t--restricted\n\t--verbose\n\t--version\nShell options:\n\t-ilrsD or -c command or -O shopt_option\t\t(invocation only)\n\t-abefhkmnptuvxBCHP or -o option\n. More details can be found in error file: /mnt/azureml/cr/j/464bfc758a684b42b512385a391314f3/cap/lifecycler/wd/.azureml_cr_log/mpi_orted_1.txt."
  1. Log entry in the system_logs/lifecycler/1/mpi_orted_1.txt
/bin/bash: /azureml-envs/azureml_5542d71450c0cb2def095a8d9037155a/lib/libtinfo.so.5: no version information available (required by /bin/bash)
/bin/bash: - : invalid option
Usage:	/bin/bash [GNU long option] [option] ...
	/bin/bash [GNU long option] [option] script-file ...
GNU long options:
	--debug
	--debugger
	--dump-po-strings
	--dump-strings
	--help
	--init-file
	--login
	--noediting
	--noprofile
	--norc
	--posix
	--rcfile
	--restricted
	--verbose
	--version
Shell options:
	-ilrsD or -c command or -O shopt_option		(invocation only)
	-abefhkmnptuvxBCHP or -o option

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,552 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Isha Chopra 0 Reputation points
    2023-02-13T22:06:32.66+00:00

    @romungi-MSFT Yes, the base image being used on this environment is an older version. Thanks for the pointer, will upgrade to the latest version and let you know if that works.