We have noticed that you rated an answer as not helpful. We appreciate your feedback and are committed to improving your experience with the Q&A. I have worked on finding an alternative solution to address your issue.
As you mentioned, you are not seeing increase in memory, it might be pointing towards deadlock condition (threads are not able to release and join properly). You can refer our Parallel job documentation which gives more robust way to handle job concurrency, timeout, no.of nodes, batch_size, etc and this is more efficient compared to python multiprocessing.pool
. You can optimize the above parameters (batch_size, etc) to optimize inference time and overall performance.
Please refer to: How to use parallel jobs in pipelines - Azure Machine Learning | Microsoft Learn
If the information is helpful, please click on "Upvote".
If you have any further queries, please let us know. Thank you.