Share via

ParallelRunStep doesn´t support multiple input datasets

AI 6 Reputation points
2022-03-20T06:54:30.327+00:00

Currently we are using the parallelrunstep in AML, Is there any workaround/other approach to support distribute datasets in the parallel run step.

Azure Machine Learning

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,836 Reputation points
    2022-03-21T05:31:00.423+00:00

    @AI Thanks for the question. An Azure ML dataset is just metadata pointing to a path or collection of paths in an Azure storage account. You should first "merge" those datasets into a collection of adjacent folders (e.g. root/dataset1/, root/dataset2/, ...) and then run PRS against root/**.

    Was this answer helpful?

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.