@Okwen, Roland T Thanks, Instead of bigger machines with more memory, there are techniques to be used with Aml Compute for larger datasets. The Parallel Run Step is an AzureML Pipeline Step which enables parallel processing or data partitions across multiple workers on multiple nodes. PRS (ParallelRunStep) is designed for embarrassingly parallel workload, e.g. train many models, batch inference, etc.
Also look into using some of the curated images provided for compute clusters.
Specifically look into the DASK image.
Curated environments - Azure Machine Learning | Microsoft Learn