@DineshSundaram-9074 Thanks for the question. We are seeing 3rd party enterprise customers train reinforcement learning agents on up to 512 cores or running their training over multiple days. In practice, it can take millions of trial runs to train an agent. These trial runs happen automatically, rapidly in parallel and the system continuously learns and improves. Azure Machine Learning uses the Ray framework to distribute reinforcement learning training to support large scale training.
Please follow the below GitHub samples for RL.
You can train your own reinforcement learning agents using Azure Machine Learning using the following resources.
• How to: reinforcement learning with Azure Machine Learning
• Github samples
• AI Show: introductory video
Video interview Microsoft Research