Distributed training of a model i ML.NET?

Johan Danforth 6 Reputation points
2021-12-27T12:47:06.28+00:00

Is it possible to do distributed training/fit of a model in ML.NET to multiple workers/servers?

.NET Machine learning
.NET Machine learning
.NET: Microsoft Technologies based on the .NET software framework.Machine learning: A type of artificial intelligence focused on enabling computers to use observed data to evolve new behaviors that have not been explicitly programmed.
150 questions
0 comments No comments
{count} vote

2 answers

Sort by: Most helpful
  1. Wilhelm 1 Reputation point
    2022-08-01T14:30:29.037+00:00

    I'm very interested by this? any answer? I have a few local machines I would like to use
    TY!
    Wil

    0 comments No comments

  2. Wilhelm Thomas 1 Reputation point
    2022-08-08T04:46:30.067+00:00

    hello, I would love for Microsoft/ML.net to allow us to train on multiple machines to go faster. Microsoft should do the following:

    • Version 1: Right now Auto ML starts many algo 1 by 1 on one machine, a simple version 1 would be to start those algos in parallels on multiple computer nodes (slaves) and aggregate the results on the main computer (master node) where AutoM orchestrate the runs and aggregate the results
    • Version 2: allow to add Azure as a slave node in the mesh in combination to local computers...so we can pay for extra boost when needed.
    • version 3: I would work on accelerating the most used algos by distributing the work across the mesh nodes (harder to do, would take more time).

    I think version 1 and 2 should be easy and fast to implement. Those will increase ML.Net usage but also allow people to get a hardware boost when needed by adding azure as a participating computational node
    Version 3 is more complex and will take more time.

    Best

    Wil

    0 comments No comments