Azure ML: ID column for joining data returns "No. Of unique values ... is greater than allowed"

Awp22 31 Reputation points
2020-11-09T15:33:21.023+00:00

Hi.

I am working on an ML model in Designer.

I have a dataset of c. 55,000 rows.

When I add an "ID" column (unique per row - so 55,000 IDs) to my dataset for training / scoring, I receive the error message:

ModuleExceptionMessage:ColumnUniqueValuesExceeded: Number of unique values in column: "ID" is greater than allowed.

Question: is this error based on a physical cap on number of rows - or capacity based on e.g. Compute power associated with the instance?

I can run 20k rows through the model without the ID column - so it seems the unique rows is the challenge.

But then - how do I keep an identifying column in the scored dataset, if there is a cap on unique values?

Because I need the ID column to join with other data that is not able to be used in modelling as features etc.

Any guidance welcome!

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,336 questions
{count} votes

Accepted answer
  1. YutongTie-MSFT 53,966 Reputation points Moderator
    2020-11-17T23:07:36.913+00:00

    User can use Edit Metadata module to mark the ID column as "ClearFeature", and thus this will not be used in Train Model. This should prevent the error. Please have a try and let me know if there is any questions. https://learn.microsoft.com/en-us/azure/machine-learning/algorithm-module-reference/edit-metadata

    40390-microsoftteams-image-7.png

    Regards,
    Yutong


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.