Share via


NasBertTrainer.NasBertOptions Class

Definition

public class NasBertTrainer.NasBertOptions : Microsoft.ML.TorchSharp.TorchSharpBaseTrainer.Options
type NasBertTrainer.NasBertOptions = class
    inherit TorchSharpBaseTrainer.Options
Public Class NasBertTrainer.NasBertOptions
Inherits TorchSharpBaseTrainer.Options
Inheritance
Derived

Constructors

NasBertTrainer.NasBertOptions()

Fields

ActivationDropout

Dropout rate after activation functions in FFN layers. Should be within [0, 1).

AdamBetas

Betas for Adam optimizer.

AdamEps

Epsilon for Adam optimizer.

AttentionDropout

Dropout rate for attention weights. Should be within [0, 1).

BatchSize

Number of samples to use for mini-batch training.

(Inherited from TorchSharpBaseTrainer.Options)
ClipNorm

The clipping threshold of gradients. Should be within [0, +Inf). 0 means not to clip norm.

Dropout

Dropout rate for general situations. Should be within [0, 1).

DynamicDropout

Whether to use dynamic dropout.

EncoderNormalizeBefore

Whether to apply layer normalization before each encoder block.

FinalLearningRateRatio

The final learning rate for polynomial decay scheduler.

(Inherited from TorchSharpBaseTrainer.Options)
FreezeEncoder

Whether to freeze encoder parameters.

FreezeTransfer

Whether to freeze transfer module parameters.

LabelColumnName

The label column name.

(Inherited from TorchSharpBaseTrainer.Options)
LayerNormTraining

Whether to train layer norm parameters.

LearningRate

Learning rate for the first N epochs; all epochs >N using LR_N. Note: this may be interpreted differently depending on the scheduler.

MaxEpoch

Stop training when reaching this number of epochs.

(Inherited from TorchSharpBaseTrainer.Options)
PoolerDropout

Dropout rate in the masked language model pooler layers. Should be within [0, 1).

PredictionColumnName

The Prediction column name.

(Inherited from TorchSharpBaseTrainer.Options)
ScoreColumnName

The Score column name.

(Inherited from TorchSharpBaseTrainer.Options)
Sentence1ColumnName

The first sentence column.

Sentence2ColumnName

The second sentence column.

StartLearningRateRatio

The start learning rate for polynomial decay scheduler.

(Inherited from TorchSharpBaseTrainer.Options)
TaskType

Task type, which is related to the model head.

ValidationSet

The validation set used while training to improve model quality.

(Inherited from TorchSharpBaseTrainer.Options)
WarmupRatio

Proportion of warmup steps for polynomial decay scheduler.

WeightDecay

Coefficiency of weight decay. Should be within [0, +Inf).

(Inherited from TorchSharpBaseTrainer.Options)

Applies to