Hi @matsuo_basho ,
I'm glad that the guidance was helpful.
Regarding your next query, you can obtain the metrics for each epoch using the MLflow Python API. To do this, you need to log the metrics for each epoch using the mlflow.log_metric()
function in your training script. Please use a FOR loop in the python script to iterate through all the values you are interested in and log it using log_metric from within the loop.
A quick sample you may want to refer to, please adjust it according to your scenario -
import mlflow
# Start an MLflow run
with mlflow.start_run():
# Train your model
for epoch in range(num_epochs):
# Train your model for one epoch
train_loss, train_acc = train_one_epoch(...)
val_loss, val_acc = validate(...)
# Log the metrics for this epoch
mlflow.log_metric("train_loss", train_loss, step=epoch)
mlflow.log_metric("train_acc", train_acc, step=epoch)
mlflow.log_metric("val_loss", val_loss, step=epoch)
mlflow.log_metric("val_acc", val_acc, step=epoch)
Once you've logged the metrics for each epoch, you can retrieve them using the methods discussed in my previous responses.
Hope this helps.
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful.