You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When training a model, I have to specify dataloaders, epochs, learning rate and I would like them to be logged by default (like huggingface).
(Could be a DeviceStatMonitor + Batch throughput + dataset metrics)
Pitch
When training a model, many metrics are accessible and it would be really nice to log them directly like :
Log all the metrics myself for all the loggers like defined above (long and tedious...), a flag like "log_default_metrics=True" would be a nice alternative.
Additional context
I'm using Databricks (mlflow) and I can have my custom model metrics but nothing in the system metrics or default parameters, default model metrics.
Hey @ierezell this is a good idea in general, right now we have save_hyperparameters that will automatically log the hyperparameters that are passed to the constructor. We could do something similar for DataModules after all.
Indeed I'm already using save_hyperparameters, I should have mentioned it.
The request would be on "non-init" parameters like precision (fp16 or 32),epochs, batch_size etc... That I do not pass to the init of the model but in the trainer arguments.
Maybe a Trainer.fit(log_user_defined_args=True)?
And you're right, also for the dataset (length, size etc...) so maybe Trainer.fit(log_dataset_metrics=True)?
Description & Motivation
When training a model, I have to specify dataloaders, epochs, learning rate and I would like them to be logged by default (like huggingface).
(Could be a DeviceStatMonitor + Batch throughput + dataset metrics)
Pitch
When training a model, many metrics are accessible and it would be really nice to log them directly like :
Pseudo-code
Alternatives
Log all the metrics myself for all the loggers like defined above (long and tedious...), a flag like "log_default_metrics=True" would be a nice alternative.
Additional context
I'm using Databricks (mlflow) and I can have my custom model metrics but nothing in the system metrics or default parameters, default model metrics.
Thanks for the framework, it's really nice !
cc @Borda @awaelchli
The text was updated successfully, but these errors were encountered: