Add a way to train a model before evaluating it #4

dirkgr · 2022-02-25T00:11:08Z

Motivation: Full fine-tuning is a baseline, or rather an upper bound, in many zero-shot and few-shot experiments. @pdasigi has explicitly asked for this.

As part of this work, we'll add a new Tango step to Catwalk that trains a model on a given task/dataset, or on multiple tasks/datasets at the same time. It should call into Tango's training functions to do so. We'll also need to add a method or two to Catwalk's Model class to make this happen. Then we'll do a full evaluation on all reasonable tasks and all reasonable models, to establish good baselines across the board. This might make for a good blog post, too.

As a stretch goal, we should also try to train adaptation methods like prompt tuning, prefix tuning, or even IA3. There are some very nice implementations of some methods at https://github.com/r-three/t-few/tree/master/src.

The text was updated successfully, but these errors were encountered:

Fix missing file for num_model_inputs

dirkgr self-assigned this Feb 25, 2022

This was referenced Feb 25, 2022

Prompt tuning #5

Open

Adapter tuning #6

Open

dirkgr removed their assignment Apr 20, 2022

OyvindTafjord added a commit that referenced this issue May 19, 2023

Merge pull request #4 from OyvindTafjord/log-inputs

ff1dccb

Fix missing file for num_model_inputs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a way to train a model before evaluating it #4

Add a way to train a model before evaluating it #4

dirkgr commented Feb 25, 2022 •

edited

Loading

Add a way to train a model before evaluating it #4

Add a way to train a model before evaluating it #4

Comments

dirkgr commented Feb 25, 2022 • edited Loading

dirkgr commented Feb 25, 2022 •

edited

Loading