Feature request
We would like to implement fine-tuning.
This task involves considering the tradeoffs between various approaches to improving action completions and outcome evaluation via fine-tuning.
More generally, this also involves:
- Creating a training set
- Fine tuning on that training set
- Comparing the results
Motivation
https://arxiv.org/abs/2406.03679
Autonomous agents that control computer interfaces to accomplish human tasks are emerging. Leveraging LLMs to power such agents has been of special interest, but unless fine-tuned on human-collected task demonstrations, performance is still relatively low.
Related
#70
#72
#415
#748
Bounty
A paid bounty is available. Please suggest a price range π
Feature request
We would like to implement fine-tuning.
This task involves considering the tradeoffs between various approaches to improving action completions and outcome evaluation via fine-tuning.
More generally, this also involves:
Motivation
https://arxiv.org/abs/2406.03679
Related
#70
#72
#415
#748
Bounty
A paid bounty is available. Please suggest a price range π