Design: Fine-Tuning

### Feature request

We would like to implement fine-tuning.

This task involves considering the tradeoffs between various approaches to improving action completions and outcome evaluation via fine-tuning.

More generally, this also involves:

1. Creating a training set
2. Fine tuning on that training set
3. Comparing the results

### Motivation

https://arxiv.org/abs/2406.03679

> Autonomous agents that control computer interfaces to accomplish human tasks are emerging. Leveraging LLMs to power such agents has been of special interest, but unless fine-tuned on human-collected task demonstrations, performance is still relatively low.

### Related

https://github.com/MLDSAI/OpenAdapt/issues/70
https://github.com/MLDSAI/OpenAdapt/issues/72
https://github.com/OpenAdaptAI/OpenAdapt/issues/415
https://github.com/OpenAdaptAI/OpenAdapt/issues/748

### Bounty

A paid bounty is available. Please suggest a price range 🙏 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Design: Fine-Tuning #69

Feature request

Motivation

Related

Bounty

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Uh oh!

Design: Fine-Tuning #69

Description

Feature request

Motivation

Related

Bounty

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions