Documentation Index
Fetch the complete documentation index at: https://docs.benchgen.com/llms.txt
Use this file to discover all available pages before exploring further.
Upload a Dataset
Train accepts labeled datasets in standard formats. Datasets can be uploaded manually or imported directly from an Eval export.Supported Formats
| Format | Use case |
|---|---|
| JSONL (instruction-response pairs) | General fine-tuning |
| CSV (prompt, completion columns) | Simple tabular data |
| Eval export | Imported automatically from an Eval benchmark run |
Upload Steps
- Go to Train → Datasets → Upload.
- Select your file or drag and drop.
- Choose the dataset format.
- Preview the parsed examples — confirm the prompt/response split looks correct.
- Name the dataset and save.
Dataset Quality Tips
- Minimum size: 50 examples for a meaningful fine-tune; 200+ for reliable results.
- Diversity: include varied phrasing of the same task, not just copies.
- Clean labels: incorrect or inconsistent responses in the training set directly hurt output quality.
- Balance: if the task has multiple subtypes, spread examples roughly evenly across them.