Documentation Index
Fetch the complete documentation index at: https://docs.benchgen.com/llms.txt
Use this file to discover all available pages before exploring further.
Run Inference
Train provides a built-in inference endpoint so you can test a fine-tuned model immediately after training — without merging or deploying first.When to Use
- Quick sanity check right after a training run completes.
- Comparing adapter vs base model responses side by side.
- Validating the model before committing to a merge.
Steps
- Go to Train → Runs and open a completed run.
- Click Test inference.
- Enter a prompt in the chat interface.
- The response is generated using the adapter applied to the base model.
API Access
The inference endpoint is also available as an API during the run’s active window:Next Steps
- Happy with the results? Merge the adapter for a portable checkpoint.
- Want to measure improvement formally? Run a benchmark in Eval.