Create a new tuning Dataset
for the specified Agent
using the provided JSONL or CSV file. A Dataset
is a versioned collection of samples conforming to a particular schema, and can be used as a source of training and test data for tuning jobs.
Each Dataset
is versioned and validated against its schema during creation and subsequent updates. The provided Dataset
file must conform to the schema defined for the dataset_type
.
File schema for dataset_type
tuning_set
is a CSV file or a JSONL file where each line is one JSON object. The following keys are required:
-
knowledge
(list[str]
): Retrieved knowledge used to generate the reference answer.knowledge
is a list of retrieved text chunks. -
reference
(str
): The gold-standard answer to the prompt. -
guideline
(str
): Guidelines for model output. If you do not have special guidelines for the model's output, you can use theSystem Prompt
defined in your Agent configuration as theguideline
. -
prompt
(str
): Question for the model to respond to.
For examples of what tuning_set
should look like, check out our Tune & Evaluation Guide
.