.jsonl
) for dataset files. You can use one of the following dataset types:
The size limit for dataset files is 5 gigabytes.
Conversational data
You can train a model by using chats. Pass along every chat as a singlemessages
line. The following is an example of how a single messages
parameter can look:
Instruction data
You can specify prompts and the expected answers to them:Text data
If you have unstructured data, you can put each piece in atext
line: