turbo-alignment/docs/dataset_example.md at main · rtaran/turbo-alignment

Dataset Descriptions

All datasets are in JSONL format, where:

Common Attributes

id: str - A distinct identifier for each data entry, represented as a string.

Dataset Types:

Chat Dataset
Pair Preferences Dataset
KTO Dataset
Sampling Dataset
Multimodal Dataset
Classification Dataset
DPPO Dataset (⌛️ Work in progress...)

Chat Dataset

messages: list[ChatMessage] — This is a sequence of messages that make up the chat history. Each ChatMessage includes:
- role - The participant's role in the conversation (e.g., user or bot).
- content - The textual content of the message.

Example:

{
  "id": 0,
  "source": "example",
  "messages": [
    {"role": "user", "content": "Can you play chess?"},
    {"role": "bot", "content": "Yes, of course"}
  ]
}

Pair Preferences Dataset

context: list[ChatMessage] — This is a sequence of messages that make up the chat history.
answer_w: ChatMessage — The more preferable response.
answer_l: ChatMessage — The less preferable response.

Example:

{
  "id": 0,
  "source": "example", 
  "context": [
    {"role": "user", "content": "Can you play chess?"}
  ],
  "answer_w": {"role": "bot", "content": "Yes, of course"},
  "answer_l": {"role": "bot", "content": "Get out, I don't want to talk to you!"}
  }

KTO Dataset

context: list[ChatMessage] — This is a sequence of messages that make up the chat history.
answer: ChatMessage — The given response.
is_desirable: bool — Indicator if the provided response is considered as desirable or no.

Example:

{
  "id": 0,
  "source": "example",
  "context": [
    {"role": "user", "content": "Can you play chess?"}
  ],
  "answer": {"role": "bot", "content": "Yes, of course"},
  "is_desirable": true
}
{
  "id": 1,
  "source": "example",
  "context": [
    {"role": "user", "content": "Can you play chess?"}
  ],
  "answer": {"role": "bot", "content": "Get out, I don't want to talk to you!"},
  "is_desirable": false
}

Sampling Dataset

messages: list[ChatMessage] — This is a sequence of messages that make up the chat history.
answers: list[ChatInferenceOutput] - A list of generated responses. Each ChatInferenceOutput is structured as:
- id: str - A unique identifier for the generated response.
- content: str - The content of generated completion

Example:

{
  "id": "0", 
  "dataset_name": "example", 
  "messages": [
    {"role": "user", "content": "hi"}, 
    {"role": "bot", "content": "hi"}, 
    {"role": "user", "content": "how are you"}
  ], 
  "answers": [
    {"content": "good", "id": "0"}, 
    {"content": "not bad", "id": "1"}
  ]
}

Multimodal Dataset

messages: list[MultimodalChatMessage] — This is a sequence of messages that make up the chat history. Each ChatMessage includes:
- role - The participant's role in the conversation (e.g., user or bot).
- type – The type of modality (e.g., text or image)
- content - If the type is text, it's the textual content of the message. If it's image, it's the file path.

Example:

{
  "id": "0",
  "messages": [
    {
      "role": "system",
      "type": "text",
      "content": "You are a Multimodal AI assistant."
    },
    {
      "role": "user",
      "type": "image",
      "content": "/path/to/cat.jpg"
    },
    {
      "role": "user",
      "type": "image",
      "content": "/path/to/dog.jpg"
    },
    {
      "role": "user",
      "type": "text",
      "content": "What's the difference between these two images?"
    },
    {
      "role": "bot",
      "type": "text",
      "content": "The two images in question both feature animals, albeit of different species. The first image depicts a dog, which is generally perceived as an animal that elicits positive emotional responses. The second image features a cat, which is also regarded as an animal that evokes a positive emotional response."
    }
  ]
}

Classification Dataset

messages: list[ChatMessage] — This is a sequence of messages that make up the chat history.
label: int — Label of provided chat history.

Example:

{
  "id": 0,
  "source": "example",
  "messages": [
    {"role": "user", "content": "Can you play chess?"},
    {"role": "bot", "content": "Yes, of course"}
  ],
  "label": 1
}
{
  "id": 1,
  "source": "example",
  "messages": [
    {"role": "user", "content": "Can you play chess?"},
    {"role": "bot", "content": "Get out, I don't want to talk to you!"}
  ],
  "label": 0
}

DDPO Dataset

context: list[ChatMessage] — This is a sequence of messages that make up the chat history.
answer_w: ChatMessage — The more preferable response.
answer_l: ChatMessage — The less preferable response.

Example:

{
  "id": 0,
  "source": "example", 
  "context": [
    {"role": "user", "content": "Can you play chess?"}
  ],
  "answer_w": {"role": "bot", "content": "Yes, of course"},
  "answer_l": {"role": "bot", "content": "Get out, I don't want to talk to you!"}
  }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset Descriptions

Common Attributes

Dataset Types:

Chat Dataset

Pair Preferences Dataset

KTO Dataset

Sampling Dataset

Multimodal Dataset

Classification Dataset

DDPO Dataset

FilesExpand file tree

dataset_example.md

Latest commit

History

dataset_example.md

File metadata and controls

Dataset Descriptions

Common Attributes

Dataset Types:

Chat Dataset

Pair Preferences Dataset

KTO Dataset

Sampling Dataset

Multimodal Dataset

Classification Dataset

DDPO Dataset