You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Alignment Data
Description: Scenarios that align the model's responses with human values, ethical guidelines, and desired behaviors, especially for handling sensitive topics.
Question and Answer (QA) Data
Description: Pairs of questions and their corresponding answers to help the model generate accurate responses to various queries.
{
"question": "What is the capital city of Australia?",
"answer": "The capital city of Australia is Canberra."
}
Code Data
Description: Code snippets paired with descriptions or requirements, essential for training models to understand, generate, and debug code in various programming languages.
{
"description": "Write a Python function that returns the factorial of a number.",
"code": "def factorial(n):\n if n == 0:\n return 1\n else:\n return n * factorial(n-1)"
}
Classification Data
Description: Texts labeled with specific categories or tags, useful for tasks like sentiment analysis, topic classification, or spam detection.
{
"text": "I absolutely loved the new movie! The storyline was gripping and the characters were well-developed.",
"label": "Positive Sentiment"
}
Entity Recognition Data
Description: Texts with annotated entities such as names, dates, locations, etc., training models to identify and categorize key information within the text.
{
"text": "Apple Inc. was founded by Steve Jobs and Steve Wozniak in Cupertino, California, in 1976.",
"entities": [
{"entity": "Apple Inc.", "type": "Organization"},
{"entity": "Steve Jobs", "type": "Person"},
{"entity": "Steve Wozniak", "type": "Person"},
{"entity": "Cupertino, California", "type": "Location"},
{"entity": "1976", "type": "Date"}
]
}
Solution
No response
Alternatives
No response
Additional context
No response
The text was updated successfully, but these errors were encountered:
Required prerequisites
Motivation
alignment data etc..
Instruction Data
Reference implemented in camel: https://github.com/camel-ai/camel/blob/master/camel/messages/conversion/alpaca.py
Conversation Data
Reference implemented in camel: https://github.com/camel-ai/camel/blob/master/camel/messages/conversion/conversation_models.py
TODO:
Alignment Data
Description: Scenarios that align the model's responses with human values, ethical guidelines, and desired behaviors, especially for handling sensitive topics.
example: https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2.5-Math-Pro-300K-v0.1
Question and Answer (QA) Data
Description: Pairs of questions and their corresponding answers to help the model generate accurate responses to various queries.
Code Data
Description: Code snippets paired with descriptions or requirements, essential for training models to understand, generate, and debug code in various programming languages.
Classification Data
Description: Texts labeled with specific categories or tags, useful for tasks like sentiment analysis, topic classification, or spam detection.
Entity Recognition Data
Description: Texts with annotated entities such as names, dates, locations, etc., training models to identify and categorize key information within the text.
Solution
No response
Alternatives
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: