Post
2559
Introducing CosmoChat, a multiturn chat dataset based on Cosmopedia that I'm working on in the open on the Hub.
π― Goals:
π¬ Create multi-turn chats seeded from Cosmopedia
π Customize questions for different audience levels
π Evaluate the model's ability to elaborate and clarify
π€ (I want to learn more about creating valuable synthetic datasets, and I learn best by doing stuff rather than reading stuff).
Cosmochat is created using the excellent distilabel library.
π Explore the current version of the dataset: davanstrien/cosmochat
π Read more: https://huggingface.co/blog/davanstrien/cosmochat
π― Goals:
π¬ Create multi-turn chats seeded from Cosmopedia
π Customize questions for different audience levels
π Evaluate the model's ability to elaborate and clarify
π€ (I want to learn more about creating valuable synthetic datasets, and I learn best by doing stuff rather than reading stuff).
Cosmochat is created using the excellent distilabel library.
π Explore the current version of the dataset: davanstrien/cosmochat
π Read more: https://huggingface.co/blog/davanstrien/cosmochat