Synthetic Data Generator
Build datasets using natural language
Spaces focused on generating synthetic datasets
Build datasets using natural language
Note A space which allows you to build datasets using natural language. Uses Distilabel under the hood
Search and save datasets generated with a LLM in real time
Note Search for a dataset you want and it'll be created just for you using Phi-3-mini-4k-instruct!
Note Would you read a book generated by an LLM? This experimental Space creates an LLM-generated blurb and allows users to vote on whether the blurb is good, contributing to an open preference dataset π€ This Space might give you ideas for creating your synthetic preference dataset from the community!
Note This Space is designed to provide you with an easy way to get started generating synthetic datasets using Spaces compute to host open LLMs. The Space comes with a ready-to-go environment and a series of notebooks showing various examples of generating synthetic datasets
Note This demo showcases Magpie, an innovative approach to generating high-quality data by prompting aligned LLMs with their pre-query templates. This Space also allows users to rate the generations to create preference data!