What is synthetic data generation?
Generation creates realistic, statistically accurate synthetic data from scratch — or based on a sample dataset you provide. Use cases include:
Test datasets for software development
Training data for ML models
Demo data for client presentations
GDPR-compliant stand-ins for real customer data
How to generate data:
Click + New → New Project
Choose Synthetic Data Generation
Either upload a sample file (the AI learns your schema and distribution) or define the schema manually
Set the number of records
Click Generate
Controlling the output:
Use the schema builder to define each column:
Column name — what the field is called in the output
Type — text, number, date, email, phone, address, name, etc.
Description — give the AI context ("This is a US zip code", "Values are between 18 and 65")
Examples — provide 2–3 example values to set the style