Lymnus Logo
Docs Projects

Synthetic Data Generation

Updated 1 day ago 2 min read 10 views

What is synthetic data generation?

Generation creates realistic, statistically accurate synthetic data from scratch — or based on a sample dataset you provide. Use cases include:

  • Test datasets for software development

  • Training data for ML models

  • Demo data for client presentations

  • GDPR-compliant stand-ins for real customer data

How to generate data:

  1. Click + New → New Project

  2. Choose Synthetic Data Generation

  3. Either upload a sample file (the AI learns your schema and distribution) or define the schema manually

  4. Set the number of records

  5. Click Generate

Controlling the output:

Use the schema builder to define each column:

  • Column name — what the field is called in the output

  • Type — text, number, date, email, phone, address, name, etc.

  • Description — give the AI context ("This is a US zip code", "Values are between 18 and 65")

  • Examples — provide 2–3 example values to set the style

Was this page helpful?