Synthetic Data Generation Resources

The Best Synthetic Data Generators

Introduction


Synthetic data is extremely important whether you are developing a new product, a new world or to test your applications. In this list, you will find websites, companies and open source libraries.

Free Data Generation Websites


These websites allow you to directly generate synthetic data inside the browser.

Mockaroo

http://www.mockaroo.com

GenerateData

https://site.generatedata4.com

JSON Schema Faker

https://json-schema-faker.js.org

Mock Turtle

https://mockturtle.net

Open Source Libraries


Exhaustive list of the best libraries in Javascript and Python to produce synthetic data. These libraries can be integrated into your pipeline to produce fake data.

JSON Schema Faker (Javascript)

https://github.com/json-schema-faker/json-schema-faker

Fony (Javascript)

https://github.com/captainsafia/fony

Casual (Javascript)

https://github.com/boo1ean/casual

Mock (Javascript)

https://github.com/nuysoft/Mock

Fake Data Generator (Javascript)

https://github.com/Cambalab/fake-data-generator

Mocker (Javascript)

https://github.com/danibram/mocker-data-generator

Mockaroo Node (Javascript)

https://github.com/mockaroo/mockaroo-node

FakerJS (Javascript)

https://github.com/marak/Faker.js

Faker (Python)

https://github.com/joke2k/faker

Trumania (Python)

https://github.com/RealImpactAnalytics/trumania

Mimesis (Python)

https://github.com/lk-geimfari/mimesis

Radar (Python)

https://pypi.org/project/radar

Fake2db (Python)

https://github.com/emirozer/fake2db

Companies


Companies that provide solutions and libraries to produce synthetic data. Most of the use cases revolve about producing test data or ensuring that GDPR and other data privacy laws are respected.

Mostly AI

https://mostly.ai

Gen Rocket

https://www.genrocket.com

Exact Data

https://www.exactdata.com

Synth

https://www.getsynth.com

Curiosity

https://opentestingplatform.curiositysoftware.ie