Glossary · Technical concept
Synthetic Data
Data generated by a model rather than collected from real-world sources. Used to augment training data when real samples are scarce, sensitive, or imbalanced. Risks include amplification of biases present in the generating model and loss of realistic edge cases that only exist in genuine data.
Framework references
- NIST AI 600-1