Glossary · Technical concept

Synthetic Data

Data generated by a model rather than collected from real-world sources. Used to augment training data when real samples are scarce, sensitive, or imbalanced. Risks include amplification of biases present in the generating model and loss of realistic edge cases that only exist in genuine data.

Framework references

  • NIST AI 600-1

Relevant Responsible AI Studio tools

More technical concept terms

See the full 80-term glossary →