Today, we are excited to introduce a very powerful new framework to The Synthetic Data Vault : ๐ฐ๐ผ๐ป๐๐๐ฟ๐ฎ๐ถ๐ป๐ ๐ฎ๐๐ด๐บ๐ฒ๐ป๐๐ฒ๐ฑ ๐ด๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐ผ๐ป (
#CAG for short). CAG addresses the shortcomings of generative models in capturing the context buried in enterprise data stores - with human input. (Link to the announcement:
datacebo.com/announcements/iโฆ)
โ ๐๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐๐ฒ ๐๐ ๐บ๐ผ๐ฑ๐ฒ๐น๐ ๐ณ๐ฎ๐ถ๐น ๐๐ผ ๐ฐ๐ฎ๐ฝ๐๐๐ฟ๐ฒ ๐ฑ๐ฒ๐๐ฒ๐ฟ๐บ๐ถ๐ป๐ถ๐๐๐ถ๐ฐ ๐ฟ๐ฒ๐น๐ฎ๐๐ถ๐ผ๐ป๐๐ต๐ถ๐ฝ๐ ๐ฏ๐ฒ๐๐๐ฒ๐ฒ๐ป ๐ฐ๐ผ๐น๐๐บ๐ป๐, ๐ฟ๐ผ๐๐, ๐ฎ๐ป๐ฑ ๐๐ฎ๐ฏ๐น๐ฒ๐. We call such relationships database context. Database context describes hard and fast rules under which data is created and stored.
What is even harder is that usually, this context is not explicitly stored within the database schema itself โ but data teams know that it exists. Downstream applications process this data based on the context using logic within the application software.
When the generative models are used to create
#syntheticdata the expectation is that the
#syntheticdata will also follow the database context.
โ
When we launched The Synthetic Data Vault โ a system to enable enterprises to build generative models for their own
#multitable data โ we provided the ability to include context via what we called
#๐ฐ๐ผ๐ป๐๐๐ฟ๐ฎ๐ถ๐ป๐๐.
๐ฅ Over the years, ๐ฐ๐ผ๐ป๐๐๐ฟ๐ฎ๐ถ๐ป๐๐ ๐ต๐ฎ๐ ๐ฏ๐ฒ๐ฐ๐ผ๐บ๐ฒ ๐ผ๐ป๐ฒ ๐ผ๐ณ ๐๐ต๐ฒ ๐บ๐ผ๐๐ ๐ฝ๐ผ๐ฝ๐๐น๐ฎ๐ฟ ๐ณ๐ฒ๐ฎ๐๐๐ฟ๐ฒ๐ ๐ผ๐ณ ๐ผ๐๐ฟ ๐ฆ๐๐ฉ ๐๐ป๐๐ฒ๐ฟ๐ฝ๐ฟ๐ถ๐๐ฒ ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐.
๐ช ๐ช๐ถ๐๐ต ๐๐๐ ๐๐ฒ ๐ฎ๐ฟ๐ฒ ๐ฑ๐ผ๐๐ฏ๐น๐ถ๐ป๐ด ๐ฑ๐ผ๐๐ป ๐ผ๐ป ๐๐ต๐ถ๐ ๐ณ๐ผ๐ฐ๐๐. To use this new and powerful framework, users can just pick the pre-defined pattern that corresponds to their database context and tell SDV where to apply it. It will then augment your synthesizer directly with this information. And 100% valid
#syntheticdata
๐ฅ๐ฒ๐ฎ๐ฑ ๐บ๐ผ๐ฟ๐ฒ ๐ฎ๐ฏ๐ผ๐๐ ๐๐๐, ๐๐ต๐ฎ๐ ๐ถ๐ ๐บ๐ฒ๐ฎ๐ป๐ ๐ณ๐ผ๐ฟ ๐๐ผ๐, ๐ฎ๐ป๐ฑ ๐ต๐ผ๐ ๐๐ผ ๐ฎ๐ฐ๐ฐ๐ฒ๐๐ ๐ถ๐ ๐ถ๐ป ๐ผ๐๐ฟ ๐น๐ฎ๐๐ฒ๐๐ ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐ ๐ฎ๐ป๐ป๐ผ๐๐ป๐ฐ๐ฒ๐บ๐ฒ๐ป๐ ๐ต๐ฒ๐ฟ๐ฒ:
datacebo.com/announcements/iโฆ
๐๐ฎ๐ฝ๐ฝ๐ ๐ต๐ผ๐น๐ถ๐ฑ๐ฎ๐๐ ๐ฎ๐ป๐ฑ ๐ฒ๐ป๐ท๐ผ๐ ๐๐๐ป๐๐ต๐ฒ๐๐ถ๐๐ถ๐ป๐ด!
- from all of DataCebo Team
#syntheticdata #generativeai #data #machinelearning #ml #ai