Sampling-Based NDV Estimation in Iceberg Tables
NDV (number of distinct values) is one of the most important statistics in cost-based query optimization. It affects selectivity estimates, join ordering, and intermediate cardinality predictions, so...
eng-floe.github.io