How do Categorical Duplicates Affect ML? A New Benchmark and Empirical AnalysesVraj ShahThomas Parashoset al.2024VLDB 2024