IT-OSE: Exploring Optimal Sample Size for Industrial Data Augmentation
New method slashes computational costs by 84% while improving model performance in industrial AI tasks.
Researchers Mingchun Sun, Rongqiang Zhao, Zhennan Huang, Songyu Ding, and Jie Liu developed IT-OSE (Information-Theoretic Optimal Sample Size Estimation), a framework to determine the ideal amount of data for augmenting industrial AI training sets. It increases classification accuracy by an average of 4.38% and reduces regression error (MAPE) by 18.80% compared to empirical methods, while cutting computational costs by 83.97% and data needs by 93.46% versus exhaustive search.
Why It Matters
It enables factories to build more accurate, reliable AI models for predictive maintenance and quality control with far less data and compute.