Deep Learning Needs WAY More Data Than You Think (and We Have Math to Prove It)

Overview

Paper Summary › Explain Like I'm Five › Conflicts of Interest › Identified Limitations › Rating Explanation › Good to know › Topic Hierarchy › File Information ›

Paper Summary

Paperzilla title

This paper establishes a lower bound for the number of samples needed to train a deep ReLU neural network, showing it scales at a rate of 1/√n, slower than classical methods. This theoretical result is supported by experiments on benchmark datasets for image classification and regression tasks. The findings confirm the common belief that deep learning requires a large amount of data for effective training.

Explain Like I'm Five

Deep learning models, like those used for image recognition, need lots of examples to learn well. This paper uses math and experiments to show they learn slower than simpler models.

Possible Conflicts of Interest

None identified.

Identified Limitations

Limited theoretical scope for CNNs

The theoretical results primarily focus on feedforward ReLU networks, while the empirical studies extend to CNNs. This leaves a gap in the theoretical understanding of CNNs.

Assumption of high input dimension

The lower bound assumes high input dimensions and might not hold for low-dimensional data.

Lack of practical sample size guidelines

Although the paper provides a lower bound, it doesn't offer practical guidance on choosing the optimal number of samples for a given task.

Rating Explanation

This paper provides a valuable theoretical and empirical analysis of sample complexity in deep learning. The derived lower bound and supporting experiments offer new insights into why deep learning models often require extensive training data. While the theoretical scope is limited to feedforward networks and assumes high input dimensions, the findings are significant and match existing upper bounds. The paper successfully addresses a fundamental question in deep learning, justifying its 4 rating.

Good to know

This is the Starter analysis. Paperzilla Pro fact-checks every citation, researches author backgrounds and funding sources, and uses advanced AI reasoning for more thorough insights.

Explore Pro →

Topic Hierarchy

Domain: Physical Sciences

Field: Computer Science

Subfield: Artificial Intelligence

File Information

Original Title: HOW MANY SAMPLES ARE NEEDED TO TRAIN A DEEP NEURAL NETWORK?

Uploaded: August 27, 2025 at 06:48 PM

Privacy: Public