Learning (Very) Simple Generative Models Is Hard

Sitan Chen; Jerry Li; Yuanzhi Li

Learning (Very) Simple Generative Models Is Hard

Sitan Chen ,
Jerry Li ,
Yuanzhi Li

2022 Neural Information Processing Systems | May 2022

Download BibTex

Motivated by the recent empirical successes of deep generative models, we study the computational complexity of the following unsupervised learning problem. For an unknown neural network $F : R^{d} \to R^{d^{'}}$ , let $D$ be the distribution over $R^{d^{'}}$ given by pushing the standard Gaussian $N (0, {Id}_{d})$ through $F$ . Given i.i.d. samples from $D$ , the goal is to output $a n y$ distribution close to $D$ in statistical distance.

We show under the statistical query (SQ) model that no polynomial-time algorithm can solve this problem even when the output coordinates of $F$ are one-hidden-layer ReLU networks with $\log (d)$ neurons. Previously, the best lower bounds for this problem simply followed from lower bounds for $s u p e r v i s e d$ $l e a r n i n g$ and required at least two hidden layers and $p o l y (d)$ neurons [Daniely-Vardi ’21, Chen-Gollakota-Klivans-Meka ’22].

The key ingredient in our proof is an ODE-based construction of a compactly supported, piecewise-linear function $f$ with polynomially-bounded slopes such that the pushforward of $N (0, 1)$ under $f$ matches all low-degree moments of $N (0, 1)$ .