AIGC Weekly | #90 What Ilya Saw

in 2014, 2024

pxiaoer

Dec 16, 2024

中文版

What Ilya Saw

Let's do a time check, comparing what Ilya said 10 years ago and now.

What Ilya Saw in 2014

The Deep Learning Hypothesis: If you have a large neural network, it can do anything humans can do in an instant.
The Autoregression Hypothesis: Simple next token prediction/sequence-to-sequence tasks will master the correct distribution, generalizing from translation to all other domains.
The Scaling Hypothesis: If you have a large dataset and train a very large neural network, success is guaranteed.
The Connectionism Hypothesis: If you believe artificial neurons work like biological neurons, then very large neural networks can be "configured to do almost everything we humans do."

What Ilya Saw in 2024

The end of the pre-training era, comparing data to "AI's fossil fuel" as a finite resource.
AI systems will demonstrate "true autonomy" with stronger reasoning capabilities.
Finding new scaling patterns from human evolution.
Future outlook: Agents, synthetic data, inference time compute.

Future

The end of the pre-training era is also talking about the future, which has been a consensus in the past year, but Ilya just articulated it. Of course, this ending can also be seen as a bifurcation - one optimizing models under limited data for better efficiency, and another exploring new training methods.

The three future trends Ilya mentioned can be consolidated into two, as Agents and synthetic data show convergence trends.

Agents refer to super-intelligent agents with reasoning capabilities and self-awareness. The self-awareness here can be understood as proactive agents that make active reasoning and decisions.

Synthetic data - current large models all involve synthetic training data, and many vendors describe their largest parameter models as specifically designed for synthetic data.

The goal behind synthetic data is to transcend (move beyond) human data, allowing AI systems to self-iterate. It has several directions: one optimizing data quality, like phi-4 for reasoning models, another generating personalized data for virtual characters to expand data boundaries, etc. The latter requires Agent participation. I believe synthetic data will gradually penetrate text, speech, image, and video domains, with model internal agents participating in the entire data synthesis process, hence the convergence of Agents and synthetic data.

Inference time compute represents further optimization of the O1 technical route.

AIGC News of the week

1). HunyuanVideo

2). DeepSeek-VL2

3). SynCamMaster

4). 2024: The Year the GPT Wrapper Myth Proved Wrong

5). Fast LLM Inference From Scratch

AIGC Newsletter

AIGC Weekly | #90 What Ilya Saw

in 2014, 2024

What Ilya Saw

Future

Top Papers of the week

AIGC News of the week