**Elon Tusk** @srs@schelling.pt · 2021-03-08T22:57:02Z

Elon Tusk @srs@schelling.pt

RT @DanHendrycks
To find the limits of Transformers, we collected 12,500 math problems. While a three-time IMO gold medalist got 90%, GPT-3 models got ~5%, with accuracy increasing slowly.

If trends continue, ML models are far from achieving mathematical reasoning.

http://arxiv.org/pdf/2103.03874

e45a93e08b2c1375.jpeg
7c6f7858f9192bb4.png
f1f499a21cffedca.png
c3dd37fd2e81bfca.png

Mar 08, 2021, 22:57 · · Moa · · ·

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…