Follow

RT @DanHendrycks
To find the limits of Transformers, we collected 12,500 math problems. While a three-time IMO gold medalist got 90%, GPT-3 models got ~5%, with accuracy increasing slowly.

If trends continue, ML models are far from achieving mathematical reasoning.

arxiv.org/pdf/2103.03874

Sign in to participate in the conversation
Mastodon

a Schelling point for those who seek one