**gavin leech** @gl@schelling.pt · Apr 26, 2023

**gavin leech** @gl@schelling.pt · Apr 26, 2023

gavin leech @gl@schelling.pt

Apr 26, 2023

'we can interpret a model’s cross-entropy loss as “how distinguishable” the model is from its training distribution, and use that to upper bound the difficulty of training a model to perform reliable, high-quality reasoning over long sequences.'

https://epochai.org/blog/the-direct-approach

The Direct Approach

Empirical scaling laws can help predict the cross-entropy loss associated with training inputs, such as compute and data. However, in order to predict when AI will…

Epoch

**gavin leech** @gl@schelling.pt · 2023-04-26T10:36:27Z

gavin leech @gl@schelling.pt

Includes an open review, bravely solicited from the grumpiest man in the neighbourhood

https://epochai.org/files/direct-approach-review-nuno-sempere.pdf

April 26, 2023 at 10:36 AM · · Moa · · ·

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…