**niplav** @niplav@schelling.pt · Jul 03, 2023, 18:14

**niplav** @niplav@schelling.pt · Jul 03, 2023, 18:14

niplav @niplav@schelling.pt

Jul 03, 2023, 18:14

niplav @niplav@schelling.pt

Has anyone tried to train neural networks to predict sudden drops in loss of LLM training?

We surely can observe many scaling curves from many different tasks

**Willow Brook 💜** @Paradox@raru.re · Jul 03, 2023, 18:22

**Willow Brook 💜** @Paradox@raru.re · Jul 03, 2023, 18:22

Jul 03, 2023, 18:22

Willow Brook 💜 @Paradox@raru.re

@niplav What kind of loss is that? How does that happen?

**niplav** @niplav@schelling.pt · 2023-07-03T18:35:17Z

niplav @niplav@schelling.pt

@Paradox loss as in predictive loss, a simple way of measuring predictive accuracy

We want loss to be as low as possible, because that corresponds to good performance

Sometimes capabillities emerge with sudden falls in loss, sometimes loss doesn't change much, we'd like to know when these capabilities will change or loss declines sharply

Jul 03, 2023, 18:35 · · Tusky · · ·

**Willow Brook 💜** @Paradox@raru.re · Jul 03, 2023, 18:46

**Willow Brook 💜** @Paradox@raru.re · Jul 03, 2023, 18:46

Jul 03, 2023, 18:46

Willow Brook 💜 @Paradox@raru.re

@niplav This is a feature of LLMs I'm not educated on. I know the basics of such systems.

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…