**niplav** @niplav@schelling.pt · 2023-02-25T21:51:26Z

niplav @niplav@schelling.pt

niplav @niplav@schelling.pt

Does regularization of RL policies act as an impact measure?

Feb 25, 2023, 21:51 · · · ·

**Rai** @agentydragon@mastodon.social · Feb 27, 2023, 09:48

**Rai** @agentydragon@mastodon.social · Feb 27, 2023, 09:48

Feb 27, 2023, 09:48

Rai @agentydragon@mastodon.social

@niplav maybe a bit? might function a bit like making a policy be more a quantizer rather than straight up utility optimizer if you say look for optimal policy within some maximum distance from pretrained LLM. not in the same formal way as the other impact measures but could have a similar practical function.

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…