**niplav** @niplav@schelling.pt · 2022-03-24T15:56:19Z

niplav @niplav@schelling.pt

use known inconsistencies of human preferences as value-learning trip-wires: if the value learning algorithm hasn't learned them yet, it's operating at the wrong level of abstraction.

March 24, 2022 at 3:56 PM · · · ·

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…