**niplav** @niplav@schelling.pt · 2022-05-05T10:53:37Z

niplav @niplav@schelling.pt

Maybe it's true that intelligence depends on the environment, but consider: the environments where policy iteration performs better than RL with temporal difference learning are kind of dumb.

May 05, 2022, 10:53 · · · ·

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…