I'm reading Sutton and Barto's RL textbook, and I notice that the formalism of "reward signal" squishes together two inchoate concepts that are valuable to track separately -- 'sensory feedback' from the 'environment', and the interpretation of the sensory feedback in terms of implications for your wantingness.

Follow

I use the word "wantingness" deliberately here -- there are many ways you can want things. You can want to achieve a goal (one-time), you can want to maximize the number of paperclips in the world (a continual task that you can only have better or worse outcomes for), or you can want to stop wanting things (an example of a particularly difficult-to-formalize instance of wantingness).

Sign in to participate in the conversation
Mastodon

a Schelling point for those who seek one