Show newer

I think it would be extremely cool if the "categorical cybernetics" bag of methods could say something about the relationship between inner and outer models - in particular, the observed fact that transformers learn gradient descent as one of the steps in their algorithm!

@julesh @bgavran @mc

---
RT @wwwojtekk
Among white students admitted to Harvard, 54% are athletes+legacy+dean's list+faculty/staff children (column 2). Just 10% is regular admission

Big athletic school, Harvard... t.co/zooXWhD1Lh
twitter.com/wwwojtekk/status/1

I need your most overwrought metaphor for American cultural domination in the aftermath of WW2. No, that's too overwrought.
---
RT @_F_B_G_
Reminded of the time the leader of WWII Japan and sometime God-Emperor, Hirohito, was loomed over threateningly by Mickey Mouse on a visit to Disneyland. twitter.com/EmmaMAshford/statu t.co/rW5UQQBOIR
twitter.com/_F_B_G_/status/159

Show older
Mastodon

a Schelling point for those who seek one