Show newer

law of one player: nothing happens unless you (yes, you specifically) make it happen

common theme rn in alignment research is how value evolve, e.g. concept/value extrapolation, shard theory

niplav boosted

bro you're not scaring the hoes at all. the hoes are actually developing an unassailable confidence and ruthless clarity of purpose that i'm finding quite alarming

if you're german then the term "gaslighting" is slightly queasy

can you money-pump the agents simulated by large language models?

give standard vNM coherence violating scenarios to smarter language models

at higher levels of intelligence, maintaining coherence becomes more difficult since your action space widens and having high coherence might be NP-hard. so the better metric is coherence divided by size of action space

niplav boosted
niplav boosted

i keep thinking about "slow is smooth, smooth is fast"

niplav boosted

onrushing tide, a mere
fifty miles away, and just
our puny channels

should i start a podcast

Disappointed that [1] doesn't actually check for common vNM violations. This must be assuaged

[1]: sohl-dickstein.github.io/2023/

So beware filtered reports of people (allegedly) reaching them quickly

Show thread

For some people jhanas are quite difficult to reach

most require a 10-day retreat

I needed 21 for the first

niplav boosted
Show older
Mastodon

a Schelling point for those who seek one