**niplav** @niplav@schelling.pt · Feb 20, 2023, 22:45

**niplav** @niplav@schelling.pt · Feb 20, 2023, 22:45

niplav @niplav@schelling.pt

niplav @niplav@schelling.pt

7.99K Posts

214 Following

234 Followers

Website: https://niplav.site/index.html

Pronouns: they/them

I operate by Crocker's rules[1].

[1]: https://www.lesswrong.com/tag/crockers-rules

Joined Aug 2021

214 Following 234 Followers

Posts Posts and replies Media

Show newer

Feb 20, 2023, 22:45

niplav @niplav@schelling.pt

Even with ML systems!

I agree that probably with most architectures, if you train them a lot to be capable alignment theorists, they have inner optimizers that are capable consequentialists, but the alignment-theorist-phase might be quite long (I could_{10%} see it going over 100x human ability).

Show thread

**niplav** @niplav@schelling.pt · Feb 20, 2023, 22:41

**niplav** @niplav@schelling.pt · Feb 20, 2023, 22:41

Feb 20, 2023, 22:41

niplav @niplav@schelling.pt

If we had those widely distributed, people would likely use them for capabilities and just widen the gap (e.g. OpenAI who talk about this as a strategy are not to be trusted with that strategy, since I don't see them using it solely for alignment work for half a year, and instead using it on both capabilities and alignment. But their plan is sound in that regard).

But I disagree with the view that you can't have the alignment theorist that is not also a consequentialist.

Show thread

**niplav** @niplav@schelling.pt · Feb 20, 2023, 22:39

**niplav** @niplav@schelling.pt · Feb 20, 2023, 22:39

Feb 20, 2023, 22:39

niplav @niplav@schelling.pt

Hm. I think the type of philosophy/math/cs needed for successful strawberry alignment is close enough to regular theorem-proving that AI systems that aren't seeds for worldcrunchers would still be very helpful.

(Doesn't feel to me like it touches the consequentialist core of cognition, a lot of philosophy is tree-traversal and finding inconsistent options, and math also feels like a MCTS-like thing)

Is the advantage we'd have by good alignment theorist ML systems 1.5x or 10x or 100x?

**niplav** @niplav@schelling.pt · Feb 20, 2023, 12:20

**niplav** @niplav@schelling.pt · Feb 20, 2023, 12:20

Feb 20, 2023, 12:20

niplav @niplav@schelling.pt

Telling my kidnappers about AI alignment until they gag me

**niplav** @niplav@schelling.pt · Feb 20, 2023, 12:15

**niplav** @niplav@schelling.pt · Feb 20, 2023, 12:15

Feb 20, 2023, 12:15

niplav @niplav@schelling.pt

Update: there's a bunch of women using the Replika thing.

I'd like to see the ratio

(95% confidence interval: [10%, 65%])

**niplav** @niplav@schelling.pt · Feb 20, 2023, 12:08

**niplav** @niplav@schelling.pt · Feb 20, 2023, 12:08

Feb 20, 2023, 12:08

niplav @niplav@schelling.pt

Hey @niconiconi did you write this: https://www.lesswrong.com/posts/mHqQxwKuzZS69CXX5/whole-brain-emulation-no-progress-on-c-elgans-after-10-years?

It's great

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:10

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:10

Feb 20, 2023, 11:10

niplav @niplav@schelling.pt

Man I do have a lot more respect for Oliver Habryka after listening to this[1]. Highlights include naming the thing where high status people eschew meritocracy because they can only lose, and the statement that there might be 5-10 years in the medium future that are about as crazy or crazier than 2020.

[1]: https://thefilancabinet.com/episodes/2023/02/05/6-oliver-habryka.html

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:05

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:05

Feb 20, 2023, 11:05

niplav @niplav@schelling.pt

Might've been in The Art of Unix Programming

Show thread

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:03

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:03

Feb 20, 2023, 11:03

niplav @niplav@schelling.pt

Hm, I remember reading somewhere sometime a classification of ways that you can use unix programs in pipes:

Sources (<, cat, programs that just produce output), filters (removing data, such as wc), transformers (?) (such as sort, cut, awk) and sinks (>, programs that just execute). Anyone recollect where I could've gotten that from?

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:02

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:02

Feb 20, 2023, 11:02

niplav @niplav@schelling.pt

https://www.readthesequences.com/

Show thread

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:02

**niplav** @niplav@schelling.pt · Feb 20, 2023, 11:02

Feb 20, 2023, 11:02

niplav @niplav@schelling.pt

people on the timeline are wrong

I have just the right thing

**niplav** · Feb 20, 2023, 01:20 *

niplav boosted

**eatscrayon** @eatscrayon@exploit.social · Feb 20, 2023, 01:20 *

Feb 20, 2023, 01:20 *

eatscrayon @eatscrayon@exploit.social

Just learned set theory and I cannot contain myself.

*edit*
This post hit 500 boosts an 1k likes :D
Trans rights are human rights.
Bash the fash.

**niplav** · Feb 20, 2023, 06:24

niplav boosted

**Dgar** @dgar@aus.social · Feb 20, 2023, 06:24

Feb 20, 2023, 06:24

Dgar @dgar@aus.social

If you rearrange the letters of POSTMEN, they become VERY ANGRY.

**niplav** @niplav@schelling.pt · Feb 20, 2023, 10:43

**niplav** @niplav@schelling.pt · Feb 20, 2023, 10:43

Feb 20, 2023, 10:43

niplav @niplav@schelling.pt

I'll take synthetic training data for $500, sam

https://twitter.com/rgblong/status/1626500027534434305

**niplav** · Feb 19, 2023, 09:01

niplav boosted

**kafkamacchiato** @kafkamacchiato@schelling.pt · Feb 19, 2023, 09:01

Feb 19, 2023, 09:01

kafkamacchiato @kafkamacchiato@schelling.pt

rlhf me daddy

**niplav** @niplav@schelling.pt · Feb 18, 2023, 10:51

**niplav** @niplav@schelling.pt · Feb 18, 2023, 10:51

Feb 18, 2023, 10:51

niplav @niplav@schelling.pt

🤔 🤔

Embarassment is a low status emotion, right?

Show thread

**niplav** @niplav@schelling.pt · Feb 17, 2023, 21:09

**niplav** @niplav@schelling.pt · Feb 17, 2023, 21:09

Feb 17, 2023, 21:09

niplav @niplav@schelling.pt

bnuuy

**niplav** @niplav@schelling.pt · Feb 17, 2023, 20:18

**niplav** @niplav@schelling.pt · Feb 17, 2023, 20:18

Feb 17, 2023, 20:18

niplav @niplav@schelling.pt

do not talk to philosophers. Do not engage in philosophy. Eschew everything that starts with "meta". Do NOT give them a platform. I am so done with this.

**niplav** · Feb 17, 2023, 20:06

niplav boosted

**MOVED to @chjara@meow.tuxcrafting.online** @chjara@snowdin.town · Feb 17, 2023, 20:06

Feb 17, 2023, 20:06

MOVED to @chjara@meow.tuxcrafting.online @chjara@snowdin.town

actually, first a short rant
i hate the libc
even outside the fact it's 99% antiquated nonsense you should never use,
a lot of it (integer types, stdarg, math functions, string/memory operations) should be handled by the compiler instead of the libc - in fact, most of the time libcs do these by just stubbing compiler intrinsics, which bruh
then stuff like memory allocation, file management, and really most IO-adjacent operations are really application/system-specific and should be put in a separate library instead of the libc proper
now you might say, wait then what would remain in the libc

exactly