with stable diffusion, you can make anything you can imagine into a picture.

turns out most people imagine puffy eyed koreans in various states of undress

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:16

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:16

Apr 23, 2023, 11:16

niplav @niplav@schelling.pt

The last points especially might be ameliorated by literally just appending "and don't optimize too hard" and "let yourself be shut down by a human" to the prompt?

Man I feel confused, but assuming that language models aren't infested with inner optimizers now I'm more hopeful?

Or am I missing something crucial here…

Show thread

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:13

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:13

Apr 23, 2023, 11:13

niplav @niplav@schelling.pt

• Last point especially crucial in situations where such an agent starts recursively improving itself (e.g. training new models)

Show thread

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:12

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:12

Apr 23, 2023, 11:12

niplav @niplav@schelling.pt

Thinking out loud what still doesn't work with giving AutoGPT agents instructions like "do X but respect human preferences while doing so".

• Inner optimizers are still a problem if they exist in the GPT models
• Do LLM agents have sufficient goal stability? I.e. when delegating & delegating further does the original goal get perturbed or even lost?
• Limited to the models' understanding of "human values"
• Doesn't solve ambitious value learning, model might generalise badly once in new domains

**niplav** @niplav@schelling.pt · Apr 22, 2023, 22:16

**niplav** @niplav@schelling.pt · Apr 22, 2023, 22:16

Apr 22, 2023, 22:16

niplav @niplav@schelling.pt

land-value taxes on arbitrary graphs

**niplav** @niplav@schelling.pt · Apr 22, 2023, 21:56

**niplav** @niplav@schelling.pt · Apr 22, 2023, 21:56

Apr 22, 2023, 21:56

niplav @niplav@schelling.pt

Hm maybe Hodge decomposition can be used to define the goal-directedness of a system?

If your system is in loops, it's not accomplishing much, but the potential part also needs to be high (rocks have no loops but also no direction)

**niplav** @niplav@schelling.pt · Apr 22, 2023, 21:53

**niplav** @niplav@schelling.pt · Apr 22, 2023, 21:53

Apr 22, 2023, 21:53

niplav @niplav@schelling.pt

How many people in the 50s knew about John von Neumann? Very few I reckon

Show thread

**niplav** @niplav@schelling.pt · Apr 22, 2023, 21:51

**niplav** @niplav@schelling.pt · Apr 22, 2023, 21:51

Apr 22, 2023, 21:51

niplav @niplav@schelling.pt

Since tails come apart, you probably don't know the relevant polymaths in our world and know the gifted communicators much better.

**niplav** · Apr 21, 2023, 14:03

niplav boosted

**Danpiker** @Danpiker@mathstodon.xyz · Apr 21, 2023, 14:03

Apr 21, 2023, 14:03

Danpiker @Danpiker@mathstodon.xyz

How many different ways can 4 equal circles be linked in 3d space?

-not counting solutions composed of multiple separate links
-no touching or crossing of the circles
-true geometric circles only, not elongated or distorted
-considering topologically equivalent arrangements to be the same

How about 5 circles? Has someone already catalogued these?
I've seen some enumerations of planar arrangements, and link tables allowing non-circular loops, but didn't find yet one for circles in space.

f7780241087e90b5.png

**niplav** @niplav@schelling.pt · Apr 22, 2023, 00:28

**niplav** @niplav@schelling.pt · Apr 22, 2023, 00:28

Apr 22, 2023, 00:28

niplav @niplav@schelling.pt

On the object level, this means that I should take climate change people more seriously out of cooperative spirit even tho I don't particularly believe their object level arguments

As partially causal cooperation with worlds where they are infact right or sth idk

Show thread

**niplav** @niplav@schelling.pt · Apr 22, 2023, 00:23

**niplav** @niplav@schelling.pt · Apr 22, 2023, 00:23

Apr 22, 2023, 00:23

niplav @niplav@schelling.pt

So how do you navigate this dilemma? People can't just disagree but avoid each other, setup implies large externalities.

Show thread

**niplav** @niplav@schelling.pt · Apr 22, 2023, 00:23

**niplav** @niplav@schelling.pt · Apr 22, 2023, 00:23

Apr 22, 2023, 00:23

niplav @niplav@schelling.pt

So, how *do* you engage in a conflict where one side is trying to avoid apocalyptic but unobservable behavior, but everyone else doesn't believe their arguments?

We might do that with money, but feels insufficient. Assume evaluating object-level arguments is really really difficult here.

Rarely doomers could be right.

**niplav** @niplav@schelling.pt · Apr 21, 2023, 23:14

**niplav** @niplav@schelling.pt · Apr 21, 2023, 23:14

Apr 21, 2023, 23:14

niplav @niplav@schelling.pt

"I read all your fanfictions."
"Bet with me on the claim X you made."
"No."
"Then you are not of our culture."

**niplav** @niplav@schelling.pt · Apr 21, 2023, 23:01

**niplav** @niplav@schelling.pt · Apr 21, 2023, 23:01

Apr 21, 2023, 23:01

niplav @niplav@schelling.pt

One of my hot takes is that game theory is basically useless

**niplav** · Apr 21, 2023, 16:52

niplav boosted

**lait accompli** @genmaicha@stereophonic.space · Apr 21, 2023, 16:52

Apr 21, 2023, 16:52

lait accompli @genmaicha@stereophonic.space

#fridey

8de851a72b5eb0b28a380cddf9477dcf35655ef4550aa65a822a73a99afe5ab9.mp4

**niplav** @niplav@schelling.pt · Apr 21, 2023, 17:39

**niplav** @niplav@schelling.pt · Apr 21, 2023, 17:39

Apr 21, 2023, 17:39

niplav @niplav@schelling.pt

Ali Maow Maalin was the last person to get smallpox before it was eradicated.

He was cured from it in 1977 and made a full recovery.

In the 1990s he was a local coordinator in the fight against Polio in the region, where he spent years traveling around, distributing vaccines and educating the population.

In 2013, he was again campaigning in the region after Polio had been reintroduced, but fell ill with a fever.

On July 22nd 2013, he died of Malaria.

Show older

Website: https://niplav.site/index.html

Pronouns: they/them

I operate by Crocker's rules[1].

[1]: https://www.lesswrong.com/tag/crockers-rules

Joined Aug 2021

niplav @niplav@schelling.pt

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…