**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 22:04

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 22:04

Corn Woman 🌽 @WomanCorn@schelling.pt

Corn Woman 🌽 @WomanCorn@schelling.pt

3.72K Posts

543 Following

341 Followers

bird site: https://twitter.com/WomanCorn

shutterstock: https://www.shutterstock.com/search/cyber-woman-with-corn

Cyber woman taking a vitamins from corn.

Joined Jan 2021

543 Following 341 Followers

Posts Posts and replies Media

Show newer

Mar 11, 2023, 22:04

Corn Woman 🌽 @WomanCorn@schelling.pt

In version B, we're talking about Inner Alignment failures, where the AI is programmed to maximize human happiness, and the "paperclips" are 10-neuron constructs that count as human to the AI and can only feel happiness.

Show thread

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 22:04

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 22:04

Mar 11, 2023, 22:04

Corn Woman 🌽 @WomanCorn@schelling.pt

In version A, we're talking about the Orthogonality Thesis, and the paperclips are actual paperclips*, because the point is that a superintelligent AI might not care about what you care about.

* This also applies to bolts, or Facebook share prices.

Show thread

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 22:04

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 22:04

Mar 11, 2023, 22:04

Corn Woman 🌽 @WomanCorn@schelling.pt

The Paperclip Maximizer is a type error.

There are at least three versions in play, depending on what's being talked about.

**Corn Woman 🌽** · Mar 11, 2023, 19:19

Corn Woman 🌽 boosted

**niplav** @niplav@schelling.pt · Mar 11, 2023, 19:19

Mar 11, 2023, 19:19

niplav @niplav@schelling.pt

extra-punish people who euler you without anything backing it up

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 15:21

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 15:21

Mar 11, 2023, 15:21

Corn Woman 🌽 @WomanCorn@schelling.pt

@empathy2000 is this just because we use the jargon "tacit knowledge" for that category, or do you think there's more discussion missing?

**Corn Woman 🌽** · Mar 10, 2023, 12:03

Corn Woman 🌽 boosted

**kafkamacchiato** @kafkamacchiato@schelling.pt · Mar 10, 2023, 12:03

Mar 10, 2023, 12:03

kafkamacchiato @kafkamacchiato@schelling.pt

the kryptonite fallacy: steelmanning the opponent's argument and then pulling out a counterargument that seems to hold, but actually only works on the steelman and not on the regular version

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 01:08

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 01:08

Mar 11, 2023, 01:08

Corn Woman 🌽 @WomanCorn@schelling.pt

@flats I think the instrumental convergence argument is still pretty good. It does rely somewhat on the idea that the AI will be trained to optimize a single metric.

When reinforcement learning seemed like the winning technique, this was a big risk. Now that LLMs are the most promising technique, it's less clear. <Minimize next token prediction error> doesn't obviously call for conquering the universe.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 01:03 *

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 11, 2023, 01:03 *

Mar 11, 2023, 01:03 *

Corn Woman 🌽 @WomanCorn@schelling.pt

@flats right. The question is how many of the fundamental arguments were worked out assuming that the goal was to build a CEV sovereign and never rechecked to see if they still apply now that that goal has been abandoned.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 22:08

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 22:08

Mar 10, 2023, 22:08

Corn Woman 🌽 @WomanCorn@schelling.pt

@flats If the AI isn't going to acquire godlike power, how many of the issues devolve into the principal-agent problem?

But no one wants to double check 1000 pages of blog posts to see if the conclusion relies on an unstated assumption.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 22:06

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 22:06

Mar 10, 2023, 22:06

Corn Woman 🌽 @WomanCorn@schelling.pt

@flats I think the problem is that a lot of their thinking on AI has a presumed final step <then we give it control over everything and it instantiates heaven on earth> and a lot of the threats hinge on the implicit assumption that you will give the AI control over everything.

So, an AI might conceal its real goals... Is that an issue if it is only going to get enough power to run the factory?

Maybe, maybe not. But we have to check every argument.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 21:50

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 21:50

Mar 10, 2023, 21:50

Corn Woman 🌽 @WomanCorn@schelling.pt

Waiting for service marks:

Networkless: Doesn't communicate with the seller or nay service provider.

Updateless: Simple enough that they won't ever be shipping a software update.

Microprocessor-free: The ultimate in assurance that it doesn't spy on you.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 21:04

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 21:04

Mar 10, 2023, 21:04

Corn Woman 🌽 @WomanCorn@schelling.pt

How many people have died since you decided nonviolent protest was the right option.

If you had just gone to war, the death total would have been smaller.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 20:37

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 20:37

Mar 10, 2023, 20:37

Corn Woman 🌽 @WomanCorn@schelling.pt

Alice: LLMs are not thinking. They're just rephrasing things they've already read. Their alleged essays revel that they have no understanding.

Bob: By that logic, most humans are not thinking either.

Alice: Okay.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 19:20

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 19:20

Mar 10, 2023, 19:20

Corn Woman 🌽 @WomanCorn@schelling.pt

Does anyone have good examples of Progressive Web Apps that are not based around disconnected use?

(Disconnected use is the acid test in a lot of ways, but it's not the build-around goal.)

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 19:04

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 19:04

Mar 10, 2023, 19:04

Corn Woman 🌽 @WomanCorn@schelling.pt

@flats it looks like I won't have time to write a real post anytime soon, so I'll point you to this short summary instead:

https://twitter.com/WomanCorn/status/1631696104403107844?s=19

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 18:57

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 18:57

Mar 10, 2023, 18:57

Corn Woman 🌽 @WomanCorn@schelling.pt

@RevCyberTrucker

What I find amazing is that none of the glass parts of the lamp broke. I'd expect those to break easiest.

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 16:30

**Corn Woman 🌽** @WomanCorn@schelling.pt · Mar 10, 2023, 16:30

Mar 10, 2023, 16:30

Corn Woman 🌽 @WomanCorn@schelling.pt

Visa:

You've got to solve the problem where I set automatic payment on something, and the bank thinks it's suspicious and asks me if it's legit, and then they payment doesn't happen even if I say yes.

Work this out. Enabling transactions is literally your main business.