Breaking Bad would’ve been awesome if Jesse was on that airplane when it exploded, fell all the way down into Walter’s pool and then said “yo, my plane just blew up, bitch”

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:16

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:16

Apr 25, 2023, 15:16

niplav @niplav@schelling.pt

In the context of making inconsistent preferences consistent, these are fairly strong results.

Not sure about their approximation behavior, but I think this makes becoming a coherent agent very difficult.

Show thread

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:11

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:11

Apr 25, 2023, 15:11

niplav @niplav@schelling.pt

• PCS and all c∈C have minimum graph edit distance of i. (Proof sketch: There is a graph for which all acyclic tournaments with the same (minimal) graph-edit distance don't contain a specific subgraph). Graph in picture, the minimal edit distance is 3, the non-preserved consistent subgraph is a2→a4. This extends to arbitrarily big consistent subgraphs (replace all edges with acyclic tournaments with n nodes).

3a120b8a2dc456ef.png

Show thread

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:07

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:07

Apr 25, 2023, 15:07

niplav @niplav@schelling.pt

Then you have the following impossibilities:

• PCS and f has polynomial runtime. (Proof sketch: finding an s of size k is in NP. Finding all of them is in NP as well.)
• PCS and C has polynomial size. (Proof sketch: You can construct a graph with exponentially many acyclic tournaments as subgraphs).

Show thread

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:03

**niplav** @niplav@schelling.pt · Apr 25, 2023, 15:03

Apr 25, 2023, 15:03

niplav @niplav@schelling.pt

Let f be a function that takes as input an inconsistent preference i and produces a set C of consistent preferences. Let S be the set of subgraphs of i for which it holds that for every s∈S: s is an acyclic subgraph, s is a subgraph of i, and s is maximal in i for those properties.

S shall contain all s that fulfill the above conditions.

Let PCS be the property that for every s∈S, s is a subgraph of at least one c∈C (every consistent subpreference appears at least once in a consistent version)

**niplav** · Apr 24, 2023, 14:28

niplav boosted

**rrogers** @rrogers@mathstodon.xyz · Apr 24, 2023, 14:28

Apr 24, 2023, 14:28

rrogers @rrogers@mathstodon.xyz

I just watched the BBC clip on Emmy Noether :) Absolutely astonished and recommend it to anybody in mathstodon.xyz. I knew she was famous and intelligent; I never knew how extensive and foundational her other work was; "modern algebra", topology, gauge theory, ....
From the broadcast, I would put her up with the heroes of mathematics; Euler, Gauss, ..
I know I am smart but I also know that there are people who are _really_ really smart, and it's clear that Noether was one of them.
Before I watched it this morning, it occurred to me "what would the world be like if Godel and Noether had married" (or worked together); I found it beyond even my SciFi fueled imagination :)

https://www.bbc.co.uk/sounds/play/m00025bw

**niplav** @niplav@schelling.pt · Apr 25, 2023, 13:54

**niplav** @niplav@schelling.pt · Apr 25, 2023, 13:54

Apr 25, 2023, 13:54

niplav @niplav@schelling.pt

Pearl is of course the GOAT, but the book doesn't look like the thing I'm looking for (not quite practical enough)

**niplav** @niplav@schelling.pt · Apr 25, 2023, 11:51

**niplav** @niplav@schelling.pt · Apr 25, 2023, 11:51

Apr 25, 2023, 11:51

niplav @niplav@schelling.pt

I kind of want to learn Bayesian Statistics real bad

But like the practical kind you can actually use to calculate likelihood ratios of experiments

Any reccs for textbooks? Ideally with code[1] *and* exercises

[1]: Best of Julia, 2nd best if Python, I don't wanna learn R :-/

**niplav** @niplav@schelling.pt · Apr 25, 2023, 11:14

**niplav** @niplav@schelling.pt · Apr 25, 2023, 11:14

Apr 25, 2023, 11:14

niplav @niplav@schelling.pt

Queering Hume's Guillotine: Ontological Crises, von Neumann-Morgenstern-inconsistent Preferences and Two Impossibility Theorems

**niplav** @niplav@schelling.pt · Apr 25, 2023, 10:21

**niplav** @niplav@schelling.pt · Apr 25, 2023, 10:21

Apr 25, 2023, 10:21

niplav @niplav@schelling.pt

Functional Decision Theory has a WP page

**niplav** · Apr 24, 2023, 06:11

niplav boosted

Apr 24, 2023, 06:11

Apr 24, 2023, 06:11

𝕹𝖞𝖝 妛彁 :labrys-45-sickle: [#NoNyxNovember arc] @nyx@social.xenofem.me

I'm gonna experience some character development in a minute and ignore all other aspects of my life besides getting my gay little projects finished

**niplav** @niplav@schelling.pt · Apr 24, 2023, 06:12

**niplav** @niplav@schelling.pt · Apr 24, 2023, 06:12

Apr 24, 2023, 06:12

niplav @niplav@schelling.pt

I should do more forecasting again

d31942f0c2f41569.jpg

**niplav** @niplav@schelling.pt · Apr 23, 2023, 17:55

**niplav** @niplav@schelling.pt · Apr 23, 2023, 17:55

Apr 23, 2023, 17:55

niplav @niplav@schelling.pt

the answer to the Needham question is that China glorified wordcels too much

**niplav** · Apr 23, 2023, 16:24

niplav boosted

**lainy** @lain@lain.com · Apr 23, 2023, 16:24

Apr 23, 2023, 16:24

lainy @lain@lain.com

with stable diffusion, you can make anything you can imagine into a picture.

turns out most people imagine puffy eyed koreans in various states of undress

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:16

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:16

Apr 23, 2023, 11:16

niplav @niplav@schelling.pt

The last points especially might be ameliorated by literally just appending "and don't optimize too hard" and "let yourself be shut down by a human" to the prompt?

Man I feel confused, but assuming that language models aren't infested with inner optimizers now I'm more hopeful?

Or am I missing something crucial here…

Show thread

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:13

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:13

Apr 23, 2023, 11:13

niplav @niplav@schelling.pt

• Last point especially crucial in situations where such an agent starts recursively improving itself (e.g. training new models)

Show thread

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:12

**niplav** @niplav@schelling.pt · Apr 23, 2023, 11:12

Apr 23, 2023, 11:12

niplav @niplav@schelling.pt

Thinking out loud what still doesn't work with giving AutoGPT agents instructions like "do X but respect human preferences while doing so".

• Inner optimizers are still a problem if they exist in the GPT models
• Do LLM agents have sufficient goal stability? I.e. when delegating & delegating further does the original goal get perturbed or even lost?
• Limited to the models' understanding of "human values"
• Doesn't solve ambitious value learning, model might generalise badly once in new domains