**niplav** @niplav@schelling.pt · Mar 09, 2023, 11:20

**niplav** @niplav@schelling.pt · Mar 09, 2023, 11:20

niplav @niplav@schelling.pt

Mar 09, 2023, 11:20

it would be helpful if we knew whether for almost all random functions from ℝⁿ→ℝ, changing any one element of the domain slightly changes the output a lot

then one might also prove (or disprove) the same thing for functions implementable by some classes neural networks

**Tetraspace 💎** @TetraspaceGrouping@mastodon.social · Mar 09, 2023, 11:44

**Tetraspace 💎** @TetraspaceGrouping@mastodon.social · Mar 09, 2023, 11:44

Mar 09, 2023, 11:44

Tetraspace 💎 @TetraspaceGrouping@mastodon.social

@niplav no free lunch version is obviously true, because they’re all horribly discontinuous 🤔

**niplav** @niplav@schelling.pt · Mar 09, 2023, 11:57

**niplav** @niplav@schelling.pt · Mar 09, 2023, 11:57

Mar 09, 2023, 11:57

niplav @niplav@schelling.pt

@TetraspaceGrouping
Hm, true.

Per universal approximation theorem neural networks can approximate any function, but some functions are clearly easier to approximate than others

And the horribly discontinuous ones are probably very hard to approximate

Perhaps it's that K-Lipschitz continuous functions are easier to approximate for smaller K?

Okay new question which prior do neural networks with grad descent implement

**niplav** @niplav@schelling.pt · 2023-03-09T21:18:55Z

niplav @niplav@schelling.pt

@TetraspaceGrouping

When I said prior I meant inductive bias of course

Me dummy

Mar 09, 2023, 21:18 · · · ·

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…