**gavin leech** @gl@schelling.pt · Feb 6, 2023

**gavin leech** @gl@schelling.pt · Feb 6, 2023

gavin leech @gl@schelling.pt

Feb 6, 2023

gavin leech @gl@schelling.pt

Unbelievably simple recent ideas in ML, often top-conference fodder:

**gavin leech** @gl@schelling.pt · Feb 6, 2023

**gavin leech** @gl@schelling.pt · Feb 6, 2023

Feb 6, 2023

gavin leech @gl@schelling.pt

To detect if text comes from LM X, randomly modify it and get X's logprobs of the original and the mod.

If p(original) > p(mod), classify as LM generated.

https://arxiv.org/abs/2301.11305v1

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

The fluency and factual knowledge of large language models (LLMs) heightens the need for corresponding systems to detect whether a piece of text is machine-written.…

arXiv.org

**gavin leech** @gl@schelling.pt · Feb 6, 2023

**gavin leech** @gl@schelling.pt · Feb 6, 2023

Feb 6, 2023

gavin leech @gl@schelling.pt

"to increase performance by 10% absolute, just take the majority-vote answer of several LM answers"

https://openreview.net/forum?id=1PL1NIMMrw

Self-Consistency Improves Chain of Thought Reasoning in Language...

We propose a new decoding strategy, self-consistency, that greatly improves chain-of-thought prompting

OpenReview

**gavin leech** @gl@schelling.pt · Feb 6, 2023

**gavin leech** @gl@schelling.pt · Feb 6, 2023

Feb 6, 2023

gavin leech @gl@schelling.pt

"to reduce resource use by 50%(!), use a large model to do rejection sampling of small models' output"

https://arxiv.org/abs/2302.01318

Accelerating Large Language Model Decoding with Speculative Sampling

We present speculative sampling, an algorithm for accelerating transformer decoding by enabling the generation of multiple tokens from each transformer call. Our…

arXiv.org

**gavin leech** @gl@schelling.pt · Feb 6, 2023

**gavin leech** @gl@schelling.pt · Feb 6, 2023

Feb 6, 2023

gavin leech @gl@schelling.pt

"to find hyperparams about twice as fast, start a bunch of networks training and after a while copy the weights of the one improving fastest. repeat"

https://www.deepmind.com/blog/population-based-training-of-neural-networks

Population based training of neural networks

Neural networks have shown great success in everything from playing Go and Atari games to image recognition and language translation. But often overlooked is that…

www.deepmind.com

**gavin leech** @gl@schelling.pt · 2023-02-06T15:02:41Z

gavin leech @gl@schelling.pt

I guess chain of thought is itself one of these.

February 6, 2023 at 3:02 PM · · Moa · · ·

Trending now

Resources

Developers

What is Mastodon?

schelling.pt

More…