Ben Edelman (@EdelmanBen) Twitter Tweets • TwiCopy

Ben Edelman

@EdelmanBen

+ Follow

Final-year PhD candidate at Harvard CS trying to understand AI scientifically. New to the platform formerly known as Twitter.

ID:2383010678

linkhttps://www.benjaminedelman.com/ calendar_today11-03-2014 02:27:18

33 Tweets

113 Followers

20 Following

Boaz Barak

@boazbaraktcs

1 month ago

One challenge in fighting “hallucinations” is distinguishing epistemic uncertainty from the natural “aleatoric” uncertainty of language. One can define latter as the uncertainty that remains in the limit of infinite compute and data.

Question is whether smaller model can…

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

account_circle

Gustaf Ahdritz

@gahdritz

1 month ago

In new work (with Tian Qin, Nikhil Vyas, Boaz Barak, and Ben Edelman), we show that LLMs can identify their own epistemic uncertainty in free-form text, suggesting new approaches for combating hallucinations. (1/6)
Paper: arxiv.org/abs/2402.03563
Blog: bit.ly/3U2emAP

account_circle

Kempner Institute at Harvard University

@KempnerInst

1 month ago

New Deeper Learning blog post: a linear probe can unlock LM's metacognitive capability to distinguish tokens that are 'knowable' from tokens where its predictions can't be improved. bit.ly/3U2emAP
Gustaf Ahdritz, Tian Qin, Nikhil Vyas, Boaz Barak, Ben Edelman
#AI #LLM

thumb_up_off_alt15

chat_bubble_outline0

repeat4

shareShare

account_circle

Ekdeep Singh

@EkdeepL

1 month ago

Two (out of 8) things that Sam Bowman wants you to know about LLMs:

(i) LLMs predictably get more capable with increasing investment

(ii) Many important LLM behaviors emerge unpredictably How can we get ahead of the curve and predict these ‘unpredictable’ behaviors?🧵⬇️

thumb_up_off_alt27

chat_bubble_outline0

repeat4

shareShare

account_circle

David Krueger

@DavidSKrueger

1 month ago

I’m super excited to release our 100+ page collaborative agenda - led by Usman Anwar - on “Foundational Challenges In Assuring Alignment and Safety of LLMs” alongside 35+ co-authors from NLP, ML, and AI Safety communities!

Some highlights below...

I’m super excited to release our 100+ page collaborative agenda - led by @usmananwar391 - on “Foundational Challenges In Assuring Alignment and Safety of LLMs” alongside 35+ co-authors from NLP, ML, and AI Safety communities! Some highlights below...

account_circle

Surbhi Goel

@SurbhiGoel_

2 months ago

New paper on understanding the evolution of induction heads using a clean task of in-context learning Markov chains. Phase transitions, cool theory and videos! Check it out!

Shout-out to Ezra Edelman (first PhD paper!) and Nikos, for leading this work!

thumb_up_off_alt27

chat_bubble_outline0

repeat5

shareShare

account_circle

Ben Edelman

@EdelmanBen

2 months ago

Top-left: hmm, bumpy loss curve 🤨

Top-right: (looking at functional behavior) Model goes through five phases! In-context 0-grams, 1-grams, 2-grams, 3-grams, 4-grams.

Bottom: Phases correspond to attn heads specializing one by one. Functional emergence <-> circuit emergence!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

account_circle

Ezra Edelman

@ezra_edelman

2 months ago

How do induction heads / in-context learning emerge? We study a new synthetic task to better understand how these circuits evolve (in stages!) over time.
w/ Ben Edelman, Surbhi Goel, Eran Malach & Nikos Tsilivis
Blog: unprovenalgos.github.io/statistical-in…
Paper: arxiv.org/abs/2402.11004 (1/7)

account_circle