Robin Jia (@robinomial) Twitter Tweets • TwiCopy

Robin Jia

@robinomial

+ Follow

Assistant Professor @CSatUSC | Previously Visiting Researcher @facebookai | Stanford CS PhD @StanfordNLP

ID:1012392833834029056

linkhttps://robinjia.github.io/ calendar_today28-06-2018 17:50:35

172 Tweets

3,2K Followers

759 Following

Robin Jia

1 month ago

While studying memorization in LLMs, we struggled with a basic question: can knowledge really be localized to specific parameters? So, we designed two independent benchmarks to answer this. Amazingly, they agree: pruning-based localization is best, works well, but isn’t perfect.

thumb_up_off_alt46

chat_bubble_outline0

account_circle

Ting-Yun Chang

1 month ago

Localization in LLMs is often mentioned. But do localization methods actually localize correctly? In our #NAACL2024 paper, we (w/ Jesse Thomason, Robin Jia) develop two benchmarking ways to directly evaluate how well 5 existing methods can localize memorized data in LLMs.

Localization in LLMs is often mentioned. But do localization methods actually localize correctly? In our #NAACL2024 paper, we (w/ @_jessethomason_, @robinomial) develop two benchmarking ways to directly evaluate how well 5 existing methods can localize memorized data in LLMs.

thumb_up_off_alt78

chat_bubble_outline0

account_circle

Deqing Fu

1 month ago

Do multimodal foundation models treat every modality equally?

Hint: Humans have picture superiority. How about machines?

Introducing IsoBench, a benchmark for multimodal models with isomorphic inputs.

🔗 IsoBench.github.io

Do multimodal foundation models treat every modality equally? Hint: Humans have picture superiority. How about machines? Introducing IsoBench, a benchmark for multimodal models with isomorphic inputs. 🔗 IsoBench.github.io

thumb_up_off_alt121

chat_bubble_outline0

account_circle

USC Thomas Lord Department of Computer Science

1 month ago

Join us for the USC Symposium on Frontiers of Generative AI Models in Science and Society!

Featuring special guests Alessandro Vespignani Nitesh Chawla Yizhou Sun Jian Ma & Robin Jia Yue Wang Ruishan Liu USC Viterbi School

📅 Mar 25
📍MCB 101
🔗RSVP (limited space) tinyurl.com/4zpmfysa

Join us for the USC Symposium on Frontiers of Generative AI Models in Science and Society! Featuring special guests @alexvespi @nvchawla @YizhouSun @jmuiuc & @robinomial @yuewang314 @ruishanzliu @USCViterbi 📅 Mar 25 📍MCB 101 🔗RSVP (limited space) tinyurl.com/4zpmfysa

thumb_up_off_alt19

chat_bubble_outline0

account_circle

Robin Jia

2 months ago

Personally my favorite experiment from this data watermark work. BLOOM-176B memorizes SHA hashes from its training data; we can use similar random character sequences to watermark any document collection!

thumb_up_off_alt14

chat_bubble_outline0

account_circle

Robin Jia

2 months ago

Determining whether an LLM has trained on your data isn’t a classification problem, it’s a statistical testing problem. Really proud of this work by Johnny Tian-Zheng Wei and Ryan Yixiang Wang on using random watermarks to rigorously detect data usage!

thumb_up_off_alt44

chat_bubble_outline0

account_circle

Jieyu Zhao@ICLR2024

3 months ago

🔊 Please consider submitting to our Secure and Trustworthy Large Language Model Workshop at #ICLR2024 ! Submission ddl is Feb 12.

thumb_up_off_alt61

chat_bubble_outline0

account_circle

ACL 2024

4 months ago

ACL announcement:
'The ACL Executive Committee has voted to significantly change ACL's approach to protecting anonymous peer review. The change is effective immediately. (1/4) #NLPRoc

thumb_up_off_alt443

chat_bubble_outline0

account_circle

Pei Zhou

5 months ago

I’m at #NeurIPS23 and on the job market🎷🧳!! Come and talk about anything LLM reasoning, evaluating communicating agents, human-AI collaboration for new discoveries, coffee and jazz in NOLA☕️

I’m at #NeurIPS23 and on the job market🎷🧳!! Come and talk about anything LLM reasoning, evaluating communicating agents, human-AI collaboration for new discoveries, coffee and jazz in NOLA☕️

thumb_up_off_alt83

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

5 months ago

hey EMNLP 2024 #EMNLP2023 i lost my bucket hat 😢 it’s white w yellow bananas 🍌 on it like in my profile, pls DM me if found 🙏🏻

thumb_up_off_alt31

chat_bubble_outline0

account_circle

Robin Jia

5 months ago

I’ll be presenting SCENE at the poster session this *Saturday* at 11am! Come chat about unanswerable questions, minimal pairs, extrapolation, and the value of synthetic data :)

thumb_up_off_alt11

chat_bubble_outline0

account_circle

Deqing Fu

5 months ago

Sadly I won't be at #EMNLP2023 in person. However, my amazing advisor Robin Jia will be presenting this work at the poster session!

thumb_up_off_alt18

chat_bubble_outline0

account_circle

USC Thomas Lord Department of Computer Science

5 months ago

We're hiring! USC Thomas Lord Department of Computer Science is growing fast, with multiple openings for tenure-track and tenured positions.

🔍Security/privacy, AI, machine learning, data science, HCI, but exceptional candidates in all areas considered.

📅Deadline: Jan 5
🔗Details: tinyurl.com/5e99bmb8

We're hiring! @CSatUSC is growing fast, with multiple openings for tenure-track and tenured positions. 🔍Security/privacy, AI, machine learning, data science, HCI, but exceptional candidates in all areas considered. 📅Deadline: Jan 5 🔗Details: tinyurl.com/5e99bmb8

thumb_up_off_alt46

chat_bubble_outline0

account_circle

USC Thomas Lord Department of Computer Science

5 months ago

🚨Now hiring full-time teaching faculty!🚨

Apply now to join our fast-growing, collaborative department, with a brand new building opening this spring.

Find out more about our open faculty positions
(teaching, tenured, tenure track): cs.usc.edu/about/open-fac…

Please share!

thumb_up_off_alt7

chat_bubble_outline0

account_circle

USC Thomas Lord Department of Computer Science

5 months ago

We are at #EMNLP2023 this week! 👋

Explore the latest USC Thomas Lord Department of Computer Science USC ISI research spanning social bias in name translation, AI tools for journalists, ambiguity in LMs, and much more ⬇️

viterbischool.usc.edu/news/2023/12/u…

USC Viterbi School EMNLP 2024 USC Research #NLP

thumb_up_off_alt7

chat_bubble_outline0

account_circle

Qinyuan Ye

5 months ago

I just arrived in Singapore for #EMNLP2023 ! 😜

I'll be presenting our work on modeling and predicting LLM capabilities at GenBench workshop poster session (10am tomorrow). Also presenting it virtually in gather.town at 4pm on Dec 8. Check out our thread below 👇

thumb_up_off_alt24

chat_bubble_outline0

account_circle

Deqing Fu

5 months ago

It's a great honor to receive this award! Had so much fun gathering together at SoCalNLP today! #SoCalNLP2023 🎉

thumb_up_off_alt44

chat_bubble_outline0

account_circle

Qinyuan Ye

6 months ago

What can we learn from thousands of LLM experiment records? Can we predict emergent abilities? Does this give any clue on making LLM benchmarking more efficient and principled? Check out our paper on analyzing BIG-bench and designing “small-bench.”

arxiv.org/abs/2305.14947 🧵⬇️

What can we learn from thousands of LLM experiment records? Can we predict emergent abilities? Does this give any clue on making LLM benchmarking more efficient and principled? Check out our paper on analyzing BIG-bench and designing “small-bench.” arxiv.org/abs/2305.14947 🧵⬇️

thumb_up_off_alt107

chat_bubble_outline0

account_circle

Robin Jia

6 months ago

ICL reduces the need for labeled *training* data, but what about test data? Fantastic work led by USC undergrad Harvey Yiyun Fu (applying to PhD programs this year!) shows we can estimate ICL accuracy well (on par with 40 labeled examples) using *unlabeled* data + model uncertainties

thumb_up_off_alt7

chat_bubble_outline0

account_circle

Robin Jia

6 months ago

How do Transformers really do in-context linear regression? 1 TF layer = 3 steps of a second-order method! Can’t be GD, which converges exponentially more slowly. Meanwhile, LSTMs are more like online GD; they don’t learn second-order optimization (likely due to limited memory).

thumb_up_off_alt26

chat_bubble_outline0

account_circle