Jimmy Lin (@lintool) Twitter Tweets • TwiCopy

23 hours ago

Neat... seems like cascade ranking has been rediscovered, Don Metzler ? dl.acm.org/doi/10.1145/20… Lag of roughly a dozen years... yup, sounds about right.

thumb_up_off_alt16

repeat1

account_circle

Terrible timing announcing this today... but bring your #LLaMA3 models over to the TREC 2024 RAG track to tackle the all important question: 'How often should you take your toddler to the potty when potty training?'

thumb_up_off_alt11

repeat2

account_circle

Jimmy Lin

3 days ago

OpenAI, Cohere, Google, and now Snowflake... welcome to the embeddings game! Extra props for models released under Apache 2. snowflake.com/blog/introduci…

thumb_up_off_alt36

repeat4

account_circle

Jimmy Lin

1 month ago

We are on a continual quest to simplify reproducibility. Anserini now allows you to reproduce runs with dense and sparse retrieval models (e.g., on MS MARCO and BEIR) directly from a fatjar, 'installed' via wget. Try it out, let us know what you think! anserini.io

thumb_up_off_alt32

repeat7

account_circle

Manveer Singh Tamber

@ManveerTamber

3 months ago

🚀Thrilled to unveil our work in efficient zero-shot listwise reranking! LiT5 harnesses T5 models to challenge state-of-the-art standards, with significantly smaller models. Discover more: arxiv.org/abs/2312.16098

thumb_up_off_alt16

repeat3

account_circle

Jimmy Lin

3 months ago

The obvious question: How do the latest prompt-decoder LLMs for listwise reranking perform on low-resource languages? For four African languages (Hausa, Somali, Swahili, and Yoruba), Mofe Adeyemi Orochimaru's Demeanour 🧜🏿‍♂️ Ronak Pradeep provide the answer: arxiv.org/abs/2312.16159

thumb_up_off_alt44

repeat7

account_circle

Jimmy Lin

3 months ago

Prompt-decoder LLMs for listwise reranking too large for you? Introducing our new LiT5 family of listwise reranking models: nearly as good but *much* smaller. Yup, T5's still got tricks to offer! arxiv.org/abs/2312.16098

account_circle

Nandan Thakur

@nandan__thakur

3 months ago

🌐 Ever wondered whether LLMs know what they don't know? Does your LLM confidently bullshit answers?

🌏 We introduce NoMIRACL to evaluate LLM robustness in RAG across 18 languages!

❌ GPT-4 can hallucinate answers with a high 33.2% hallucination rate!

📜arxiv.org/abs/2312.11361

account_circle

CIRAL Project

@CiralProject

4 months ago

💥💥 Catch us at #FIRE2023 starting tomorrow!
- Track Overview: 10:15am IST, Dec 15th
- Track Session: 11:00am IST, Dec 16th

We wrapped up CIRAL earlier, and we say a big thank you thank to participating teams 💯🎉🎉

Our public leaderboard is available: ciralproject.github.io/#timeline

thumb_up_off_alt3

repeat2

account_circle

Orochimaru's Demeanour 🧜🏿‍♂️

@theYorubayesian

4 months ago

We've been doing some research on scaling pretraining data and language models for African languages and I'm excited to share our research at EMNLP 2023!

Work done with Mofe Adeyemi Oreva Ahia J.O Abraham Owodunni David Ifeoluwa Adelani 🇳🇬 and Jimmy Lin

Here's a primer:

1 /

account_circle

Jimmy Lin

4 months ago

Dueling LLMs for reranking from my students... trying to remain neutral. Maybe let's just build RankZephyr-wo-GPT.

thumb_up_off_alt7

repeat0