Vinh Q. Tran(@vqctran) 's Twitter Profileg
Vinh Q. Tran

@vqctran

i research language models @Google, all thoughts my own, he/him

ID:974097637564665856

linkhttp://vqtran.github.io calendar_today15-03-2018 01:39:09

111 Tweets

1,2K Followers

282 Following

Oriol Vinyals(@OriolVinyalsML) 's Twitter Profile Photo

Gemini 1.5 has arrived. Pro 1.5 with 1M tokens available as an experimental feature via AI Studio and Vertex AI in private preview.

Then there’s this: In our research, we tested Gemini 1.5 on up to 2M tokens for audio, 2.8M tokens for video, and 🤯10M 🤯 tokens for text. From…

Gemini 1.5 has arrived. Pro 1.5 with 1M tokens available as an experimental feature via AI Studio and Vertex AI in private preview. Then there’s this: In our research, we tested Gemini 1.5 on up to 2M tokens for audio, 2.8M tokens for video, and 🤯10M 🤯 tokens for text. From…
account_circle
Jascha Sohl-Dickstein(@jaschasd) 's Twitter Profile Photo

Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.

account_circle
Ibrahim Alabdulmohsin | إبراهيم العبدالمحسن(@ibomohsin) 's Twitter Profile Photo

And there are many more results in the paper so don’t forget to check it out. This is joint work with my awesome colleagues: Vinh Q. Tran & Mostafa Dehghani. We believe we are scratching the surface and we’ll see more applications of fractals in LLMs in the future
arxiv.org/abs/2402.01825

account_circle
Vinh Q. Tran(@vqctran) 's Twitter Profile Photo

fun fact: I was working on DSI right after being released from the hospital for losing all remaining kidney function -- semantic ids were thought of and implemented from a dialysis chair!

account_circle
Vinh Q. Tran(@vqctran) 's Twitter Profile Photo

Congratulations Yi and Reka on the launch!! Going from zero to the most multimodal model to date in 6 months is insane 😮

account_circle
Mahesh Sathiamoorthy(@madiator) 's Twitter Profile Photo

Our work 'Recommender Systems with Generative Retrieval' got accepted to NeurIPS 😊🎉

Congrats again to my co-authors Shashank Rajput, Nikhil Mehta, Vinh Q. Tran, Yi Tay, jonah, Maciej Kula, Ed H. Chi

Latest version at arxiv.org/abs/2305.05065

account_circle
Rylan Schaeffer(@RylanSchaeffer) 's Twitter Profile Photo

Excited to announce my newest breakthrough project!!

🔥🔥 State-of-the-art results (100%!!) on widely used academic benchmarks (MMLU, GSM8K, HumanEval, OpenbookQA, ARC Challenge, etc.) 🔥🔥

1M param LLM trained on 100k tokens 🤯

How??

Introducing **phi-CTNL**

🧵👇

1/6

Excited to announce my newest breakthrough project!! 🔥🔥 State-of-the-art results (100%!!) on widely used academic benchmarks (MMLU, GSM8K, HumanEval, OpenbookQA, ARC Challenge, etc.) 🔥🔥 1M param LLM trained on 100k tokens 🤯 How?? Introducing **phi-CTNL** 🧵👇 1/6
account_circle
Vinh Q. Tran(@vqctran) 's Twitter Profile Photo

to live, eat, and breathe the new york summer with friends and loved ones, what more could you ask for really

account_circle
Vinh Q. Tran(@vqctran) 's Twitter Profile Photo

'I would never wish to incorporate this technology into my work at all. I strongly feel that this is an insult to life itself.' -miyazaki

account_circle