Guillaume Lample @ ICLR 2024(@GuillaumeLample) 's Twitter Profileg
Guillaume Lample @ ICLR 2024

@GuillaumeLample

Cofounder & Chief Scientist https://t.co/hLfvKLkFHd (@MistralAI). Working on LLMs. Ex @MetaAI | PhD @Sorbonne_Univ_ | MSc @CarnegieMellon | X11 @Polytechnique

ID:806058672619212800

calendar_today06-12-2016 08:52:18

505 Tweets

36,6K Followers

649 Following

Devendra Chaplot(@dchaplot) 's Twitter Profile Photo

We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1:
- Free to use under Apache 2.0 license
- Outperforms all open models
- Native function calling
- Masters English, French, Italian, German and Spanish.
- Seq_len = 64K

mistral.ai/news/mixtral-8…

We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
account_circle
Mistral AI(@MistralAI) 's Twitter Profile Photo

magnet:?xt=urn:btih:9238b09245d0d8cd915be09927769d5f7584c1c9&dn=mixtral-8x22b&tr=udp%3A%2F%2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%https://t.co/OdtBUsbeV5%3A1337%2Fannounce

account_circle
Guillaume Lample @ ICLR 2024(@GuillaumeLample) 's Twitter Profile Photo

Due to an unexpected number of requests, Le Chat is temporarily unavailable. We apologize for the inconvenience -- we are working on getting it back up and running as soon as we can, thanks for your patience!

account_circle
lmsys.org(@lmsysorg) 's Twitter Profile Photo

[Arena] Exciting update! Mistral Medium has gathered 6000+ votes and is showing remarkable performance, reaching the level of Claude. Congrats Mistral AI!

We have also revamped our leaderboard with more Arena stats (votes, CI). Let us know any thoughts :)

Leaderboard

[Arena] Exciting update! Mistral Medium has gathered 6000+ votes and is showing remarkable performance, reaching the level of Claude. Congrats @MistralAI! We have also revamped our leaderboard with more Arena stats (votes, CI). Let us know any thoughts :) Leaderboard
account_circle
Devendra Chaplot(@dchaplot) 's Twitter Profile Photo

Proud to announce:
Mixtral 8x7B -- Mixtral of Experts

- Free to use under Apache 2.0 license
- outperforms Llama 2 70B with 6x faster inference.
- matches or outperforms GPT3.5
- masters English, French, Italian, German and Spanish.
- seq_len = 32K

mistral.ai/news/mixtral-o…

1/N

Proud to announce: Mixtral 8x7B -- Mixtral of Experts - Free to use under Apache 2.0 license - outperforms Llama 2 70B with 6x faster inference. - matches or outperforms GPT3.5 - masters English, French, Italian, German and Spanish. - seq_len = 32K mistral.ai/news/mixtral-o… 1/N
account_circle
Pierre Stock(@PierreStock) 's Twitter Profile Photo

Mixtral 8x7B is here, 11 weeks only after Mistral 7B. Outperforms Llama 2 70B and GPT 3.5 on most benchmarks, at the inference cost of a 12B dense model, with 32k tokens context size.

Mixtral 8x7B is here, 11 weeks only after Mistral 7B. Outperforms Llama 2 70B and GPT 3.5 on most benchmarks, at the inference cost of a 12B dense model, with 32k tokens context size.
account_circle
Georgi Gerganov(@ggerganov) 's Twitter Profile Photo

Adding support for the new Mixtral models

Runs on CPU, CUDA and Metal with quantization support and partial GPU offloading.

Very interesting architecture to play with!

github.com/ggerganov/llam…

account_circle