Tsinghua KEG (THUDM)(@thukeg) 's Twitter Profileg
Tsinghua KEG (THUDM)

@thukeg

#ChatGLM #GLM130B #CodeGeeX #CogVLM #CogView #AMiner The Knowledge Engineering Group (KEG) and THUDM at @Tsinghua_Uni @jietang @ericdongyx

ID:1544212427432022016

linkhttps://github.com/THUDM calendar_today05-07-2022 06:51:56

210 Tweets

4,6K Followers

152 Following

Chenhao Tan(@ChenhaoTan) 's Twitter Profile Photo

I am quite excited about this work! Ideation and hypothesis generation are challenging tasks for scientists. It is quite plausible that LLMs will contribute to the next breakthrough in science.

account_circle
Tsinghua CS(@thudcst) 's Twitter Profile Photo

🌟Dive into the future of AI this summer! DCST presents a comprehensive summer school on . Master the art of pre-training, fine-tuning, and beyond. 🚀Open to all 2nd & 3rd year CS undergrads. 🗓Apply before May 1st! 💡Learn more at ss.cs.tsinghua.edu.cn

🌟Dive into the future of AI this summer! #Tsinghua DCST presents a comprehensive summer school on #LLM. Master the art of pre-training, fine-tuning, and beyond. 🚀Open to all 2nd & 3rd year CS undergrads. 🗓Apply before May 1st! 💡Learn more at ss.cs.tsinghua.edu.cn
account_circle
Turi👨‍💻(@QubeeGen) 's Twitter Profile Photo

GitHub Copilot and ChatGPT competitor from that you probably don't know exist.

1. General Language Model - GLM by Tsinghua KEG (THUDM) & ZhipuAI has new improved GLM-4 model. Unlike OpenAI they have open source model.

GLM-4 explaining components of model architecture from image👇

GitHub Copilot and ChatGPT competitor from #China that you probably don't know exist. 1. General Language Model - GLM by @thukeg & ZhipuAI has new improved GLM-4 model. Unlike @OpenAI they have open source model. GLM-4 explaining components of model architecture from image👇
account_circle
Tsinghua KEG (THUDM)(@thukeg) 's Twitter Profile Photo

OAG-Challenge at KDD Cup 2024 has launched! Welcome to join the competition and address the key challenges in academic graph mining. The use of LLMs is encouraged and a free quota of GLM-4 API is provided.

account_circle
AK(@_akhaliq) 's Twitter Profile Photo

AutoWebGLM

Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

Large language models (LLMs) have fueled many intelligent agent tasks, such as web navigation -- but most existing agents perform far from satisfying in real-world webpages due to three

AutoWebGLM Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Large language models (LLMs) have fueled many intelligent agent tasks, such as web navigation -- but most existing agents perform far from satisfying in real-world webpages due to three
account_circle
AK(@_akhaliq) 's Twitter Profile Photo

ChatGLM-Math

Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving. While many

ChatGLM-Math Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving. While many
account_circle
Aran Komatsuzaki(@arankomatsuzaki) 's Twitter Profile Photo

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Achieves SotA results for opensource LLMs on GSM8K and MATH by self-critique

arxiv.org/abs/2404.02893

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Achieves SotA results for opensource LLMs on GSM8K and MATH by self-critique arxiv.org/abs/2404.02893
account_circle
SIGKDD 2024(@kdd_news) 's Twitter Profile Photo

🏆 Another exciting news! We announce the second KDD Cup 2024 challenge!
Meta KDD Cup 2024 - CRAG: Comprehensive RAG Benchmark
Let the games begin! 🌟🏃‍♂️

A huge thanks to Meta for making this happen!

aicrowd.com/challenges/met…

account_circle
Tanishq Mathew Abraham, Ph.D.(@iScienceLuvr) 's Twitter Profile Photo

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

abs: arxiv.org/abs/2403.05121

Introduces CogView3, which uses relay diffusion (a variant of cascaded diffusion) in latent space with a 3B U-net and T5 XXL text encoder. Trained with LAION-2B, recaptioned…

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion abs: arxiv.org/abs/2403.05121 Introduces CogView3, which uses relay diffusion (a variant of cascaded diffusion) in latent space with a 3B U-net and T5 XXL text encoder. Trained with LAION-2B, recaptioned…
account_circle
AK(@_akhaliq) 's Twitter Profile Photo

CogView3

Finer and Faster Text-to-Image Generation via Relay Diffusion

Recent advancements in text-to-image generative systems have been largely driven by diffusion models. However, single-stage text-to-image diffusion models still face challenges, in terms of

CogView3 Finer and Faster Text-to-Image Generation via Relay Diffusion Recent advancements in text-to-image generative systems have been largely driven by diffusion models. However, single-stage text-to-image diffusion models still face challenges, in terms of
account_circle
Aran Komatsuzaki(@arankomatsuzaki) 's Twitter Profile Photo

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

The distilled variant of CogView3 achieves comparable performance while only utilizing 1/10 of the inference time by SDXL

arxiv.org/abs/2403.05121

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion The distilled variant of CogView3 achieves comparable performance while only utilizing 1/10 of the inference time by SDXL arxiv.org/abs/2403.05121
account_circle
Jan Schnyder(@xShnippi) 's Twitter Profile Photo

Bill Yuchen Lin 🤖 AK Hugging Face Allen Institute for AI UCSB NLP Group Waterloo's Cheriton School of Computer Science Sick!! CogAgent might also be interesting to add (huggingface.co/THUDM/cogagent…). Its an improved version based on CogVLM by Tsinghua KEG (THUDM) . IMO currently the best open-source MLLM one can get, the people at THUDM are cooking 🧑‍🍳

account_circle
Christopher Manning(@chrmanning) 's Twitter Profile Photo

🏅 To me, this feels more like the kind of neural model interpretability research we should be doing than much of the recent work on interpretability of transformer models.

account_circle
Adina Yakup(@AdeenaY8) 's Twitter Profile Photo

LongAlign - A recipe for long context alignment of LLM introduced by Tsinghua KEG (THUDM) 🔥

✨ With LongAlign-10k dataset to support
✨ Outperforms existing LLM recipes by up to 30%, while maintaining the ability to handle short, general tasks

Model huggingface.co/THUDM/LongAlig…
Paper…

account_circle