@slashML
ID:806206253634457600
calendar_today06-12-2016 18:38:44
15,3K Tweets
121,3K Followers
1 Following
Brian Cheung
This is my hat, there are many like it, but this one is mine. @MIT_CSAIL 🧢 | @berkeley_ai 🎓 | Google B̶r̶a̶i̶n̶ DeepMind 🎩
4 weeks ago
CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments reddit.com/r/MachineLearn…
NExT: Teaching Large Language Models to Reason about Code Execution reddit.com/r/MachineLearn…
How much coursework is required to land an entry-level ML job? reddit.com/r/MachineLearn…
Foundational papers for Graph Adversarial Learning? reddit.com/r/MachineLearn…
Do Lead's in an AI/DS/ML team always have PhDs, is it a requirement? reddit.com/r/MachineLearn…
Dynamic Gaussians Mesh reddit.com/r/MachineLearn…
You need everything other than ML to win a ML hackathon reddit.com/r/MachineLearn…
1 month ago
How would you diagnose these spikes in the training loss? reddit.com/r/MachineLearn…
Real talk about RAG reddit.com/r/MachineLearn…
Llama-3 based OpenBioLLM-70B & 8B: Outperforms GPT-4, Gemini, Meditron-70B, Med-PaLM-1 & Med-PaLM-2 in Medical-domain reddit.com/r/MachineLearn…
Mathematical aspects of tokenization reddit.com/r/MachineLearn…
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey reddit.com/r/MachineLearn…
LLMs: Why does in-context learning work? What exactly is happening from a technical perspective? reddit.com/r/MachineLearn…
What are your horror stories from being tasked impossible ML problems reddit.com/r/MachineLearn…
Why transformers are not trained layer-wise? reddit.com/r/MachineLearn…
Why would such a simple sentence break an LLM? reddit.com/r/MachineLearn…
Meta does everything OpenAI should be reddit.com/r/MachineLearn…
How to and Deploy LLaMA 3 Into Production, and Hardware Requirements reddit.com/r/MachineLearn…
Llama-3 may have just killed proprietary AI models reddit.com/r/MachineLearn…
Why isn't GNN in high demand in industry? reddit.com/r/MachineLearn…