Spencer Schiff(@SpencerKSchiff) 's Twitter Profileg
Spencer Schiff

@SpencerKSchiff

21 | Anarcho-capitalist, tech enthusiast, optimist. The only thing that matters is AGI

ID:956298928076611584

calendar_today24-01-2018 22:53:26

11,9K Tweets

130,7K Followers

285 Following

Jim Fan(@DrJimFan) 's Twitter Profile Photo

The moat of software AI agents is not the thin wrapper layer (Devin, SWE-Agent), but the underlying LLM. Instead of benchmarking the wrapper, I think SWE-Bench is excellent for evaluating coding LLMs instead:

Hold the agent layer fixed and vary only the LLM backend. Provide all…

The moat of software AI agents is not the thin wrapper layer (Devin, SWE-Agent), but the underlying LLM. Instead of benchmarking the wrapper, I think SWE-Bench is excellent for evaluating coding LLMs instead: Hold the agent layer fixed and vary only the LLM backend. Provide all…
account_circle
xlr8harder(@xlr8harder) 's Twitter Profile Photo

This is the first music generation AI that has music quality high enough I could actually listen to it. Incredible.

account_circle
varepsilon(@var_epsilon) 's Twitter Profile Photo

the big labs are gonna keep one upping each other until someone casually ships AGI with a name like 'gpt-5-ultra-2025-preview' or 'claude-5-symphony-20241019'

account_circle
Sully(@SullyOmarr) 's Twitter Profile Photo

Ok so from really early tests the new gpt4 definitely feels better at coding

Less lazy, more willing to write code. Was able to give it a few file, and it wrote perfect code (very uncommon before)

Might be switching away from opus.(gpt4 is cheaper & works better with cursor)

account_circle
Alex Volkov (Thursd/AI)(@altryne) 's Twitter Profile Photo

So just to recap this 🔥 Tuesday:

- Google refreshes 1.5 Pro, with sound understanding
- Google releases Gemma Code 3 models
- OpenAI gave us GA for vision + updated model
and now
Mistral AI doing torrents again, with 170B MoE

This is a day after cohere beat GPT-4 🚀

So just to recap this 🔥 Tuesday: - Google refreshes 1.5 Pro, with sound understanding - Google releases Gemma Code 3 models - OpenAI gave us GA for vision + updated model and now @MistralAI doing torrents again, with 170B MoE This is a day after @cohere beat GPT-4 🚀
account_circle
Harris Rothaermel(@DeveloperHarris) 's Twitter Profile Photo

does anyone have any examples of the new gpt-4-turbo model and how it's better?

supposedly better at reasoning/math but haven't had a chance to test this yet

account_circle