-
Grok โ v4.2Stronger reasoning for math and multi-step logic, with more consistent step-by-step problem solving. Better long-form conversation stability, with fewer abrupt topic jumps and less mid-answer cutoff. Smarter context handling through heavier summarization of earlier turns to stay focused in long chats. More reliable performance on long prompts, with improved coherence across extended back and forth. -
รlvaro Sรกnchez Romรกn๐ 110 karmaOct 18, 2024@NotebookLMAccuracy nice. Free -
AI-powered academic search: find and understand science faster.Open
Tried using it to get answers for a few questions, but what really impressed me was how many sources it pulled in. Didnt expect that, but it actually turned out to be super useful. Nice tool :) -
Gemini โ v3.1 ProUpgraded Pro-tier reasoning for harder multi-step problems. Better handling of novel logic patterns and unfamiliar reasoning setups. Stronger โintelligence appliedโ output quality, producing clearer, more useful explanations and syntheses. More capable support for agentic workflows and longer-horizon task execution. Rolling out as the new Pro model across Gemini app, NotebookLM, Gemini API, and Vertex AI. -

-
Claude โ v4.6Claude Sonnet 5 Release and access: ships as claude-sonnet-5-20260203, available via Anthropic API, Claude Pro, and Google Vertex AI. Coding benchmark jump: posts 82.1% SWE-Bench Verified, positioned above Claude Opus 4.5 at 80.9%. Pricing reset: priced at $3 per 1M input tokens and $15 per 1M output tokens, framed as a major cost drop vs Opus 4.5. Massive context: 1,000,000-token context window for repository-level understanding, whole-codebase prompts, and large refactors without chunking. Agentic engineering stack: built for multi-step workflows with built-in code execution and self-correction, plus a โDev Teamโ mode that spawns parallel sub-agents for implementation, testing, and review. -

