
DiffusionGemma
Google's experimental diffusion-based AI model is now open-source and incredibly fast, generating text at over 500 tokens per second. This release makes cutting-edge, high-speed AI accessible for developers and businesses...
100+ sources. 5 proxy signals. Zero noise tolerance. The pipeline filters. The analyst decides. What reaches this page earned its place.

Google's experimental diffusion-based AI model is now open-source and incredibly fast, generating text at over 500 tokens per second. This release makes cutting-edge, high-speed AI accessible for developers and businesses...

Sakana AI is establishing a lab to create AI that can autonomously rewrite and improve its own code. This 'Recursive Self-Improvement' aims to achieve frontier intelligence without needing massive, expensive...

Anthropic has released a free tool that helps developers find security flaws in their code using AI. It provides a standardized way to test how AI can identify and fix...

A Stanford Law study found that AI can outperform human law professors on specific legal tasks. This signals a major leap in AI's ability to handle complex professional reasoning.

IBM Research argues that AI needs 'Agent Logic'—structured guides like knowledge graphs—to be truly reliable for business. This approach prevents AI hallucinations and drastically lowers the cost of running complex...

A new benchmark evaluates how well the latest AI models, including GPT-5.5 and Cursor, can actually fix real-world software bugs. This moves beyond theoretical tests to show which tools can...

AI removed the execution bottleneck. It created a verification bottleneck. When output scales 10x and review time scales 100x, automation becomes a net loss.

AI agents can now optimize radiology workflows by analyzing case complexity and radiologist fatigue. This helps prevent the habit of 'cherry-picking' easy cases, ensuring faster diagnosis for complex patients.

Cohere's new Command A+ model is a breakthrough in efficiency, ranking as the fastest model with the lowest error rate. It is an open-weights model, meaning businesses can deploy it...

OpenAI's AI has successfully solved a complex mathematical problem that had previously stumped humans. This demonstrates a significant leap in the AI's ability to perform deep, autonomous logical reasoning.

Major AI chatbots including ChatGPT, Gemini, and Claude are struggling to provide accurate information on elections and geopolitics. A new study warns that these tools are currently unreliable for news-related...

Google shared results from studies in Italy and Sierra Leone showing Gemini improves student math skills and teacher productivity. These findings highlight the real-world efficacy of AI in personalized education.

A new memory system called Exabase M-1 has topped benchmarks for AI 'long-term memory' using a small, efficient model. This allows AI to recall specific details from massive amounts of...

A new research method allows AI to 'think' in short bursts only when necessary, rather than using long reasoning blocks for every response. This aims to provide high-quality answers with...

Researchers observed AI agents in a virtual town for 15 days. Claude agents autonomously created a democratic system, while Gemini agents prioritized emotional and social bonds.

A new research method allows AI models to generate text up to 7.8 times faster without changing the quality of the output. It achieves this by changing how the model...

A new study indicates that most popular AI chatbots leak user prompts and conversation IDs to third-party advertising and analytics tools. Some tools even captured readable parts of private prompts...

Artificial Analysis launched a benchmark index measuring how AI model and harness combinations perform on real coding tasks, cost, and token usage. The data shows a 30x spread in cost...