Neural Stream
AI News
- Meta debuts Muse Spark, closing the open-source chapter
- Anthropic restricts Claude Mythos to select partners via Project Glasswing
- Google DeepMind releases Gemma 4 under Apache 2.0
- OpenAI releases GPT-5.4 with native computer use
- DeepSeek V4 reshapes enterprise AI economics
- Claude 3.5 Sonnet announcement by Anthropic
- GPT-4o multimodal model announcement by OpenAI
- FineWeb: 15 trillion tokens of high quality web data
- Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
- LLama 3 released by Meta
- New Claude 3 model released by Anthropic
Microsoft AI
- Data Formulator 0.7: AI-powered data analytics for enterprise data
- Extending Human Intelligence Through AI
- MagenticLite, MagenticBrain, Fara1.5: An agentic experience optimized for small models
- Vega: Zero-knowledge proofs for digital identity in the age of AI
- Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability
Nvidia AI
- NVIDIA Research Advances Robotics From Simulation to the Real World
- The Name’s Gaming … Cloud Gaming: ‘007 First Light’ Launches on GeForce NOW
- AI Factories: The New Infrastructure of Intelligence
- NVIDIA Vera CPU Is ‘Packing a Heavy-Hitting Punch’ Against Competition
- NVIDIA GTC Taipei at COMPUTEX: Live Updates on What’s Next in AI
Deep Mind
Amazon AI
- Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU utilization to LLM quality
- Training Azerbaijani language models on Amazon SageMaker AI
- Build a custom portal with embedded Amazon SageMaker AI MLflow Apps
- Streamline external access to Amazon SageMaker MLflow using a REST API proxy
- Evaluating Deep Agents using LangSmith on AWS
Arxiv AI Papers
- Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software
- VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
- LLMSurgeon: Diagnosing Data Mixture of Large Language Models
- SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations
- Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection
- Unlocking the Working Memory of Large Language Models for Latent Reasoning
- GPIC: A Giant Permissive Image Corpus for Visual Generation
- Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents
- Demystifying Data Organization for Enhanced LLM Training
- Reasoning with Sampling: Cutting at Decision Points