Neural Stream
AI News
Arxiv AI Papers
- MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
- Generalization in LLM Problem Solving: The Case of the Shortest Path
- Diagnosing LLM Judge Reliability: Conformal Prediction Sets and Transitivity Violations
- How Do LLMs and VLMs Understand Viewpoint Rotation Without Vision? An Interpretability Study
- AD4AD: Benchmarking Visual Anomaly Detection Models for Safer Autonomous Driving
- Why Do Vision Language Models Struggle To Recognize Human Emotions?
- Prism: Symbolic Superoptimization of Tensor Programs
- SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation
- CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas
- Stability and Generalization in Looped Transformers
Microsoft AI
- New Future of Work: AI is driving rapid change, uneven benefits
- Ideas: Steering AI toward the work future we want
- ADeLe: Predicting and explaining AI performance across tasks
- AsgardBench: A benchmark for visually grounded interactive planning
- GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Nvidia AI
- No Need for Space Gear — Capcom’s ‘PRAGMATA’ Joins GeForce NOW on Launch Day
- Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters
- New Adobe Premiere Color Grading Mode Accelerated on NVIDIA GPUs
- National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources
- Strength and Destiny Collide: ‘Samson: A Tyndalston Story’ Arrives in the Cloud
Deep Mind
- Gemini 3.1 Flash TTS: the next generation of expressive AI speech
- Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning
- Gemma 4: Byte for byte, the most capable open models
- Gemini 3.1 Flash Live: Making audio AI more natural and reliable
- Protecting people from harmful manipulation
Amazon AI
- Introducing granular cost attribution for Amazon Bedrock
- Optimize video semantic search intent with Amazon Nova Model Distillation on Amazon Bedrock
- Power video semantic search with Amazon Nova Multimodal Embeddings
- Nova Forge SDK series part 2: Practical guide to fine-tune Nova models using data mixing capabilities
- From hours to minutes: How Agentic AI gave marketers time back for what matters
Google AI
- Generative AI to quantify uncertainty in weather forecasting
- AutoBNN: Probabilistic time series forecasting with compositional bayesian neural networks
- Computer-aided diagnosis for lung cancer screening
- Using AI to expand global access to reliable flood forecasts
- ScreenAI: A visual language model for UI and visually-situated language understanding