Latest articles
 After analyzing 1000 papers, the Oxford University team found that AI's thinking process was not credible
2025-07-11

After analyzing 1000 papers, the Oxford University team found that AI's thinking process was not credible

After analyzing 1000 papers, the Oxford University team found that AI's thinking process was not credible

 AI finally learned to remember: the breakthrough of the team of Nanyang Technological University in making the virtual world unforgettable

AI finally learned to remember: the breakthrough of the team of Nanyang Technological University in making the virtual world unforgettable

The research team of Nanyang Technological University has developed the WorldMem framework, which enables AI to have real long-term memory capability for the first time and solves the consistency problem in virtual world simulation. The system stores historical scenes by memory bank and uses intelligent retrieval mechanism to enable AI to accurately reproduce previous scenes and events, even if the interval is very long. Experiments show that it performs well in Minecraft and real scenes, bringing broad application prospects for games, automatic driving, robots and other fields.

 AWS strengthens infrastructure strategy and comprehensively upgrades SageMaker to cope with AI competition

AWS strengthens infrastructure strategy and comprehensively upgrades SageMaker to cope with AI competition

AWS expands its market position by upgrading the SageMaker machine learning platform, adding observation capabilities, connected coding environments, and GPU cluster performance management functions. In the face of fierce competition from Google and Microsoft, AWS focuses on providing AI infrastructure support for enterprises. New features of SageMaker include in-depth insight into the causes of model performance degradation, providing developers with more control over computing resources, and supporting local IDE connection deployment. These updates are mainly derived from customer requirements and are intended to solve practical problems in AI model development.

 MTS AI launched: an "intelligent programming assistant" that lets AI write code like a writer writes a novel“

MTS AI launched: an "intelligent programming assistant" that lets AI write code like a writer writes a novel“

The MTS AI research team proposed the RewardRanker system, which significantly improves the quality of AI code generation through reordering models and iterative self training. This method makes the 13.4B parameter model surpass the 33B big model, performs well in many programming languages, and even surpasses GPT-4 in C++. By introducing difficult negative samples and PPO optimization, the system can select the optimal scheme from multiple code candidates, which lays the foundation for the practicality of AI programming assistant.

 How does BigQuery integrate data and AI to achieve business transformation

How does BigQuery integrate data and AI to achieve business transformation

AI has the potential to transform enterprise insight, but success depends on data quality. Most AI project failures are due to data chaos and dispersion rather than algorithm limitations. Google's BigQuery cloud data AI platform breaks data silos, simplifies governance, and accelerates enterprise AI applications. Through AI automated data processing, real-time analysis is realized, and it is deeply integrated with Vertex AI, so that enterprises can efficiently process structured and unstructured data, and transform intelligent business from vision to reality.

 The "command following" capability breakthrough of multimodal large model: the team of Shanghai AI laboratory enables AI to understand the requirements of visual tasks accurately like human beings

The "command following" capability breakthrough of multimodal large model: the team of Shanghai AI laboratory enables AI to understand the requirements of visual tasks accurately like human beings

Shanghai AI Lab, in conjunction with several institutions, launched the MM-IFEngine system, which specifically solves the problem of multimodal AI's "command compliance". The system can automatically generate complex picture instruction training data, and create MM IFEval evaluation benchmark with 400 questions and 32 constraint types. The experiment shows that the trained AI model improves the ability to follow instructions by more than 10%, and significantly improves the ability to understand and execute complex requirements while maintaining the original ability.

 Armour UFS 4.1 flash memory promises to improve AI application performance

Armour UFS 4.1 flash memory promises to improve AI application performance

Armour is testing the latest UFS v4.1 embedded flash memory chip, designed for smart phones and tablets, which can provide faster download speed and smoother device side AI application performance. The chip adopts 218 layer TLC 3D NAND technology, providing 256GB, 512GB and 1TB capacity options. Compared with v4.0 products, the random write performance is improved by about 30%, the random read performance is improved by 35-45%, and the power consumption efficiency is improved by 15-20%. The new standard also adds host initiated defragmentation, enhanced exception handling and other functional features.

 The breakthrough research of UCLA: a new era of multi round game of AI dialogue offensive and defensive war

The breakthrough research of UCLA: a new era of multi round game of AI dialogue offensive and defensive war

This breakthrough research systematically revealed the serious threat of multi round dialogue attacks to AI security for the first time, and developed the X-Teaming intelligent attack framework and XGuard Train protection data set. Research shows that the current AI system has a high miss rate of 98% in the face of carefully designed multi round attacks, but this risk can be significantly reduced through the newly constructed large-scale training dataset, which provides an important tool and new ideas for AI security protection.

 Google Firebase Studio Launches Agent Mode to Realize Automatic Programming

Google Firebase Studio Launches Agent Mode to Realize Automatic Programming

Google released the update of Firebase Studio at the London Cloud Summit, adding Gemini command line interface integration, model context protocol support and "proxy mode". The agent mode provides three AI collaboration levels: the conversational "inquiry" mode is used for brainstorming, the human-computer collaboration agent requires developers to confirm code changes, and the almost autonomous agent mode. Although Google claims that millions of applications have used the platform, it still needs to carefully design prompt words, and non engineer users cannot directly create mature applications.

 When AI has permanent memory: MemOS created by the team of Shanghai Jiaotong University makes the big model say goodbye to "amnesia"“

When AI has permanent memory: MemOS created by the team of Shanghai Jiaotong University makes the big model say goodbye to "amnesia"“

The team of Shanghai Jiaotong University has developed the MemOS memory operating system, which enables AI to have a real long-term memory capability. The system manages three types of parameter memory, activation memory and clear text memory in a unified way, and realizes memory life cycle management and cross type conversion through MemCube intelligent unit. In the LOCOMO benchmark, MemOS achieved the best results in all reasoning tasks, especially in multi hop reasoning and time reasoning.

 Google adds image to video generation function for Veo 3

Google adds image to video generation function for Veo 3

Google announced on Thursday that it will add image generation video function to its Veo 3 AI video generator through Gemini application. This function was previously provided in the AI video tool Flow launched at the I/O developer conference in May. At present, the Veo 3 video generation function has been launched in more than 150 countries, and is only available to Google AI Ultra and Pro users, with a daily limit of 3 videos. Users can upload photos and add audio descriptions to generate videos. Since its release 7 weeks ago, users have created more than 40 million videos, all with visible and invisible digital watermarks.

 A major breakthrough of Shanghai AI Laboratory: using ordinary cameras can shoot ultra-high speed slow lenses, and 4D reconstruction technology subverts traditional photography

A major breakthrough of Shanghai AI Laboratory: using ordinary cameras can shoot ultra-high speed slow lenses, and 4D reconstruction technology subverts traditional photography

The team of Shanghai AI Laboratory proposed an innovative asynchronous shooting scheme, which can achieve high-speed 4D reconstruction with only ordinary cameras. This method improves the effective frame rate from 25FPS to 100-200FPS by staggering the camera start time, and combines the video diffusion model to repair the reconstruction artifacts caused by sparse view angles. The experimental results show that the new method is significantly better than the existing technology in dealing with fast moving scenes, which opens up a new path for low-cost and high-quality 4D content creation.

Mail subscription