Google Introduces Neural Long-Term Memory Module, Breaks Through Large Model Long Sequence Bottleneck

Google researchers have launched the Neural Long-Term Memory Module (Titan), addressing Transformer architecture challenges in long sequence processing including attention dilution, performance degradation, and VRAM dependency. As a deep neural network, this module dynamically updates weights during runtime and selectively remembers information through a “surprise” mechanism, similar to human brain function. Google designed three integration approaches: MAC uses memory output as additional context tokens to enhance long-range recall capability; MAG introduces nonlinear gating mechanisms; MAL directly incorporates the memory module as a network layer. Experiments demonstrate this technology significantly improves “needle in a haystack” test results, potentially advancing breakthroughs in large language models for long text processing and knowledge base retrieval applications. While Gemini’s current 1M context is sufficient, the 10M expansion potential offers tremendous opportunities for the AI industry.

Original Link:Linux.do

抢沙发

评论前必须登录!

立即登录   注册