Conference DDLs
ccfddl.comSOTA Models
arena.aiIndustry News
The Future of Addictive Design + Going Deep at DeepMind + HatGPT
At the Pentagon, OpenAI is In and Anthropic Is Out · February 27, 2026 • 1 ... Claude's Code: Anthropic Leaks Source Code for A.I. Software Engineering
Silicon Valley Is in a Frenzy Over Bots That Build Themselves
Sam Altman says that by 2028, OpenAI plans to have developed a fully “automated AI researcher.” By then, “we are pretty confident we will have systems that can
I lived through Google's AI-military crisis. Here's why engagement still ...
Deep collaboration between AI companies and the military is harder than withdrawal and harder than compliance. It's also the best option.
Google DeepMind Releases Gemma 4: Its Most Capable Open AI Model ...
The American technology company Google has announced Gemma 4, a new generation of open models designed for advanced reasoning and agentic workflows.
Retrieval-Augmented Generation (RAG) systems enhance the performance of large language models (LLMs) by incorporating supplementary retrieved documents,...
In this work, we propose RRPO to address the fundamental misalignment between static retrieval metrics and the dynamic needs of LLM readers in RAG systems....
In this work, we propose RRPO to address the fundamental misalignment between static retrieval metrics and the dynamic needs of LLM readers in RAG systems....
Prior works explore various strategies which can enable agents to synergise agents' actions and optimize overall system reasoning and problem-solving...
Recent advances in Large Language Model (LLM)-based Multi-Agent Systems (Li et al., 2024) have enhanced Retrieval-Augmented Generation (RAG) (Lewis et al., 2020...
We demonstrate this approach on a GPU-FPGA system by offloading sparse, irregular, and memory-bounded operations to FPGAs while retaining compute-intensive operations on GPUs. Evaluated on an AMD MI210 GPU and an Alveo U55C FPGA, our system is 1.04∼2.2×1.04\sim 2.2\times faster a...
In this paper, we introduce MemBoost, a memory-boosted architecture with three components: (1) an Associative Memory Engine (AME) that performs fast semantic retrieval and supports write-back of newly generated answers; (2) a high-capability Large-LLM Oracle that provides an accu...
We demonstrate MiCP on adaptive RAG and ReAct, where it achieves the target coverage on both single-hop and multi-hop question answering benchmarks while...