WisPaper
WisPaper
学术搜索
学术问答
价格
TrueCite
AAC: Bridging the Gap Between Differentiable Learning and Search Admissibility
总结
问题
方法
结果
要点
摘要

The paper introduces AAC (Architecturally Admissible Compressor), a differentiable landmark selection module for the ALT (A*, Landmarks, Triangle inequality) shortest-path framework. By using a row-stochastic Gumbel-softmax architecture, it yields an admissible heuristic for any parameter setting, matching state-of-the-art expansion reduction while enabling end-to-end training.

TL;DR

AAC (Architecturally Admissible Compressor) is a novel neural module that makes the selection of "landmarks" in A* search differentiable. Unlike previous neural heuristics that "hope" to be admissible through loss penalties, AAC is admissible by architecture. It guarantees optimal paths at any point during training and reduces to a classical, efficient ALT heuristic at deployment, achieving up to 1.5x speedups in real-world road networks.

The Problem: The Admissibility vs. Differentiability Paradox

In the world of shortest-path planning, ALT (A*, Landmarks, and Triangle inequality) is a gold standard for admissible heuristics. It precomputes distances to key "landmarks" to provide a lower bound on travel costs.

However, the field has been split into two camps:

  1. Classical Algorithms: Use Farthest-Point Sampling (FPS) to pick landmarks. They are provably safe (admissible) but "blind"—they don't adapt to specific traffic patterns or query clusters.
  2. Neural Methods: Learn powerful heuristics backpropagated from data. They are adaptive but "unsafe"—they often overestimate distances, leading to suboptimal paths, and enforcing admissibility usually requires clunky loss-function "hacks" that aren't 100% reliable.

Methodology: Admissibility by Construction

The author, An T. Le, asks: Can we make selection itself differentiable and admissible by architecture rather than by loss?

AAC solves this by posing selection as a row-stochastic compression problem. It starts with a large pool of "teacher" landmarks and uses a Gumbel-softmax layer to learn which ones are most useful.

The Secret Sauce: AAC relies on a classical insight from Judea Pearl (1984): any convex combination (a weighted average where weights sum to 1) of admissible heuristics is itself admissible. Because AAC's selection matrix is row-stochastic, the output is guaranteed to be a valid lower bound—no matter how messy the training or how unexpected the query.

Model Architecture In directed graphs, AAC uses dual matrices to handle forward and backward distances independently.

Experiments: Hitting the "FPS Ceiling"

The paper introduces a rigorous matched-memory protocol, comparing AAC to the industry-standard FPS-ALT by ensuring both use the exact same number of bytes per vertex.

Key Findings:

  • The FPS Ceiling: On road networks, the standard FPS strategy is remarkably hard to beat. The paper proves (Theorem 5) that FPS achieves near-optimal coverage, leaving only about 1-4% of "headroom" for any neural selector to improve upon.
  • Speed Advantage: Even when AAC expands slightly more nodes than FPS, it is often 1.2x to 1.5x faster in wall-clock time. This is due to its contiguous memory layout, which plays nicely with modern CPU caches.
  • Zero Violations: Across 1,500+ queries and multiple datasets (DIMACS, OSMnx), AAC recorded zero admissibility violations, even during early training or after training divergence.

Performance Comparison Pareto frontier showing AAC tracking the FPS-ALT performance curve closely while remaining differentiable.

Deep Insight: Training Drift

A fascinating finding in the paper is the "Binding Constraint." It turns out the architecture could perfectly replicate FPS, but standard training often drifts away from the optimal FPS subset. This suggests that the next frontier in neural search isn't just about different model types, but better initialization and optimization strategies that respect the geometry of the search space.

Conclusion

AAC provides a "best of both worlds" solution. It gives researchers a differentiable pathway to integrate search heuristics into larger neural pipelines (like Warcraft terrain cost prediction) while giving engineers the "peace of mind" of a hard-coded admissibility certificate.

As AI moves deeper into safety-critical domains like autonomous robotics and logistics, architectures that are "Safe-by-Design" like AAC will likely become the new standard.

发现相似论文

试试这些示例

  • Search for recent papers on "Architecturally Admissible" or "Constrained Hypothesis Class" approaches to learning search heuristics beyond shortest-path problems.
  • Who first proposed the "Convex Combination of Admissible Heuristics" theorem, and how has this principle been applied in modern neural A* research?
  • Investigate how differentiable landmark selection methods like AAC can be adapted for dynamic graphs where edge costs change frequently (e.g., traffic routing or robotic motion planning).
目录
AAC: Bridging the Gap Between Differentiable Learning and Search Admissibility
1. TL;DR
2. The Problem: The Admissibility vs. Differentiability Paradox
3. Methodology: Admissibility by Construction
4. Experiments: Hitting the "FPS Ceiling"
4.1. Key Findings:
5. Deep Insight: Training Drift
6. Conclusion