Abbas Ahmed

Posted on Mar 14

Linking Two Research Papers to Real AI Applications

#agents #ai #learning #webdev

## A Survey of LLM-based Deep Search Agents

Adaptive Path Planning via Weighted A* and Heuristic Rewards

When I first read these two papers, my immediate thought was how closely they relate to the concepts we learn in our Artificial Intelligence course, especially search algorithms and intelligent agents. In class we usually study algorithms like BFS, DFS, Best-First Search, and A* using small graph examples. At first these problems can feel very academic. However, while reading these papers, I realized that the same ideas are actively being extended and used in modern AI systems, especially when combined with Large Language Models (LLMs).

Both papers approach the idea of intelligent search and planning, but from different angles. One focuses on how LLM-based agents perform deep search, while the other proposes improvements to classical path-planning algorithms using Weighted A* and heuristic rewards.

Paper 1: A Survey of LLM-based Deep Search Agents

The goal of this paper is to review and analyze how Large Language Models can act as reasoning agents that perform deep search over possible solutions. Traditional search algorithms explore a state space systematically, but LLM-based agents introduce the ability to reason about the search process itself.

Instead of blindly expanding nodes, these agents can:

Plan multi-step solutions
Evaluate intermediate results
Decide which search branch is more promising'

This connects strongly to the agent models we study in AI. In our coursework, we learn about:

Simple reflex agents
Model-based agents
Goal-based agents
Utility-based agents

LLM-based deep search agents resemble goal-based and utility-based agents, because they evaluate possible actions and choose those that move closer to the goal.

For example, when solving complex reasoning tasks, an LLM agent can:

Break a problem into smaller steps
Generate candidate solutions
Evaluate which branch is most promising
Continue searching in that direction

This resembles Best-First Search, but guided by language-based reasoning instead of a purely mathematical heuristic.

Paper 2: Adaptive Path Planning via Weighted A* and Heuristic Rewards

The second paper focuses on improving path-planning algorithms, particularly the A* search algorithm.

In standard A* search, the evaluation function is:

g(n) → cost from start node to current node
h(n) → heuristic estimate from node to goal

The total score is calculated as:

f(n)=g(n)+h(n)

However, the paper proposes using Weighted A* to prioritize heuristic information more strongly:

f(n) = g(n) + w \times h(n)

Here w is a weight that increases the importance of the heuristic estimate.

The paper further introduces heuristic rewards, which allow the algorithm to dynamically adjust its guidance based on the environment. Instead of relying only on static heuristics, the system can learn or adapt its evaluation during the search.

This modification is especially useful in environments where:

The search space is very large
Conditions change over time
Quick decisions are required

This concept directly connects to our AI course topics such as:

A* search algorithm
heuristic design
optimization of search strategies

Practical Example: Autonomous Delivery Robots

A practical real-world application of these ideas is autonomous delivery robots used in smart cities or warehouses.

Imagine a robot delivering packages inside a large warehouse.

Using Classical A*

With standard A* search:

The robot finds the shortest path between two points.
The heuristic might simply be the distance to the goal.

However, this approach ignores many real-world factors such as:

Human workers moving around
Temporary obstacles
High-traffic zones
Battery efficiency

Using Weighted A* with Adaptive Rewards

With the method described in the second paper:

The algorithm prioritizes promising routes faster using weighted heuristics.
The system dynamically adjusts path preferences using heuristic rewards.

For example:

Path Distance ObstacleRisk HeuristicReward Result
Path A Short High Low Avoid
Path B Medium Low High Choose
Path C Long Medium Medium Backup

Even if Path A is shorter, the algorithm may select Path B because it is safer and faster overall.

This leads to:

Faster deliveries
Less congestion
improved efficiency

Combining Both Papers

The most interesting insight is that both papers complement each other.

LLM-based Deep Search Agents provide high-level reasoning and planning.
Weighted A* provides efficient path optimization.

A future intelligent system could combine both approaches:

The LLM agent decides strategic goals (which delivery route or task to prioritize).
The Weighted A* algorithm finds the optimal path to execute that decision.

This combination could power systems like:

autonomous vehicles
robotic warehouses
intelligent logistics systems
disaster-response robots

Insights from Manual Reading and NotebookLM Exploration

While reading the papers manually, I noticed that both emphasize the importance of hybrid AI systems. Classical algorithms are not replaced by modern AI models; instead, they are enhanced by them.

Using NotebookLM helped highlight key insights:
LLMs can guide search processes through reasoning.
Adaptive heuristics improve search efficiency.

Combining symbolic search algorithms with neural models is a growing research direction.

NotebookLM also helped summarize complex sections of the papers and made it easier to understand how these algorithms scale to real-world environments

Personal Reflection

Reading these papers helped me connect our AI course concepts with real research developments. Algorithms like A* that we practice in programming assignments are still fundamental in modern AI systems.

What has changed is that researchers are now integrating them with large language models and adaptive heuristics to make them more intelligent and flexible.

This shows that learning classical algorithms is still extremely valuable because they form the foundation for advanced AI systems.

Mention:
@raqeeb_26