Controlling Search Agents to Perform Search with Noisy Observations and Probabilistic Guarantees
P Thaker (Mitsubishi Electric Research Laboratories)
A control system that directs teams of search agents along fuel-constrained paths using a multi-level multi-armed bandit, classifying regions from noisy measurements with probabilistic finite-time guarantees.