Research Papers - Parth K. Thaker

Controlling Search Agents to Perform Search with Noisy Observations and Probabilistic Guarantees

P Thaker (Mitsubishi Electric Research Laboratories)

December 2024

US Patent Application

A control system and a method for controlling search agents to perform search with noisy observations and probabilistic guarantees is provided. The control system collects confidence bounds of a probabilistic classification of at least one region within at least one path of a set of paths. The control system compares aggregations of the confidence bounds of the probabilistic classifications of eac...

PatentsMulti-armed BanditsMulti-agent SystemsAutonomous SearchProbabilistic GuaranteesMMBS

DOI

→

Non-Stationary Bandits with Periodic Behavior: Harnessing Ramanujan Periodicity Transforms to Conquer Time-Varying Challenges

P Thaker, V Gattani, V Tirukkonda et al.

April 2024

ICASSP 2024

We address the challenging problem of non-stationary multi-armed bandits where the reward distributions change over time in a periodic manner. Traditional approaches struggle with time-varying environments due to the exploration-exploitation trade-off becoming more complex when the optimal actions shift cyclically. We propose a novel framework that harnesses Ramanujan Periodicity Transforms to det...

Non-stationary BanditsRamanujan TransformsSignal ProcessingPeriodic BehaviorTime-varying Systems

→

Bandit-based multi-agent search under noisy observations

P Thaker, S Di Cairano, A P Vinod

June 2023

IFAC 2023

We address autonomous search using teams of multiple agents, requiring tractable coordination strategies that can lower the time to identify interesting areas in the search environment, lower the costs/energy usage by the search agents during movement and sensing, and be resilient to the noise present in the sensed data due to the use of low-cost and low-weight sensors. We propose a data-driven, m...

Multi-armed BanditsMulti-agent SystemsAutonomous SearchRoboticsNoise Robustness

DOI

→

Maximizing and Satisficing in Multi-armed Bandits with Graph Information

P Thaker, M Malu, N Rao et al.

November 2022

NeurIPS 2022

We consider the pure exploration problem in stochastic multi-armed bandits where the similarities between the arms are captured by a graph and the rewards may be represented as a smooth signal on this graph. We specifically examine the problem of finding the arm with the maximum reward (maximizing problem) or one with a sufficiently high reward (satisficing problem) under this model. We propose no...

Multi-armed BanditsGraph TheoryPure ExplorationOptimizationMachine Learning

arXiv: 2108.01152

→

Differentiable Programming for Hyperspectral Unmixing using a Physics-based Dispersion Model

J Janiczek, P Thaker, G Dasarathy et al.

July 2020

ECCV 2020

Hyperspectral unmixing is an important remote sensing task with applications including material identification and analysis. Characteristic spectral features make many pure materials identifiable from their visible-to-infrared spectra, but quantifying their presence within a mixture is a challenging task due to nonlinearities and factors of variation. We consider spectral variation from a physics-...

Computer VisionHyperspectral ImagingDifferentiable ProgrammingPhysics-based ModelsRemote Sensing

arXiv: 2007.05996

→

On the Sample Complexity and Optimization Landscape for Quadratic Feasibility Problems

P Thaker

June 2020

ISIT 2020

We consider the problem of recovering a complex vector from quadratic measurements, known as quadratic feasibility, which encompasses the well known phase retrieval problem and has applications in power system state estimation and x-ray crystallography. While the quadratic feasibility problem is generally NP-hard and may be unidentifiable, we establish conditions under which this problem becomes i...

OptimizationNonconvex OptimizationSample ComplexityPhase RetrievalSignal Processing

arXiv: 2002.01066

→

When to arrive in a congested system: Achieving equilibrium via learning algorithm

P Thaker

December 2016

IEEE ISIT 2017

We consider a strategic problem where multiple players compete to access a shared server platform that operates intermittently, switching between ON and OFF periods. Each player incurs costs to sample the server state and receives payoffs inversely proportional to the number of simultaneously connected players. We propose a distributed randomized learning algorithm that enables players to minimize...

Game TheoryLearning AlgorithmsNash EquilibriumResource AllocationCongestion Control

DOI

→