☆ 4.7 Article

Regret and Convergence Bounds for a Class of Continuum-Armed Bandit Problems

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2009)

期刊

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

卷 54, 期 6, 页码 1243-1253

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TAC.2009.2019797

关键词

Adaptive control; sequential decision procedures; stochastic approximation

类别

Automation & Control Systems Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

Reagent

摘要

We consider a class of multi-armed bandit problems where the set of available actions can be mapped to a convex, compact region of R-d, sometimes denoted as the continuum-armed bandit problem. The paper establishes bounds on the efficiency of any arm-selection procedure under certain conditions on the class of possible underlying reward functions. Both finite-time lower bounds on the growth rate of the regret, as well as asymptotic upper bounds on the rates of convergence of the selected control values to the optimum are derived. We explicitly characterize the dependence of these convergence rates (in the minimal rate of variation of the mean reward function in a neighborhood of the optimal control. The bounds can be used to demonstrate the asymptotic optimality of the Kiefer-Wolfowitz method of stochastic approximation with regard to a large class of possible mean reward functions.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Engineering, Industrial

Adaptive fully sequential selection procedures with linear and nonlinear control variates

Shing Chih Tsai, Jun Luo, Guangxin Jiang, Wei Cheng Yeh

Summary: This article introduces an adaptive fully sequential Ranking-and-Selection (R&S) procedure, adopting the classic Indifference-Zone (IZ) formulation in the statistical literature and incorporating control variates method. The proposed procedure is demonstrated to have advantages through simulation experiments.

IISE TRANSACTIONS (2023)

添加到收藏夹

Article Mathematics, Applied

ADAPTIVE SEQUENTIAL SAMPLE AVERAGE APPROXIMATION FOR SOLVING TWO-STAGE STOCHASTIC LINEAR PROGRAMS

Raghu Pasupathy, Yongjia Song

Summary: This study introduces an adaptive sequential SAA algorithm to solve large-scale two-stage stochastic linear programs, achieving favorable performance through a sequential framework with optimal sample size schedule and the use of warm starts. Extensive numerical tests demonstrate the success of the proposed algorithm, providing a solution with a probabilistic guarantee on quality.

SIAM JOURNAL ON OPTIMIZATION (2021)

添加到收藏夹

Article Management

On the finite-sample statistical validity of adaptive fully sequential procedures

Zhenxia Cheng, Jun Luo, Ruijing Wu

Summary: We study the simulation optimization problem of selecting the best system design, known as ranking and selection (R&S). We propose fully sequential procedures that incorporate adaptive sampling rules while preserving finite-sample statistical guarantees. Specifically, we introduce an adaptive sampling rule that utilizes consecutively updated sample mean and variance information by solving a minimization problem of the approximated total sample size. Extensive simulation experiments demonstrate the efficiency of the proposed procedures, and we apply them to solve an ambulance dispatching problem.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2023)

添加到收藏夹

Article Automation & Control Systems

Adaptive Stabilization With Control-Dependent Stochastic Noise

Fengzhong Li, Yungang Liu

Summary: This article discusses the issue of control-dependent stochastic noise in adaptive control. It proposes basic theorems on stochastic convergence and establishes a martingale-based analysis pattern for adaptive control. Global stabilization of a certain class of uncertain nonlinear systems is achieved through the use of dynamic gains.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2023)

添加到收藏夹

Article Operations Research & Management Science

Stochastic Approximation Procedures for Levy-Driven SDEs

Jan Seidler, Ondrej Tybl

Summary: We study a continuous-time Robbins-Monro-type stochastic approximation procedure for a system described by a stochastic differential equation driven by a general Levy process, and we establish sufficient conditions for its convergence using Lyapunov functions. Despite the possible disruption caused by the jump part of the noise, we show that convergence can still be achieved by choosing suitable noise coefficients, even under weaker assumptions on the drift compared to the diffusion case or in the presence of multiple roots of the drift.

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Adaptive-Neuro-Learning Tracking Control for the Permanent Magnet Synchronous Motor with Full-State Prescribed Performances and Time Delays

Tandong Li, Shaobo Li, Junxing Zhang, Hang Sun, Chaojie Zheng, Dongchao Lv

Summary: A new hybrid controller is proposed for high-performance tracking control of permanent magnet synchronous motors in perturbed environments. The controller achieves full-state performance constraints using a prescribed performance method and avoids complexity explosion using a time-varying filter. By combining Lyapunov-Krasovskii functionals with adaptive neural networks, the controller solves the problems of time-delay disturbance and unknown nonlinear dynamics.

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS (2023)

添加到收藏夹

Article Automation & Control Systems

Deep Filtering With Adaptive Learning Rates

Hongjiang Qian, George Yin, Qing Zhang

Summary: This article presents a new deep learning framework for general nonlinear filtering. The main contribution is the development of a computationally feasible procedure. The proposed algorithms can handle challenging filtering problems involving diffusions with randomly-varying switching. The article demonstrates the efficiency of the algorithm through highly nonlinear dynamic system examples.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2023)

添加到收藏夹

Article Automation & Control Systems

Robust and Adaptive Sequential Submodular Optimization

Vasileios Tzoumas, Ali Jadbabaie, George J. Pappas

Summary: In this article, the authors propose a robust and adaptive maximization algorithm for solving discrete optimization problems in adversarial environments. The algorithm, called RAM, runs in an online fashion and adapts to the history of failures in each step. It guarantees near-optimal performance and has both provable per-instance a priori bounds and tight and/or optimal a posteriori bounds.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2022)

添加到收藏夹

Article Automation & Control Systems

Stochastic Adaptive Nonlinear Control With Filterless Least Squares

Wuquan Li, Miroslav Krstic

Summary: A new least-squares identification scheme is proposed for stochastic strict-feedback nonlinear systems with unknown parameters, without regressor filtering. The key new element in the estimator design is a weighted term with design parameters, introduced to handle nonlinear terms and stochastic noise. Adaptive controllers are designed to ensure global stability in probability at the equilibrium point and regulation of states to zero almost surely.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2021)

添加到收藏夹

Article Automation & Control Systems

A distributed stochastic approximation algorithm for stochastic LQ control with unknown uncertainty

Zhaorong Zhang, Juanjuan Xu, Xun Li

Summary: This paper investigates a discrete-time stochastic control problem with linear quadratic criteria over an infinite-time horizon. The focus is on control systems whose system matrices are associated with random parameters involving unknown statistical properties. A distributed stochastic approximation algorithm is designed to solve the Riccati equation and obtain the optimal controller for stabilizing the system. Convergence analysis is provided.

AUTOMATICA (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

A New Adaptive Sparse Pseudospectral Approximation Method and its Application for Stochastic Power Flow

Jikeng Lin, Kaiming Yuan, Lingfeng Wang

Summary: This study introduces a new adaptive sparse pseudospectral approximation method called NA-SPAM, which addresses the issues of inaccurate global error estimate and inefficiency in multi-output systems that were present in the original A-SPAM. Through two improvements, NA-SPAM demonstrates higher estimation accuracy and improved efficiency in calculating multi-output problems.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS (2021)

添加到收藏夹

Article Water Resources

Adaptive water resource planning using decision-rules

Tohid Erfani, Julien J. Harou

Summary: Dealing with uncertainty in water resource planning is complex due to the potential social and environmental costs of insufficient infrastructure. Multistage stochastic optimisation offers a solution to this challenge, but can be difficult and expensive for real systems. The 'Decision-rule' formulation approximates the multistage problem by introducing a series of rules that are functions of uncertainty and system state, ultimately impacting adaptive water resources planning.

ADVANCES IN WATER RESOURCES (2021)

添加到收藏夹

Article Management

Adaptive Sequential Experiments with Unknown Information Arrival Processes

Yonatan Gur, Ahmadreza Momeni

Summary: This paper investigates the performance achieved by leveraging auxiliary information in sequential experiments and proposes effective algorithms. The study shows that upper confidence bound and Thompson sampling algorithms have good performance when the mapping between auxiliary observations and rewards is known, and auxiliary information can improve performance. When the mapping is unknown, an adaptive strategy is proposed to ensure near optimality, and better performance can be achieved by utilizing auxiliary observations in practical applications.

M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT (2022)

添加到收藏夹

Article Management

Hybrid strategies using linear and piecewise-linear decision rules for multistage adaptive linear optimization

Said Rahal, Dimitri J. Papageorgiou, Zukui Li

Summary: Decision rules provide a rich framework for solving multistage adaptive optimization problems, with recent literature showing the potential of using both linear and nonlinear decision rules. The study explores hybrid decision rules combining the benefits of the two classes, highlighting the trade-off between solution quality and computational cost. Unexpectedly, a linear decision rule was found to be superior to a more complex piecewise-linear decision rule in a simulator, emphasizing the importance of assessing decision rule quality within a simulator.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2021)

添加到收藏夹

Article Computer Science, Interdisciplinary Applications

Robust Adaptive Submodular Maximization

Shaojie Tang

Summary: The goal of a sequential decision-making problem is to design an interactive policy that adaptively selects a group of items, each selection is based on the feedback from the past, to maximize the expected utility of selected items. This study proposes to study two variants of adaptive submodular optimization problems and introduces a new class of stochastic functions called worst-case submodular functions. Several applications of the theoretical results are also described.

INFORMS JOURNAL ON COMPUTING (2022)

添加到收藏夹

暂无数据

暂无数据

© Peeref 2019-2024. All rights reserved.