4.5 Article

Cusum techniques for timeslot sequences with applications to network surveillance

Journal

COMPUTATIONAL STATISTICS & DATA ANALYSIS
Volume 53, Issue 12, Pages 4332-4344

Publisher

ELSEVIER
DOI: 10.1016/j.csda.2009.05.029

Keywords

-

Ask authors/readers for more resources

We develop two cusum, change-point detection algorithms for data network monitoring applications where numerous and various performance and reliability metrics are available to aid with the early identification of realized or impending failures. We confront three significant challenges with our cusum algorithms: (1) the need for nonparametric techniques so that a wide variety of metrics can be included in the monitoring process, (2) the need to handle time varying distributions for the metrics that reflect natural cycles in work load and traffic patterns, and (3) the need to be computationally efficient with the massive amounts of data that are available for processing. The only critical assumption we make when developing the algorithms is that suitably transformed observations within a defined timeslot structure are independent and identically distributed under normal operating conditions. To facilitate practical implementations of the algorithms, we present asymptotically valid thresholds. Our research was motivated by a real-world application and we use that context to guide the design of a simulation study that examines the sensitivity of the cusum algorithms. (C) 2009 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Statistics & Probability

The Cameron-Martin Theorem for (p-)Slepian Processes

Wolfgang Bischoff, Andreas Gegg

JOURNAL OF THEORETICAL PROBABILITY (2016)

Article Statistics & Probability

Boundary crossing probabilities for (q, d)-Slepian-processes

Wolfgang Bischoff, Andreas Gegg

STATISTICS & PROBABILITY LETTERS (2016)

Article Pharmacology & Pharmacy

A SEAMLESS PHASE II/III DESIGN WITH SAMPLE-SIZE RE-ESTIMATION

Wolfgang Bischoff, Frank Miller

JOURNAL OF BIOPHARMACEUTICAL STATISTICS (2009)

Article Statistics & Probability

The limit of the partial sums process of spatial least squares residuals

Wolfgang Bischoff, Wayan Somayasa

JOURNAL OF MULTIVARIATE ANALYSIS (2009)

Proceedings Paper Statistics & Probability

On Designs for Recursive Least Squares Residuals to Detect Alternatives

Wolfgang Bischoff

MODA 11 - ADVANCES IN MODEL-ORIENTED DESIGN AND ANALYSIS (2016)

Article Statistics & Probability

An asymptotic result for non crossing probabilities of brownian motion with trend

Wolfgang Bischoff, Enkelejd Hashorva, Juerg Huesler

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS (2007)

Article Environmental Sciences

Growth curve models for stochastic modeling and analyzing of natural disinfection of wastewater

Wolfgang Bischoff, Mong-Na Lo Huang, Lei Yang

ENVIRONMETRICS (2006)

Article Statistics & Probability

Efficient lack of fit designs that are optimal to estimate the highest coefficient of a polynomial

Wolfgang Bischoff, Frank Miller

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2006)

Article Statistics & Probability

Lack-of-fit-efficiently optimal designs to estimate the highest coefficient of a polynomial with large degree

Wolfgang Bischoff, Frank Miller

STATISTICS & PROBABILITY LETTERS (2006)

Article Statistics & Probability

Optimal designs which are efficient for lack of fit tests

Wolfgang Bischoff, Frank Miller

ANNALS OF STATISTICS (2006)

Article Statistics & Probability

D-optimally Lack-of-Fit-Test-efficient Designs and Related Simple Designs

Wolfgang Bischoff

AUSTRIAN JOURNAL OF STATISTICS (2008)

Article Computer Science, Interdisciplinary Applications

One point per cluster spatially balanced sampling

Blair Robertson, Chris Price

Summary: Spatial sampling designs are crucial for accurate estimation of population parameters. This study proposes a new design method that generates samples with good spatial spread and performs favorably compared to existing designs.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Simultaneous confidence region of an embedded one-dimensional curve in multi-dimensional space

Hiroya Yamazoe, Kanta Naito

Summary: This paper focuses on the simultaneous confidence region of a one-dimensional curve embedded in multi-dimensional space. An estimator of the curve is obtained through local linear regression on each variable in multi-dimensional data. A method to construct a simultaneous confidence region based on this estimator is proposed, and theoretical results for the estimator and the region are developed. The effectiveness of the region is demonstrated through simulation studies and applications to artificial and real datasets.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Efficient and robust optimal design for quantile regression based on linear programming

Cheng Peng, Drew P. Kouri, Stan Uryasev

Summary: This paper introduces a novel optimal experimental design method for quantifying the distribution tails of uncertain system responses. The method minimizes the variance or conditional value-at-risk of the upper bound of the predicted quantile, and estimates the data uncertainty using quantile regression. The optimal design problems are solved as linear programming problems, making the proposed methods efficient even for large datasets.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Multi-block alternating direction method of multipliers for ultrahigh dimensional quantile fused regression

Xiaofei Wu, Hao Ming, Zhimin Zhang, Zhenyu Cui

Summary: This paper proposes a model that combines quantile regression and fused LASSO penalty, and introduces an iterative algorithm based on ADMM to solve high-dimensional datasets. The paper proves the global convergence and comparable convergence rates of the algorithm, and analyzes the theoretical properties of the model. Numerical experimental results support the superior performance of the model.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Nonparametric augmented probability weighting with sparsity

Xin He, Xiaojun Mao, Zhonglei Wang

Summary: This paper proposes a nonparametric imputation method with sparsity to estimate the finite population mean, using an efficient kernel method and sparse learning for estimation. An augmented inverse probability weighting framework is adopted to achieve a central limit theorem for the proposed estimator under regularity conditions.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Conditional-mean multiplicative operator models for count time series

Christian H. Weiss, Fukang Zhu

Summary: This study introduces a multiplicative error model (CMEMs) for discrete-valued count time series, which is closely related to the integer-valued generalized autoregressive conditional heteroscedasticity (INGARCH) models. It derives the stochastic properties and estimation approaches of different types of INGARCH-CMEMs, and demonstrates their performance and application through simulations and real-world data examples.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Hybrid exact-approximate design approach for sparse functional data

Ming-Hung Kao, Ping-Han Huang

Summary: Optimal designs for sparse functional data under the functional empirical component (FEC) settings are investigated. New computational methods and theoretical results are developed to efficiently obtain optimal exact and approximate designs. A hybrid exact-approximate design approach is proposed and demonstrated to be efficient through simulation studies and a real example.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

GP-BART: A novel Bayesian additive regression trees approach using Gaussian processes

Mateus Maia, Keefe Murphy, Andrew C. Parnell

Summary: The Bayesian additive regression trees (BART) model is a powerful ensemble method for regression tasks, but its lack of smoothness and explicit covariance structure can limit its performance. The Gaussian processes Bayesian additive regression trees (GP-BART) model addresses this limitation by incorporating Gaussian process priors, resulting in superior performance in various scenarios.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Additive partially linear model for pooled biomonitoring data

Xichen Mou, Dewei Wang

Summary: Human biomonitoring is a method of monitoring human health by measuring the accumulation of harmful chemicals in the body. To reduce the high cost of chemical analysis, researchers have adopted a cost-effective approach that combines specimens and analyzes the concentration of toxic substances in the pooled samples. To effectively interpret these aggregated measurements, a new regression framework is proposed by extending the additive partially linear model (APLM). The APLM is versatile in capturing the complex association between outcomes and covariates, making it valuable in assessing the complex interplay between chemical bioaccumulation and potential risk factors.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Laplace approximated quasi-likelihood method for heteroscedastic survival data

Lili Yu, Yichuan Zhao

Summary: The classical accelerated failure time model is a linear model commonly used for right censored survival data, but it cannot handle heteroscedastic survival data. This paper proposes a Laplace approximated quasi-likelihood method with a continuous estimating equation to address this issue, and provides estimation bias and confidence interval estimation formulas.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Standard error estimates in hierarchical generalized linear models

Shaobo Jin, Youngjo Lee

Summary: Hierarchical generalized linear models are widely used for fitting random effects models, but the standard error estimators receive less attention. Current standard error estimation methods are not necessarily accurate, and a sandwich estimator is proposed to improve the accuracy of standard error estimation.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Probability of default estimation in credit risk using mixture cure models

Rebeca Pelaez, Ingrid Van Keilegom, Ricardo Cao, Juan M. Vilar

Summary: This article proposes an estimator for the probability of default (PD) in credit risk, derived from a nonparametric conditional survival function estimator based on cure models. The asymptotic expressions for bias, variance, and normality of the estimator are presented. Through simulation and empirical studies, the performance and practical behavior of the nonparametric estimator are compared with other methods.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Joint modelling of the body and tail of bivariate data

L. M. Andre, J. L. Wadsworth, A. O'Hagan

Summary: This paper proposes a dependence model that captures the entire data range in multi-variable cases. By blending two copulas with different characteristics and using a dynamic weighting function for smooth transition, the model is able to flexibly capture various dependence structures.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Significance test for semiparametric conditional average treatment effects and other structural functions

Niwen Zhou, Xu Guo, Lixing Zhu

Summary: The paper investigates hypothesis testing regarding the potential additional contributions of other covariates to the structural function, given the known covariates. The proposed distance-based test, based on Neyman's orthogonality condition, effectively detects local alternatives and is robust to the influence of nuisance functions. Numerical studies and real data analysis demonstrate the importance of this test in exploring covariates associated with AIDS treatment effects.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Full uncertainty analysis for Bayesian nonparametric mixture models

Blake Moya, Stephen G. Walker

Summary: A full posterior analysis method for nonparametric mixture models using Gibbs-type prior distributions, including the well known Dirichlet process mixture (DPM) model, is presented. The method removes the random mixing distribution and enables a simple-to-implement Markov chain Monte Carlo (MCMC) algorithm. The removal procedure reduces some of the posterior uncertainty and introduces a novel replacement approach. The method only requires the probabilities of a new or an old value associated with the corresponding Gibbs-type exchangeable sequence, without the need for explicit representations of the prior or posterior distributions. This allows the implementation of mixture models with full posterior uncertainty, including one introduced by Gnedin. The paper also provides numerous illustrations and introduces an R-package called CopRe that implements the methodology.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)