4.5 Article

Outlier Detection in Functional Observations With Applications to Profile Monitoring

Journal

TECHNOMETRICS
Volume 54, Issue 3, Pages 308-318

Publisher

AMER STATISTICAL ASSOC
DOI: 10.1080/00401706.2012.694781

Keywords

Asymptotic test; Functional data analysis; Functional principal component analysis; Statistical process control

Funding

  1. NNSF of China [11001138, 11071128, 11131002, 11101306]
  2. RFDP of China [20110031110002]
  3. Nankai Grant [65010731]
  4. National Center for Theoretical Sciences, Math Division

Ask authors/readers for more resources

The presence of outliers has serious adverse effects on the modeling and forecasting of functional data. Therefore, outlier detection, aiming at identifying abnormal functional curves from a dataset, is quite important. This article proposes a new testing procedure based on functional principal component analysis. Under mild conditions, the null distribution of the test statistic is shown to be asymptotically pivotal with a well-known asymptotic distribution. Simulation results demonstrate good finite-sample performance of the asymptotic test and detection procedure. Finally, by illustrating the connection between profile monitoring in statistical process control and outlier detection in functional data, we apply the proposed approach to a real-data example from a manufacturing process and show that it performs quite well in detecting outlying profiles. Supplementary Material for this article is posted online on the journal web site.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Genetics & Heredity

Non-invasive Detection of Exosomal MicroRNAs via Tethered Cationic Lipoplex Nanoparticles (tCLN) Biochip for Lung Cancer Early Detection

Chang Liu, Eric Kannisto, Guan Yu, Yunchen Yang, Mary E. Reid, Santosh K. Patnaik, Yun Wu

FRONTIERS IN GENETICS (2020)

Article Environmental Sciences

Improving the robustness of beach water quality modeling using an ensemble machine learning approach

Leizhi Wang, Zhenduo Zhu, Lauren Sassoubre, Guan Yu, Chen Liao, Qingfang Hu, Yintang Wang

Summary: The study introduces an ensemble machine learning approach known as model stacking for predicting beach water quality reliably. In experiments conducted at three beaches along eastern Lake Erie, the stacking model consistently ranked 1st or 2nd in accuracy every year, with yearly-average accuracy of 78%, 81%, and 82.3% at the three studied beaches, respectively.

SCIENCE OF THE TOTAL ENVIRONMENT (2021)

Article Immunology

Tissue-specific activation of Myd88-dependent pathways governs disease severity in primary Sjo • gren ' s syndrome

Jeremy Kiripolsky, Eileen M. Kasperek, Chengsong Zhu, Quan-Zhen Li, Jia Wang, Guan Yu, Jill M. Kramer

Summary: Myd88 activation plays a crucial role in specific cell types in the pathology of pSS. Deleting Myd88 in hematopoietic cells can mitigate inflammatory responses in salivary tissue and nephritis, but increases pulmonary inflammation; meanwhile, ablation of Myd88 in stromal cells can reduce pulmonary inflammation and alleviate levels of anti-nuclear autoantibodies.

JOURNAL OF AUTOIMMUNITY (2021)

Article Environmental Sciences

Co-exposure to inorganic arsenic and fluoride prominently disrupts gut microbiota equilibrium and induces adverse cardiovascular effects in offspring rats

Xiaoyan Yan, Xushen Chen, Xiaolin Tian, Yulan Qiu, Jie Wang, Guan Yu, Nisha Dong, Jing Feng, Jiaxin Xie, Morgan Nalesnik, Ruiyan Niu, Bo Xiao, Guohua Song, Sarah Quinones, Xuefeng Ren

Summary: Co-exposure to inorganic arsenic and fluoride results in more prominent adverse effects on cardiovascular systems and perturbation of gut microbiota, with certain bacterial genera strongly correlated with higher risk of cardiovascular events.

SCIENCE OF THE TOTAL ENVIRONMENT (2021)

Article Statistics & Probability

High-Dimensional Cost-constrained Regression Via Nonconvex Optimization

Guan Yu, Haoda Fu, Yufeng Liu

Summary: The article explores a new high-dimensional cost-constrained linear regression problem and proposes a new optimization method, demonstrating the convergence of the algorithm and the possibility of a global optimal solution. Experimental results show that the method performs well in high-dimensional problems.

TECHNOMETRICS (2022)

Article Immunology

Immune-Intrinsic Myd88 Directs the Production of Antibodies With Specificity for Extracellular Matrix Components in Primary Sjogren's Syndrome

Jeremy Kiripolsky, Eileen M. Kasperek, Chengsong Zhu, Quan-Zhen Li, Jia Wang, Guan Yu, Jill M. Kramer

Summary: Primary Sjogren's syndrome, predominantly seen in women, is characterized by exocrine gland dysfunction and serious systemic manifestations. Research suggests that ECM degradation may represent a novel source of chronic B cell activation in the context of pSS.

FRONTIERS IN IMMUNOLOGY (2021)

Article Statistics & Probability

Sample-Wise Combined Missing Effect Model with Penalization

Jialu Li, Guan Yu, Qizhai Li, Yufeng Liu

Summary: Modern high-dimensional statistical inference often faces the problem of missing data. In this article, we propose a new method called SCOM to deal with missing data occurring in predictors. SCOM makes full use of all available data and is robust with respect to various missing mechanisms.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2023)

Article Health Care Sciences & Services

Integrative nearest neighbor classifier for block-missing multi-modality data

Guan Yu, Surui Hou

Summary: This paper addresses the classification problem in multi-modality datasets with missing data by developing a new weighted nearest neighbors classifier called the integrative nearest neighbor (INN) classifier. INN efficiently utilizes available information in the training data and feature vector of the test data point to predict its class label without deleting or imputing any missing data. Simulation and real application results demonstrate that INN outperforms other methods in terms of classification performance.

STATISTICAL METHODS IN MEDICAL RESEARCH (2022)

Article Engineering, Environmental

Interpretable tree-based ensemble model for predicting beach water quality

Lingbo Li, Jundong Qiao, Guan Yu, Leizhi Wang, Hong-Yi Li, Chen Liao, Zhenduo Zhu

Summary: Tree-based machine learning models offer low-cost and timely solutions for predicting microbial fecal contamination in beach water. However, many of these models are difficult to interpret. This study evaluates five tree-based models and employs the SHAP explanation method to improve interpretability. LightGBM and XGBoost achieve the highest precision and recall scores in predicting Escherichia coli concentration in beach water.

WATER RESEARCH (2022)

Article Hematology

Effect of dialysate bicarbonate and sodium on blood pH in maintenance hemodialysis-A prospective study

Nagaraja Rao Sridhar, Ziqiang Chen, Guan Yu, Judy Lambert, Mary Muscarella, Madan Nanjundegowda, Mandip Panesar

Summary: Based on the findings of this study, adjusting the dialysate bicarbonate concentration based on pre-HD serum bicarbonate is unnecessary, while higher bicarbonate and lower dialysate sodium are associated with post-HD alkalemia.

THERAPEUTIC APHERESIS AND DIALYSIS (2023)

Article Environmental Sciences

Do Neighborhood Factors Modify the Effects of Lead Exposure on Child Behavior?

Seth Frndak, Gabriel Barg, Elena Queirolo, Nelly Manay, Craig Colder, Guan Yu, Zia Ahmed, Katarzyna Kordas

Summary: Lead exposure and neighborhood characteristics have an impact on children's behavior. However, the study found that the effects of lead on behavior are modified by the distance to greenspace. This suggests that interventions should consider both greenspace access and lead exposure prevention.

TOXICS (2022)

Article Statistics & Probability

TNN: A transfer learning classifier based on weighted nearest neighbors

Haiyang Sheng, Guan Yu

Summary: Traditional weighted nearest neighbors classifiers are designed for independent and identically distributed supervised learning problems, but in real applications, it is often difficult to obtain the desired data distribution. Therefore, we propose a novel transfer learning weighted nearest neighbors classifier that can flexibly combine training samples from different distributions to improve prediction accuracy.

JOURNAL OF MULTIVARIATE ANALYSIS (2023)

Article Chemistry, Multidisciplinary

Simultaneous Detection of Tumor Derived Exosomal Protein-MicroRNA Pairs with an Exo-PROS Biosensor for Cancer Diagnosis

Chang-Chieh Hsu, Yunchen Yang, Eric Kannisto, Xie Zeng, Guan Yu, Santosh K. Patnaik, Grace K. Dy, Mary E. Reid, Qiaoqiang Gan, Yun Wu

Summary: Tumor-derived exosomes (TEXs) are considered as promising biomarkers for cancer liquid biopsy. We developed an ExoPROS biosensor that selectively captures TEXs and enables simultaneous detection of TEX protein-microRNA pairs using a surface plasmon resonance mechanism. The assay demonstrated high accuracy for lung cancer and breast cancer diagnosis compared to conventional methods. The ExoPROS assay is a potent liquid biopsy assay for cancer diagnosis.

ACS NANO (2023)

Article Public, Environmental & Occupational Health

Reducing the complexity of high-dimensional environmental data: An analytical framework using LASSO with considerations of confounding for statistical inference

Seth Frndak, Guan Yu, Youssef Oulhote, Elena I. Queirolo, Gabriel Barg, Marie Vahter, Nelly Manay, Fabiana Peregalli, James R. Olson, Zia Ahmed, Katarzyna Kordas

Summary: This study presents a two-step approach for selecting exposures in high-dimensional environmental datasets, considering confounding. The approach involves using LASSO algorithm for feature selection and linear regression models with confounder adjustment based on directed acyclic graphs (DAGs) for statistical inference. The results identified four variables associated with cognitive ability scores.

INTERNATIONAL JOURNAL OF HYGIENE AND ENVIRONMENTAL HEALTH (2023)

Article Dentistry, Oral Surgery & Medicine

Correlation between craniofacial structures, anthropometric measurements, and nasopharyngeal dimensions in black adolescents

Thikriat Al-Jewair, Simran Marwah, Charles Brian Preston, Yufei Wu, Guan Yu

Summary: This study evaluated the correlation between craniofacial structures and nasopharyngeal dimensions in African Black adolescents, finding significant associations between craniofacial structures and stature/upper body height, maxillary growth and bony nasopharyngeal variables, as well as mandibular growth and soft tissue nasopharyngeal variables. Sexual dimorphism in lower facial height was identified, suggesting the need for further research on managing craniofacial complexity and nasopharyngeal airway in this population.

INTERNATIONAL ORTHODONTICS (2021)

No Data Available