AdaptVarLM: A linear regression model for covariate-dependent non-constant error variance

Date

2024

Authors

Wang, Wanmeng

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In biological research, traditional multiple regression models assume homoscedasticity — constant variance of error terms — an assumption that is difficult to maintain in complex biological data. This thesis introduces AdaptVarLM, a novel linear regression model specialized in dealing with non-constant error variance dependent on one covariate. AdaptVarLM integrates an auxiliary linear relationship between the logarithmic variance of the error term and a specific explanatory variable, and uses maximum likelihood estimation (MLE) in the iterative updating process to improve the parameter estimation accuracy. By modelling non-constant error variance, AdaptVarLM outperforms the traditional regression model in capturing the complex variability inherent in biological data. Applying to the study of Alzheimer's disease, AdaptVarLM detects genetically linked genes associated with the disease and error variance. The results of analyzing both bulk and single-cell data validate the effectiveness of AdaptVarLM in detecting significant genes.

Description

Keywords

statistics, linear regression model, non-constant error variance

Citation