چكيده لاتين
In linear regression modeling, assumptions such as error normality and homoscedas- ticity are fundamental for valid statistical inference, yet are often violated in real-world data due to outliers, skewness, or heteroscedasticity. This thesis reviews some of such techniques and focuses on robust transformations of the response variable to address these issues. In this study, we review the Box–Cox and Yeo–Johnson transformations and their limitations. The new extended transformation method, proposed by Riani et al. (2023), resolves the limitations of the mentioned methods. It is an automated and efficient technique using fan plot, Bayesian information criterion, and forward search al- gorithms simultaneously to select a precise transformation parameter value. This method is able to identify outliers and guard against them, leading to robust regression models. Furthermore, in this study, the extended version of the Yeo–Johnson transformation, pro- posed by Atkinson et al. (2020), is introduced. Some mathematical developments in this novel method provide more flexibility by allowing different transformation parameters for positive and negative responses,generally improving variance stabilization and normality under complex distributions and in the presence of numerous outliers. These new trans- formations not only contribute to variance stabilization and residual normalization but also substantially improve prediction accuracy, model fit, and the validity of statistical inference. Their robustness in the presence of outliers makes them an ideal choice for prac- tical applications. The effectiveness of the proposed approaches is demonstrated through simulations and applications to real datasets, showing superior performance compared to classical and nonparametric methods. Ultimately, this research not only contributes to the theoretical development of regression models but also provides practical and reliable tools for researchers and data analysts to more effectively handle complex and imperfect data. It is recommended that future studies focus on extending these methods to nonlinear and multivariate regression models to broaden their scope of application.