Yohai 1984, which can attain an efficiency up to 33%. Part of the lecture notes in statistics book series lns, volume 26. The breakdown value is a measure of the proportion of contamination that an estimation method can withstand and still maintain its robustness. See rousseeuw 1984 and rousseeuw and leroy 1987 for applications of lms and related ideas to regression and other problems. Rousseeuw born october 1956 is a statistician known for his work on robust statistics and cluster analysis.
Peter rousseeuw be3001 leuven, belgium march 26, 2017. Therefore it can be viewed as a statistical theory dealing with approximate parametric models and a bridge between the fisherian parametric approach and the full nonparametric approach. Sestimators, proposed by rousseeuw and yohai 1984, were the first high. Transformational leadership in open distance learning. A class of robust and fully efficient regression estimators. Ronchetti, rousseeuw, and stahel 1986, maronna, martin, and yohai 2006, and dellaquila and ronchetti 2006 for an overview. Mestimation is the simplest approach both computationally and theoretically but cannot handle data which are contaminated in the covari. Introduction to rousseeuw 1984 least median of squares. Step 2 makes the estimate scale equivariant, whereas the following steps are kind of principal components that replace the eigen values of by robust variances. A combination of the high breakdown value method and mestimation is the mmestimation yohai, 1987. Robust regression and outlier detection rousseeuw, peter j. Its selfcontained treatment allows readers to skip the mathematical material which is concentrated in a few sections. M estimation, s estimation, and mm estimation in robust. It has a higher statistical efficiency than sestimation.
We create a global set of portfolios over the 19972014 time period that offer substantial outperformance of a global stock benchmark by using beaton and tukey, gunst et al. Least squares, for example, minimizes the variance of the residuals and is a special case of sestimators. Robust regression examples worcester polytechnic institute. A novel class of dimension reduction methods is combined with a stochastic multifactor panel regressionbased statespace model in order to model the dynamics of yield curves whilst incorporating regression factors. Outlier detection using distributionally robust optimization. Hestenes, m conjugate direction methods in optimization. E0, f3, f4, g1 abstract three of the most important recent facts in global macroeconomics the sustained rise in the us. Robust regression via lts methods which achieve the goal of being insensitive to changes in a small percentage of the. The results directly paralleled the uncorrected analyses. Leroy provides an applicationsoriented introduction to robust regression and outlier detection, emphasising highbreakdown methods which can cope with a sizeable fraction of contamination.
We refer the reader to the book of rousseeuw and leroy for an elaborate description of these robust regression methods. For comparison to the partial correlation and linear regression analyses summarized above, we also conducted robust regression analyses using the s rousseeuw and yohai, 1984 and mm estimation yohai, 1987 procedures, both of which correct estimates for the effects of outliers. Mar 26, 2017 3,339 rousseeuw 1984 lms and lts regression paper 21 765 rousseeuw yohai 1984 sestimators in a proceedings book paper 23 975 rousseeuw 1985 multivariate estimation in a proceedings book paper 28 6,031 hampelronchettirousseeuwstahel 1986 robust statistics book 1 3,990 rousseeuw 1987 silhouettes display for clustering. Mestimation is the simplest approach both computationally. In this analysis of the risk and return of stocks in global markets, we apply several applications of robust regression techniques in producing stock selection models and several optimization techniques in portfolio construction in global stock universes. With the same breakdown value, it has a higher statistical ef. Rousseeuw and yohai, 1984 and mm estimation yohai, 1987 procedures, both of which correct estimates for the effects of outliers. We refer the reader to the book of rousseeuw and leroy 2005 for a detailed description of these robust regression methods. P application of the conjugate gradient method in electromagnetics and signal. Journal of the american statistical association volume 69, number 348, 1974 r. Heteroscedasticityconsistent standard errors wikipedia.
The breakdown value is a measure of the proportion of contamination that an. We present a distributionally robust optimization dro approach to outlier detection in a linear regression setting, where the closeness of probability distributions is measured using the wasserstein metric. Yohai, a fast algoritm for sregression estimates, jour nal of computational and graphical statistics, 15, no. Note that this lms midpoint is also called the shorth in some more recent literature e. There are at least two reasons why robust regression techniques are useful tools in robust time series. Econometrics free fulltext financial big data solutions. Outlier detection using distributionally robust optimization under the wasserstein metric. Sestimators introduced by rousseeuw and yohai 1984. Yohai, robust regression by means of sestimators, in robust and nonlinear time series analysis, lecture notes in statistics, vol. The mve algorithm is based on the algorithm used in the minvol program by rousseeuw 1984.
S estimation is a high breakdown value method that was introduced byrousseeuw and yohai 1984. Evolution of reinforcement learning in uncertain environments. Huber, 1964, 1973, least median of squares lms rousseeuw, 1984, least trimmed squares lts rousseeuw, 1985, sestimation rousseeuw and yohai, 1984, and mmestimation yohai, 1987, are elaborated in the book of rousseeuw and leroy 2005. A fast algorithm for sregression estimates ubc department of. Robust regression and outlier detection rousseeuw, peter. These are also known as eickerhuberwhite standard errors also huberwhite standard errors or white standard errors, to recognize the contributions of friedhelm eicker, peter j. A fast algorithm for sregression estimates citeseerx. He obtained his phd in 1981 at the vrije universiteit brussel, following research carried out at the eth in zurich in the group of frank hampel, which led to a book on influence functions.
Heidelberg workshop on robust and nonlinear time series, sept. These very robust methods are extremely timeconsuming and therefore only applicable for models with only a few variables. A least squares estimator with asymmetrical trimming and. Caballero, emmanuel farhi, and pierreolivier gourinchas nber working paper no. Rousseeuw and yohai 1984, by permission of springerverlag, new york. Ling comparison of several algorithms for computing sample means and variances. Rousseeuw and yohai 1984 were obtained by minimization. Asymptotic behavior of estimates based on residual autocovariances for arma models. Following seminal papers by box 1953 and tukey 1960, which demonstrated the need for robust statistical procedures, the theory of robust statistics blossomed in the 1960s and 1970s. Mm robust regression techniques, discussed in maronna. Your use of this publication shall be governed by the terms established by the vendor at the time.
Individual differences in the perception of biological motion. In addition, asymptotic distributions of the estimators are given, coupled with second order corrections to the bias of the estimators. Siegel, 1982, least median of squares rousseeuw, 1984, sestimatorsrousseeuw and yohai, 1984, mmestimators yohai, 1987 and testimators yohai and zamar, 1988. Robust regression via lts methods which achieve the goal of being insensitive to changes in a small percentage of the observations have only recently been developed. The asymptotic breakdown point of the sestimator is given by rousseeuw and yohai, 1984. Asymptotic normality of bn was proved by rousseeuw and yohai 1984 and further considered by davies 1990.
Fast very robust methods for the detection of multiple outliers. A robust learning approach for regression models based on. Rousseeuw 1984 developed the first practical robust regression estimators least median squares lms, least trimmed squares lts, and variants. The aforementioned robust estimation procedures focus on modifying the objective func. Individual differences in the perception of biological motion and fragmented figures are not correlated. Yohai, robust regression by mean of s estimators, robust and nonlinear time series analysis, new york, 1984, 256274, doi. S estimation is a high breakdown value method introduced by rousseeuw and yohai 1984. Individual differences in the perception of biological. In order to obtain a more efficient estimator under. We find that 1 that robust regression applications are appropriate for modeling stock returns in global markets. Rousseeuw, 1984 the asymptotic breakdown point is then defined as 2. Therefore, the outputs may differ slightly from those given in rousseeuw and leroy 1987 or those obtained from software based on the older version of progress. The topic of heteroscedasticityconsistent hc standard errors arises in statistics and econometrics in the context of linear regression and time series analysis.
Journal of computational and graphical statistics, volume 15, number 2, pages 414427. Sestimators of regression parameters, proposed by rousseeuw and yohai 1984, search for the. Robust regression by means of sestimators springerlink. Olson comparative robustness of six tests in multivariate analysis of variance. Minimum volume ellipsoid estimator mve the minimum volume ellipsoid mve estimator, first proposed by rousseeuw 1984, has been studied. The performance of this method was improved by the fastlts algorithm ofrousseeuw and van driessen2000. It has a higher statistical e ciency than sestimation. Later he was professor at the delft university of technology, the netherlands, at the. Trimmed squares lts rousseeuw, 1985, sestimation rousseeuw and yohai, 1984, and mmestimation yohai, 1987, are elaborated in the book of rousseeuw and leroy 2005. Journal of computational and graphical statistics, volume 15, number 2, pages 114. Robust dependence modeling for highdimensional covariance matrices with financial applications zhe zhu and roy e.
This is achieved via probabilistic principal component analysis ppca in which new statisticallyrobust variants are derived also treating missing data. To compute it, they use a modified version of the forward search algorithm see e. Mmestimates proposed by yohai 1987 combine a high breakdown point with good efficiency approximately 95% to ls under the gaussmarkov assumption. Yohai, robust regression by means of sestimators, in robust and nonlinear time series analysis, lecture notes in statistics. Use the link below to share a fulltext version of this article with your friends and colleagues. Robust procedure for estimating multivariate location and. An equilibrium model of global imbalances and low interest rates ricardo j. Note that abl is contained in al and b2, so in the sequel we may.
581 972 481 1293 162 880 889 16 923 990 226 668 666 799 544 33 1050 266 446 1476 1300 924 207 512 461 1191 335 766 1235 450 1097 355 84 1487