Boxplots are among the most widely used exploratory data analysis (EDA) tools in statistical prac... more Boxplots are among the most widely used exploratory data analysis (EDA) tools in statistical practice. Typical applications of boxplots include eliciting information about the underlying distribution (shape, location, etc.) as well as identifying possible outliers. This article focuses on a modification using a type of lower and upper fences similar in concept to those used in a traditional boxplot; however, instead of constructing the upper and lower fences using the upper and lower quartiles, respectively, and a multiple of the interquartile range (IQR), multiples of the upper and the lower semiinterquartile ranges (SIQR), respectively, measured from the sample median, are used. Any observation beyond the proposed fences is labeled a potential outlier. An exact expression for the probability that at least one sample observation is wrongly classified as an outlier, the so-called "some-outside rate per sample" ), is derived for the family of location-scale distributions and is used in the determination of the fence constants. Tables for the fence constants are provided for a number of well-known location-scale distributions along with some illustrations with data; the performance of the outlier detection rule is explored in a simulation study.
Li and Liu [New nonparametric tests of multivariate locations and scales. Statist Sci. 2004;19(4)... more Li and Liu [New nonparametric tests of multivariate locations and scales. Statist Sci. 2004;19(4):686-696] introduced two tests for a difference in locations of two multivariate distributions based on the concept of data depth. Using the simplicial depth [Liu RY. On a notion of data depth based on random simplices. Ann Stat. 1990;18(1):405-414], they studied the performance of these tests for symmetric distributions, namely, the normal and the Cauchy, in a simulation study. However, to the best of our knowledge, the performance of these tests for skewed distributions has not been studied in the current literature. This paper is a contribution in that direction and examines the performance of these depth-based tests in an extensive simulation study involving ten distributions belonging to five well-known families of multivariate skewed distributions. The study includes a comparison of the performance of these tests for four popular affine-invariant depth functions. Conclusions and recommendations are offered.
To monitor the quality/reliability of a (production) process, it is sometimes advisable to monito... more To monitor the quality/reliability of a (production) process, it is sometimes advisable to monitor the time between certain events (say occurrence of defects) instead of the number of events, particularly when the events occur rarely. In this case it is common to assume that the times between the events follow an exponential distribution. In this paper, we propose a one-and a two-sided control chart for phase I data from an exponential distribution. The control charts are derived from a modified boxplot procedure. The charting constants are obtained by controlling the overall Type I error rate and are tabulated for some configurations. A numerical example is provided for illustration. The in-control robustness and the out-of-control performance of the proposed charts are examined and compared with those of some existing charts in a simulation study. It is seen that the proposed charts are considerably more in-control robust and have out-control properties comparable to the competing charts.
Uploads
Papers by Herve Dovoedo