Academia.eduAcademia.edu

Multivariate Normal Distribution

description3,109 papers
group233 followers
lightbulbAbout this topic
The multivariate normal distribution is a generalization of the one-dimensional normal distribution to higher dimensions, characterized by a mean vector and a covariance matrix. It describes the joint distribution of multiple correlated random variables, where any linear combination of these variables follows a normal distribution.
lightbulbAbout this topic
The multivariate normal distribution is a generalization of the one-dimensional normal distribution to higher dimensions, characterized by a mean vector and a covariance matrix. It describes the joint distribution of multiple correlated random variables, where any linear combination of these variables follows a normal distribution.

Key research themes

1. How can mixture model extensions improve clustering of data with non-elliptical shapes, skewness, or heavy tails beyond the traditional multivariate normal assumption?

This line of research investigates extending classical Gaussian mixture models to handle complex cluster shapes such as skewed, heavy-tailed, or multi-modal distributions, which the multivariate normal mixture fails to adequately capture. Improving modeling fidelity in clustering is critical for accurate group identification and reducing misclassification, especially in high-dimensional or complex structured data.

Key finding: Introduces mixtures of multivariate normal inverse Gaussian (MNIG) distributions as a more flexible alternative to Gaussian mixtures that handle skewness and heavy tails. Their EM algorithm implementation shows improved... Read more
Key finding: Develops parsimonious mixtures of multivariate leptokurtic-normal (MLN) distributions characterized by parameters directly related to practical moments like kurtosis, addressing limitations of normal mixtures in modeling... Read more
Key finding: Proposes a bimodal extension of the epsilon-skew-normal (ESN) distribution that naturally models bimodal and asymmetric data, which traditional skew-normal and normal mixtures often fail to capture. The paper presents maximum... Read more

2. What are the advanced statistical inference and hypothesis testing techniques tailored for high-dimensional or dependent multivariate normal data?

With the rise of high-dimensional data where dimensionality approaches or exceeds sample size, and the presence of dependence structures in samples, classical multivariate normality tests and hypothesis procedures become invalid. This theme focuses on developing accurate, computationally feasible inference and testing methods that address the challenges posed by high dimensionality and sample dependence.

Key finding: Demonstrates that saddlepoint-based directional tests yield exact, uniformly distributed p-values under null hypotheses even when dimension p increases proportionally to sample size n, under mild conditions ensuring MLE... Read more
Key finding: Extends Mardia's multivariate kurtosis test to dependent samples by deriving explicit asymptotic mean and variance expressions under sample dependence modeled through matrix-valued covariances. The test improves joint... Read more
Key finding: Introduces a novel test statistic (Tnew), based on consistent U-statistic estimators of quadratic and bilinear forms of covariance matrices, for testing equality of two covariance matrices in high-dimensional normal data with... Read more

3. How can computational and methodological innovations enhance estimation and simulation of multivariate normal and related distributions?

This theme covers novel computational techniques for generating, approximating, or estimating multivariate normal and related distributions, improving simulation efficiency and accuracy in applied contexts. This includes transformations, spherical integration methods, and estimation of complicated functionals such as products of normal variables or truncated/skewed extensions.

Key finding: Presents a novel generation method for multivariate normal distributions based on symplectic transformations linked to real Dirac matrices, allowing conversion of independent normal variables into correlated variables with a... Read more
Key finding: Proposes a spherical Monte Carlo integration technique for computing multivariate normal probabilities by decomposing the integral into radial and spherical components, using randomized rotations of well-distributed point... Read more

All papers in Multivariate Normal Distribution

The paper addresses asymptotic estimation of normal means under sparsity. The primary focus is estimation of multivariate normal means where we obtain exact asymptotic minimax error under globallocal shrinkage prior. This extends the... more
In this paper, two-stage and fixed-sample-size procedures are developed for constructing confidence intervals for the common correlation ρ of an equi-correlated multivariate normal distribution. Two different approaches for estimation are... more
We present and study a procedure for testing the null hypothesis of multivariate elliptical symmetry. The procedure is based on the averages of some spherical harmonics over the projections of the scaled residual (1978, N. J. H. Small,... more
By modifying the statistic of , we introduce and study the properties of a notion of multivariate skewness that provides both a magnitude and an overall direction for the skewness present in multivariate data. This notion leads to a test... more
Let X = (X1, X2,..., Xd) ~ be a random vector of positive entries, such that for some )~ = ()h, A2,..., Ad) t, the vector X (~) defined by X~ ~) --(X~ ~ -1)/),~, i = 1,..., d is elliptically symmetric. We describe a procedure based on the... more
This work describes an unsupervised method to objectively quantify the abnormality of general anatomical shapes. The severity of an anatomical deformity often serves as a determinant in the clinical management of patients. However,... more
This work describes an unsupervised method to objectively quantify the abnormality of general anatomical shapes. The severity of an anatomical deformity often serves as a determinant in the clinical management of patients. However,... more
Let us consider (Ω, ℱ, 𝑃) be a probability space of a random experiment. A collection or vector 𝑋 ~ = (𝑋 1 , 𝑋 2 , ⋯ , 𝑋 𝑝 ) 𝑇 defined on (Ω, ℱ, 𝑃) and maps Ω into ℛ 𝑝 i.e., (𝑣𝑖) For all 𝑥 ~= (𝑥 1 , 𝑥 2 , ⋯ , 𝑥 𝑝 ) 𝑇 ∈ ℛ 𝑝 and for all ℎ 𝑖... more
This paper derives the characteristic function of the univariate t-distributiou in terms of a well-known special function, namely, the Macdouald function, which has been extensively studied by several authors in recent years. Moments of... more
In this paper we investigate some of the well known properties of the multivariate t-disribution by directly utilizing the Macdonald function represeatation of its characteristic function. Consequently, we obtain a limit theorem of the... more
In a recent paper Muirhead (1986) derived certaiu useful ideutities iavolviag expectatione taken with respect to the wishart distribution. This uote generalizes the above results by taking expectations with respect to a geaeralized.... more
We consider n pairs of random variables (Xt1,Xzt),(Xn,Xz),...,(Xr,,Xz) having a bivariate elliptically contoured density of the form K(n)lt\l-'12 s where 01, 02 are location parameters and A : ((1,*)) is a 2 x 2 symmetric positive... more
It is shown (Proposition (3.9)) that the asymptotic information bound which is valid for the estimation of a parameter in the structure (mixture) model remains valid in the functional model (incidental nuisance parameters) if only... more
The Fully Bayesian Significance Test (FBST) is a coherent Bayesian significance test for sharp hypotheses. The computation of the evidence measure used on the FBST is performed in two steps: 1) The optimization step consists of finding f... more
We review the definition of the Full Bayesian Significance Test (FBST), and summarize its main statistical and epistemological charac- teristics. We review also the Abstract Belief Calculus (ABC) of Darwiche and Ginsberg, and use it to... more
principal components. Compared to Mclust (model-based clustering), our method shows more consistent results.
The Full Bayesian Significance Test (FBST) for precise hypotheses is applied to a Multivariate Normal Structure (MNS) model. In the FBST we compute the evidence against the precise hypothesis. This evidence is the probability of the... more
Maier et al. (2010) introduced the relational causal model (RCM) for representing and inferring causal relationships in relational data. A lifted representation, called abstract ground graph (AGG), plays a central role in reasoning with... more
Although the hazard rates sharing a number of similar aspects. The hazard functions are far less frequently used. The problem of hazard functions estimation has received much attention in the statistical literature. This paper studies the... more
In this study we present a closed form solution to the moments and, in particular, correlation of two log-normally distributed random variables, where the underlying log-normal distribution is potentially truncated and censored at both... more
This paper introduces a new family of multivariate distributions based on Gram-Charlier and Edgeworth expansions. This family encompasses many of the univariate seminonparametric densities proposed in the financial econometrics as... more
Numerical computation of multivariate normal probability integrals is often required in quantitative genetic studies. In particular, this is the case for the evaluation of the genetic superiorities after independent culling levels... more
Bootstrap methods are considered in the application of statistical process control because they can deal with unknown distributions and are easy to calculate using a personal computer. In this study we propose the use of bootstrap-t... more
Minimax control chart uses the joint probability distribution of the maximum and minimum standardized sample means to obtain the control limits for monitoring purpose. However, the derivation of the joint probability distribution needed... more
We consider a wide class of stochastic process traffic assignment models that capture the dayto-day evolving interaction between traffic congestion and drivers' information acquisition and choice processes. Such models provide a... more
We consider a wide class of stochastic process traffic assignment models that capture the dayto-day evolving interaction between traffic congestion and drivers' information acquisition and choice processes. Such models provide a... more
PurposeThe purpose of this study is to extend the classical noncentral F-distribution under normal settings to noncentral closed skew F-distribution for dealing with independent samples from multivariate skew normal (SN)... more
Despite surface displacements observed by geodesy are linear combinations of slip at faults in an elastic medium, determining the spatial distribution of fault slip remains a ill-posed inverse problem. A widely used approach to circumvent... more
In this paper the problem of estimating the scale matrix in a complex elliptically contoured distribution(complex ECD) is addressed. An extended Haff-Stein identity for this model is derived. It is shown that the minimax estimators of the... more
We present a graph-based technique for estimating sparse covariance matrices and their inverses from high-dimensional data. The method is based on learning a directed acyclic graph (DAG) and estimating parameters of a multivariate... more
We propose a new and computationally efficient algorithm for maximizing the observed log-likelihood for a multivariate normal data matrix with missing values. We show that our procedure based on iteratively regressing the missing on the... more
We propose estimation methods for change points in high-dimensional covariance structures with an emphasis on challenging scenarios with missing values. We advocate three imputation like methods and investigate their implications on... more
We propose estimation methods for change points in high-dimensional covariance structures with an emphasis on challenging scenarios with missing values. We advocate three imputation like methods and investigate their implications on... more
We present a graph-based technique for estimating sparse covariance matrices and their inverses from high-dimensional data. The method is based on learning a directed acyclic graph (DAG) and estimating parameters of a multivariate... more
High-breakdown-point estimators of multivariate location and shape matrices, such as the MM-estimator with smooth hard rejection and the Rocke S-estimator, are generally designed to have high efficiency at the Gaussian distribution.... more
Many applications in signal processing require the estimation of mean and covariance matrices of multivariate complex-valued data. Often, the data are non-Gaussian and are corrupted by outliers or impulsive noise. To mitigate this, robust... more
A copula is an aggregation function, that can be used as a dependency model. Any multivariate distribution function can be characterized by its marginals and a copula. When introducing imprecision in the modelling of those distribution... more
We study the distribution of the maximum likelihood estimate (MLE) in high-dimensional logistic models, where covariates are Gaussian with an arbitrary covariance structure. We prove that in the limit of large problems holding the ratio... more
A depth-based rank sum statistic for multivariate data introduced by Liu and Singh [J. Amer. Statist. Assoc. 88 (1993) 252-260] as an extension of the Wilcoxon rank sum statistic for univariate data has been used in multivariate rank... more
This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will... more
We construct a flexible dynamic linear model for the analysis and prediction of multivariate time series, assuming a two-piece normal initial distribution for the state vector. We derive a novel Kalman filter for this model, obtaining a... more
We propose and study the class of Box-Cox elliptical distributions. It provides alternative distributions for modeling multivariate positive, marginally skewed and possibly heavy-tailed data. This new class of distributions has as a... more
In this paper we obtain an adjusted version of the likelihood ratio test for errors-in-variables multivariate linear regression models. The error terms are allowed to follow a multivariate distribution in the class of the elliptical... more
The problem of reducing the bias of maximum likelihood estimator in a general multivariate elliptical regression model is considered. The model is very flexible and allows the mean vector and the dispersion matrix to have parameters in... more
Download research papers for free!