Research Seminars: Probability and Statistics

Fall 2024

Time & Location: All talks are on Wednesdays in Gibson 126 at 4:00 PM unless otherwise noted.
Organizer: Xiang jiMichelle Lacey and Yuwei Bao

Archives

 

October 9

Title: SOFARI: High-Dimensional Manifold-Based Inference
Jinchi Lv – University of Southern California

Abstract:  Multi-task learning is a widely used technique for harnessing information from various tasks. Recently, the sparse orthogonal factor regression (SOFAR) framework, based on the sparse singular value decomposition (SVD) within the coefficient matrix, was introduced for interpretable multi-task learning, enabling the discovery of meaningful latent feature-response association networks across different layers. However, conducting precise inference on the latent factor matrices has remained challenging due to the orthogonality constraints inherited from the sparse SVD constraints. In this paper, we suggest a novel approach called the high-dimensional manifold-based SOFAR inference (SOFARI), drawing on the Neyman near-orthogonality inference while incorporating the Stiefel manifold structure imposed by the SVD constraints. By leveraging the underlying Stiefel manifold structure that is crucial to enabling inference, SOFARI provides easy-to-use bias-corrected estimators for both latent left factor vectors and singular values, for which we show to enjoy the asymptotic mean-zero normal distributions with estimable variances. We introduce two SOFARI variants to handle strongly and weakly orthogonal latent factors, where the latter covers a broader range of applications. We illustrate the effectiveness of SOFARI and justify our theoretical results through simulation examples and a real data application in economic forecasting. This is a joint work with Yingying Fan, Zemin Zheng and Xin Zhou.

Time: 4:00 pm
Location: Gibson Hall 126

 

October 16

Title: MCMC Importance Sampling via Moreau-Yosida Envelopes
Eric Chi – Rice University

Abstract:  Markov chain Monte Carlo (MCMC) is the workhorse computational algorithm employed for inference in Bayesian statistics. Gradient-based MCMC algorithms are known to yield faster converging Markov chains.  In modern parsimonious models, the use of non-differentiable priors is fairly standard, yielding non-differentiable posteriors. Without differentiability, gradient-based MCMC algorithms cannot be employed effectively. Recently proposed proximal MCMC approaches, however, can partially remedy this limitation. These approaches employ the Moreau-Yosida (MY) envelope to smooth the nondifferentiable prior enabling sampling from an approximation to the target posterior. In this work, we leverage properties of the MY envelope to construct an importance sampling paradigm to correct for this approximation error. We establish asymptotic normality of the importance sampling estimators with an explicit expression for the asymptotic variance which we use to derive a practical metric of sampling efficiency. Numerical studies show that the proposed scheme can yield lower variance estimators compared to existing proximal MCMC alternatives.

Time: 4:00 pm
Location: Gibson Hall 126

 

December 4

Title: Shrinkage-based phylogenetic modeling
Alexander Fisher – Duke University

Abstract:  In many phylogenetic models, the number of parameters to estimate grows with the number of taxa under study. However, parsimonious models of evolution demand local similarity in parameters on subtrees. To achieve scalable inference in such a setting, we employ auto-correlated, shrinkage-based models. We compare inference under these models to previous state-of-the art in a variety of applied settings. In one example, we investigate the heritable clock structure of various surface glycoproteins of influenza A virus in the absence of prior knowledge about molecular clock placement. In another example, we estimate the phylogenetic location of environmental shifts in the ancestry of Anolis lizards.
 

Time: 4:00 pm
Location: Gibson Hall 126