Search

Scholarly Works (3 results)

Sort By:

Thesis
Peer Reviewed

Learning Latent Hierarchical Structures via Probabilistic Models and Deep Learning

Arabshahi, Forough
Advisor(s): Singh, Sameer

UC Irvine Electronic Theses and Dissertations (2018)

Hierarchical structures arise in many real world applications and domains. For example, in social networks people’s relationships and the groups to which they belong form a hierarchy. In natural language and computer programs, parse trees (which have a hierarchical structure) are used to represent the compositionality of expressions. These hierarchies strongly affect the statistics and the behavior of the data. Hence, it is important to develop algorithms that take these structures into account when modeling such data. Apart from these hierarchical structures, some datasets are best explained with hierarchical models even though there is no apparent hierarchy in the data itself. For instance when modeling the occurrence of words in a document, it is more realistic to assume that the words are drawn in a hierarchical manner from a topic distribution rather than independently from a single topic. In this dissertation, we focus on capturing these hierarchies and leveraging them for modeling high dimensional datasets.

Hierarchical structures underlying the data are either observed or latent. For example in the context of computer programs, the syntax tree is inherent to the program and is therefore observed. On the other hand, the statistical dependence of a social network’s users is latent. In this dissertation, we study both types of hierarchies and develop models under both struc- tures because they both arise in many applications and are equally important. Nevertheless, capturing latent hierarchical structures is more challenging. We develop novel probabilistic models to capture latent hierarchies and present statistically efficient and provably consistent parameter learning algorithms for them. When capturing observed hierarchical structures we develop deep learning models that learn low-dimensional continuous representations for the discrete symbols and variables.

Cover page: Learning Latent Hierarchical Structures via Probabilistic Models and Deep Learning

Article
Peer Reviewed

Spectral Methods for Correlated Topic Models

UC Irvine Previously Published Works (2016)

In this paper, we propose guaranteed spectral methods for learning a broad range of topic models, which generalize the popular Latent Dirichlet Allocation (LDA). We overcome the limitation of LDA to incorporate arbitrary topic correlations, by assuming that the hidden topic proportions are drawn from a flexible class of Normalized Infinitely Divisible (NID) distributions. NID distributions are generated through the process of normalizing a family of independent Infinitely Divisible (ID) random variables. The Dirichlet distribution is a special case obtained by normalizing a set of Gamma random variables. We prove that this flexible topic model class can be learned via spectral methods using only moments up to the third order, with (low order) polynomial sample and computational complexity. The proof is based on a key new technique derived here that allows us to diagonalize the moments of the NID distribution through an efficient procedure that requires evaluating only univariate integrals, despite the fact that we are handling high dimensional multivariate moments. In order to assess the performance of our proposed Latent NID topic model, we use two real datasets of articles collected from New York Times and Pubmed. Our experiments yield improved perplexity on both datasets compared with the baseline.

Cover page: Spectral Methods for Correlated Topic Models

Multimedia
Peer Reviewed

Predictive Modeling in Online Learning Environments

UC Irvine Previously Published Works (2015)

Abstract— In this work we study scalable probabilistic modeling and prediction for predicting student performance in a Massive Open Online Course (MOOC). Students’ performance sequence form a high dimensional multivariate time series whose joint prediction is a challenging task. We solve the problem through the discovery of hierarchical latent groups that influence the dynamics of the time series. We introduce a Conditional Latent Tree Model (CLTM), in which the latent variables incorporate the unknown groups. The latent tree itself is conditioned on observed covariates such as seasonality, past activity and node attributes. We propose a statistically efficient framework for learning the hierarchical tree structure, and the parameters of the CLTM. We demonstrate competitive performance compared to the baseline (chain CRF) that does not use the hierarchical latent groupings for prediction. Our modeling framework also provides valuable and interpretable information about the hidden group structures and their effect on the evolution of the time series.

Visit our project page: http://newport.eecs.uci.edu/anandkumar/Lab/Lab_sub/CLTM.html

Faculty Advisor: Animashree Anandkumar, Carter T. Butts

Data Science Initiative, University of California Irvine, May 2015

1 supplemental PDF