【转】collection of papers related with topic models

转载  collection of papers related with topic models[To be added more]

http://blog.sina.com.cn/s/blog_6d06d9bd0100w9rq.html

(This paper list is made based on the list found athttp://hi.baidu.com/flyer_hit/blog/item/e1110d38d3708bd5d562251c.html by removing some irrelevant papers and by adding some important papers. All hyperlinks are attached by me.)

Theory

Fundamentals

Unsupervised Learning by Probabilistic Latent Semantic Analysis
Latent Dirichlet Allocation
Finding Scientific Topics
On Smoothing and Inference for Topic Models
Rethinking LDA: Why Priors Matter
On an Equivalence between PLSI and LDA

Variants

Correlated Topic Models
Hierarchical Topic Models and the Nested Chinese Restaurant Process
Hierarchical Dirichlet Processes
Nonparametric Bayes Pachinko Allocation.
Topic Models with Power-Law Using Pitman-Yor Process
Supervised Topic Models
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression
Discriminative Topic Modeling based on Manifold Learning
Interactive Topic Modeling
Mixtures of Hierarchical Topics with Pachinko Allocation
Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors
Conditional topic random fields
Markov Random Topic Fields
A Two-dimensional Topic-aspect Model for Discovering Multi-faceted Topics
Generalized Component Analysis for Text with Heterogeneous Attributes
Sparse Additive Generative Models of Text
A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases (using von Mises-Fisher for generating normalized word counts)
Term Weighting Schemes for Latent Dirichlet Allocation

Inference

Finding Scientific Topics
Parameter Estimation for Text Analysis
A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation
Deterministic Single-Pass Algorithm for LDA

Online learning and scalability

Fast Collapsed Gibbs Sampling for Latent Dirichlet Allocation
Distributed Inference for Latent Dirichlet Allocation
Online Inference of Topics with Latent Dirichlet Allocation
Online Variational Inference for the Hierarchical Dirichlet Process
Online Learning for Latent Dirichlet Allocation
Efficient Methods for Topic Model Inference on Streaming Document Collections
Parallel Inference for Latent Dirichlet Allocation on Graphics Processing Units

On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking

Evaluation

Reading Tea Leaves: How Humans Interpret Topic Models
Evaluation Methods for Topic Models
Bayesian Checking for Topic Models

Applications

Supervised learning

DiscLDA: Discriminative Learning for Dimensionality Reduction and Classification
Labeled LDA: A Supervised Topic Model for Credit Attribution in Multi-labeled Corpora
MedLDA: Maximum Margin Supervised Topic Models for Regression and Classification
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression

Network data (social network) mining

Empirical Study of Topic Modeling in Twitter
Characterizing Microblogs with Topic Models
TwitterRank: Finding Topic-sensitive Influential Twitterers (using not collapsed but conventional Gibbs sampling for LDA)
Comparing Twitter and Traditional Media Using Topic Models
Link-PLSA-LDA: A New Unsupervised Model for Topics and Influence of Blogs
Connections between the Lines: Augmenting Social Networks with Text
Relational Topic Models for Document Networks
---> Hierarchical Relational Models for Document Networks (arXiv)
Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email
Group and Topic Discovery from Relations and Text
Probabilistic Models for Discovering E-communities
Arnetminer: Extraction and Mining of Academic Social Networks
Community Evolution in Dynamic Multi-mode Networks
An LDA-based Community Structure Discovery Approach for Large-scale Social Networks
Probabilistic Community Discovery Using Hierarchical latent Gaussian Mixture Model
Joint Group and Topic Discovery from Relations and Text
Social Topic Models for Community Extraction
Topic-Link LDA: Joint Models of Topic and Author Community
Modeling Hidden Topics on Document Manifold (an extension of PLSI)
Topic Modeling with Network Regularization
Mining Topic-Level Influence in Heterogeneous Networks
Utilizing Context in Generative Bayesian Models for Linked Corpus

Latent Topic Models for Hypertext

iTopicModel: Information Network-Integrated Topic Modeling

Sentiment analysis and opinion mining

Rated Aspect Summarization of Short Comments (an extension of PLSI)
Learning Document-level Semantic Properties from Free-text Annotations
Joint Sentiment/Topic Model for Sentiment Analysis
Mining Multi-faceted Overviews of Arbitrary Topics in a Text Collection
Modeling Online Reviews with Multi-grain Topic Models
A Joint Model of Text and Aspect Ratings for Sentiment Summarization
Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs
Opinion Integration through Semi-supervised Topic Modeling (a semi-supervised version of PLSI)
Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation
Latent Aspect Rating Analysis on Review Text Data: A Rating Regression Approach (a non-Bayesian method)
Aspect and Sentiment Unification Model for Online Review Analysis
An Unsupervised Aspect-sentiment Model for Online Reviews (using LDA as a black box)
Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid

Temporal and spatial data analysis

Discovering Evolutionary Theme Patterns from Text: an Exploration of Temporal Text Mining (a non-Bayesian method)
Topics over Time: a Non-Markov Continuous-time Model of Topical Trends
Topic Models over Text Streams: a Study of Batch and Online Unsupervised Learning
Mining Correlated Bursty Topic Patterns from Coordinated Text Streams (an extension of PLSI)
Topic Evolution in a Stream of Documents (a non-Bayesian method)
Evolutionary Hierarchical Dirichlet Processes for Multiple Correlated Time-varying Corpora
Studying the History of Ideas Using Topic Models
Mining Common Topics from Multiple Asynchronous Text Streams (a non-Bayesian method)
Online Multiscale Dynamic Topic Models
The Dynamic Hierarchical Dirichlet Process
Multiscale Topic Tomography

A Latent Variable Model for Geographic Lexical Variation

Dynamic Topic Model
Continuous Time Dynamic Topic Model
A Probabilistic Approach to Spatio Temporal Theme Pattern Mining on Weblogs (a non-Bayesian method)
Dynamic Mixture Models for Multiple Time Series
Spatial Latent Dirichlet Allocation

Scientific publication mining

The Author-Topic Model for Authors and Documents
Statistical Entity-Topic Models
Probabilistic Author-Topic Models for Information Discovery
The Author-Recipient-Topic Model for Topic and Role Discovery in Social Networks
Expertise Modeling for Matching Papers with Reviewers
Topic Evolution and Social Interactions: How Authors Effect Research
Joint Latent Topic Models for Text and Citations
Co-ranking Authors and Documents in a Heterogeneous Network
Mixed-Membership Models of Scientific Publications
Modeling Individual Differences using Dirichlet Processes
Multi-Aspect Expertise Matching for Review Assignment (an extension of PLSI)
Group and Topic Discovery from Relations and Their Attributes
Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation
Mining a Digital Library for Influential Authors (a non-Bayesian method)
Bibliometric Impact Measures Leveraging Topic Analysis
--> Topical N-grams: Phrase and Topic Discovery, with an Application to Information Retrieval
Context-aware Citation Recommendation (not so closely related to topic models)
Detecting Topic Evolution in Scientific Literature: How Can Citations Help?
Latent Interest-Topic Model: Finding the Causal Relationships behind Dyadic Data
A Topic Modeling Approach and its Integration into the Random Walk Framework for Academic Search

Information retrieval

LDA-based Document Models for Ad-hoc Retrieval
Exploring Social Annotations for Information Retrieval
Modeling General and Specific Aspects of Documents with a Probabilistic Topic Model
Exploring Topic-based Language Models for Effective Web Information Retrieval (not so closely related to topic models)

Information extraction

Optimizing Semantic Coherence in Topic Models
Employing Topic Models for Pattern-based Semantic Class Discovery (using LDA and PLSI as is)
Modeling Documents by Combining Semantic Concepts with Unsupervised Statistical Learning
A Probabilistic Approach for Adapting Information Extraction Wrappers and Discovering New Attributes (a non-Bayesian method)
An Unsupervised Framework for Extracting and Normalizing Product Attributes from Multiple Web Sites
Learning to Adapt Web Information Extraction Knowledge and Discovering New Attributes via a Bayesian Approach
Semi-supervised Extraction of Entity Aspects Using Topic Models

Annotations (or tagging, labeling) and recommendation

Automatic Labeling of Multinomial Topic Models (using PLSI as a black box)
Context Modeling for Ranking and Tagging Bursty Features in Text Streams (a non-Bayesian method)
Learning Document-level Semantic Properties from Free-Text Annotations
Generating Summary Keywords for Emails Using Topics (using LDA as a black box)
Latent Dirichlet Allocation for Tag Recommendation (using LDA as a black box)
Tag-LDA for Scalable Real-time Tag Recommendation
The Topic-Perspective Model for Social Tagging Systems
A Probabilistic Topic-Connection Model for Automatic Image Annotation
Clustering the Tagged Web (Multi-Multinomial LDA)

Summarization

Generating Aspect-oriented Multi-Document Summarization with Event-aspect model
Topical Keyphrase Extraction from Twitter
Bayesian Query-Focused Summarization
Topic-based Multi-document Summarization with Probabilistic Latent Semantic Analysis
Multi-topic based Query-oriented Summarization
Multi-Document Summarization using Sentence-based Topic Models
Generating Impact-Based Summaries for Scientific Literature
A Hybrid Hierarchical Model for Multi-Document Summarization
Generating Templates of Entity Summaries with an Entity-Aspect Model and Pattern Mining
Latent Dirichlet Allocation and Singular Value Decomposition Based Multi-document Summarization(using LDA as a black box)

NLP tasks

Structured Relation Discovery using Generative Models
A Framework for Incorporating General Domain Knowledge into Latent Dirichlet Allocation using First-Order Logic
Word Features for Latent Dirichlet Allocation
Content Modeling Using Latent Permutations
Topic Models for Word Sense Disambiguation and Token-based Idiom Detection
Syntactic Topic Models
Integrating Topics and Syntax
Topic Modeling: Beyond Bag-of-words
A Bayesian LDA-based Model for Semi-supervised Part-of-speech Tagging
Topical n-grams: Phrase and Topic Discovery, with an Application to Information Retrieval
A Topic Model for Word Sense Disambiguation
Named Entity Recognition in Query
Multilingual Topic Models for Unaligned Text
Markov topic models
Modeling Syntactic Structures of Topics with a Nested HMM-LDA [slides (PDF)(PPT)]
Topic Segmentation with an Aspect Hidden Markov Model
Polylingual Topic Models
A Latent Dirichlet Allocation method for Selectional Preferences
Improving Word Sense Disambiguation Using Topic Features
Cross-Lingual Latent Topic Extraction
Exploiting Conversation Structure in Unsupervised Topic Segmentation for Emails
Topic Models for Word Sense Disambiguation and Token-Based Idiom Detection
Exploring Supervised LDA Models for Assigning Attributes to Adjective-Noun Phrases (using Labeled LDA)

DB

Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases

原文地址:https://www.cnblogs.com/parapax/p/3728018.html