Skip to content
This repository has been archived by the owner on Dec 13, 2017. It is now read-only.

senya-ashukha/bigram-anchor-words

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bigram Anchor Words Topic Model

Implementation for the Bigram Anchor Words Topic Model paper. Bag of words is very poor text representation, since that, in traditional topic models, we are losing a lot of information. The project goal is to combine linguistic with statistical topic models. We propose new Anchor Words Topic Model [1] such as bigrams also could be anchor words.

[1] Sanjeev A., Rong G.: A Practical Algorithm for Topic Modeling with Provable Guarantees (NIPS, 2012)

Results

Here are an example of anchor words. Metrics are also good and could be found in the paper.

Experiments

You could use following code to repeat published results. A simple way to repeat experiments is to try to understand examples =) I'm sorry that documentation is absent.

cd bigram-anchor-words
ipython ./examples/{corpus}/{model}.py

Citation

If you found this code useful please cite our paper

@inproceedings{ashuha2016bigram,
  title={Bigram Anchor Words Topic Model},
  author={Ashuha, Arseniy and Loukachevitch, Natalia},
  booktitle={International Conference on Analysis of Images, Social Networks and Texts},
  pages={121--131},
  year={2016},
  organization={Springer}
}

About

An Implementation of Bigram Anchor Words algorithm

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages