Skip to content

perathambkk/probabilistic-lexicon-classification

 
 

Repository files navigation

A Python3 compatible fork of Unsupervised Learning for Lexicon-Based Classification (with a notebook)

This is the (forked 🍴) implementation from the eponymous AAAI 2017 paper.

Please cite:

Eisenstein, Jacob. "Unsupervised learning for lexicon-based classification." Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 31. No. 1. 2017.

BibTex:

@inproceedings{eisenstein2017unsupervised,
  title={Unsupervised learning for lexicon-based classification},
  author={Eisenstein, Jacob},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={31},
  number={1},
  year={2017}
}

My notebook (from bayeslex.py -- the main file) is at Unsup_Lexicon.ipynb.


(Following texts are from the original repo's readme.)

It includes:

I (Prof. Eisenstein) cannot promise to provide much support for the code, but I will try.

Reproducing results

To reproduce the Cornell dataset results, you can run the following line:

python bayeslex.py --epochs 250 cornell liu-pos.utf8 liu-neg.utf8 --optimizer admm --prefilter

The second column of the output contains the AUC numbers reported in the paper.

You can get the other datasets in the paper at the following locations:

The ISOL lexicon was obtained here: http://metashare.upf.edu/repository/browse/isol/f93c81ba65c911e48763000c291ecfc893e66d98695145c0b0066cfa00e6ccda/

About

Python3 compatible with a notebook

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 67.8%
  • Jupyter Notebook 32.2%