Skip to content

shivikasharmaaa/LDA-Document-Classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

LDA-Document-Classifier

A Document Classification project using Latent Dirichlet Allocation

About

This repository demonstrates the use of Topic Modelling for the task of Document Classification. Topic Modelling is useful in learning latent topics in a given document corpus and can be extended to tasks like classification. The LDA.ipynb first implements a rudimentary version of Latent Dirichlet Allocation using Gibbs Sampling (a Markov Chain Monte Carlo (MCMC) algorithm) and then uses its output for the purpose of Document Classification using Support Vector Machines.

Note: The original paper on LDA is an interesting read!

About

Document Classification using Latent Dirichlet Allocation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published