1 Star 0 Fork 0

learning-ml / LDA

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README

LDA: Latent Dirichlet Allocation

This repository includes three open source versions of LDA with collapsed Gibbs Sampling, modified by nanjunxiao.

GibbsLDA++ single thread,written in C++

ompi-lda multi-node/multi-threads, written in C++

online_twitter_lda multi-threads,written in Python

collapsed Gibbs LDA reference : my blog

What's New

1. GibbsLDA++

fixed bugs:

1). memory leakage. 'delete[] p' instead of 'delete p',when p points to an Array.

2). Array out of bound. (double)random() / RAND_MAX in [0,1]

int topic = (int)(((double)random() / RAND_MAX) * K);  -->  int topic = (int)(((double)random() / RAND_MAX + 1) * K);
double u = ((double)random() / RAND_MAX) * p[K - 1];   -->  double u = ((double)random() / RAND_MAX + 1) * p[K - 1];

2. ompi-lda

fixed bug:

1). infer.cc bugs.

2). rm 'sampler.UpdateModel(corpus)' in lda.cc.

add features:

1). add theta twords file output.

2). add partial boost's hpp/cpp in include dir, so can make directly.

3. online_twitter_lda

add features:

1). add theta phi mat file output.

TODO

ompi-lda

1). twordsnum can configure.

2). rewrite cmd_flag without boost, so can remove include dir.

3). rewrite makefile.

空文件

简介

Three open source versions of LDA with collapsed Gibbs Sampling, modified by nanjunxiao 展开 收起
C++
取消

发行版

暂无发行版

贡献者

全部

近期动态

加载更多
不能加载更多了
C++
1
https://gitee.com/learning-ml/LDA.git
git@gitee.com:learning-ml/LDA.git
learning-ml
LDA
LDA
master

搜索帮助