- [api] class Simhasher: default argument supported
- README: add https://github.com/yanyiwu/simhash-demo
- demo: removed example/demo.cpp and using github.com/yanyiwu/simhash-demo instead
- class Simhasher: using cppjieba::Jieba instead of cppjieba::keywordextractor
- [submodule/cppjieba] v5.3.1 -> v5.4.0
- cmake: avoid testing when FetchContent by other project
- [stale-issues] stale 1 year ago
- [googletest] removed submodule and add cmake-fetchcontent
- [submodule] cppjieba v5.3.0 -> v5.3.1
- [submodule] cppjieba v5.1.2 -> v5.3.0
- [CI] macos,linux and c++[11,14,17,20]
- [CMake] mini_required 2.6->3.5
- [googletest] git submodule add googletest-1.6.0
- [submodule] add submodules/cppjieba, and remove deps/cppjieba/ and remove dict/
- [submodule] using git submodule, and add submodules/limonp.
- [pr-28] merged.
- [pr-27] merged.
- [pr-26] merged.
- [pr-21] merged.
- [deps] update limonp to v0.6.2
- upgrade cppjieba to v4.5.3
-
add new directory:
deps
forcppjieba
andlimonp
-
change
namespace Simhash
tonamespace simhash
-
mv
src/main.cpp
toexample/demo.cpp
-
mv
src/
toinclude/
-
upgrade limonp to v0.5.4
-
upgrade cppjieba to v4.5.0
- 升级 CppJieba 到 v4.1.2 版本。
- 使用CppJieba v3.0.1 ,修复一些兼容性问题。
- 更新CppJieba用以适配更加低版本的g++。
- 更新CppJieba用以引入在关键词抽取过程中使用停用词(dict/stop_words.utf8)。
- 增加性能测试。
- 更新
KeywordExtractor
提高关键词抽取的速度,性能约提高1.3倍。
- 更新CppJieba用以修复关于头文件包含的小bug
- 完成simhash海明距离的计算
- 修复关键词抽取后权重排序的bug
- 完成分词,关键词抽取,simhash值计算的基本功能