-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BM25, Co-occurrence Matrix, faster ROUGE, Fixing LSA. #165
Conversation
I have ported BM25 and Co-Occurrence Matrix from StringAnalysis.jl. Co-Occurrence Matrix works 10-15x faster than one in #164, uses less space, supports operations over Document and Corpus types. LSA has been fixed. ROUGE - N has been re-implemented, supports languages, 15 - 20% improvement in speed and memory. Tests, docstrings, online documentation added for all these. @aviks, please review. |
I've fixed merge conflicts, and added explicit license. attribution to |
I am porting various implementations from StringAnalysis.jl and fixing various others.
Co-Occurrence Matrix
BM25
Speeding up Rouge.jl
Docstrings and Docs for Evaluation Metrics (Rouge)
Fixing
lsa
Docs and tests for
lsa
As per the discussions in #164 , I am preferring to port COOM from StringAnalysis.jl for various advantages discussed.
There seem to be performance bottlenecks in rouge.jl due to Abstract containers, this also needs to be worked upon.