Use fulltext corpus in MLLM tests which is much faster #570

osma · 2022-02-11T13:22:58Z

I noticed that the unit tests weren't making use of the small fulltext document corpus included in the test suite (tests/corpora/archaeology/fulltext/). In this PR, I've added a fixture to use it and switched the MLLM tests to use it instead of document_corpus. This speeds up the MLLM tests by a lot (24s -> 2s on my laptop).

codecov · 2022-02-11T13:24:47Z

Codecov Report

Merging #570 (90f1537) into master (5948dee) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master     #570   +/-   ##
=======================================
  Coverage   99.47%   99.47%           
=======================================
  Files          84       84           
  Lines        5565     5568    +3     
=======================================
+ Hits         5536     5539    +3     
  Misses         29       29

Impacted Files	Coverage Δ
annif/lexical/mllm.py	`100.00% <ø> (ø)`
annif/parallel.py	`100.00% <ø> (ø)`
tests/conftest.py	`100.00% <100.00%> (ø)`
tests/test_backend_mllm.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5948dee...90f1537. Read the comment docs.