Skip to content

rank_bm25 dependency is not optional #1

@kaaloo

Description

@kaaloo

I get the following stack trace when running build_wikIR.py if the rank_bm25 dependency is not installed. Perhaps the README file for this project could be updated to remove indications as to the optional character of this dependency.

2021-01-14 1 wikiextractor/WikiExtractor.py:2429: DeprecationWarning: Flags not at the start of the expression '\\[(((?i)bitcoin:|ftp' (truncated)
   ExtLinkBracketedRegex = re.compile(
 wikiextractor/WikiExtractor.py:2436: DeprecationWarning: Flags not at the start of the expression '^(http://|https://)(' (truncated)
   EXT_IMAGE_REGEX = re.compile(
 wikiextractor/WikiExtractor.py:644: DeprecationWarning: The 'warn' function is deprecated, use 'warning' instead
   logging.warn("Template errors in article '%s' (%s): title(%d) recursion(%d, %d, %d)",
 WARNING: Template errors in article 'Prishtina' (222982): title(1) recursion(0, 0, 0)
 Traceback (most recent call last):
   File "build_wikIR.py", line 9, in <module>
     from rank_bm25 import BM25Okapi
 ModuleNotFoundError: No module named 'rank_bm25'

P.S. Thank you so much for your work on this ! 👍🏼

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions