Sort by Initial WordΒΆ

Example

>>> # TEST PREPARATION
>>> import sys
>>> from io import StringIO
>>> from techminer2.thesaurus.user import InitializeThesaurus, SortByInitialWords
>>> # Redirecting stderr to avoid messages during doctests
>>> original_stderr = sys.stderr
>>> sys.stderr = StringIO()
>>> # Reset the thesaurus to initial state
>>> InitializeThesaurus(thesaurus_file="demo.the.txt", field="raw_descriptors",
...     root_directory="examples/fintech/", quiet=True).run()
>>> # Creates, configures, an run the sorter
>>> sorter = (
...     SortByInitialWords(use_colorama=False)
...     .with_thesaurus_file("demo.the.txt")
...     .where_root_directory_is("examples/fintech/")
... )
>>> sorter.run()
>>> # Capture and print stderr output to test the code using doctest
>>> output = sys.stderr.getvalue()
>>> sys.stderr = original_stderr
>>> print(output) 
Sorting thesaurus by common initial words...
  File : examples/fintech/data/thesaurus/demo.the.txt
  214 matching keys found
  Sorting process completed successfully

Printing thesaurus header
  File : examples/fintech/data/thesaurus/demo.the.txt

    ACCELERATE_ACCESS
      ACCELERATE_ACCESS
    ACTIVE_CUSTOMERS
      ACTIVE_CUSTOMERS
    ACTIVE_FINTECH_SOLUTIONS
      ACTIVE_FINTECH_SOLUTIONS
    ACTIVE_POLICIES
      ACTIVE_POLICIES
    ADDITIONAL_DATA
      ADDITIONAL_DATA
    ADDITIONAL_USE
      ADDITIONAL_USE
    ADDRESSES_RISK
      ADDRESSES_RISK
    ADVANCED_ECONOMIES
      ADVANCED_ECONOMIES