Apply ThesaurusΒΆ

Smoke tests:
>>> from techminer2.thesaurus.user import InitializeThesaurus
>>> (
...     InitializeThesaurus()
...     .with_thesaurus_file("demo.the.txt")
...     .with_field("raw_descriptors")
...     .where_root_directory("examples/fintech/")
...     .using_colored_output(False)
...     .run()
... )
INFO: Thesaurus initialized successfully.
  Success : True
  File    : examples/fintech/data/thesaurus/demo.the.txt
  Status  : 1721 keys found
  Header  :
    A_A_THEORY
      A_A_THEORY
    A_BASIC_RANDOM_SAMPLING_STRATEGY
      A_BASIC_RANDOM_SAMPLING_STRATEGY
    A_BEHAVIOURAL_PERSPECTIVE
      A_BEHAVIOURAL_PERSPECTIVE
    A_BETTER_UNDERSTANDING
      A_BETTER_UNDERSTANDING
    A_BLOCKCHAIN_IMPLEMENTATION_STUDY
      A_BLOCKCHAIN_IMPLEMENTATION_STUDY
    A_CASE_STUDY
      A_CASE_STUDY
    A_CHALLENGE
      A_CHALLENGE
    A_CLUSTER_ANALYSIS
      A_CLUSTER_ANALYSIS
>>> from techminer2.thesaurus.user import ApplyThesaurus
>>> (
...     ApplyThesaurus()
...     .with_thesaurus_file("demo.the.txt")
...     .with_field("raw_descriptors")
...     .with_other_field("descriptors_cleaned")
...     .where_root_directory("examples/fintech/")
...     .using_colored_output(False)
...     .run()
... )
INFO: Thesaurus applied successfully.
  Success : True
  File    : examples/fintech/data/thesaurus/demo.the.txt
  Status  : 1788 keys applied
>>> from techminer2.io import Query
>>> Query(
...     query_expression="SELECT descriptors_cleaned FROM database LIMIT 5;",
...     root_directory="examples/fintech/",
...     database="main",
...     record_years_range=(None, None),
...     record_citations_range=(None, None),
... ).run()  
                                 descriptors_cleaned
0  AN_EFFECT; AN_INSTITUTIONAL_ASPECT; AN_MODERAT...
1  ACTOR_NETWORK_THEORY; ANT; AN_UNPRECEDENTED_LE...
2  AN_INITIAL_TECHNOLOGY_ADVANTAGE; CHINA; FINANC...
3  AGGREGATION; ANALYSIS; AN_ADVANTAGE; AN_EXTENS...
4  ACCESS; A_FORM; BEHAVIOURAL_ECONOMICS; DIGITAL...
>>> from techminer2.database.operators import DeleteOperator
>>> (
...     DeleteOperator()
...     .with_field("descriptors_cleaned")
...     .where_root_directory("examples/fintech/")
...     .run()
... )