Apply ThesaurusΒΆ
- Smoke tests:
>>> from techminer2.thesaurus.user import InitializeThesaurus >>> ( ... InitializeThesaurus() ... .with_thesaurus_file("demo.the.txt") ... .with_field("raw_descriptors") ... .where_root_directory("examples/fintech/") ... .using_colored_output(False) ... .run() ... ) INFO: Thesaurus initialized successfully. Success : True File : examples/fintech/data/thesaurus/demo.the.txt Status : 1721 keys found Header : A_A_THEORY A_A_THEORY A_BASIC_RANDOM_SAMPLING_STRATEGY A_BASIC_RANDOM_SAMPLING_STRATEGY A_BEHAVIOURAL_PERSPECTIVE A_BEHAVIOURAL_PERSPECTIVE A_BETTER_UNDERSTANDING A_BETTER_UNDERSTANDING A_BLOCKCHAIN_IMPLEMENTATION_STUDY A_BLOCKCHAIN_IMPLEMENTATION_STUDY A_CASE_STUDY A_CASE_STUDY A_CHALLENGE A_CHALLENGE A_CLUSTER_ANALYSIS A_CLUSTER_ANALYSIS
>>> from techminer2.thesaurus.user import ApplyThesaurus >>> ( ... ApplyThesaurus() ... .with_thesaurus_file("demo.the.txt") ... .with_field("raw_descriptors") ... .with_other_field("descriptors_cleaned") ... .where_root_directory("examples/fintech/") ... .using_colored_output(False) ... .run() ... ) INFO: Thesaurus applied successfully. Success : True File : examples/fintech/data/thesaurus/demo.the.txt Status : 1788 keys applied
>>> from techminer2.io import Query >>> Query( ... query_expression="SELECT descriptors_cleaned FROM database LIMIT 5;", ... root_directory="examples/fintech/", ... database="main", ... record_years_range=(None, None), ... record_citations_range=(None, None), ... ).run() descriptors_cleaned 0 AN_EFFECT; AN_INSTITUTIONAL_ASPECT; AN_MODERAT... 1 ACTOR_NETWORK_THEORY; ANT; AN_UNPRECEDENTED_LE... 2 AN_INITIAL_TECHNOLOGY_ADVANTAGE; CHINA; FINANC... 3 AGGREGATION; ANALYSIS; AN_ADVANTAGE; AN_EXTENS... 4 ACCESS; A_FORM; BEHAVIOURAL_ECONOMICS; DIGITAL...
>>> from techminer2.database.operators import DeleteOperator >>> ( ... DeleteOperator() ... .with_field("descriptors_cleaned") ... .where_root_directory("examples/fintech/") ... .run() ... )