TODO: Thematic Document ClassificationΒΆ

Algortihm:

  1. Compute the TF matrix.

  2. Apply TF-IDF transformation

  3. Clustering using cosine distance

  4. Obtain the table of units by clusters