1

I've got the result shown in below figure for my clustering.

enter image description here

Are there any libraries which are similar to fvid_clusters which could produce the plots as shown below ? (USING PYTHON)

enter image description here

6
  • did you try matplotlib Commented Mar 21, 2019 at 17:44
  • Which vectorizer did you use to convert words to vectors ? Commented Mar 21, 2019 at 18:24
  • @mujjiga I use TfidfVectorizer Commented Mar 22, 2019 at 2:24
  • @TuckYew did you got any way to create diagram as above? Commented Nov 23, 2019 at 20:44
  • Did you solve the problem? @Rehan Commented May 14, 2020 at 11:53

2 Answers 2

1

Plots are not exactly like you show, but there two tools that I like to use in order to explore the result of text clustering (both have binds for Python).

Termite: http://vis.stanford.edu/papers/termite

Termite

UMAP: https://umap-learn.readthedocs.io/en/latest/

Here is the result of a topic modeling performed with NMF (from my thesis) using the BBC Sports dataset.

BBC Sports

Sign up to request clarification or add additional context in comments.

1 Comment

can you please share the code? Thank you in advance.
0
  • Step 1: Vectorize the words into vectors (each of size n)
  • Step 2: Use kmean to cluster them into k clusters
  • Step 3: Run PCA on vectors produced in step 1 and reduce them to 2 dimensions (say x,y)
  • Step 4: Plot the words at the location (x, y) where (x,y) are the dimensions reduced by the PCA from its full size vector of size n. Color the word corresponding to which cluster it falls identified in step 2 (use one color per cluster).

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.