Skip to content

Use of HDBSCAN clustering to visualize the multi-dimensional public good space into well-defined groups of related projects in two dimensions.

License

Notifications You must be signed in to change notification settings

rohitmalekar/grantee-clustering

Repository files navigation

grantee-clustering

Use of HDBSCAN clustering to visualize the multi-dimensional public good space into well-defined groups of related projects in two dimensions.

Step 1: PRE-PROCESSING - embd.py creates embeddings of project descriptions Step 2: CLUSTERING - cluster.py reduces dimensionality using t-SNE and applies HDBSCAN for clustering

Sample results: Here are some algorithmically determined organic groupings of projects (a) saving water bodies, (b) protecting forests, (c) utilizing solar power, respectively

TSNE 0

TSNE 1

TSNE 3

About

Use of HDBSCAN clustering to visualize the multi-dimensional public good space into well-defined groups of related projects in two dimensions.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages