Text Clustering analysis usually involves the Text Mining process to turn text into structured data for analysis, via application of natural language processing (NLP) and analytical methods.

In this post it is described the process to classify and visualize meaningful textual contents of European Union projects into topics clusters.

The steps involved for this process are:

  • Pre- processig Text
  • Feature extraction
  • Build model and Evaluation
  • Visualization

Problems definition and Identify text to be collected.

The dataset involved in this project contains concrete projects funded by European Union downloaded from EU Open Data Portal (http://data.europa.eu/88u/dataset/eu-results-projects) . …

Silvia Ruffini

Passionate about Data Science

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store