A summary of the toolbox used for these curated works may be found here.
Clustering
A form of unsupervised machine learning useful in a wide variety of applications.
Hierarchical clustering
Unsupervised machine learning looking for hidden structure in “unlabeled” data.
Interactive web scraping
Web scraping and data visualisation in an interactive web application.
Predictive modelling
A form of supervised machine learning. Regression models are used to make predictions based on patterns in the data.
SPARQL end-point
Using HM Land Registry’s SPARQL end-point to import and visualise house price data.
Textual similarity
Quantitative textual analysis assessing document similarity and readability.
Visualising “bigger data”
A technique for interactively visualising and exploring larger datasets.
Word frequency
Text mining and use of word frequency to enable analysis of planning applications by theme.