Visualising “bigger data”
In the blog post Criminal goings-on in a random forest, we used supervised machine learning to see how well we could predict crime in London. We began by rendering and exploring some of the many facets of the recorded crime summary data at London borough-level .
There comes a point though where the many faces of the data require something more than a static visualisation. And there are alternative options. We can make “bigger data” visualisations more consumable and engaging. In this post we’ll go deeper into the original data with a more interactive and flexible approach. Continue reading “Seeing the wood for the trees”
Why forecast sales?
Humans have the magical ability to plan for future events, for future gain. It’s not quite a uniquely human trait. Because apparently ravens can match a 4-year-old.
An abundance of data, and some very nice R packages, make our ability to plan all the more powerful.
A couple of months ago we looked at sales from an historical perspective in Digital Marketplace. Six months later. In this post, we’ll use the sales data to March 31st to model a time-series forecast for the next two years. The techniques apply to any time series with characteristics of trend, seasonality or longer-term cycles. Continue reading “But can ravens forecast?”
With tensions heightened recently at the United Nations, one might wonder whether we’ve drawn closer, or farther apart, over the decades since the UN was established in 1945.
We’ll see if we can garner a clue by performing cluster analysis on the General Assembly voting of five of the founding members. We’ll focus on the five permanent members of the Security Council. Then later on we can look at whether Security Council vetoes corroborate our findings. Continue reading “An East-West less divided?”
Revisiting an old post
Last September I wrote a post entitled Is the Government realising its ambition for SMEs on G-Cloud? Six months on, I wanted to revisit and update this article, fold in a second Digital Marketplace framework, and share the R code here. Revisiting an old post also provides an opportunity to see if one can simplify and improve older code. Continue reading “Digital Marketplace. Six months later.”
Every story needs a good plot
One could think of data science as “art, grounded in facts”. It tells a story through visualisation. Both story and visualisation rely on a good plot. And an abundance of those has evolved over time. Many have their own dedicated Wikipedia page!
Which generate the most interest? How is the interest in each trending over time? Try this app to find out. Continue reading “The plots thicken”
Unsupervised machine learning
Hansard reports what’s said in the UK Parliament, sets out details of divisions, and records decisions taken during a sitting. The hansard R package provides functions to import its data.
Using the Hansard API (Application Programming Interface), we’ll apply unsupervised machine learning to analyze the voting patterns of 219 Labour Members of Parliament (MPs). We’ll consider all divisions (results of the votes) in the UK House of Commons since the 2017 general election. Continue reading “The “cluster of six””
Responding to a weak property market
In December I looked at how recent events have shaped the property market in London SW10. If short-distance moves are off the table in the current climate, how are property owners responding? When sales are weak, are planning applications in the ascendency? I applied data science techniques to Royal Borough of Kensington and Chelsea (RBKC) planning data to find out.
Continue reading “SW10 digs deep”
The anatomy of SW10
Analyses of house sales often focus on the wider UK market. In this blog, we’ll take a deep dive into one of London’s more-than 100 postcode districts. We’ll draw on 10,000 property transactions to see how key events have shaped the market. The object of our focus will be SW10 which forms part of the Royal Borough of Kensington and Chelsea.
We’ll start with the anatomy of SW10. Over 80% of property transactions were for leasehold flats. In contrast, detached freehold properties are a prized scarcity: Only 40 of the circa 10k transactions, over the past 20 years, were for detached properties. Continue reading “House sales in London SW10 take a few punches”
The importance of keyword-rich descriptions
There are nearly 20,000 services on G-Cloud. Suppliers have strewn their services with G-Cloud keywords designed to grab the attention of buyers. So what should buyers search for, and how does that vary by cloud service category?
Only selected parts of the suppliers’ content are indexed for searching: The service title, a 50-word summary, and bulleted features and benefits. So suppliers must cram in thoughtful keyword-rich phrases to optimise their chances of success.
In this blog, I want to compare and contrast the most frequent keywords used by suppliers. I’ve selected four categories from the Cloud Hosting lot for this purpose: Continue reading “The key to unlocking services on G-Cloud”
Background to G-Cloud pricing
The Digital Marketplace is helping those transforming public services by making it simpler, clearer and faster for them to buy what they need. G-Cloud focuses on cloud-based services. Since its launch in 2012, it has evolved through multiple iterations, with the current version being G-Cloud 9.
So, the introduction of a set of categories in G-Cloud 9 provided a natural step forward. These offered a level of granularity below the three lots of Cloud Hosting, Software and Support. As a result, buyers are able to find and compare groups of suitable products more easily.
Yet there is plenty of opportunity to further simplify the buyer’s task in future G-Cloud iterations. For example, around price comparison. Continue reading “Could G-Cloud pricing be simplified?”