Skip to main content

Cubic: predictive analytics is putting fortune tellers out of business

The rise of machine learning and artificial intelligence means that fortune tellers will soon be out of business. Ed Chavis takes a behind the scenes look at the world of predictive analytics ver since organisations started taking advantage of insights derived from Big Data, data scientists concentrated their efforts on the ability to make correct assumptions about the future. A few years later, with the help of automation, developments in machine learning (ML) and advancements in the application of a
November 23, 2018 Read time: 8 mins
© Cherriesjd | Dreamstime.com
The rise of machine learning and artificial intelligence means that fortune tellers will soon be out of business. Ed Chavis takes a behind the scenes look at the world of predictive analytics


Ever since organisations started taking advantage of insights derived from Big Data, data  scientists concentrated their efforts on the ability to make correct assumptions about the future. A few years later, with the help of automation, developments in machine learning (ML) and  advancements in the application of artificial intelligence (AI), the ability to make predictions is no longer wishful thinking. We now live in an era of predictive analytics.

Predictive analytics incorporates both ML and AI to make predictions about unknown future events by analysing rich historical data. Although the technology applies to almost any industry, in transportation - where travels patterns remain reasonably regular - its uses can be particularly enlightening. It can empower cities to become more efficient by optimising network performance, it can help transit agencies provide better customer experience by preventing service downtime, and it can ensure smooth throughput of main city arteries, predicting potential areas of congestion long before the first car comes to a stop in traffic.

But how does the technology actually work? How are the predictions made and what conditions must be met to ensure maximum accuracy and reliability? And finally – are our cities ready to reap the benefits of predictive analytics?  

Data crunching  


Although predictive analytics can apply to various areas of the transportation industry, we’ll focus on two key categories: predictive analytics for services (predictive maintenance) and predictive analytics for operations. The former helps transit agencies pre-empt equipment failures and improve customer satisfaction, while the latter helps cities deliver more robust solutions for traffic management with confidence and at a reduced cost.

In the predictive maintenance model, the source for the modelling and forecasting typically comes from historical equipment data and external variables, such as passenger throughput and meteorological conditions. Equipment data can cover the history of all assets, down to the component level of each machine (let’s say a ticket vending machine), including elements such as the smart card reader, coin vault, bill handling unit, and more.

This information, alongside other historical data such as repair call logs, feeds into the ML-part of the predictive analytics engine. The engine assimilates the data, creates a historical understanding of each asset’s behaviour and performance over time, looks for patterns and correlations, and then, with the help of a dedicated algorithm, builds a predictive base model of how the asset is likely to behave in the future. For the base model to be deemed accurate, it must make correct predictions at least eight out of 10 times. Importantly, the predictive tool creates projections on the residual life of an asset – not just the next issue, allowing transit agencies to take proactive action ahead of equipment failure. This information feeds into the life cycle management for the devices and enables agencies to improve overall asset maintenance.

It takes more than simply flicking the ‘ON’ button for predictive analytics to work effectively. For instance, the reliability of the predictive model often depends on the quantity and quality of data. Unfortunately, not all agencies can offer a robust data source. Even when an agency does monitor and collect information about its assets, data can become corrupted or lost, information might not be collected consistently, or might be insufficient for predictive analytics. Equally, not all equipment can collect, store and transmit large amounts of real-time data into the central system.

One of the vital learning lessons from 378 Cubic’s US predictive maintenance trials, first in Atlanta and then in Los Angeles, was that lack of appropriate hardware is a reasonably common barrier among agencies. Therefore, an effective predictive analytics programme may require both the agency and the technology provider to go back to hardware and work out a way to improve its data generation capabilities. In other scenarios, predictive analytics programmes may involve a preliminary period dedicated to real-time data collection to supplement the existing database and improve the representation of the data sample.

To further ensure the accuracy of the predictive engine, a transit agency can rely on a combination of predictive technology and human expertise. This approach is particularly important in the early stages of predictive maintenance. When the model flags a particular piece of equipment as likely to fail, a subject matter expert deployed to fix the issue can confirm the prediction by, for instance, reporting significant wear and tear, indicating failure was imminent. Such validation is not only helpful for optimising the predictive algorithm, but it also provides a critical feedback loop for continually improving the quality of predictions.


Extreme multi-tasking


Earlier this year, the government of New South Wales, Australia’s most populous state, pledged to spend millions of dollars on enhancing the monitoring and management of the road network across the region. As part of the initiative, it tasked Cubic with delivering an intelligent congestion management programme – an example of predictive analytics for operations. This data-driven transport management platform enables cities to predict traffic patterns, reduce congestion, improve major event planning and response to incidents on the transport network. When ready by the end of 2020, it will make Sydney the first city in the world to manage its transportation network based on a predictive analytics model.

However, projecting traffic patterns and incidents for an entire transportation network in a city as large as Sydney is a tough ask that involves several moving parts. The prediction engine must assemble and synthesise data from multiple input points throughout the city. These include pedestrians, private cars, public transit vehicles, third-party transportation services (such as ride-hailing services, scooters and micro-transit), as well as an entire host of city infrastructure endpoints – traffic cameras, traffic and street lights, parking, bus stops, railway stations – the list can go on. On top of that, the system must be smart enough to incorporate variable data that impacts the network, such as weather and seasonality (a Black Friday shopping event, a local football game, a holiday).

Assembling all the information is not always possible. Although the notion of ‘smart cities’ has captivated urban planners and city agencies alike, not many metropolises can yet claim the title. With significant gaps in smart city infrastructure, lack of common language for information sharing between various city systems, and inadequate network coverage, the pace of innovation in urban areas often falls short of an environment ideal for making predictions. In such situations, the predictive engine needs to not only analyse, understand and mimic the actual transportation system but, inevitably, fill in information gaps with AI-based simulations. As cities upgrade their smart infrastructure and agencies invest in connectivity, they can fill in the gaps in information, leading to better accuracy of predictions.

Thankfully, despite all the complexity, predictive models are easily scalable. Although the initial investment of time and resource to build the base model is significant, once the legwork has been done, models can then be applied to other cities, with the possibility of improving their performance over time. If, for instance, the initial accuracy of a base model in a new city is 70%, the predictive algorithm can be adjusted with new data sources to yield better results.

Future focus


Confirming the accuracy of predictions in the operations model is less straightforward than in the predictive maintenance model since predictions are made on the basis of complex and dynamic algorithms that constantly adjust and re-evaluate information. It is, nevertheless, possible through an analysis of AI simulations of various traffic events and benchmarking against similar historical scenarios. It’s important to keep in mind that responding to a traffic event in one part of the network - e.g. redirecting drivers to alternative routes due to a broken down vehicle - may inadvertently affect other parts. Therefore, improving the overall efficiency of the network over achieving singular gains around individual traffic events must always be front-of-mind when assessing the effectiveness of the predictive analytics model.

Although Cubic’s tagline for Sydney’s transport management platform is ‘Predict 30 minutes into the future, act in 5’, long-term predictions are a matter of time. With access to the right information, a good understanding of the network, and the imminent arrival of 5G, cities won’t have to wait long to predict an hour, 12 hours or a day into the future.

With time, we can reasonably expect cities’ predictive abilities to grow exponentially, paving the way for automated city networks where traffic lights instantly recognise and give priority to emergency vehicles, dynamic lanes can accommodate changing traffic conditions, drivers’ phones alert them to road obstructions and automatically redirect to an alternative route and autonomous cars never get stuck in traffic.

While the future is undoubtedly impressive, it’s important to keep in mind that predictive analytics cannot operate in a vacuum. Without the appropriate regulatory environment, the coming together of different stakeholders, and the overall investments in city infrastructure, even the most advanced analytics technology will fail to make a difference. By understanding the technicalities and practical problems faced by cities that have invested in the technology, transit agencies, city authorities and tech companies alike can ensure cities are adequately prepared to make the most of predictive analytics, today and well into the future.

Related Content

  • March 15, 2019
    Cost Benefit: Utah traffic light scheme pays dividends
    A traffic signal control scheme in Utah is being taken up by other US authorities. David Crawford finds out how the Beehive State is leading the way in DoT and driver savings Growing numbers of US state departments of transportation (DoTs) and their road users are gaining real financial benefits from an advanced approach to traffic signal monitoring recently developed in Utah. Central to the system is its use of automated traffic signal performance measures (ATSPM) technology, brought in to improve th
  • September 12, 2024
    What’s right with this picture?
    AI-driven image review is a game changer for tolling industry efficiency. Rafael Hernandez of IntelliRoad outlines the importance of partnerships with service providers
  • January 23, 2012
    Tunnel simulators vital for real world tunnel management
    Guillaume Ponsar, tunnel safety engineer with Egis Road Operation, writes about the advantages to be gained from the use of tunnel simulators. Major tunnel disasters over the last decade and more have shown how swiftly and badly a simple crash or fire may evolve should the wrong actions be taken by control room operators or traffic managers. Global safety issues and the reactions of operations staff have now become the principal concerns for Operations and Maintenance (O&M) service providers. As a result, n
  • June 15, 2023
    Traffic management: risky business
    Adding a real-time accident risk layer to the profile of a road network ticks all the crucial boxes: it saves time, fuel, money and, ultimately, lives. Harriet King of Valerann explains the brain power of Lanternn by Valerann’s Core Fusion Engine...