Skip to main content

Statistical improvement for short-term travel time predictions

Researchers at Imperial College in London have developed a generic three-stage short-term travel prediction model that promises to give greater accuracy under both normal and abnormal conditions. As travellers do not like the randomness of non-recurrent traffic congestion and delays, it is particularly useful for network managers to know how the ongoing traffic situation will develop when an atypical event occurs.
June 2, 2014 Read time: 6 mins
Fangce Guo
Researchers at Imperial College in London have developed a generic three-stage short-term travel prediction model that promises to give greater accuracy under both normal and abnormal conditions.

As travellers do not like the randomness of non-recurrent traffic congestion and delays, it is particularly useful for network managers to know how the ongoing traffic situation will develop when an atypical event occurs. In that way they can take appropriate actions to mitigate the delays and congestion or at least inform motorists how much of a delay they can expect. Traffic predictions in normal conditions are reasonably well catered for but predictions for abnormal conditions, whether caused by planned events or unexpected incidents, remains scientifically challenging. Therefore the main objective of researchers Fangce Guo, John Polak and Rajesh Krishnan at 500 Imperial College in London was to identify the best traffic prediction framework for both normal and abnormal conditions.

Previous studies of short-term traffic prediction models tended to focus on statistical and machine learning methods and the apparent superiority of one prediction method over others when applied to a specific short-term prediction problem. Little attention has previously been paid to the issues surrounding the overall prediction model structure, in particular data smoothing and error feedback. So in developing a short-term traffic prediction model, Imperial’s researchers used a three-stage framework, including data smoothing step and error feedback mechanism, as research assistant Fangce Guo explained: “We illustrated the effect of the data smoothing and error feedback structure using five different machine learning methods: k-Nearest Neighbour approach, Grey system Model, Neural Networks (NN), Random Forests (RF) and Support Vector Regression (SVR). Not only are these methods less complicated and burdensome to implement, they may also potentially enable the prediction process to adapt more easily to normal or abnormal traffic regimes.”
The existing researches show that a wide range of methods have been used in short-term traffic prediction and highlight the apparent superiority of one prediction method over the other when applied to a particular set of circumstances. However, the accuracy of the traffic prediction using a given model depends not only on the choice of the underlying statistical or machine learning prediction tool, but also on the model structure and error feedback structure in which the prediction techniques are embedded. It has been demonstrated that simple traffic data cleaning processes can significantly improve the accuracy of traffic estimation models. This concept is also used in other disciplines such as hydrology, but to date the value of formal data smoothing techniques has not been systematically explored in the context of traffic prediction.

One of the research objectives is to develop a model that predicts traffic variables under abnormal traffic conditions, and this objective raises particular challenges. Compared with the averaged historical data, traffic patterns can suddenly change during abnormal periods resulting in parameters outside the computational window of the system.

To counter this, an error feedback mechanism that makes use of this error relationship was used to make the proposed framework dynamically self-adaptive in response to such changes, in order to maintain the accuracy of short-term traffic predictions. Therefore a data smoothing step and a mechanism of error feedback are used to create the three-stage framework for short-term traffic prediction under normal and abnormal traffic conditions.

In order to evaluate the impact of a data smoothing step and error feedback mechanism, this research used the following three different prediction frameworks for comparison in short-term traffic predictions:

  • A one-stage framework which uses only one machine learning method.
  • A two-stage framework that applies data smoothing step before machine learning. 
  • A three-stage framework which adds an error feedback mechanism to the two-stage framework.
The proposed framework is generic and can use any suitable data smoothing, machine learning and error feedback methods.
Data for the study came from automatic number plate recognition (ANPR) cameras in London. Travel time data from link 1309 of the A40 road in central London (see Figure 1) were extracted in this experiment. Data from traffic under normal conditions was used in the training dataset and the testing dataset consists of known traffic incidents.  Public holidays were not included in the training data but incident days were. One eastbound lane was blocked from around 17:57 to 18:40 on the testing day because of a broken down vehicle.

Travel time prediction accuracy using three traffic prediction frameworks with five machine learning methods are compared using mean percentage error (MP), mean absolute percentage error (MAPE) and root mean square error (RMSE) metrics in Figure 1.

The five methods used in this research have a similar level of prediction accuracy during normal conditions and it is clear that both data smoothing structures and feedback mechanisms can, in general, improve prediction accuracy under normal conditions.

Guo said: “Under abnormal traffic conditions, we found that the use of data smoothing and feedback structures caused a decrease in the values of MAPE metric with all five machine learning methods. This means the three-stage framework is applicable to all five machine learning methods and gave an average improvement in the MAPE value of 21.5% for short-term predictions on the A40 during the abnormal period.

“The Singular Spectrum Analysis (SSA) data smoothing structure and feedback mechanism can improve the prediction accuracy during the abnormal event period.
Moreover, the kNN based method can detect the drop in the traffic profile better than other methods and provides the best predictions.”

This improvement can be seen in Figure 2 which shows the prediction results using data from during the incident period.

Figure 3 is a scatter-plot of predicted and observed travel time data, the error auto-correlation plot of predictions, the histogram of error distribution and the time-series plot between predicted and observed travel time of the kNN method with the three-stage prediction framework. Both the data smoothing structure and feedback mechanism can improve the prediction accuracy and reduce the prediction error in short-term traffic prediction under abnormal traffic conditions using the kNN method.

In the three-stage prediction framework model the RF based method produced the most accurate predictions.

All five machine learning methods used have a similar level of prediction accuracy during normal conditions and can be improved by the two-stage traffic prediction framework. However the error feedback mechanism used in the three-stage model does not help the prediction model to significantly improve one-step ahead traffic prediction. For example, the value of MAPE by kNN using the two-stage framework for one-step ahead prediction is 3.76% which reduces to 3.66% using feedback mechanism, resulting in an improvement of 2.7%.

“These prediction results show that the three-stage framework can improve traffic prediction accuracy regardless of the machine learning method used. This is true during both normal and abnormal traffic conditions. The kNN based method showed the best ability to respond to the sudden change of traffic patterns caused by abnormal traffic events. This is because lazy learning approaches such as kNN do not involve any model construction before it is required for testing; it can quickly detect pattern changes and has the flexibility to match the best patterns from historical datasets. The findings suggest that attention should be paid to combine data smoothing and error feedback elements to arrive at a robust and accurate prediction model.
UTC

Related Content

  • December 5, 2012
    Reducing congestion with Tomtom's historical traffic data
    Historical traffic data provided by TomTom is being used by the local government in Spain’s Basque region to reduce road congestion at less cost. Old habits die hard. Photos from as far back as the 1930s show people counting cars by the roadside in order to provide congestion data to those running road networks. Today, such techniques are still used, albeit augmented by a range of automation technologies such as inductive loops, infra-red sensors and number plate recognition. Even with these advances, howe
  • August 10, 2016
    Calculating the cost of stellar solutions
    The increasing availability and accuracy of global navigation satellite system (GNSS) is opening up low-cost options in many areas as David Crawford finds out. Boosting commercialisation of European global navigation satellite system (EGNSS) technologies for ITS initially depends heavily on demonstrating competitive and cost/benefit advantages obtainable from the deployment of EGNOS (the current European Geostationary Navigation Overlay Service), and ultimately the EU’s Galileo constellation (see box). So,
  • August 1, 2012
    InfoConnect delivers accurate travel information on all levels
    Deryk Whyte provides an overview of how the New Zealand Transport Agency's InfoConnect concept was developed. Historically, the New Zealand Transport Agency (NZTA) (formerly Transit New Zealand) has faced challenges in communicating effectively with road users, its customers, about highway-related events or incidents in a timely, accurate manner. Prior to 2007, Transit relied on a third-party organisation to collect and disseminate national road condition information. This often resulted in incomplete infor
  • February 25, 2016
    System predicts train delays and informs response
    David Crawford looks into the near-term future for Stockholm’s rail commuters. Swedish rail operator Stockholmståg, which runs commuter services in and around the country’s capital, is claiming a world first with the introduction of its automated Pendelprognosen (commuter prognosis) service. Developed to enable the prediction of delays as much as two hours before they are likely to occur, this offers the operator the scope for much earlier remedial action than previously - for example by filling in the expe