Abstract
A fuzzy multivariate based line traffic prediction model and a station traffic proportion inference model are proposed in order to predict line entrance traffic and station traffic in the Shanghai subway system. The nonlinear autoregressive with external input (NARX) is adopted to predict subway line traffic. Correlative features that influence the line traffic time series are identified by time series trend analysis, including meteorological features, time features, and proportion of commuter passenger features. A time series correlation method and fuzzy c-means (FCM) is proposed to simplify the feature set by deleting the features of small coefficients between features and traffic series. We determine the statistics of all stations’ traffic proportion of a subway line in order to construct a proportion matrix, and an eigenvector based station traffic proportion inference model is proposed to predict the future station traffic proportion, which is combined with the subway line traffic prediction results to realize subway station traffic prediction. We evaluate our model on the dataset of smart card records and weather condition dataset of one month in the Shanghai subway system. Experiment results confirm our proposed model’s advantages over baseline approaches.
Introduction
In recent years, there has been a marked increase in passenger flow on subway systems in China and it has become very important to accurately forecast passenger flow to effectively prevent emergencies that can occur because of large gathered crowds [10]. Subway passenger flow prediction is a research problem that adopts technologies to forecast passenger flow at future intervals, which greatly assists subway system management departments in being able to dynamically adjust train schedules during peak hours [7]. Researchers have proposed many methods to satisfy the need for early warnings of subway traffic congestion. Cai et al., proposed a multiply ARIMA model to predict entrance and exit passenger flow at urban railway stations [1], Sun Y. and S. Clark used time series theory to adopt nonparametric regression models to predict subway transfer station traffic[5, 11]. A passenger source prediction model has been proposed through analysis of passengers’ origins and destinations [3]. Zhou Y. proposed a model to estimate the travel time for subway transfers [13].Wei Y. brought the neural network into passenger flow prediction [12]. As a kind of neural network, nonlinear autoregressive with external input (NARX).[14] is used in wind speed prediction [2] and chaotic flow prediction combined with fuzzy functions [6].
In this paper, we investigate a big data based method of subway passenger entrance flow prediction in the Shanghai subway system. Similar to factors used in bike sharing systems [9], the factors that we use include weather conditions, day of the week, hour of the day, and proportion of the passenger population that consists of commuters. Same as coefficient modeling in [8], we apply fuzzy cluster algorithms to the extracted features and calculate the relationship between factors and passenger traffic. The identified features are dropped if the coefficient is small. Simplified features are set as the external input of NARX to predict the subway line traffic. Because specific line traffic is composed of station traffic, we propose a station traffic contribution model based on eigenvector analysis of the station traffic contribution proportion matrix.
Our main contributions in this paper are: In predicting passenger flow, we take into account both temporal characteristics, and the influences of weather conditions commuter passenger proportion, day of the week, and hour of the day. A fuzzy multivariate time series-based NARX model is proposed to forecast subway line traffic. The correlation of multivariate and traffic time series identifies the ultimate correlative features. A station traffic proportion inference model is proposed to predict the future proportion of station traffic contribution vectors. Station traffic is predicted through combining the results of subway line traffic predictions with the results of station traffic proportion inferences.
Experiments are presented which show supportive evidence for the proposed model of subway traffic prediction.
Fuzzy time series and neural net
Fuzzy multivariate correlation
Both line traffic and station traffic in subway systems are affected by multiple factors. Correlation calculation between relative time series and prediction time series is as shown in [4]. The symbolization method used is the (Statistic Vector based Series Symbolization) SVSS method, which uses the intrinsic statistic characteristics to symbolize the time series. Statistical characteristics include frequency of intervals of equal probability and the relationship between two and three adjacent data points. In the statistics of frequency in intervals of equal probability, first, we must divide the space into intervals of equal probability. Samples are divided into intervals of equal probability I1, I2, ... , Im, where m refers to the interval number. Then, the series is symbolized, and the characteristic vector is expressed as:
After symbolizing, the relationship between the vectors can be seen in in Equation (2):
Where Xi,t and Xj,t are respectively the tth value, and H is the length of historic time series. If the relevance is small, the variable must be selected again. Otherwise, the sample series is considered closely relevant to the corresponding series. Then each element in this time series is clustered using the fuzzy c-means (FCM) algorithm, the objective function and cluster number in this paper are calculated usingEquation (3):
Respectively, Dmax, Dmin refer to the maximum and minimum, C1, C2, …, C K are k clusters, ci is the cluster center point and uij denotes whether xj belongs to cluster Ci. Then subscripts of the fuzzy subsets are used to represent the fuzzy subsets. In this way, each element of a time series is replaced by the fuzzy set subscript.
Dynamic neural networks, which include tapped delay lines, are used for nonlinear filtering and prediction. The NARX is a recurrent dynamic network, with feedback connections enclosing several layers of the network. The NARX model is based on the linear ARX model, which is commonly used in time-seriesmodeling. The defining equation for the NARX model is Equation (4).
Where the next value of the dependent output signal y (t) is regressed on d previous values of the output signal and d previous values of an independent input signal. As shown in Fig. 1, where a two-layer feed forward network is used for the approximation. The the regression performance function used for the NARX network to optimize the network isEquation (5):
where γ is the performance ratio, W is the vector of the neurons’ connection weight.
Correlative feature identification
Line traffic prediction methodology
Line traffic prediction is based on the correlative features extracted in Section 3.1. Combined with fuzzy time series theory, the coefficient between line traffic and its different correlative features are calculated as described in Section 2.1, which is set as the input of the neural net time series as described in Section 2.2. The time feature is described in Table 1. According to time feature, the passenger is labeled as “commuter” when the trip is initiated either during the morning rush or evening rush. We compute the commuter proportion as a commute feature. The modeling process is shown as follows.
Proportion inference model
We can obtain the station traffic contribution proportion matrix Msk,m,ti,n for each line in each day using the smart card data, where ti,j stands for the time slot j of the ith day in the historical dataset and sk,m refers to the station m of line k.
We can obtain a set S with M matrix, given that we have smart card expense data for M days. Each matrix is transformed into a vector Γ of size N and placed into the set.
The mean matrix of the set can be calculated using the following formula:
Assume that the time slot of the station traffic we want to predict is in the ith day, then the difference between the predicting matrix and the mean matrix is Φ:
Where Ψ is the mean matrix of the set S, and Γ i is the station traffic contribution proportion vector of the ith day.
Then we construct a different matrix set A as shown below. The covariance matrix C is obtained in the following manner:
Assuming that u
k
, λ
k
are respectively the eigenvectors and eigenvalues of the covariance matrix C and a set of M orthonormal vectors u can best describe the distribution of the data set. The kth vector uk is chosen such that
is a maximum, subject to
Assume that we want to predict the station for the sth time slot in the dth day. First we delete the sth to n column of all the matrixes in set S before vector transformation. Then, predicting the day’s station traffic contribution matrix is projected to eigenvectors:
Lastly, we determine which day’s matrix provides the best description for predicting the day’s station traffic contribution matrix. This is done by minimizing the Euclidean distance. Then the predicting matrix’s sth column is set to the matching matrix’s sth column which represents the station traffic contribution in the predicting time slot.
Datasets and metric
We conducted experiments on the datasets of the Shanghai subway system, which were collected from April 1 to April 30 2015. The dataset information is described in Table 2.
The metrics we adopt to measure the accuracy of the prediction results are the Root Mean Squared Error (RMSE) and Mean Relative Error (MRE).
Results and comparison
In order to compare the experiment performance of different methods, two baselines are included as shown below
The NARX model generated in our experiment is by MATLAB software as shown in Fig. 7. Owing to space limitation, we only present the line 2 traffic prediction comparison graph as shown in Fig. 8. The station traffic contribution of each subway line is described in Table 3. The overall comparison of the traffic prediction error is in Table 4.
As shown in Table 3, the fitting degree of all station traffic contribution is 0.9768, which means that the stations’ traffic contribution of a specific line is relatively stable. As shown in Table 4, the prediction error of the station traffic increases by 0.07% when we use the FCM clustering method for the extracted features. While, when we apply the coefficient limitation to features, the error of line traffic and station traffic decreases by about 22.16% and 0.79%,respectively.
Conclusion
We proposed a subway system traffic prediction methods respectively for line and station passenger entrance flow. Before modeling, affecting features are extracted: rain, air quality index, whether it is a weekday or holiday, and time interval of day, and a correlation calculation method is proposed. For line traffic prediction, affecting features are selected by coefficient limitation after being clustered by FCM, and the NARX model is adopted with the features as external input. Passenger flow traffic of each subway station is predicted by combining the stations’ traffic proportion prediction and the line traffic prediction results. Experiments on the Shanghai subway dataset shows that our proposed model shows more prediction accuracy. Further work needs to be done to test the model on more datasets and compare it with more baselines.
Footnotes
Acknowledgments
This research is supported by China Postdoctoral Science Foundation under Grant No. 2015M71369, NSFC No. 51278221 and NSFC No. 51378076.
