Abstract
This study develops an approximate optimal control problem to produce time‐dependent bid prices for the airline network revenue management problem. The main contributions of our study are the analysis of time‐dependent bid prices in continuous time and the use of splines to modify the problem into an approximate second‐order cone program (ASOCP). The spline representation of bid prices permits the number of variables to depend solely on the number of resources and not on the size of the booking horizon. The advantage of this framework is the ASOCP's scalability, which we demonstrate by solving for bid prices on an industrial‐sized network. The numerical experiments highlight the ASOCP's ability to solve industrial sized problems in seconds.
Introduction
Stochastic network capacity control problems are characterized by customers arriving at random instances in time requesting products, which require different combinations of perishable resources. The most common application of network capacity control problems in revenue management (RM) is the control of reservations for airline tickets. In the airline RM problem, the single‐leg flights correspond to the perishable resources and the available origin–destination (OD) itineraries comprise the set of products. Airlines can maximize their revenue by determining whether or not to accept an itinerary request. Given sufficient resources, the optimal policy for an airline is to accept a booking request for an itinerary if its fare is greater than the opportunity cost of required resources. The opportunity cost of each flight depends on the number of available seats in the network and the remaining time in each flight's booking horizon. Although the optimal controls can theoretically be found using the Bellman equation, even for a modest sized airline network the problem suffers from the curse of dimensionality.
The intractability of the airline RM problem has lead to an extensive literature proposing various heuristics and approximate solutions. One fundamental approach is to derive bid prices, which approximate the value of each resource, using a deterministic linear program (LP). The resulting control policy is to sell the itinerary if its fare is greater than the sum of the bid prices of seats on flights in the requested itinerary. The shortcoming of the bid price policy based on the LP is that the bid prices are static over the booking horizon. Thus, the bid prices do not adjust as the system evolves until the LP is re‐optimized using the updated capacity levels.
Adelman (2007) proposed an alternative bid price approach, the dynamic linear program (DLP), by modeling the revenue function with an affine approximation to obtain time‐dependent bid prices. Adelman derived the DLP directly from the Bellman's equation, developed appropriate bounds on the performance of the method, demonstrated structural properties, and proposed a solution approach based on column generation. However, approximating the revenue function and generating time‐dependent bid prices remain difficult for large networks due to the number of variables and constraints in the DLP problem. Network models that generate dynamic (time or capacity dependent) bid prices have become an active area of RM research, since Adelman's seminal paper. Research on dynamic bid prices extends existing formulations to incorporate other aspects of RM or provides novel computational approaches to improve performance.
Topaloglu (2009) decomposed itinerary request decision by flight leg using Lagrangian relaxation to create capacity‐dependent bid prices. Kunnumkal and Topaloglu (2010) subsequently extended this model by also incorporating time‐dependence into the inventory‐sensitive bid prices. Extensions of existing models include Zhang and Adelman (2009), who integrated Liu and van Ryzin's customer choice based LP into Adelman's initial model and Kunnumkal and Topaloglu (2008), who incorporated customer choice behavior into their Lagrangian relaxation time‐sensitive bid prices model. Meissner and Strauss (2012) formulated an inventory‐sensitive bid price model with customer choice and demonstrated that their model encapsulates the choice based LPs by Zhang and Adelman (2009), Kunnumkal and Topaloglu (2008), and Liu and Van Ryzin (2008). Since we focus strictly on time‐dependent bid prices, the phrases “dynamic bid prices” and “time‐dependent bid prices” are used interchangeably throughout this study.
Both Tong and Topaloglu (2014) and Vossen and Zhang (2015) present new methods for producing time‐dependent bid prices, recognizing that the number of variables and constraints as well as the column generation procedure prevent the DLP from being implemented in practice. Tong and Topaloglu (2014) showed that the column generation problem for the dual to Adelman's DLP can be solved in polynomial time by using a minimum cost network flow problem. Using the network flow structure, they were able to reduce the number of constraints from exponential to linear in the number of flight legs. Vossen and Zhang (2015) use a Dantzig–Wolfe reformulation to provide the same reduction found in Tong and Topaloglu (2014). Vossen and Zhang (2015) also develop a novel dynamic disaggregation algorithm to solve the reduced programs, utilizing the fact that bid prices only change toward the end of the booking horizon.
Although network RM problems are traditionally modeled based on discrete time, in reality, customer arrivals occur in continuous time. As Bitran and Caldentey (2003) point out in their overview of RM and pricing, continuous time models of RM problems are more appropriate than their discrete counterparts given the growth of e‐commerce and the Internet. Bitran and Caldentey's observation could not be more relevant for network RM in the current business environment as exemplified by travel websites such as Expedia, Priceline, Travelocity, and Orbitz which attract over 7.5 millions people each day.
While continuous time formulations are used extensively in dynamic pricing research, only Akan and Ata (2009) have used a continuous time framework to obtain and study bid prices in the network RM setting. Akan and Ata highlight that the nonoptimality of classical bid‐price controls stems from discreteness and implement a generalized bid price control, which combines bid‐prices with a capacity usage limit process to determine how much of the demand rate to accept. Using a nonnegative stochastic process to characterize bid prices, they demonstrated that the optimal generalized bid price process forms a martingale. While Akan and Ata's model is not readily applicable for solving large‐scale problems, their model is general and provides several important theoretical insights into the structure of bid prices. Rather than representing the bid prices as stochastic processes, our model (like Adelman) utilizes an affine functional form to approximate the value function and develop bid prices. Working in continuous time diminishes the impact of time on the number of variables and the number of constraints, which helps address the shortcomings of previous approaches regarding the problem size. The challenge with the continuous time formulation is replacing the infinite collection of constraints with a finite set and selecting a finite‐dimensional family of functions to approximate the bid prices. Applying properties from the optimality conditions of the problem, we construct a scalable computational procedure by representing the derivatives of the bid price functions with cubic splines and reformulating the problem as a second‐order cone program. The cubic splines allow the number of the bid price variables to be independent of the size of the booking horizon making it scalable to larger networks.
Our numerical experiments demonstrate that the revenue benefits from discrete time dynamic bid prices dissipate in a continuous time setting as the problem scale increases. In comparison, by developing an approach from continuous time, our method is able to maintain revenue improvements over an LP bid price policy that is updated several thousands of times as the network size increases. By implementing our method on the Porter Airlines network, which consists of 182 flight legs, 2196 products, and 70 seats on each flight, we establish that our continuous time formulation can generate effective time‐dependent bid prices on an industrial scale.
The remainder of the study is organized as follows. In section 2, we introduce notation, formally define the network RM problem, and present the Hamilton–Jacobi–Bellman (HJB) conditions, which serves as a starting point for our model. In section 3, we derive our bid price approach directly from the optimal control problem of the network RM problem by introducing two approximations into the problem. The first approximation introduces approximation of the value function using time‐dependent bid prices and the second reduces the number of variables and constraints in the problem by an implicit treatment of the accept/reject controls. We establish the optimality conditions and structural properties for the approximate optimal control problem in section 4. In section 5, we use the established properties to select a finite‐dimensional family of functions of time that can approximate the derivatives of bid prices and reduce the infinite collection of constraints to a finite set. The resulting model is an approximate second‐order cone program (ASOCP). Three sets of numerical experiments illustrating our model's effectiveness are presented in section 6. The first two sets of experiments compare discrete and continuous dynamic bid prices against LP bid prices over a randomly generated network in continuous time. The third set of experiments computes bid prices from our solution method over the Porter Airlines network to demonstrate the potential of our method's performance in an industrial setting. Finally, in section 7 we provide a brief summary and directions for future research. Please see Appendix S1 for proofs.
Network Revenue Management Problem in Continuous Time
The capacity control approach to network RM can be described as follows. Consider a firm which has m perishable resources with capacities
In particular, it is still a common practice among airlines to fix prices
The capacity control problem is to find a policy
Approximate Optimal Control Problem
We start by contrasting our approach to approximating the network capacity control problem with that of Adelman (2007), who starts with a discrete time version of the problem where it is assumed that the time unit is sufficiently small for
Our approach differs from that of Adelman (2007) by the use of continuous time methods in approximating the value function as well as the use of a different starting point – Equation 6 rather than Equation 2. Consider a variational problem of the form
The idea of the proof is to observe that
We now consider a restriction of the Problem (7)–(9), where: The feasible trajectories are restricted to a subspace represented as The max operator is approximated from above by the following lemma (the proof is immediate):
For any g,
Constraint 8 uses terms of the form
The optimal value of AOCP is an upper bound for
Although AOCP belongs to the general class of control problems for unbounded differential inclusions, we are able to derive rather simple optimality conditions for it using results of Loewen and Rockafellar (1994). The inclusion is unbounded since
Optimality Conditions for AOCP
For convenience, we let the function of t,
If
The optimality conditions of Theorem 1 are essential for further analysis of the optimal solution to AOCP. In particular, we obtain the following monotonic property:
The adjoint trajectory has the following properties:
The second property asserted in Corollary 1 means that
The optimal bid price trajectory of AOCP is decreasing and nonnegative in every component
In addition to the managerial implication that dynamic bid prices produced by the continuous time model decrease over time, this result reveals that each bid price remains constant for some period of time starting from the beginning of the planning horizon. The length of these intervals is determined from the adjoint variables. Corollary 2 also suggests a way to eliminate a bulk of constraints from the problem. Consider any decreasing bid price trajectory
If we modify the Problem 10–12 by adding a constraint that all state components
The resulting reduced set of constraints is indexed by the set
Corollaries 1–3 parallel findings in Adelman (2007). The interpretation of
An optimal solution to AOCP yields a feasible solution to the standard LP and its value is bounded from above by
The proposition shows that the bound on
The theorems and lemmas presented in this section assume the existence of an optimal solution to AOCP. Augmenting AOCP with a lower bound constraint on
Finite‐Dimensional Computational Strategies for AOCP
In this section, we discuss finite‐dimensional computational strategies for solving AOCP. There are several key points that need to be taken into account. First, we need to select an appropriate family of functions of time that can approximate feasible state trajectories
Reduction to a Finite‐Dimensional Problem
Bid‐price trajectories
Since we already know that AOCP is equivalent to AOCPM, it makes sense to restrict
In a practical implementation, variables
We complete the construction of the approximate problem by observing that constraints 11 restricted to
The resulting approximate SOCP problem (ASOCP) is to minimize the linear objective Equation 25 with respect to the variables
Constraint Generation Procedure
The proposed constraint‐sampling algorithm works with a subset of constraints in 39 and maintains this subset as a collection of lists of capacity vectors
Constraints (42)–(44) in the row generation problem are equivalent to the constraints in a fixed cost selection problem, which has an integral optimal solution (Rhys 1970). Thus, it follows
The linear programming problem (41)‐(44) is equivalent to the binary optimization problem 40 and has a binary optimal solution (i.e., all
The following proposition shows that the problem of finding the most violated constraint in 39 for each
The termination condition of the algorithm is based on the error measure obtained from solving the ASOCP with constraint set Initialize sets Solve the Master problem corresponding to constraints indexed by For each Find the error measure Let
Given the Master trajectory, we now consider a modified bid‐price trajectory
On termination of the algorithm, the modified bid‐price trajectory
The initialization step 1 depends on the number and placement of the knot points. The best approach to handling the selection of knot points would be to model the knot points as variables. However, introducing these additional variables creates a highly nonlinear and nonconvex optimization problem. Considering the scale of the problems we are trying to solve, this method does not seem appropriate. A practical approach would be to simply adjust the knot points in order to obtain the best performance in terms of expected revenue. We took this approach when constructing our experiments. Another heuristic would be to simulate the arrival process and bid price updates using the dual variables from the LP at various points throughout the booking horizon, selecting the knot points so that the splines can better approximate the expected behavior of the bid prices. The number of knot points can be chosen to balance the computational time with the accuracy of the approximation. The numerical experiments with the proposed computational strategy show that a choice of three or four knot points is sufficient for achieving effective bid prices.
Figure 1 graphs the bid prices produced by the algorithm for a four‐resource network with

ASOCP Bid Prices Trajectories for a 4 Resource Network
Numerical Experiments
In this section, we present the results from three sets of numerical experiments. The experiments compared the performance of various bid price control policies used to make accept/reject decisions for a simulated stream of customers. The purpose of these experiments were to provide empirical answers to the following questions: How do discrete time dynamic bid prices compare to continuous time dynamic bid prices in a continuous time setting? How do continuous time dynamic bid prices perform in comparison to LP bid prices that are re‐optimized a sufficient number of times? Are continuous time dynamic bid prices scalable to industrial sized networks?
The first two questions were analyzed using network structures similar to the problem instances in Adelman (2007). Each spoke had flights traveling to and from the hub, departing at the end of the booking horizon. For cases with more than one hub, each hub had flights traveling to and from the other hub(s). These flights made up the network's resources. If H was the number of hubs in the network and L was the number of spokes, then the total number of resources in the network was the sum of the number of hub‐spoke legs and the number of hub‐hub legs. Thus, the network had m = 2HL + H(H − 1) single‐leg flights. The set of products for each network included all possible OD itineraries. Both high‐fare and low‐fare tickets were offered for each OD pair. The load factor was calculated as the expected demand for seats over all itinerary divided by the capacity of the network and was equal to
The third set of experiments utilized the Porter Airlines' schedule to create an industrial scale network. Porter Airlines is a regional airline based in Toronto, Canada, with hubs in Toronto, Montreal, Ottawa, and Halifax. The airline provides “short haul” flights to cities in Ontario, Quebec, and Atlantic Canada, as well as cities in the United States, including daily flights to Boston, Newark, and Chicago. Since Porter's business model emphasizes service, speed, and convenience for business and leisure travelers, the airline offers frequent flights between major Canadian and US cities, flying up to 20 times daily from Toronto to Ottawa and 11 times daily from Toronto to Newark‐New York. The experiments over the Porter Airlines network demonstrate the potential of the ASOCP as a solution method for problems of industrial size.
All solution methods for each set of experiments were executed through SHARCNET, a high‐performance Canadian research computing consortium. The first set of experiments was run on the SHARCNET serial throughput cluster Kraken using single threaded AMD Opteron 2.2 GHz processors. The second and third sets of experiments, which have larger networks and re‐optimization, were run on the SHARCNET serial throughput cluster ORCA using 8 threaded Intel Xeon 2.7 GHz processors. For further details on the experimental design and additional documentation of the results please see Appendix S3.
Fixed Bid Price Experiments
The first set of experiments examines whether discrete time dynamic bid prices provide a suitable control policy when customer arrivals occur in continuous time. To analyze this question, fixed bid price control policies from the LP, DLP, and ASOCP were used to generate revenues from a simulated stream of customers modeled by a Poisson process. We chose to use the dynamic disaggregation (DD) approach to solve the DLP, since the approach represents state of the art computational efficiency for dynamic bid prices. The numerical experiments in Vossen and Zhang (2015) demonstrated that the DD is the fastest method for producing dynamic bid prices in discrete time. The DD approach produces time‐dependent bid prices by solving the LP and adding variables and constraints simultaneously to the problem until optimality is obtained. Since the DLP bid prices are constant for the majority of the booking horizon, the dynamics of the bid prices are captured entirely by the columns and rows added in the constraint generation procedure. For brevity, we refer the reader to section 3 of Vossen and Zhang (2015) for a complete description of the DD algorithm and optimization problems. To approximate the Poisson process in discrete time, we set the probability of an arrival, ρ, to 0.8 producing a discrete booking horizon of
The bid price policies were tested on a 1 hub 3 spoke (six resources and 24 products) network. Similar to Cooper (2002) and Jasin and Kumar (2013) the problems were scaled by the parameter κ, such that the capacity was
Table 1 reports the upper bound on the revenue, the simulated revenues for each bid price policy, and the average run‐times for fixed bid price experiments. The objective value of the DD was selected as the upper bound, since DD provides a tighter upper bound compared to the LP. For each network, the ASOCP generated the highest revenue, capturing between 87.6–93.5% of the upper bound, while the LP and DD captured 79.2–84.8% and 83.8–86.7% of the upper bound, respectively. Although the ASOCP generates greater revenue relative to the DD when arrivals are in continuous time, the run‐time of the DD is considerably faster. The ASOCP's average run‐time across the various scale factors ranged within 20–60 times the run‐time of the DD. From the standpoint of a fixed bid price policy, the DD offers a compromise relative to the LP and ASOCP in terms of the revenue vs. run‐time performance tradeoff. There is a distinct revenue improvement relative to the LP, with a substantial run‐time improvement relative to the ASOCP.
Revenue and Run‐Time in Seconds for Fixed Bid Price Experiments
To account for the stochastic nature of demand and capacity consumption, airlines update capacity vectors and re‐optimize bid prices periodically throughout the booking horizon. Consequently, when airlines solve for the optimal bid prices, the capacity of each resources is unlikely to be balanced. For either the DD or ASOCP to be a functional solution method for dynamic bid prices, the run‐times should not change significantly when the available capacities across resources vary. To test the DD and ASOCP performance given asymmetric capacities, the first set of experiments was re‐simulated with randomized capacities such that
Table 2 reports the average run‐times for the DD and ASOCP across the different values of ν and κ. For each value of κ, the average DD run‐time appears to increase substantially. The run‐times for ν = 4 increase with κ and range between 45 − 187 times the run‐times for ν = 1. On the other hand, the ASOCP run‐times for ν = 4 are 1.003 − 1.782 times the run‐times for ν = 1. These distinct behaviors result from the structure of dynamic bid prices and substantial differences in computing methodology. Theorem 2 of Adelman (2007) states that the value of the bid price for a given resource is static from the start of the booking horizon until a critical time where the bid price becomes dynamic. As the capacity vectors increase in variability, select flight legs become scarcer at the start of the horizon. Bookings for these resources have a greater impact on the dynamics of the bid price, resulting in the critical times occurring earlier in the booking horizon as ν increases. Therefore, as the capacity vectors increase in variability, the DD requires adding a greater number of rows and columns before the algorithm terminates. On the other hand, the ASOCP considers the entire booking horizon using the spline approximation regardless of the capacity vector. Thus, there is little impact on the algorithm's timing and it is fairly robust to variation in capacity. Finally, despite the dramatic increase in run‐time for the DD, Table 3 shows that the average revenue is still less than the revenue generated by the ASOCP. Since the run‐times increase as the variability in the starting capacity grows, we conclude that the DD is at a disadvantage for situations with frequent updating and unbalanced capacities, when arrivals occur in continuous times.
Run‐Time in Seconds for Fixed Bid Price Experiments with Asymmetric Capacities
Revenue for Fixed Bid Price Experiments with Asymmetric Capacities
Bid Price Experiments with Updates
In a study comparing the behavior of LP heuristics, Jasin and Kumar (2013) provide theoretical results that advocate using LP bid prices over the asymptotically optimal booking limit heuristics, provided that the LP is resolved sufficiently frequently. For the ASOCP to have value as a bid price solution method, it must be able to provide higher revenues than the LP bid price control that is updated sufficiently often throughout the booking horizon. To ensure adequate updating, we re‐optimized the LP at uniformly spaced intervals as many times as the expected number of arrivals over the course of the booking horizon. On the other hand, we limited the number of optimization updates for the ASOCP to 10 (also at evenly spaced intervals) in order to keep the run‐time for both bid price controls at a similar order of magnitude. The experiments establish that the ASOCP provides an effective bid price policy and demonstrates the value of time‐dependent bid price compared to static bid prices that are updated repeatedly.
The experiment networks consisted of H ∈ {1, 2, 3} hubs connected to L = 3 spokes. The initial capacity for each single‐leg flight was c = 150. The arrival rates λ were varied in order to produce different load factors. For the one hub experiments λ ∈ {1000, 1100, 1200}, for the two hub experiments λ ∈ {1600, 1800, 2000}, and for the three hub experiments λ ∈ {2400, 2600, 2800}. Each network was simulated 500 times with the bid price policies facing identical customer arrivals modeled by a Poisson process with the same parameters as described in subsection 6.1.
Similar to the fixed bid price experiments in subsection 6.1, we evaluated the performance of the ASOCP by benchmarking revenues against the LP. The average revenues and the upper bound as well as the run‐times and remaining inventory are listed in Table 4. The percentage of the revenue generated by the LP relative to the LP bound ranged from 96.5% to 98.5%, with an average of 97.6%, while the revenue percentage for the ASOCP ranged from 98.4% to 99.4%, with an average of 99.1%. The low performance gap is due to the asymptotic optimality of the LP Bound; however, the ASOCP is still able to provide a revenue boost compared to LP bid prices. Under a direct comparison of bid price policy performance, the ASOCP provided an average improvement from 0.81% to 2.58% (1.01% across the entire set). Although the total run‐time was 25 times larger on average, with the LP being updated 1000–2800 times, the solution time for each network was under a minute, which is reasonable considering the revenue improvements. The experiments also show that ASOCP can achieve good performance with limited updating, unlike the LP, which requires frequent re‐optimization. Another interesting observation is that the LP had a higher utilization of resources for each network. This implies that a significant amount of the ASOCP revenue improvement stems from denying product requests in favor of reserving capacity for more profitable itineraries.
Revenue and Run‐Time in Seconds for Multi‐Hub Network with Updates
Porter Airlines Network
We simulated bid prices using the ASOCP and LP over an entire day in Porter's network, which consisted of 182 single‐leg flights and 1098 possible itineraries. Table 5 provides a list of Porter's single‐leg flights and their frequency of service on December 1, 2011. We consider the network with one fare class and two fare classes for each itinerary. Similar to subsections 6.1 and 6.2, the booking horizon was scaled to the interval [0, 1] and time was continuous. The capacity of each flight was 70 seats, coinciding with Porter's fleet of Bombardier Q400 aircraft and the expected demand over the entire network for both the single and two‐fare cases was given by λ = 12000. The LP and ASOCP optimization problems were optimized ϕ ∈ Φ≡{5, 10, 20, 35} times at uniform intervals over the booking horizon. The Porter network was simulated 200 times, with randomly sampled expected demand and price for each itinerary (see the Appendix for details).
List of Single‐Leg Flights for Porter Airlines Network (December 1, 2011)
Table 6 compares the revenues and run‐time for the LP and ASOCP bid price controls. For both one fare and two fare class problems, the ASOCP produced higher revenues than the LP bid price for each ϕ ∈ Φ. In addition, for a single fare class, the ASOCP‐based policy with ϕ = 5 generated more revenue than each LP‐based policy. The improvement offered by the ASOCP ranged between 0.27–3.37% and 0.26–0.53% for the one and two‐fare class problems, respectively. Although the ASOCP run‐times are several orders of magnitude greater than the LP run‐times, we argue that the ASOCP is a promising approach for solving large‐scale capacity control problems. Airlines have access to superior computational power through industrial server farms and work stations, which would increase the optimization speed for computing ASOCP bid prices. On the other hand, airlines are restricted in the number of times that they can re‐optimize bid prices. Bid price controls involve extensive scenario analysis and constant monitoring, often leaving bid prices to be computed overnight (Talluri and van Ryzin 2004). If the booking horizon represents a 10‐week period and optimization occur overnight, then the values of ϕ ∈ {5, 10, 20, 35} corresponds to re‐optimizing the bid prices once every two weeks, once a week, twice a week, and every other day, respectively, over the 10‐week period. From this perspective, on the nights when the airline updated the bid prices, the average run‐time, regardless of the number of fare classes was under 25 seconds. Finally, SOCP solvers have not evolved to the same extent as LP solvers in terms of solution speed. As SOCP solvers mature, the ASOCP will become even more viable as a solution method for computing bid prices for industrial networks.
Average Revenue and Run‐Time in Seconds per Optimization for the Porter Airlines Experiments
Conclusion and Future Research
In this article, we construct a bid price control policy starting from a continuous time network RM framework. After substituting the optimal control policy into the HJB equation and reformulating it as a differential inclusion, we introduce two approximations into the inclusion to establish the AOCP. Using the monotonicity of the bid prices, approximation theory is used to develop the ASOCP, which makes the number of variables independent of the time horizon. Finally, we employ an efficient constraint generation procedure allowing the ASOCP to produce time‐dependent bid prices by considering only a select number of time‐capacity vectors. The numerical experiments highlight the effectiveness of the proposed approach in generating bid prices as well as its scalability by solving problems on an industrial sized network. Future research could extend the ASOCP structure to incorporate customer choice. It would also be interesting to combine ASOCP with robust optimization to incorporate uncertainty in the arrival rates. The ASOCP not only has the potential to improve revenues for large airlines, but its methodology can be applied to other areas of RM and large‐scale approximate optimal control problems.
Footnotes
Acknowledgments
This research was supported in part by the Natural Sciences and Engineering Research Council of Canada (grant number 341412‐2011) and Queen's School of Business. The authors thank the Department and Senior Editors, and the Referees for their constructive suggestions, which helped to improve the results. The computational component was made possible by the facilities of the Shared Hierarchical Academic Research Computing Network (SHARCNET:
