Abstract
We address an inventory rationing problem in a lost sales make‐to‐stock (MTS) production system with batch ordering and multiple demand classes. Each production order contains a single batch of a fixed lot size and the processing time of each batch is random. Assuming that there is at most one order outstanding at any point in time, we first address the case with the general production time distribution. We show that the optimal order policy is characterized by a reorder point and the optimal rationing policy is characterized by time‐dependent rationing levels. We then approximate the production time distribution with a phase‐type distribution and show that the optimal policy can be characterized by a reorder point and state‐dependent rationing levels. Using the Erlang production time distribution, we generalize the model to a tandem MTS system in which there may be multiple outstanding orders. We introduce a state‐transformation approach to perform the structural analysis and show that both the reorder point and rationing levels are state dependent. We show the monotonicity of the optimal reorder point and rationing levels for the outstanding orders, and generate new theoretical and managerial insights from the research findings.
Introduction
Inventory rationing among multiple customer classes (segments) lies at the heart of the yield management problem (Deshpande et al. 2003). It is an important tactic for coordinating demand management and inventory control in many industries where the service level requirements vary widely among demand classes. For example, Cohen et al. (1988) study service parts management with priority demand classes in the computer industry where a retailer could place either regular orders or emergency orders. Deshpande et al. (2003) provide an example of inventory rationing in the US military. Another example is Dell Computer, which segments customers by type (e.g., family, industry, government, academic, etc.) and sells the same product (computers) to different segments at different prices (McWilliams 2001). Similar problems also exist in service industries with fixed and perishable capacity (e.g., airlines, car rentals, hotels, etc.) where the critical decisions include the prices charged to demand classes and the rationing levels (see, e.g., Kimes 1989, Talluri and van Ryzin 2004). Kleijn and Dekker (1999) provide an overview of the inventory rationing problem and present applications ranging from airlines to petrochemical firms.
Our study aims to address several salient features that are commonly observed in the inventory management of many production and supply systems. First, demand is uncertain and can be segmented into multiple classes according to customers' willingness to pay and their service level requirements. Second, the production and supply processes may consist of multiple sequential phases with uncertain delivery lead times. For example, an in‐house production system may have a sequential production process that includes raw materials processing, work‐in‐process component processing, assembly, inspection, and packaging. A typical supply system of a global firm may consist of multiple sequential delivery stages including order processing, multi‐phase shipping, and custom clearance. The end customer demand and the time required to complete each stage and the whole process may fluctuate over time. See Zipkin (2000) for more discussions on sequential supply systems. Third, replenishment orders are often restricted to fixed batch sizes (i.e., full truck loads or containers).
In this study we consider an inventory rationing problem of a continuous review make‐to‐stock (MTS) system with batch production and multiple demand classes. Demands arrive according to Poisson processes. Unmet demand is lost and a penalty cost is incurred. For any incoming customer order, the system manager determines whether to satisfy it with on‐hand inventory (if there is any) or reject it. The demand classes have different values for the same product, which are represented by class‐specific prices and penalty costs for lost sales (or shortages). Each production order contains a single batch of a fixed lot size (e.g., in a full truckload or a full container). The production processing time is random. The objective of the system is to maximize the total discounted profit over an infinite horizon.
We formulate the problem as a Markov decision process (MDP). We first address the case where the production times are generally distributed and there is at most one outstanding order. We show that the optimal ordering policy can be characterized by a critical stock level. That is, the reorder point policy is optimal. The inventory rationing control for each demand class is characterized by time‐dependent critical stock levels, also called rationing levels. We show that the rationing levels are decreasing in the elapsed production time of the outstanding order. Since it is difficult to further generalize the structural analysis under general production times to the case that allows multiple outstanding orders, we approximate the production time distribution with a phase‐type distribution and show that the optimal policy can be characterized by a reorder point and state‐dependent rationing levels. We then use a tandem MTS system to address the issue of allowing multiple outstanding orders. Assuming that the production time follows an Erlang distribution, we show that both the reorder point and inventory rationing levels are state dependent. We characterize the monotonicity of the optimal reorder point and rationing levels for the pipeline of outstanding orders and discuss the managerial insights. The numerical results show that when the batch size is relatively large it may be sufficient to restrict the system to allow at most one order outstanding.
Our contributions are two twofold. First, we generalize the lost sales inventory rationing models to batch‐ordering systems while allowing multiple orders outstanding simultaneously. The intrinsic difficulty in analyzing the lost sales system with multiple orders outstanding arises from the fact that the decision maker needs to take into account the status of all the outstanding orders while making inventory replenishment and rationing decisions. Second, when addressing the case with multiple outstanding orders, we introduce a state‐transformation approach to treat the system as a serial system. This approach enables us to tackle the multi‐outstanding‐order problem and provide new insights into the inventory rationing problem. We show that when an order is placed, the subsequent reorder point decreases, but will increase in the time since the order was placed. This has the effect of decreasing the likelihood of placing a second order until such time when the first order is more likely to arrive. Conversely, we show that the rationing levels decrease in the time since the order was placed. This implies that when an order is placed, little changes in the rationing policy, but as the order arrival approaches, the rationing levels fall to ensure that excess inventory is removed prior to the arrival of the order. To the best of our knowledge, these structural results are shown for the first time for batch production inventory rationing problems.
The remainder of this article is organized as follows. Section 2 reviews the related literature. Section 3 addresses the case with general production times and a single outstanding order and approximates the general production time distributions with phase‐type distributions. Section 3 generalizes the analysis to the case with multiple outstanding orders. Section 5 provides the concluding remarks. All the proofs are placed in Appendix S1.
Related Literature
This study follows the growing literature on the inventory rationing problem initiated by Veinott (1965). See Ding et al. (2006), Arslan et al. (2007), Möllering and Thonemann (2008), Fadiloglu and Bulut (2010), and Cheng et al. (2011) for comprehensive reviews of recent developments. Our study fits into the stream of continuous‐review MTS models with lost sales. The most related works in this literature are Ha (1997, 2000), Melchiors et al. (2000), and Melchiors (2001).
Ha (1997) considers a MTS production system with several demand classes and lost sales. He shows that the critical‐level policy is optimal. Ha (2000) extends Ha (1997) to the case with Erlang distributed production times. Assuming that each production order contains a unit of product and there is only one outstanding order at any point in time, Ha (2000) finds that by combining the two state variables—inventory level and status of the outstanding order—into a single dimensional state, called work storage, the structural results can be readily obtained by backward induction. His results give some insights into the inventory rationing problem by incorporating production status information into the inventory allocation decision, but application of his model is relatively limited due to the restrictive assumptions of unit production and single outstanding order. He points out that one direction important future research is to address the general production time distributions. Our study not only address his concern on the general production time distribution but also further generalizes the model to allow batch production. Note that his approach of work storage state variable aggregation no longer works in our model. Moreover, with Erlang distribution lead times, we allow multiple orders outstanding simultaneously.
Melchiors et al. (2000) consider an (Q, R) inventory system with lost sales and two priority demand classes, under the assumptions of unit Poisson demand, deterministic constant lead times, and at most one order outstanding. They introduce a lead time–independent critical‐level policy, derive the exact formulation of the average cost, and propose a simple optimization procedure. Melchiors (2001) extends the model of Melchiors et al. (2000) to multiple demand classes with generally distributed replenishment lead times. He analyzes the rationing policy for an (R, Q) system with exogenously given reorder point R and order size Q, where R < Q. He shows that the optimal rationing policy can be characterized by time‐dependent critical stock levels. Note that the restriction R < Q implies that there is at most one order outstanding at any point in time. For the case of constant lead times, he shows that the optimal critical levels are a decreasing function of the elapsed lead time of the outstanding order. He does not analyze the optimality of the reorder point policy or the (R, Q) policy. In this study we also consider a batch production system with an exogenously given batch size. However, our model differs from Melchiors (2001) in that the reorder policy is endogenously determined, although it turns out that the optimal production policy can be characterized by a critical reorder point when there is at most one order outstanding at any point in time. Note that the single‐outstanding‐order assumption does not imply R < Q. Nevertheless, when the production times are approximated by a phase‐type distribution, we are able to generalize the model with a single outstanding order to the case with multiple outstanding orders and show that the state‐dependent reorder point policy is optimal.
When there is only one demand class, the inventory rationing model of the (Q, R) system reduces to the traditional (Q, R) model (see, e.g., Hadley and Whitin 1963). For tractability, it is often assumed that there is at most one order outstanding (see, e.g., Buchanan and Love 1985, Hill and Johansen 2004, Nahmias and Demmy 1981). Johansen and Thorstenson (2004) attempt to generalize the lost sales (Q,R) model to the case where more than one order may be outstanding. Assuming that orders do not cross over time, they obtain the equilibrium equations for the underlying MDP and develop a computational algorithm. Different from them, we focus on characterizing the optimal policy structure for an inventory rationing problem with a fixed order batch size, assuming the production times follow an Erlang distribution and allowing multiple outstanding orders.
Also related is the joint pricing and inventory control literature, see, e.g., Elmaghraby and Keskinocak (2003) and Chen and Simchi‐Levi (2012) for comprehensive reviews. Both pricing and rationing are typical marketing instruments for demand management. Chen et al. (2006, 2009) address the joint pricing and production control problem for the unit and batch (exponential) production system, respectively. Pang and Chen (2010) generalize Chen et al. (2009) to the case with Erlangian lead times and two outstanding orders. But their analysis cannot be readily extended to the more general case that allows any number of outstanding orders. We address the multiple outstanding orders issue. Employing a state‐transformation approach, we are able to characterize the structural properties of the optimal policies.
Single Outstanding Order
The Model with General Production Times
Consider a single‐facility MTS production system that offers a single product to N demand classes (subscript
The product is produced in batches with a fixed lot size Q, an exogenously given positive integer (e.g., a truck load or a full container). This treatment with a (exogenously given) fixed ordering lot size is common in the literature (see, e.g., Chen 2000, Song 2000). The production of each batch incurs a fixed cost, C f > 0, and a variable cost c per unit. The fixed cost includes administrative costs, transportation costs, and fixed payments to the supplier. So the total cost of each production order is C = C f + c·Q. We assume that the payment occurs when an order is completed.
The production processing time of each batch, τ, is assumed to be generally distributed with probability density function f and distribution function F. Deterministic production time can be seen as a special case. When τ is a positive random variable, we assume that the failure rate function
In this section, we assume that at any point in time there is at most one batch being processed in the facility. This assumption ensures that at the time before an order is placed, the inventory position and the net inventory level coincide. Although this assumption is rather restrictive, it provides mathematical tractability while preserving the major characteristics of batch production systems. We refer to Hadley and Whitin (1963), Nahmias and Demmy (1981), and Berk and Gürler (2008) for a similar treatment in the analysis of lost sales (Q, R) inventory systems.
The system state is described by a two‐dimensional state variable (X(t), S(t)), where
Different from the system where the transition time between each two consecutive inventory states is exponentially distributed (see, e.g., Ha 2000), the transitions of inventory states in our model are not time memoryless since the transitions depend on the elapsed time (or age) of the outstanding order (if there is any). Let
The objective of the system is to find the optimal control policy u
* that maximizes the expected discounted profit over an infinite horizon:
Let Δ be a positive small interval and
By Bellman's principle, we have
Re‐arranging the terms, dividing both sides by Δ, and letting Δ → 0 yields the HJB equations: If s = 0,
Structure of the Optimal Policy
It is difficult to analyze the solutions of differential equation systems 8 and 9 directly. In the following analysis we perform backward induction on the first‐order approximate optimality equation 2. For convenience, we omit the notation o(Δ) in the analysis.
We first define a set of structural functions. Let
Dv(x + Q,0) ≤ Dv(x,s) and Dv(x,s + Δ) ≤ Dv(x,s).
Dv(x,s) < p
1 + π
1.
These properties partially characterize the structural properties of the optimal value function. The first inequality (C1) implies that the opportunity cost of each outstanding batch order v(x + Q,0) − v(x,s) is decreasing in the inventory level x for any s ≥ 0. This property is also called Q‐difference decreasing (Huh and Janakiraman 2012). The second inequality (C1) states that the marginal value of inventory on hand is decreasing in the age of the outstanding order, which implies that
Lemma 1 employs the backward induction approach on the Bellman equation to establish some structural properties of the optimal profit function. To completely characterize the structure of the joint production control and inventory rationing policy requires the concavity property with respect to the inventory level. We are not able to prove concavity via the backward induction approach. Fortunately, we find that concavity can be proved by using the properties of Lemma 2 and manipulating the HJB equations 8 and 9.
For any s ≥ 0, J(x,s) is concave in x, i.e., D
2
J(x,s) ≤ 0 for all x ≥ 2.
We now characterize the structure of the optimal policy
The optimal production control policy is characterized by a critical stock level (reorder point): The optimal rationing policy can be characterized by the time‐dependent critical levels:
In Theorem 1, part (a) shows that the optimal batch production control policy is of the critical‐level type. Thus, the reorder point policy is optimal.
Part (b) states that for any class n demand, the optimal rationing policy is characterized by the time‐dependent critical level K n (s) when s units of time have elapsed. If the inventory level is below or at K n (s), the marginal value of inventory exceeds the penalty cost plus the lost sales revenue due to rejecting the customer order. In other words, when the inventory level is low, it is more beneficial to reserve inventory in anticipation of future demands from higher priority classes. When the inventory level is high, it is more beneficial to accept more orders from lower priority demand classes so as to increase the total revenue and reduce the inventory holding cost. In particular, the marginal value of inventory is always less than p 1 + π 1, which implies that it is always optimal to accept class 1 customer orders, i.e., K 1(s) = 0.
For any s, K n (s) is increasing in n. This nested threshold structure implies that the higher the inventory level is, the more demands from lower priority classes will be accepted. For any n, K n (s) is decreasing in s, which follows from property (C1). The rationale behind the time‐monotone structure is as follows: When making an inventory allocation (rationing) decision, the manager needs to take into account not only the inventory position (= on‐hand inventory + inventory being processed) but also the status of production. As time goes by, the outstanding order gets closer to completion, so the opportunity cost of on‐hand inventory becomes lower (due to the incoming production order). Thus, given the same inventory position, the state with a production order closer to completion tends to accept more lower priority customer orders.
The threshold type of policy structure characterized by Theorem 2 is very intuitive and easy to implement. It is in line with that of Ha (2000) for systems with unit production and Erlangian production times. However, it is worth mentioning that the analysis of the batch production system is technically more challenging because Ha's approach of aggregating the two‐dimensional state space into a one‐dimensional state space no longer works in our model. In addition, the system with a batch‐ordering restriction normally does not have the concavity property (see Huh and Janakiraman 2012). Without the concavity property, the rationing control policy may not be of the threshold type. Hence, this nice property is a surprise to us and it allows us to have a complete characterization of the rationing policy.
Our analysis benefits from the assumptions of Poisson demand, lost sales, and a single outstanding order. In particular, when allowing multiple outstanding orders, with generally distributed lead times, the model becomes intractable since there may be an infinite number of outstanding orders at any point in time. In the following analysis, we first use a phase method to approximate the production times and then extend the structural analysis to the case that allows multiple outstanding orders.
Approximation with Phase‐Type Distributions
The exact computation of the above model with general production time distribution, although possible, is not easy, as it involves solving a differential equation system. It is known that it is possible to approximate any distribution on non‐negative real numbers by a phase‐type (PH) distribution to any degree of accuracy (Tijms 1994). Using the PH distribution to approximate the general distribution is also called the method of phases in queueing theory. This method is often used in inventory theory to model stochastic lead times (Zipkin 1988, 2001). A typical class of PH distributions is the mixed‐Erlang distribution. Many distributions, such as the exponential distribution, Erlang distribution, and hyper‐exponential distribution, are special cases of the mixed‐Erlang distribution. Therefore, we can use the mixed‐Erlang distributed processing times to approximate general stochastic lead times. The advantage of using the mixed‐Erlang distribution is that the duration of each delivery phase is exponential. Since the demand processes are Poisson processes, the system is memoryless when it is between two delivery phases or two consecutive demand arrivals, which enables us to easily work on the Markovian discrete event system and compute the optimal policies.
Approximating the production time distribution using the mixed‐Erlang distribution is natural for systems with the following operational characteristics: The production process consists of multiple processing steps. The duration of each step is approximately exponentially distributed, and the number of steps to finish the production of each batch is random. Then, each Erlangian phase corresponds to a production step, and the completion of each phase corresponds to the completion of each production step. See Ha (2000) for detailed justifications for using phases to represent partially completed production.
In the following analysis, we assume that the production times follow the following distribution:
Tijms (1994) demonstrates how to approximate a general distribution using the mixed‐Erlang distribution. For example, when 0 ≤ c
τ
≤ 1, where c
τ
is the coefficient of variation in the production time τ, the distribution of τ can be approximated by a mixture of two Erlang distributions with k and k − 1 phases with probabilities ω and 1 − ω, respectively, and the same rate β such that
With mixed Erlangian processing times, the state of the system can be represented by
The phase‐type distribution approximation allows us to formulate the problem as a typical Markov decision problem. Re‐scale the time unit so that γ + Λ + β = 1, where
Note that J(x,k) = J(x + Q,0). If we do not re‐scale the time unit, the equation 11 should be replaced by
The following theorem shows that the approximate model has a similar optimal policy structure as that of the model with general production time distribution.
The optimal production policy is characterized by a critical level: The optimal rationing policy is characterized by the state‐dependent critical levels:
The optimal policies and the resulting expected profits for both the exact model and the approximate model can be computed using the standard value iteration approach (see, e.g., Puterman 1994). More specifically, the computation is started from a system with a truncated state space by limiting the maximum inventory level. We first initialize the value function by assigning zeros to all the states. We then conduct value iterations according to the optimality equation. The iteration is terminated only when a pre‐set level of accuracy is achieved. The size of the state space is enlarged gradually until the profit is no longer sensitive to any increase in the state space. Our results can also be easily extended to the long‐run average profit setting by letting the discount rate go to zero. See, e.g., Ha (2000) and Benjaafar and ElHafsi (2006) for more detailed discussions. Using the value iteration approach, we can also compute the optimal policies and relative value functions under the long‐run average profit criterion.
We next provide an illustrative numerical example to show how to approximate a general distribution with a mixed‐Erlang distribution.
Suppose that the production time τ satisfies a Gamma distribution with shape parameter κ and rate μ, with κ > 1. Then, E[τ] = κ/β and

Probability Density Functions with Gamma Distribution and Its Mixed‐Erlang Approximation

Optimal Rationing Policies with Gamma Distribution and Its Mixed‐Erlang Approximation
Multiple Outstanding Orders: A Tandem MTS System
The preceding analysis relies on the assumption that there is at most one order outstanding at any point in time, i.e., no new order is issued if there is an order in the outstanding order pipeline. We relax this assumption in this section by considering a k‐stage serial inventory system, in which there may exist multiple outstanding orders at the same time. The stages are indexed by j = 0, 1, …, k − 1. The lowest stage storing the on‐hand inventory is represented by stage 0, stage j + 1 ships all its inventory to stage j, j = 0, 1, …, k − 2, and stage k − 1 places batch orders from an outside supplier with infinite supply.
We assume that the duration of each phase is exponentially distributed with a mean 1/μ, which implies that the production time of each order follows a k‐Erlang distribution with a mean k/μ. Due to the memoryless property of the exponential distribution, the orders placed at different times may pile up at some stage. To approximate the exogenous sequential supply system where the replenishment orders do not cross overtime (Zipkin 2000, § 7.4), we assume that all the orders in the same stage, once piled up together, will move simultaneously to the subsequent stage. In other words, an order may be delivered simultaneously with the other orders placed earlier. This is a common treatment in the literature (see, e.g., Johansen 2005, Kaplan 1970, Zipkin 2008).
Let (x, q
1, …, q
k−1) denote the state of the system, where x is the on‐hand inventory level and q
i
represents the size of the outstanding order at stage i, i = 1, …, k − 1. Note that x is a non‐negative integer and q
i
is a non‐negative integer multiple of Q. Similar to the multi‐echelon inventory models (see, e.g., Pang et al. 2012), we can transform the state variable by a vector y = (y
0,y
1, …, y
k−1) such that y
0 = x,
Here, the operators H
i
addresses the order shipments from stage i to stage i − 1, i = 1, …, k − 1, and H
k
addresses the ordering decisions at stage k − 1 (i.e., shipments from the outside supplier or stage k). For convenience, we define Δ
e
V(y) = V(y) −V(y − e) and Δ
i
V(y) = V(y) − V(y − Qe
i
), where e
i
is the k‐dimensional unit vector with the i‐th component being 1, i = 0, 1, …, k − 1, and
Let s = (s 1,⋯,s k−1), where s j = q 1 + … + q j represents the partial sum of the sizes of the outstanding orders (in batches) from stage 1 to stage j, j = 1,⋯,k − 1. Then, y = (x,x + s 1,…,x + s k−1). The following theorem characterizes the structure of the optimal policy.
The optimal inventory replenishment policy is characterized by a state‐dependent reorder point: The optimal rationing control policy can be characterized by state‐dependent rationing threshold levels:
This theorem characterizes the structure of the joint production and inventory rationing policy when there may be multiple orders outstanding at the same time. Note that compared with state y, y + Qe
k−1 has one more batch in the last stage and in the total number of outstanding orders. For i < k − 1, compared with y, y + Qe
i
has one more batch in stage i, but has one less batch in stage i + 1, with the total number of outstanding orders being the same, which implies that in state y + Qe
i
the outstanding orders are closer to their receivers. Part (a) shows that the optimal production control is characterized by a state‐dependent reorder point (in terms of inventory level). The optimal reorder point depends on the status of the production order pipeline, which is different from that derived under the assumption of at most one order outstanding where the optimal reorder point is independent of the delivery status. Note that given the total number of outstanding orders
Part (b) shows that the optimal rationing policy can be characterized by the state‐dependent critical stock levels. The monotonicity of
We now translate the optimal policy structure back into the production status in terms of (i
1, …, i
k−1). Let R(q
1, …, q
k−1) and K
n
(q
1, …, q
k−1) be the respective production and inventory rationing control thresholds corresponding to
For 1 < l ≤ k − 1, if q
l−1 = q
l
= 0, then the following inequalities hold:
Corollary 1 provides further insights into the optimal policy structure. From inequalities 13 we know that
These inequalities have the following implications. (1) The more orders are outstanding, the lower the reorder point is. (2) The effect of having one more batch that has completed k − l phases on the reorder point is stronger than the effect of having one more batch that has completed k − l + 1 phases. That is, the sensitivity of the reorder point to the number of outstanding orders in each phase decreases in the ages of the outstanding orders, where the age of an outstanding order refers to the number of phases it has completed. The reordering decision is most sensitive to the youngest outstanding orders that have completed only one phase.
From inequalities 14, we know that
The above monotone sensitivity is in line with the lost sales inventory models without the batch‐ordering restriction (see, e.g., Huh and Janakiraman 2010, Zipkin 2008) and the periodic‐review inventory‐pricing model (Pang et al. 2012). In the continuous‐review setting, Pang and Chen (2010) present some preliminary analysis for these properties when there are at most three orders outstanding (i.e., k = 3). However, it is not easy to further extend their analysis to the general case where k can be any positive integer. The state‐transformation approach enables us to analyze the structural properties in the context of the inventory rationing model with multiple outstanding orders.
The optimal policy parameters can be obtained by solving the optimality equation 12 using the conventional value iteration approach. Note that such an approach requires remembering the profit values for all the states. As the number of the outstanding orders increases, the state space increases exponentially and the number of iterations required before the algorithm converges may also increase significantly, and the computation effort becomes prohibitive. For more detailed discussions of the computational complexity and convergence of the value iteration algorithm, the reader may refer to Puterman (1994).
The following two examples demonstrate the structure of the optimal policies in a two‐phase and a three‐phase tandem MTS systems, respectively.
Consider a system with a two‐phase Erlang production process (k = 2). The mean of each phase is 1/k (so that the mean of the total production time is 1). The other parameters are γ = 0.01, Q = 10, C = 10 + 5Q, h = 1,

Structure of Optimal Policy (k = 2)
Consider a system with a three‐phase Erlang production process (k = 3). The mean of each phase is 1/k (so that the mean of total production time is 1). The other parameters are the same as those in Example 2. The optimal reorder point and rationing levels are as follows.
It is interesting to know how sensitive the system performance is to the restriction on the number of outstanding orders. To this end, we solved a set of numerical examples to compare the performance of the system that allows at most one order outstanding with that of the system that allows multiple orders outstanding at any time. We examined the systems with two‐, three‐, and four‐phase Erlang production times, respectively. The average production time is 1 and the mean time per phase is 1/k. For ease of comparison, we use the long‐run average profit as the performance measure. Performance Comparison.
The system parameters are still γ = 0.01, Q = 10, C = 10 + 5Q, h = 1,
Performances: One Outstanding Order vs. Multiple Outstanding Orders
Optimal Policies: One Outstanding Order vs. Multiple Outstanding Orders
First, for all k, as the batch size Q increases, the optimal reorder point, rationing levels, and profit difference tend to decrease. This fits the intuition that the larger the order size is, the less frequently the replenishment order is placed and more demand orders are accepted. When Q is small, the reorder point tends to be greater than Q, R > Q, and the profit difference is significant, which implies that the single‐outstanding‐order assumption may lead to a greater loss. When Q is large, the reorder point tends to be smaller than Q, R < Q, and the profit difference becomes close to zero, which implies that it may suffice to allow at most one outstanding order.
Second, comparing the single‐outstanding‐order system and tandem system, it is interesting to observe that given the same production status, especially when the batch size is small, the tandem system tends to have lower rationing levels and reorder points. This may be due to the opportunity to have multiple outstanding orders before the current order is delivered and then the tandem system will be more likely to have higher inventory levels later. The anticipation of having more inventory induces the manager to set lower rationing levels to accept more orders and lower the reorder point to avoid the inventory over‐stocking risk. However, when the batch size is large and thus R < Q, the tandem system only allows at most one order outstanding and the rationing levels are effectively the same as those of the system with at most one order outstanding. This finding confirms the view that when the batch size is sufficiently large, the system with at most one outstanding order provides a good approximation of the tandem system.
Concluding Remarks
This study addresses the inventory rationing problem for a lost sales MTS system with batch ordering and multiple demand classes. We first consider the case with general production times and a single outstanding order and then approximate the production time distribution by the phase‐type distributions. To address the cases with multiple outstanding orders, we consider a MTS tandem system. We introduce a transformation approach that enables us to characterize the structure of the optimal policy and obtain some new structural results. These results provide some new insights into the inventory rationing problems.
Nevertheless, our model is restricted to the assumptions of Poisson demand and fixed batch size. In addition, although we are able to characterize the structure of the optimal policies when there are multiple outstanding orders, it is still unrealistic to compute the optimal policies directly due to the curse of dimensionality. An important future research direction is to use some of the insights provided by the structural analysis to design effective optimal or heuristic algorithms.
More importantly, we have a limited understanding on backlog systems with batch production/ordering. Huh and Janakiraman (2010) show that the (R,nQ) policy is optimal when there is only a single demand class. In the presence of multiple demand classes, the questions as to whether the (R,nQ) policy is still optimal and whether the optimal rationing control is still of threshold type remain open. We aim to address these issues in future research.
Footnotes
Acknowledgments
We sincerely thank Professor Panos Kouvelis (the department editor), the senior editor, and three anonymous referees for their valuable comments and suggestions that helped improve this study. The corresponding author, Houcai Shen, was partly supported by National Natural Science Foundation of China (No. 71071074) and MOE (Ministry of Education in China) Project (No 20120091110059).
