Abstract
We consider a firm purchasing a storable raw material commodity from a spot market with volatile commodity prices and the access to an associated financial derivatives market. The purchased commodity is processed into an end product with uncertain demand and lost sales. The firm aims to integrate the inventory replenishment and financial hedging decisions to maximize the mean‐variance of terminal wealth over a finite horizon. Recognizing time‐inconsistency of mean‐variance criteria, we employ the dynamic programming approach to obtain a time‐consistent policy. Assuming no arbitrage in financial market, we show that the mean‐variance utility functions under the time‐consistent policy have a recursive representation which enables us to readily characterize the structure of the time‐consistent policy. We analyze two types of hedging instruments, vanilla hedges and exotic hedges, and show that inventory and financial hedging decisions can be separated in the presence of forward contracts and a myopic state‐dependent base stock policy is optimal. The optimal hedging policy can be obtained by minimizing the variance of the hedging portfolio, the value of excess inventory and the profit‐to‐go as a function of future price. In the presence of a continuum of option strikes, we show how to construct custom exotic derivatives using forwards and options of all strikes to replicate the profit‐to‐go function. We then show the optimality of the time‐consistent policy under exotic hedge for the initial mean‐variance objective. We further investigate the dynamic interplay of inventories and financial hedge and show that they can be substitutes in a dynamic environment. Finally, we compare the performances in different hedging environments to discuss how financial hedges add value and provide a numerical study.
Introduction
The past decade has witnessed unprecedented volatility in commodity prices, threatening the survival of individual companies. For example, the crude oil price experienced tremendous fluctuations in recent years, falling from a peak of $115 per barrel in June 2014 down to under $35 in February 2016 but then up to about $50 in December 2016. According to a PwC survey of leading manufacturers, a large majority of senior executives said commodity price is crucial to a company's financial performance and many organizations were more proactive in managing commodity risks by focusing on improving operational efficiency and derivative hedging strategies (PwC 2008). For instance, the beverage producer Anheuser‐Busch uses exchange‐traded wheat futures and over‐the‐counter (OTC) aluminum swaps to hedge the risk of higher raw material prices. But financial hedging itself is a double‐edged sword and it needs to be integrated with operational strategies. In 2001, Ford announced $1 billion loss on precious metals inventory and forward‐contract agreements while Hewlett–Packard (HP) was able to cut commodity price risk using procurement risk management strategy based on a contract portfolio approach (Nagali et al. 2008).
Why should firms hedge? Merton H. Miller once said to an oil producer who resisted to use futures market, “When you hold inventory, non‐hedging is gambling. You gambled that the price of oil would not drop and you lost” (Miller 1997). A substantial literature in finance has provided the economic interpretations on why value‐maximizing firms may benefit from reducing profit variability and therefore engage in financial hedges as if they were risk averse. In their seminal paper, Smith and Stulz (1985) show that hedging, as part of firms’ financial policies, can reduce expected tax liability, bankruptcy costs, or compensation paid to risk‐averse managers, and they define hedging as the acquisition of financial assets that reduce the variance of the firm's payoffs. To capture those market imperfections and managerial risk aversion in a tractable fashion, instead of explicitly considering those factors (e.g., tax liability or bankruptcy), concave utility functions are often used to define firms’ decision criteria; see Chod et al. (2010) for a concave exponential function, and Gaur and Seshadri (2005) and Ding et al. (2007) for mean‐variance criteria. As argued by Smith and Stulz (1985) that the primary goal of hedging is to smooth cash flows, minimizing variability of cash flows is a natural objective for firms to develop hedging strategies. Duffie and Richardson (1991) also argue that hedging may reduce the expected costs arising from those market imperfections, and a simple hedging criterion, such as the minimum‐variance criterion, can be quite satisfactory from a practical point of view. Following these arguments, we adopt the mean‐variance criterion to analyze the joint replenishment and financial hedging problem.
The mean‐variance analysis of Markowitz (1952), that measures the risk of a portfolio with its variance, is seen as the cornerstone of modern portfolio theory and has been widely used in both academia and industry due to its simplicity and intuitive appeal (see, e.g., Basak and Chabakauri 2010, 2012, Hull 2008, Markowitz 1959, 1991 and the references therein). Eydeland and Wolyniec (2002) argue that the mean‐variance rule can serve as a proxy for other types of risk minimization rules. In particular, Levy and Levy (2004) show that the efficient sets of mean‐variance and prospect theory almost coincide and therefore one could use the mean‐variance rule to construct efficient portfolios under prospect theory, which justifies the robustness of the mean‐variance rule.
In this study, we address a joint inventory replenishment and financial hedging problem for a firm aiming to maximize mean‐variance of terminal wealth over a finite horizon. We attempt to answer the following questions: (i) What is the structure of the time‐consistent policy? (ii) When are the time‐consistent policies optimal for the initial mean‐variance objective? (iii) How do inventories and financial hedges interplay? and (iv) How do financial hedges add value for the firm?
More specifically, we consider a firm purchasing a storable commodity from a competitive physical spot market with the access to associated financial derivatives. The purchased commodity serves as a raw material which can be processed into an end product with fixed selling prices. The demand of the end product is random and unmet demand is treated as lost sales. The spot price evolves as a Markov process. The physical spot market and the associated financial derivatives market are both assumed to be liquid in that no transaction cost is incurred in trading. Moreover, we assume that there is no arbitrage opportunity in either the physical spot market or the financial market.
Note that the mean‐variance criterion is time‐inconsistent as the variance of terminal wealth evaluated at any point in time is greater than that of a later date, which provide incentives for the firm to deviate from the optimal policy for the initial objective. The time‐inconsistency problem has long been recognized in the economics literature (see, e.g., Caplin and Leahy 2006, Strotz 1956). However, since the time consistency is a basic requirement of rational decision making, Strotz (1956) argues that an investor should always choose the best plan among those that he will actually follow unless pre‐commitment is possible. Following Strotz (1956), Basak and Chabakauri (2010) use the dynamic programming approach to derive the time‐consistent policy for the dynamic mean‐variance portfolio problem. Using a similar approach, we develop a recursive formulation to derive a time‐consistent policy for the commodity inventory system.
We derive the time‐consistent policies in two types of hedging environments. When only exchanged‐traded vanilla derivative contracts, such as forwards and options, are used, we show that, with forward contracts maturing at next period, the joint replenishment and hedging optimization problem can be decomposed into two sub‐problems: a myopic inventory decision problem and a variance‐minimization portfolio optimization problem. We show that a myopic state‐dependent base‐stock policy is optimal. The optimal hedging decisions can be obtained by minimizing the variance associated with the expected future cash flows. We then show how to construct exotic derivative contracts with European calls and puts of all strikes to fully replicate the expected future cash flows, conditional on the next period's spot price. Note that exotic derivatives refer to all those non‐standard and tailor‐made derivative contracts that are mostly traded in OTC markets (Briys et al. 1998). We show that the optimal exotic hedge leads to the optimality of the time‐consistent policy for the initial objective function. We also investigate the interplay between inventories and financial hedges and show that financial hedges increase the inventory level in the last period, which is in line with the finding of the newsvendor literature (see, e.g., Gaur and Seshadri 2005), but may reduce the inventory level in the preceding periods, which implies that inventories and financial hedges may be substitutes. We further compare the performances of different hedging strategies and provide a numerical study to examine these results.
The rest of the study is organized as follows. Section 2 reviews the related literature. Section 3 describes the problem, discuss the time‐inconsistency issue and derive a recursive representation. Section 4 analyzes the structure of the time‐consistent policies under vanilla and exotic hedges. Section 5 studies the dynamic interplay between inventories and financial hedges. Section 6 presents a numerical study. Section 7 concludes the study. All the proofs are in the Appendix.
Related Literature
There exists a substantial literature on financial hedging in finance; see Sanda et al. (2013) for a review on the practice and recent developments of financial hedging in non‐financial firms. As the primary goal of hedging is to reduce cash‐flow variability (Smith and Stulz 1985), the minimum‐variance criterion is commonly used in practice (Duffie and Richardson 1991, Hull 2008). To capture the tradeoff between risk and return, Markowitz (1952) introduces mean‐variance criterion to portfolio risk management. In a complete market setting, Li and Ng (2000) provide a closed‐form solution for the optimal pre‐commitment policy that maximizes the mean‐variance objective at the initial date. However, due to the time‐inconsistency of the mean‐variance criteria, an investor may subsequently find it optimal to deviate from the initial policy if market conditions change in the future unless she is able to pre‐commit (Strotz 1956). Following Strotz (1956), Basak and Chabakauri (2010) employ a dynamic programming approach to derive a time‐consistent policy for a dynamic portfolio optimization problem in an incomplete market. Similarly, Basak and Chabakauri (2012) derive a time‐consistent variance‐minimization hedging strategy for a non‐tradable asset. We adopt a similar approach to derive a time‐consistent policy for a stochastic commodity inventory system where uncertain demand is a non‐tradable risk factor.
There is a growing body of literature on joint operational and financial risk management. See Kleindorfer (2009) and Turcic et al. (2015) for reviews of recent developments. Earlier studies focus on extending conventional models under risk‐neutral criteria to settings under risk‐averse criteria (see, e.g., Agrawal and Seshadri 2000, Bouakiz and Sobel 1992, Eeckhoudt et al. 1995). Recent developments on the joint operational and financial hedging problem focus on single‐period (newsvendor) settings. For example, Gaur and Seshadri (2005) consider a joint purchasing and financial hedging problem for a risk averse retailer with demand being correlated to a market index. They show that financial hedging increases the optimal inventory level, which implies that inventories and financial hedges are complements. Ding et al. (2007) consider a global manufacturer coordinating production and financial hedging strategies in the presence of exchange rate risk. Chod et al. (2010) examine the relationship between the operational flexibility and financial hedging in capacity investment decisions. They show that product flexibility and financial hedging tend to be complements (substitutes) when product demands are positively (negatively) correlated, whereas postponement flexibility is a substitute to financial hedging. Martínez‐de‐Albéniz and Simchi‐Levi (2006) study the mean‐variance tradeoffs for a manufacturer with a portfolio of option contracts.
There are only a few papers in multi‐period settings. With an additive exponential utility function that is time‐consistent, Chen et al. (2007) study an inventory system that integrates pricing, inventory control, consumption and financial hedging decisions. They characterize the optimal policy structure and show that the inventory and pricing decisions can be separated from the financial decisions, extending the separation theorem of Smith and Nau (1995) to an operational context. Note that their analysis implies that using financial hedges does not alter the optimal policy structure for inventory and pricing decisions when compared to the system without financial hedges. Using a similar framework, Geman and Ohana (2008) discuss time‐inconsistency of various risk measures and propose a time‐consistent criterion for a commodity portfolio problem.
More recently, Kouvelis et al. (2013) study a joint replenishment and financial hedging problem for a lost sales commodity inventory system under the criterion of maximizing mean‐variance of the net present value (NPV) of total cash flows over a finite horizon. Also recognizing the time‐inconsistency of such a criterion, they propose a stochastic program framework to derive a time‐consistent policy by solving a sequence of optimization problems. They analyze and compare single‐contract and multi‐contract hedging strategies and show that myopic base stock policies are optimal under forward hedge. Note that the mean‐variance utilities evaluated in consecutive periods do not have a recursive form, which limits its tractability. While considering a similar commodity inventory system, different from Kouvelis et al. (2013), we adopt the criterion of maximizing mean‐variance of terminal wealth. Note that although decision criteria based on NPV are prevalent in operations literature, the criteria based on terminal wealth are more common in portfolio theory (see, e.g., Back 2017, Basak and Chabakauri 2010, Duffie and Richardson 1991 and references therein). Following Basak and Chabakauri (2010), we develop a recursive representation for the mean‐variance utilities evaluated in consecutive periods, which allows us to derive a time‐consistent policy by backward induction. Compared to Kouvelis et al. (2013), our approach has better analytical tractability and computational efficiency. In particular, under exotic hedge, we show that the time‐consistent policy is indeed optimal for the initial objective. To our best knowledge, it is the first attempt to show that a time‐inconsistent criterion can have a time‐consistent optimal policy in the presence of non‐tradable risk. Moreover, we analyze the interplay between inventories and financial hedges and identify a condition under which inventories and financial hedges may be substitutes while Kouvelis et al. (2013) only discuss it in a numerical study.
In summary, our contributions to this literature are twofold. First, we explicitly address time‐inconsistency in the joint inventory and financial hedging problem in the mean‐variance framework. We employ the dynamic programming approach to derive the time‐consistent policies and show that when the financial market is reasonably complete, such as with a continuum of option strikes, the time‐consistent policy is also optimal in the admissible policy set. Second, our analysis sheds new light into effective commodity risk management practices: Inventory is used to hedge demand (quantity) risk while financial derivatives are used to hedge the price risk. The inventory replenishment decisions can be separated from financial hedging decisions and a state‐dependent myopic base stock policy is optimal. The optimal hedging policy consists of a forward hedge to offset the price risk related to the expected leftover inventory of the current period and a portfolio of financial derivatives to hedge the cash flows related to future decisions. Financial hedging does affect inventory policies and often simplifies them. However, optimal financial hedging strategies are heavily dependent on inventory policies as they determine future cash flows. In other words, inventory decisions can be separated from financial hedging decisions, but not the other way around.
The Model
Problem Description
Consider a firm purchasing a storable commodity from a volatile spot market as a price taker. This commodity serves as a raw material which can be processed into an end product with uncertain demand and fixed selling prices. The processing is on demand basis, which implies that only the raw material inventory is held. Assume that there exists a spot market and an associated financial market for financial derivative contracts written on the spot price. The inventory is reviewed periodically and the periods are indexed by t = 0, 1, 2, …, T + 1. Period 0 is the beginning period and period T + 1 is the ending period. The firm serves the demand occurring in period t = 1, …, T, denoted by
Demand in each period may depend on the spot price. Without loss of generality, we assume that the support of demand is
We assume that this commodity can be sold to or bought from the spot market at spot price at the beginning of each period. The replenishment lead time is zero. Assume that the commodity spot market is liquid in the sense that there is no transaction cost and the bid–ask spread is zero. Let S
t
be the spot price at the beginning of period t. Denote by s the realization of the spot price. The spot price process {S
t
, t = 0, 1, …, T} is a Markov process. Let E[·] be the expectation operator under the subjective probability measure and E
t
[·] = E[·¦S
t
= s] be the conditional expectation operator. According to the competitive storage theory, the commodity price satisfies the following no‐arbitrage condition (see, e.g., Routledge et al. 2000, Williams and Wright 1991, Working 1948).
We assume that the derivatives market is frictionless in that the firm can buy or sell as many shares of a derivative contract as desired at the market price without incurring any transactional cost. In addition, all the relevant financial derivatives (e.g., forwards/futures and options) are written on the spot price of the underlying commodity. Let
Following Froot et al. (1993), Gaur and Seshadri (2005) and Chod et al. (2010), we assume that the expected payoff of the financial hedging portfolio is zero, i.e., E t [H t (S t+1)] = 0. Any violation of such an assumption implies that there exist speculative opportunities from financial hedging. From an operational manager's perspective, the main goal of trading financial assets is to hedge the risk in the operating profit due to the commodity price volatility, not to gain profit through speculation in the financial markets. Most firms do not allow managers to engage in such speculative behavior. As the goal of this research is to provide insights into how firms should manage the variations of operating profits using financial hedging, we use this assumption to eliminate the speculative motive in our analysis. Such a treatment is also commonly seen in finance literature (see, e.g., Brown and Toft 2002, Geman and Ohana 2008). Nevertheless, we will relax this assumption to allow non‐zero expected profits (i.e., arbitrage trading opportunities) from financial hedging in section 8.4 where we show that our structural results remain true, and thus our results are robust to this assumption.
Our assumptions on the joint probability space of the demand and spot price are in line with the assumption of “partial complete market” by Smith and Nau (1995). See Chen et al. (2007) and Kouvelis et al. (2013) for more discussion on it. The forward prices are determined by the expected spot price at maturity of the contract. Let f t,τ (s) be the forward price quoted in period t and for delivery in period τ ≥ t given the spot price at t, s. In particular, f t,τ (s) = E t [S τ ] and f t,t = S t .
Mean‐Variance Formulation and Time Inconsistency
Note that in period 0 only hedging decision is made. From period 1 to T, a joint inventory and hedging decision is made. Let x
t
be the inventory level at the beginning of period t and y
t
be the order‐up‐to level in period t such that
Denote by u
t
= (y
t
, H
t
) be a feasible joint strategy in period t and u = (u
0, …, u
T
) an admissible policy. Let
Let W
0 be the initial wealth level of the planning horizon such that W
0 ≥ 0. Let
Note that the term S
t+1
x
t+1, which is equal to the market value of x
t+1 units of inventory, in the operating profit function of period t + 1,
The dynamics of wealth process can be redefined as
In the initial period 0, the firm's goal is to maximize the mean‐variance of terminal wealth accumulated over the whole planning horizon:
As time proceeds, upon receiving new market information and reviewing on‐hand inventory and wealth levels, the firm updates its mean‐variance objective
Time consistency is a basic requirement of rational decision makers. Strotz (1956) argues that, when recognizing time‐inconsistency of mean‐variance criteria, the firm making decisions at any point in time should take account of the strategies it will actually execute in the future, even though those future decisions may not be optimal for its initial objective of Equation 4. Hence, it is rational for the firm to adopt a time‐consistent policy under which the firm optimally chooses the decisions at any point in time to maximize the mean‐variance of terminal wealth evaluated in that time, taking into account that he or she will act optimally in the future with respect to the mean‐variance of terminal wealth to be reevaluated in the future. We then follow Strotz (1956) and Basak and Chabakauri (2010) to employ the dynamic programming approach to derive a time‐consistent policy.
Dynamic Programming Formulation for Time‐Consistent Policies
We now present the approach to derive the time‐consistent policy. In this approach, the firm's dynamic optimization problem can be viewed as an intrapersonal sequential game such that the firm in each period acts as a Stackelberg leader and chooses the best strategy in this period while taking into account its best responses in future periods, and the corresponding sub‐game perfect Nash equilibrium policy is a time‐consistent policy (Basak and Chabakauri 2010, Cui et al. 2017).
More specifically, for each period t, the firm solves the following optimization problems:
Let
The following proposition further develops the recursion 8 as a bi‐level recursive representation.
Proposition 1 shows a bi‐level recursive representation of the value functions under the time‐consistent policy u*, including the recursive representations for the mean‐variance utility functions and the expected operating profit functions. The following proposition further simplifies that the recursive representation 9 and 10 by separating the wealth level.
The optimal mean‐variance utility function with
Note that Equations 12 and 13 are corresponding to Equations 9 and 10 with the separation of wealth level from the optimal mean‐variance utility functions. The separation of the wealth level allows us to separate J t+1 from the optimization problem 10 and focus on the expected single period profit E t [π t (y, s, S t+1)] and the time‐consistency variance adjustment term.
The optimality equation 13 implies that the inventory and financial hedging decisions depend only on the initial price and thus the current inventory decision will not affect next period's decisions. Moreover, the time‐consistent term is divided into two parts: the first one is related to the variance of leftover inventory (driven by demand uncertainty) that is carried over to the next period, and the second one is about the variance of the cash flows related to the spot price: the market value of expected carryover inventory, the period‐(t + 1) expected profit‐go‐to, and the payoff of financial hedging. This decomposition implies that the financial hedges are used to mitigate the price risk (by minimizing the variance of the cash flows related to the spot price of the next period), while the risk due to demand uncertainty (i.e., the variance of leftover inventory) cannot not be financially hedged. In other words, demand risk is hedged with physical inventory and price risk can be hedged by financial derivatives.
The optimal policy can be solved by backward induction from Equations 12 to 13. The optimality equation 13 also indicates that the joint inventory and financial hedging decision can be solved in two sequential steps. First, given any inventory decision y, the optimal hedging strategy is derived by solving the problem 15 from which we can see that the financial hedges should be constructed to offset the cash flow at the beginning of the next period,
Structure of Time‐Consistent Policies
This section studies the structures of the time‐consistent joint replenishment and hedging policies under vanilla hedge and exotic hedges respectively. For convenience, we use superscripts (f) and (e) to indicate forward (and vanilla) hedges and exotic hedges, respectively, for relevant notation.
Hedging with Vanilla Derivatives
Vanilla derivatives include standard forward contracts and all those exchanged‐traded derivatives such as futures and options. The trading mechanisms of futures and forwards are similar though futures are traded on exchanges and forwards are traded in OTC markets. For simplicity, we ignore the difference between the futures and forwards. The following theorem characterizes the structure of the time‐consistent policy under vanilla hedge.
For periods t = 0, 1, …, T, suppose The time‐consistent replenishment policy is characterized by a myopic base‐stock policy and the optimal base‐stock levels derived from the following sub‐problems: The time‐consistent hedging policy is characterized by the hedging portfolios as follows. For each period t, the hedging portfolio consists of a position of shorting In particular, if only forwards maturing at next period are available, then
Theorem 1 characterizes the time‐consistent policy in the presence of forward contracts. Specifically, part (a) shows that a myopic base‐stock policy is optimal and the optimal base stock level is either zero or the unique solution of Equation 17. Part (b) shows that the optimal hedging position of the forward contracts consists of two parts: Short
This theorem implies that, in the presence of the forward contract maturing at the next period, one could make the inventory decision independently, knowing that the risk associated with the expected excess inventory will be eliminated by a forward hedge. Then J
t
can be rewritten as
This separation has an important practical implication. In practice, inventory decisions are made by operations managers while the hedging decisions are made by financial managers. Our result suggests that the management can allow the operations manager to manage inventory decisions separately from financial hedging decisions. But, when making the hedging decisions, the risk manager must take into account the optimal inventory policy to forecast the resulting future cash flow. Therefore, the financial hedges simplify the operations manager's task and help clarify the relationship between operations management and financial risk management within an organization.
The optimality of the myopic base stock policy is ensured by the forwards maturing in next period. Without the forward contract maturing at next period, the risk associated with carryover inventory cannot be perfectly hedged and therefore the determinant of the optimal base stock level
The following corollary follows immediately from Theorem 1.
Suppose the demand is perfectly correlated to the spot price with the specification
If only forwards maturing at next period are available, then the optimal hedging portfolio is If both call and put options contracts maturing at any future period are available and b
t
= 0, i.e.,
Corollary 1 shows that when the demand is perfectly correlated to the spot price, i.e., only price risk exists, the ordering quantity of each period is simply to order the demanded quantity. But v t (s) is still nonlinear in s (due to the term (r t − S t )+), which implies that the price risk cannot be perfectly hedged by forward contracts. For deterministic demand, the price risk can be perfectly hedged by entering the short position in put options.
We next show the effect of the degree of risk aversion, represented by λ, on the inventory policy.
For all t, the optimal inventory decision
Corollary 2 shows that the more risk‐averse firm orders less under forward hedge. In particular, when λ = 0 and r
t
> s,
Hedging with Exotic Derivatives
In derivatives markets, since standard options contracts traded in exchanges normally have limited number of strikes, exotic derivatives are normally structured by financial institutions in OTC markets to meet the precise needs of their clients to develop more sophisticated hedging strategies (Hull 2008). In finance literature, it is common to assume that there exists a continuum of options of all strikes as a proxy to a frictionless financial market (see, e.g., Carr and Madan 2001). Carr and Madan (2001) argue that, the assumption of a continuum of strikes is essentially the counterpart of the standard assumption of continuous trading, which serves as a reasonable approximation to a market environment where there is a large but finite number of European options strikes (e.g., options for S&P 500). In a single‐period hedging model, Carr and Madan (2001) show that a twice‐continuously differentiable payoff function can be replicated by a portfolio of a spectrum of European put and call options with a continuum of strikes, forwards, and risk‐less bonds.
We next follow Carr and Madan (2001) to construct an optimal exotic hedging strategy. To this end, we need to impose the following assumption on spot price.
For all t and any real function v(·) which is twice differentiable almost everywhere, the conditional expectation E[v(S
t+1)¦S
t
= s] is twice differentiable in s almost everywhere.
A common example of commodity price is the geometric Ornstein–Uhlenbeck process:
The following theorem characterizes the optimal policy structure under exotic hedge.
Suppose that there exist forwards and European calls and puts of all strikes, and Assumption 1 holds. The time‐consistent replenishment policy under exotic hedge is identical to that under forward hedge as described in Theorem 1(a): For all t, if r
t
≤ s, the least optimal base‐stock level The time‐consistent hedging policy under exotic hedge is characterized by the hedging portfolios as follows. For t = 0, …, T, the optimal hedging portfolio is to short
Theorem 2 characterizes the structure of the time‐ shows that the optimal ordering policy under exotic hedge is the same as that under forward hedge: a myopic state‐dependent base‐stock policy is optimal.
Then, under the optimal exotic hedging strategy, the Equation 14 is equal to
We next show the optimality of the time‐consistent policy under exotic hedge for initial objective.
The time‐consistent policy under exotic hedge, characterized by Theorem 2, is optimal for the original mean‐variance optimization problem 4 in the initial period 0.
Note that although forward hedges are sufficient to separate the inventory decisions from financial hedges in the same period (Theorem 1) the future inventory decisions, which determine the profit‐to‐go function
Dynamic Interplay between Inventories and Financial Hedges
We now study the dynamic interplay between inventories and financial hedges. To this end, we consider the setting in the absence of financial hedges, which is described as a “no hedge” environment, indicated by the superscript (n). Then under the time‐consistent policy, we have
Suppose there is no financial hedge for period T. If r
T
≤ s, the optimal inventory decision In particular, when r
T
> s,
Part (a) characterizes the optimal inventory decision in period T. Part (b) shows that financial hedging, such as forward hedge, can result in a higher inventory level that no hedging does, which implies that inventories and financial hedges are complements in the last period. This insight is in line with that in newsvendor models (see, e.g., Ding et al. 2007, Gaur and Seshadri 2005).
However, for earlier periods before period T, inventories and financial hedges may no longer be complements. The following proposition identifies a condition that the inventory decisions under forward hedges are less than that under no hedge.
For any period t = 1, …, T − 1, suppose there are no financial hedges and
Proposition 4 shows that the optimal unhedged inventory decision can be greater than that under financial hedges in period t when inequality 28 holds. Note that
Intuitively, the expected profit of each period is a decreasing function of the purchasing price and thus the future cash flows may be negatively correlated to the spot price. To show that such a negative correlation does exist, we consider the following special case.
For period t = 1, …, T, suppose the demand is perfectly negatively correlated with spot price with the form
Note that
The above two propositions shed new light into the relationship between physical inventories and financial hedges in dynamic inventory systems. If only the price risk associated with leftover inventory is of the concern in period T, financial hedges that eliminate the price allow the firm to focus on demand risk and therefore induce the firm to increase inventory level, which implies a complementary relationship. However, in the earlier periods, the firm is concern with not only the price risk associated with the leftover inventory but also that associated with future cash flows, the aggregate price risk may drive the firm to raise inventory level in the absence of financial hedges, which implies a substitute relationship.
We next analyze the effects of financial hedges on several financial performance indicators, namely, mean‐variance utility, mean and variance of terminal wealth.
Let For each t = 1, …, T, the mean‐variance utilities under exotic hedge are greater than that under both forward hedge and no hedge, i.e., For each t = 1, …, T, the mean values of terminal wealth under forward hedge and exotic hedge are identical but the variances of terminal wealth under forward hedge are greater than that under exotic hedge, i.e., In particular, for period T,
Proposition 6 shows that the exotic hedge results in higher mean‐variance utilities than the forward hedge and no hedge. The expected terminal wealth under exotic hedge and forward hedge are identical, which can be explained by the fact that the inventory policy under them are identical and the expected terminal wealth depends only on the inventory policy. In period T, exotic hedge and forward hedge lead to a higher inventory level and therefore a higher expected profit. In earlier periods, since the optimal inventory levels under no hedge may be higher (by Proposition 4), which leads to higher expected profits, it is unclear whether the expected values of terminal wealth evaluated in earlier periods under no hedge are greater or smaller than that under financial hedges. The variances of terminal wealth under exotic hedge are smaller than that under forward hedge, which is because the exotic hedge can fully eliminate the price risk associated with future cash flows while the forward hedge can only partially hedge it. But it is unclear whether the variances of terminal wealth under financial hedges are greater or smaller than that under no hedge.
In summary, in a dynamic commodity inventory system, financial hedging can add value by substituting physical inventories that are costly due to holding costs and more efficiently offsetting the variation driven by commodity spot prices. When the derivatives market is sufficiently complete in the sense that the firm can use exotic contracts to fully hedge the price risk, exotic hedging is more advantageous to no hedge and simple forward hedge.
A Numerical Study
The numerical study compares the best time‐consistent policies and their performances in different hedging environments: (i) forward hedge, (ii) exotic hedge, and (iii) no (financial) hedge.
Specifically, we consider a four‐period setting, i.e., T = 4, with the length of each period being Δ = 1. Assume that the risk‐adjusted spot price process follows the Geometric Mean‐Reverting process 22. We use the calibration of Schwartz (1997) for crude oil price to specify the parameters of the price process as κ = 0.428, η = 2.991, σ = 0.257, γ = 0.002. The demand is expressed as

Ordering Decisions [Color figure can be viewed at

Hedging Decisions [Color figure can be viewed at

Mean‐Variance Performance of Terminal Wealth [Color figure can be viewed at
Figure 1 compares the optimal order‐up‐to levels with financial hedges or no hedge. Recall that the optimal inventory decisions under forward hedge and exotic hedge are identical. Comparing the hedged and unhedged inventory decisions, in the last decision period T = 4 the hedged order‐up‐to levels are greater than the unhedged ones whereas in the earlier periods the hedged ones are smaller. That is, financial hedges drive the firm to order more in the last period but order less in the earlier periods, which confirms our predictions in Propositions 3 and 5 and implies that inventories and financial hedges can be substitutes in dynamic environments.
Figure 2 demonstrates the structure of hedging strategies. Note that the positive (negative) sign indicates a long (short) position. In the last period, the optimal hedging policy should offset the risk associated with the carryover inventory, which implies a short position. Hence the sign is negative. But in the previous period, the hedging positions are all positive, which echoes the preceding analysis that the negative correlation between the spot price S t+1 and profit‐to‐go v t (S t+1) may drive the firm to enter a long position. Moreover, the firm will buy (long) more forwards in earlier period than that in later periods. Under the exotic hedges, the firm tends to enter short positions in call options in the earlier periods and long positions in the later periods, which reveals an opposite pattern to the corresponding forward hedges. Note that the short positions may be due to the convexity of the profit‐to‐go functions and the role of options hedges to offset the convexity. Compared to the optimal forward positions, the numbers of options are relatively smaller, which is driven by the fact that the profit‐to‐go functions decrease almost linearly.
Finally, we compare the means and variances of terminal wealth in different hedging environments. Figure 3 demonstrates the mean and variance performance for periods 0, 2, and 4. The left panels of Figure 3 show that the profit function is decreasing in spot price. The profits under forward hedge and exotic hedge are identical. In the last period, the hedged profit is slightly higher than the unhedged one, but the opposite is observed in the previous periods. This is because the order‐up‐to level under financial hedges in the last period is higher (and thus the profit is closer to the risk neutral one) but in the earlier periods the order‐up‐to levels under financial hedges are lower resulting in lower profits. Also observe that the difference between the hedged and unhedged profits is enlarged when there are more remaining periods, which implies that the value of financial hedging increases in the length of the remaining decision horizon. The right panels of Figure 3 show that in periods 0 and 1 the unhedged variances are greater than that under forward hedge and the later are greater than that under exotic hedge. In the last period, the forward hedge and exotic hedge have the same variance and the unhedged variance is smaller than the hedged variances.
Concluding Remarks
This study addresses a joint inventory and financial hedging decision problem in a dynamic mean‐variance framework for a commodity inventory system with lost sales. Recognizing the time‐inconsistency of mean‐variance criteria, we employ the dynamic programming approach to derive a time‐consistent policy. We characterize the structures of the time‐consistent policies under forward hedge and exotic hedges and compare them to the system in the absence of financial hedges. We show that as long as forward contracts are used the inventory decisions can be separated from the financial hedging decisions and myopic base‐stock policies are optimal. We also show that under exotic hedge the time‐consistent policy is indeed optimal for the original objective of maximizing mean‐variance of terminal wealth over the whole planning horizon in the initial period. Furthermore, we identify conditions under which financial hedges may lead to lower order‐up‐to levels, as contrast to the prediction of the typical one‐period models in the literature that financial hedges may lead to lower order‐up‐to level. We also compare the key financial performance indicators in different hedging environments analytically and examine our results numerically.
Our results shed new light into commodity risk management. Firstly, operational (inventory) decisions and financial hedging decisions interplay with each other. On the one hand, financial hedges allow us to decompose a dynamic inventory decision problem into a sequence of myopic decision problems, which significantly simplifies the inventory decision process. On the other hand, financial hedging decisions rely on inventory strategies, since the future cash flow depends on the inventory policies. In the corporate world, the operational and financial decisions are often made separately, partially because of the difficulty in coordinating the decisions between operational and financial managers. Our model suggests that the operational managers can still enjoy their independence (from the details of the financial decisions) while financial managers to effectively hedge need to better understand the cash flow implications of operational strategies. Secondly, financial hedges can be substitutes to inventories. In the one‐period setting, it is known that the financial hedges and inventory levels are complementary. However, in the earlier periods of a dynamic system, financial hedges may lead to lower inventory levels. That is, in the absence of financial hedges, a firm tends to order more inventory in anticipation of a higher price of purchased materials in the future. Our finding suggests that financial derivatives are better instruments to hedge the future price risk, while the inventory decisions focus on the current demand (quantity) risk which cannot be hedged by financial instruments.
Our model assumes that the financial derivatives are fairly priced so that no profit or loss is expected from financial hedging, which is common in the literature (see, e.g., Chod et al. 2010, Froot et al. 1993, Gaur and Seshadri 2005). In the real world, it is possible that there are non‐zero expected returns from the derivative trading (i.e., E t [H t (S t+1)] ≠ 0), which provides a speculative motive, in addition to the hedging motive, for the firm to use financial derivatives. In this case, using a similar dynamic programming approach, we can also develop a time‐consistent policy which allows us to separate the inventory decisions and financial hedging decisions in the presence of forward hedges. However, it is notable that the time‐consistent policy under the exotic hedge is no longer optimal for the initial objective in problem 4.
A limitation of our model is to assume that the firm has access to capital markets for borrowing and lending any amount of cash with the risk‐free interest rate without concerning bankruptcy or other financial distress costs. Although such an assumption is common in the literature for the sake of analytical tractability, addressing those market imperfections will lead to a more practical model (see e.g., Kouvelis and Zhao 2012), which will be addressed in our future research.
Footnotes
A. Proofs of Statements
Acknowledgments
This research of the second author is partly supported by National Science Foundation of China (NSFC) Grants 71671085 and 71528003, and a RGC Grant from the Research Grants Council of Hong Kong, China (Project No. CityU 11501917). This research of the third author is partially supported by NSFC Grants 71620107002 and 71771100. The authors would like to thank Professor George Shanthikumar, the department editor, the senior editor and two anonymous referees for their constructive comments.
