This paper defines a financial differential game framework to a multi-agent financial Merton Model. Unlike the Merton model based on consumption optimization, we assume that consumption expenditures are determined by agents financial commitments to consumption (and thereby, their savings and investments for future consumption). The consumption price is then defined by the aggregate demand for consumption and supply factors (rather than the utility price that each agent is willing to pay). As a result, the Financial Merton multi-agent model presented in this paper points out that consumption decisions depend on consumers strategic strengths.
Consumption and utility pricing models underlie a broad number of financial models based on the seminal Merton paper [1]. For example, Constantinidis and Duffie [2], Cochrane [3] as well as an extensive list of research papers and books have profusely pointed out to its applications. In particular, the Consumption Capital Assets Pricing Model (CCAPM) has used a consumption rationality [3,4] to define a kernel pricing model based on a single and representative agent summarized by with , a pricing kernel, the return on an initial investment of one dollar and , a filtration, summarizing all the relevant and common past information at time t. This approach, unlike the Arrow-Debreu pricing model [5–10] is utility based and therefore its probability measure is a function of the risk preferences embedded in the definition of its utility [1,10] formally stated in terms of a representative investor, k, endowed with initial wealth and optimizing both her utility of consumption and her investment policy by choosing an investment portfolio, . When the decision to consume is financial, the consumer decision is defined by a financial commitment while the price of consumption is random as it depends on other agents’ financial commitments to consume (as well as competitive, risk and supply factors). In this case, both current consumption and future wealth are random, unlike the conventional Merton model defined by the following problem which we state in its infinite horizon version: where r is a time preference for consumption, is the wealth increment of the investor’s trading strategy , and is the investor’s utility of consumption. This formulation assumes that the consumer is a price taker whose decision to consume has no effect on the prices of consumption; that there are no agents that can significantly affect the market price of consumption and finally there exists “a representative agent” whose decision defines the market demand for consumption. A number of extensions to the Merton’s fundamental model have considered various types of investors (e.g., time inconsistent investors [11]), constraints (e.g., borrowing and consumption constraints [12,13]), as well as a multi-agent framework (e.g., [14–18]). In particular, Karatzas et al. [14] reduce a multi-agent consumption/investment decision problem to the problem of finding an appropriate representative agent. Anufrieva and Dindo [15] focus on asset price dynamics in a market with heterogeneous agents. The agents do not consume and only decide whether to invest in a risk-free bond or in a risky financial asset. Using wealth dynamics as a selection device they characterize the long run asset returns and wealth distributions for a general class of investment behaviors. Gorton et al. [16] incorporate a standard agency problem into a classical asset pricing model to simultaneously endogenize the pricing kernel and the cash flows. Tapiero [17] considers a wealth heterogeneous multi-agent financial pricing CCAPM model, which is based on a distinction between what agents are willing to pay for consumption and what they actually pay, pointing to a growth of inequality among consumers. Mbodji et al. [18] study the Merton problem for the case of two investors sharing a final wealth. Each agent has a different utility function and discount factor.
Unlike previous papers, we consider the Merton problem as a stochastic differential Cournot game resulting in a unique consumption price, a function of aggregate demand for consumption. In a multi-agent and dynamic financial framework, both savings and consumption are then strategic if some consumers can affect the aggregate demand by their investments and consumption policies. In a stochastic differential Cournot game framework, consumers’ wealth matters and provides additional insights on the competitive effects of wealth and savings when the price of consumption is defined by aggregate demand and aggregate supply. Consequently, as with the conventional Cournot model, an agent in our model is strategic in the sense that her decisions alter the market price. However, since an agent does not know what the other agent’s decision is, an additional strategic risk compounds the consumer’s investment risks (and thereby her future wealth). Technically, the solution of the Cournot stochastic differential game is usually addressed with only a few agents, which leads to duopoly and oligopoly market pricing models. In this paper, due to the number of agents, we have defined the “Cournot game” as that of an individual agent (consumer) “gaming the market”. Such an approach is concurrent with the financial approach that views investors as gaming the financial market to “beat its price”. Of course, consumers that are poorly endowed financially have a negligible ability to game the market and are therefore price takers while only appreciably endowed agents are able to sway the market.
We study two cases of the problem characterized by (i) a finite market with some agents setting the price while the others are price takers; (ii) an infinitely large market which is compared with only one monopolistic agent setting the consumption price. In terms of the micro-economic consequences of the multi-agent framework, we show that, given the same level of wealth and trading strategy along with constant risk aversion, the agent (no matter how rich she is), as determined by the conventional Merton model, spends on consumption more in a transient state and less in a steady state than a strategic price-setting agent who is able to affect prices to ensure greater consumption in the long run. In terms of macro-economic consequences, we show that this strategic behavior promotes social inequality – the rich becomes richer and the poor becomes poorer. On the other hand, when the market is extremely large, we find that if the mean rate of return of investments is high – consumer strategies can lead to social equality.
The paper proceeds as follows. A financial, multi-agent, differential Merton game model is defined in Section 2 while Section 3 determines the Nash feedback equilibrium conditions and studies underlying steady-state equilibrium consumption strategies. These strategies are detailed in Section 4 and characterized by consumers with an Increasing Absolute Risk Aversion (IARA). Expected dynamics of consumers’ wealth are discussed in Sections 5. The effects of a trading (investment) strategy, inflation and market volatility on the agent wealth are considered in Section 6. Our results are summarized in Section 7.
The continuous time financial merton model
We consider an agent k, k = 1, …, K whose wealth, Wk(t), is invested proportionately, 0 ≤ sk(t) ≤ 1, for future consumption while its residual, (1 − sk(t))Wk(t), is consumed currently. An agent investment strategy consists then in a portfolio allocation which is distributed between a financial index and bonds in proportions 1 − ak(t), 0 ≤ ak(t) ≤ 1, respectively. This results in the following wealth process: In Eq. (1) Rk(t) is the rate of return on the investment portfolio: where Rf is the fixed rate of return, 𝜆 is the equity investment mean rate of return, and 𝜎 is its volatility while B (t) is a standard Weiner process. That is: With ek = Wk(1 − sk) being an agent’s financial consumption at time t and aggregate expenditure equation (3) reduces to while the price of consumption, 𝜋(t), is by definition: Assume that aggregate supply equals aggregate demand, its price 𝜋∗, is then a function of aggregate expenditure, i.e., 𝜋 = 𝜋∗(E (t)) and . For example, let the Cournot price be , where 𝛾 is the minimal price when the market is an infinitely large and 𝛽 is the price sensitivity to changing consumption. Then from (6) we have 𝛾C (t) + 𝛽 = E (t) or . Thus, and . The utility of consumption of an agent k, is therefore a function: Consequently, at any time t, the consumer chooses a control ek(t) from a set Normal consumption persists unless Wk(t) ≤ w0, where w0 is the minimal wealth preventing the consumers bankruptcy. Given a confidence level 𝜂, the propensity to avoid bankruptcy is set by a downside chance constraint: Equivalently, letting the cumulative probability distribution of Wk(t) be Fek, Wk(. ), Given an agent k initial wealth, Wk(0) = wk, k = 1, …, K, her objective is to define an admissible control process , i.e., 𝛺k-valued process satisfying (8) and (10) from a set of admissible controls 𝛤k that maximizes the agent expected discounted future utility. Agent’s k control, unlike the Merton model, depends on the cumulative effect of all agents, E. Thus, the following problems result for all agents: Below we estimate the multi-agent optimal consumption strategy.
Consumption strategies
Henceforth we omit in our notation, where it is obvious, the time dependence t. Recalling that consumer k is uninformed of others’ initial wealth, the consumer’s value function is: and by the principle of dynamic programming: where an instantaneous expected capital gain/loss from investment and consumption in (13) is With respect to (14), Eq. (13) defines equilibrium feedback expenditures in the following form: The first order condition for optimal consumption expenditure, ek = ek(wk), yields, Equation (16) implies that the optimal individual consumption is determined as in Merton’s model by equating the marginal utility of consuming the current period output, , to the marginal utility of allocating it to capital and enjoying an augmented consumption over the following periods, . The consumer’s utility, however, depends on aggregate consumption, that is, the first term in (16), (the marginal utility), is a component that makes consumer behavior strategic, where contains two terms, and . The first term is a given constant in the conventional Merton’s model while the other, is not present in the Merton’s model and is due to the effects of the aggregate consumption on the resulting price and thereby on the individual consumption of agent k. Note, the weaker the dependence of the price on the aggregate consumption , the smaller the second term becomes in (17) and the closer the multi-agent model is to the conventional Merton one.
The second order condition is: Consider a steady-state Nash solution satisfying (16) and (18). From Eq. (5) we observe that E[dWk] = dE[Wk] = 0, if In other words, a consumption equilibrium may have an expected steady state wealth which is the solution of Eqs. (19) in E[Wk]. If the steady state exists, , it also satisfies (16), i.e., if ak < 1: Comparing (19) and (20) we find and with ak < 1 we have Accordingly, the greater the market volatility the lower the expected wealth . On the other hand, if ak = 1, then from (5) we readily find that the Nash equilibrium steady state wealth is given by , where the optimality conditions (16) reduce to: which with respect to (17) results in a system of equations, Consequently, if , (22) has a feasible interior solution only if and vice versa. Therefore differentiating (22) with respect to wk, and dividing the result by , we obtain Using the Arrow-Pratt index of absolute risk aversion , we then have: Next substituting from (22), (24) is reduced to the following condition for the equilibrium consumption ek, providing equivalence between a utility-based risk aversion and a value function index of risk aversion: For example, let A (ek) = A be a constant risk aversion and Av(wk) be the consumer’s value function absolute risk aversion. Then their relationship is reduced to: i.e., a change in agent’s k financial consumption (expenditure) is proportional to the agent’s index of risk aversion of the value function (as defined by the agent’s Hamilton-Jacobi equation). With respect to and Ak(ek) = A, the value function at a steady state satisfies the ordinary differential equation, whose solution is where C1 > 0 and C2 > 0 are the integration constants determined from the HJB (15).
Given Vk satisfying (28), the Nash equilibrium steady-state controls (as well as, states ) are found from the system of K equations (22) by accounting for : or equivalently: Note, if a steady state exists, , then (or equivalently dividing by , if − A < 0, which always holds unless the agents are risk-loving). That is, the derivative of the left-hand side of (30) with respect to ek is negative, the same holds for the derivative of the right-hand side.
Let K = 2, i.e., equations (30) result in and consider two types of agents: price-setting and price-taking. Unlike price-setting (strategic) consumers and similar to the representative agent defined by the Merton’s model, price takers assume they do not affect the market, . Note, that . Therefore implies, given identical conditions, the left hand side of (29) is smaller for a price-taking agent when we consider Cournot pricing, . We thus observe a number of possibilities. If an agent becomes price-taking, then to balance the decreased left-hand side of (30) she needs to reduce ek when risk-aversion A is either relatively low or, inversely, relatively high, i.e., . Otherwise, moderate risk aversion will imply increased ek by the price taker.
Let all agents be characterized by a constant risk aversion A. If A is either relatively low or high, given the same wealth and trading strategy, price setting agents spend more and are expected to achieve a greater steady-state wealth than agents that are price taking . On the other hand, it is vice versa under a moderate risk aversion.
Proposition 3.1 implies that given the same initial wealth and trading strategy, under relatively low/high risk aversion and Cournot pricing property, price-setting consumers would consume less than price-taking consumers at transient states which will enable them to become richer in the long run and enjoy a greater consumption at a steady state. Of course, price-setting agents are normally endowed with much higher wealth. Therefore, they would not need to hold back their expenditures even at the transient states to grow richer. On the other hand, when risk aversion is moderate the price-setting agents will spend more in the transient state and if they are initially rich may still arrive at a greater wealth in the long run than the price-taking agents.
Let again the Cournot price be . Then and . Consider a general utility function under the Cournot pricing and ak < 1. Then and . Since constraint (10) prevents the consumers bankruptcy, when the number of consumers K and thereby consumption E grow, we readily obtain the following result.
Let V solve (15) in Vk. If K tends to infinity, constraint (10) is not binding, and The wealth consumption equilibrium by consumer k, is then: Otherwise, if the chance constraint (10) is binding, an equilibrium consumption ek is determined by solving .
Proposition 3.2 implies an equilibrium solution when the market is large. In this case, its Jacobian matrix is simplified and the stability of the steady state defined by (19) is straightforwardly determined by differentiating the expected value of (19) with respect to E[Wk] and requiring the result to be negative at its steady state: which holds for Wk > 0 when . This implies that if (34) holds and , then (E[Wk] − E[ek])(1 + 𝜆 + ak(Rf −𝜆)) − E[Wk] > 0, and therefore, and the expected wealth of this consumer tends to . Otherwise if then and the expected wealth of this consumer tends to , as well. Based on Proposition 3.2, these results are summarized as follows.
If either and , or and , then an expected steady state wealth, , exists but it is not necessarily stable.
If , then the steady state is stable and consumer’s wealth is expected to tend to it. Otherwise, it is unstable, if , then the consumers with , will enrich, while those with , will become poorer.
Proposition 3.3 implies that there are four possibilities related to the signs of and . These possibilities determine the evolution of an agent’s wealth and depend on the choice of the Cournot price, 𝜋 = 𝜋(C) and the utility, U (ck). We shall show that all four possibilities occur under the same utility function when the market is large.
Recalling (31), we note that the sufficient optimality condition can be expressed in terms of a consumers risk aversion. Specifically when dividing (31) by and employing the Arrow-Pratt index of absolute risk aversion, we have Therefore, the more an agent is risk averse, the more likely the sufficient condition (31) defining a unique solution of (32) holds. The example below is based on an Increasing Absolute Risk Aversion (IARA) and therefore emphasizes the existence of a solution to agent k consumption expenditure.
Consumption under an IARA
Consider a quadratic consumption utility function , where n is a parameter, . Then , and the interior equilibrium (32) is: Consequently, the dynamic programming (HJB) Eq. (15) is reduced to: When ek = 0, it simplifies into whose solution is: where , are integration constants and Assume next a quadratic form for the value function, . Consequently, an interior solution, ek, is determined by (36) and the HJB equation transforms into: which is solved by comparing the corresponding terms on both sides of (40). Introducing the notations, a kth consumer with current wealth wk = w, ak = a, 𝜒k = 𝜒, , , and , has then a linear wealth consumption expenditure ek = e: Note, when m1 < 1 and m0 < 0, we have e < w, if . On the other hand, when , we can set w0 = 0. Next, e = 0 when . Further, the integration constants A1, A2 in (39) are found by requiring that the value function is continuous at w∗ and its derivative is continuous at this point as well: with This ensures the value function is twice differentiable except for a single point of w = w∗ (where its first derivative is continuous). Since Ito’s Lemma is valid for functions with a second derivative that have a finite number of discontinuities, we conclude the following theorem.
If and , then the consumer’s k value function and the equilibrium expenditure are
If and , then the consumer’s k value function is given by and the equilibrium financial consumption expenditure is:
Otherwise, if and , then: and the financial consumption equilibrium is:
Expected equilibrium and the evolution of wealth
We next study the behavior of an individual consumer, k, and for simplicity omit index k, i.e., Wk = W, ak = a, , and so on. First, we consider the steady state of the expected consumer’s wealth, E[dW] = 0 under an interior equilibrium solution. From (4) and (42): Consequently, the steady state of expected wealth is: This steady state is stable if: Let . Based on (50)–(53), Proposition 5.1 results.
Case: m0 < 0.
If , then a steady state exists and it is stable, ; Otherwise, if , then , i.e., and there is no feasible steady state.
Case: m0 > 0.
If , then , there is no feasible steady state and .
Otherwise, if , a steady state exists, but it is unstable, .
then We next illustrate two types of market equilibria with function L.
Effect of the trading strategy, inflation and market volatility
To illustrate switching between two possibilities m0 < 0 and m0 > 0, we consider numerical examples. According to Proposition 5.1, m0 < 0 may induce a stable equilibrium of social equality while the opposite, m0 > 0, may lead to an unstable equilibrium with a constantly growing inequality. Let 𝛾 = 5, n = 3, r = 1.13, a = 1∕2, 𝜆 = 1.16 and Rf = 1.06, then and . This environment is characterized by a significant discounting rate. Figures 1–2 below illustrate for this case, the effect of uncertainty on a growing disequilibrium in inequalities. Specifically, Fig. 1 shows that 0 < m1 < 1 and m0 > 0, while a negative L observed from Fig. 2(b) illustrates a stronger condition for m1: , that is, the equilibrium conditions resulting in a growing inequality.
Furthermore, as market volatility increases, m1 increases and m0 decreases which results in greater consumption, i.e., greater uncertainty induces greater consumption under the same wealth w. The threshold wealth, , also increases with uncertainty (see Fig. 2(a)). Accordingly, the number of consumers who are expected to become poor with increases with uncertainty and the number of those who grow increasingly rich, , decreases.
In contrast to the previous case, a stable social equality is depicted in Figs 3 and 4. All the parameters are the same, except for the future consumption which is discounted at a smaller rate, with r = 0.13. Then and .
Figure 3 shows 0 < m1 < 1 and m0 < 0, while L is positive (see Fig. 4(b)). As market volatility increases, both m1 and m0 increase, implying that greater uncertainty induces greater consumption under same wealth w. Unlike Fig. 2, the expected steady state wealth to which all consumers tend declines with uncertainty (see Fig. 4(a)).
Conclusions
This paper has provided a financial approach to consumption. In a multi-agent framework, the approach is based on financial outlays devoted to consumption while consumption depends on all agents demands as well as market supplies. As a result, multi-agent consumption is a strategic problem implying a potential extension of the fundamental Merton model to include a number of elements such as: “consumers wealth”, “consumption pricing” as well as social consequences arising by large and small consumption agents along with their effects on economic and social inequalities.
We show that, given the same level of wealth and trading strategy, the agent determined by the conventional Merton model, spends on consumption less in a steady state under relatively low/high risk aversion than a strategic price-setting agent who accounts for the aggregate consumption and thereby for the resulting market price. In such a case, strategic agents initially hold back their expenditures by accounting for the overall aggregate effect of all agents when making their individual decisions. This ensures greater wealth and consumption in the long run compared to the conventional price-taking agents. Moderate risk aversion, on the other hand, may have quite the opposite effect. Taking into account that price-setting agents are typically endowed with much greater wealth, however, even moderate risk aversion may still ensure greater wealth and consumption in both the long run and transient state. As a result, this strategic behavior promotes social inequality – the rich becomes richer and the poor becomes poorer. Notably, higher uncertainty, only deepens the gap.
Under the assumption of a large financial market we have derived policies a consumer might follow under various circumstances. In particular, in a financial market with extremely high mean return rate from risky asset - a stable equilibrium of social equality may exist. When returns are low and/or inflation is high, wealth distribution is unstable with increasingly growing inequality between consumers.
References
1.
MertonR.C., Theory of rational option pricing, Bell Journal of Economics and Management Science4 (1973), 141–183.
2.
ConstantinidisG.M. and DuffieD., Asset pricing with heterogeneous consumers, The Journal of Political Economy104(2) (1996), 219–240.
3.
CochraneJ.H., Assets Pricing, Princeton University Press, Princeton, 2001.
4.
LucasR., Asset prices in an exchange economy, Econometrica46 (1978), 1429–1446.
5.
ArrowK.J., Social Choice and Individual Values, Wiley, New York, 1951.
6.
ArrowK.J., Alternative approaches to the theory of choice in risk-taking situations, Econometrica19(4) (1951), 434–437.
7.
ArrowK.J., Aspects of the theory of risk bearing, YRJO Jahnsson lectures, also in 1971 Essays in the Theory of Risk Bearing, Markham Publishing Company, Chicago (1963).
8.
ArrowK.J. and DebreuG., Existence of an equilibrium for a competitive economy, Econometrica22(3) (1954), 265–290.
9.
DebreuG., The Theory of Value: An Axiomatic Analysis of Economic Equilibrium, Yale University Press, 1959.
10.
DebreuG., Economics under uncertainty, Économie Appliquée (1960).
11.
Marín-SolanoJ. and NavasJ., Consumption and portfolio rules for time inconsistent investors, European Journal of Operational Research201 (2010), 860–872.
12.
FlemingW.H. and ZariphopoulouT., An optimal investment/consumption model with borrowing constraints, Mathematics of Operations Research16 (1991), 802–822.
13.
LehoczkyJ.SethiS. and ShreveS., Optimal consumption and investment policies allowing consumption constraints and bankruptcy, Mathematics of Operations Research8(4) (1983), 613–636.
14.
KaratzasI.LehoczkyJ.P. and ShreveS.E., Existence and uniqueness of multi-agent equilibrium in a stochastic, dynamic consumption/investment model, Mathematics of Operations Research15 (1990), 80–128.
15.
AnufrievaM. and DindoP., Wealth-driven selection in a financial market with heterogeneous agents, Journal of Economic Behavior & Organization73 (2010), 327–358.
16.
GortonG.B.HeP. and HuangL., Agency-based asset pricing, Journal of Economic Theory (2013). doi:10.1016/j.jet.2012.09.017.
17.
TapieroC.S., The financial CCAPM and economic inequalities, Quantitative Finance15(3) (2015), 521–534.