A Sales Forecast Model for Short‐Life‐Cycle Products: New Releases at Blockbuster

Abstract

We develop, in this article, a sales model for movie and game products at Blockbuster. The model assumes that there are three sales components: the first is from consumers who have already committed to purchasing (or renting) a product (e.g., based on promotion of, or exposure to, the product prior to its launch); the second comes from consumers who are potential buyers of the product; and the third comes from either a networking effect on closely tied (as in a social group) potential buyers from previous buyers (in the case of movie rental and all retail products) or re‐rents (in the case of game rental). In addition, we explicitly formulate into our model dynamic interactions between these sales components, both within and across sales periods. This important feature is motivated by realism, and it significantly contributes to the accuracy of our model. The model is thoroughly tested against sales data for rental and retail products from Blockbuster. Our empirical results show that the model offers excellent fit to actual sales activity. We also demonstrate that the model is capable of delivering reasonable sales forecasts based solely on environmental data (e.g., theatrical sales, studio, genre, MPAA ratings, etc.) and actual first‐period sales. Accurate sales forecasts can lead to significant cost savings. In particular, it can improve the retail operations at Blockbuster by determining appropriate order quantities of products, which is critical in effective inventory management (i.e., it can reduce the extent of over‐stocking and under‐stocking). While our model is developed specifically for product sales at Blockbuster, we believe that with context‐dependent modifications, our modeling approach could also provide a reasonable basis for the study of sales for other short‐Life‐Cycle products.

Keywords

sales forecasting retail operations new‐product diffusion buyer behavior DVD movies stochastic model applications

1. Introduction

Forecasting sales is an important task for most firms. This task can in fact be critical for the survival of companies that deal with innovative, very‐short‐life‐cycle products. Blockbuster is one such company in the rentable‐DVD and game‐media industry. With over 6500 stores in 18 countries (4018 in the United States including franchisees), its annual revenue for fiscal year 2009 is over $4.06 billion, of which more than 60% is attributed to DVD rentals (Blockbuster Incorporated 2009). Industry wide, DVD movie rental and retail (i.e., outright product purchases) activity generated over $24.1 billion in sales in 2006, and rental and retail sales for game software exceeded $6.5 billion annually (Blockbuster Incorporated 2006). Our objective in this article was to develop an effective sales forecast model for short‐life‐cycle products, such as those found at Blockbuster.

The primary sales characteristic of products in the DVD and electronic game industry is that they have highly compressed life cycles. The short active life span is inherent in any entertainment media product. It is customary for products in this industry to realize the majority of sales in the first few weeks they are offered. This high initial sales volume is typically followed by quickly declining sales in subsequent weeks. However, it is also not uncommon to find products that have sales trajectories which exhibit either a classical diffusion pattern (Bass 1969, Mahajan et al. 2000, Niu 2002, Rogers 2003), manifested as a later period increase in sales, or some other characteristics that are altogether different, such as post first period sales spikes. Thus, the DVD and electronic game industry can be viewed as one with highly compressed product life cycles that have little correlation to traditional product seasons. Furthermore, as each product is unique and original, no reliable benchmarks can be used to predict sales of specific products. As a result of these difficulties, the industry has developed an innate, deep‐rooted skepticism regarding the use of decision support systems. This backdrop provides a strong motivation for the development of reliable forecasting approaches.

The challenge in forecasting sales in this industry can be illustrated by several generic product examples. For newly released DVD and game products (retail or rental), weekly sales for many titles follow what appears to be a pure exponential‐decay pattern. In Figure 1, the titles “Charlotte's Web” and “Because I Said So” both exhibit steadily declining sales. Notice, however, that there is a subtle difference between the two, in that the latter has a heavier tail for an extended duration, suggesting that some “hidden” sales components may be at play. With somewhat less frequency, there also exist sales patterns that are much different from exponential. In Figure 2, sales for the titles “Aurora Borealis” and “WII Prince of Persia” are interesting examples where the second‐period sales are significantly higher than those of the first. Moreover, while there exist signs of an exponential decay after the second‐period spike, sales in later periods appear to be primarily driven by some “non‐apparent” forces. These disparate sales patterns clearly demonstrate that accurate sales forecasting is difficult.

Figure 1

Exponential Sales Pattern

A further challenge in this environment is that it is important to get the forecast right in the first several periods of product sale. If the company under‐forecasts in the early periods, it would experience lost sales that, due to the competitive nature of the industry, could not be recouped in subsequent periods. In other words, any instance of under‐forecasting in the first few periods would have a negative impact on the profitability of a product. Conversely, any over‐forecasting would result in poor product utilization and hence unnecessary stocking and holding costs. As there are roughly 55 new releases at Blockbuster every week, accurate and timely sales forecasts can significantly reduce operating costs.

Figure 2

Non‐Exponential Sales Pattern

In this article, we will first formulate a sales model for movie and game products at Blockbuster and then, after validation, use the model to develop sales forecasts.

Our model formulation is based on a decomposition of the total sales into three components. The first component is due to consumers who are strongly committed to purchasing the product. The second is due to consumers who are potential buyers (whose purchase decisions could be influenced by previous buyers). The third is due to either a networking effect from previous buyers on their closely tied social groups or re‐rents. This decomposition is motivated by and helps explain the observed diverse sales patterns at Blockbuster. Another important feature of our model is that we allow dynamic interactions between these sales components within each sales period as well as period‐by‐period updates of model parameters.

The sales model is tested against extensive sales data from Blockbuster. It fits actual sales extremely well. With the model validated, we also demonstrate that when used in conjunction with publicly available environmental data, such as theatrical sales, studio, genre, MPAA ratings, etc., it is capable of delivering reasonable sales forecasts. We further develop a method that can be used to adjust and improve the initial forecasts based on actual first‐period sales, which can help Blockbuster to respond to any discrepancies in the initial order. As noted previously, excessive initial over‐ or under‐stocking of products has negative cost consequences. Therefore, our forecasting method has the potential to enhance revenue and reduce cost for Blockbuster.

Our work provides several useful contributions to the extant literature (see section 2). The explicit formulations of three sales components, the interaction between these components, and the dynamic parameter updates, are novel. When coupled with good quality input data, our model can be used to conduct sales forecasts, which could lead to better purchasing and inventory decisions for Blockbuster's retail operations (see section 4.4). Finally, the sales decomposition is quite generic, and therefore, can be used in the modeling of sales for short‐life‐cycle products in other scenarios.

The remainder of this article is organized as follows. In section 2, we review some of the available literature. Sections 3 and 4 are devoted, respectively, to model development and to an empirical study on the performance of our model, using actual sales data from Blockbuster. Section 5 contains concluding remarks.

2. Literature Review

Numerous approaches for forecasting sales have been studied in the literature. Most of the methods are developed for settings that are not specific to short‐life‐cycle products, and a significant number of them are based on the Bass Model (BM; Bass 1969). We will therefore begin with a brief review of the BM, focusing on its main idea and some related variants. This is followed by a summary of other methods. We then conclude this section with a discussion of what we consider as important modeling features for short‐life‐cycle products.

Bass originally envisioned a single large potential‐adopter population. In his model, it is assumed that each potential adopter in this population has an instantaneous adoption (i.e., purchasing) rate that depends on two forces. The first force is due to an intrinsic interest in the given product, independent of the number of previous adopters, and the second is due to a positive influence from previous adopters. This leads to a differential equation (involving two parameters denoted by p and q that correspond to these two forces) for the fraction of potential adopters who would have adopted by time t, for t ≥ 0 (Bass 1969, p. 217, or Niu 2006, p. 679). For a given “large” population size, the solution of this differential equation then yields an S‐shaped cumulative‐sales curve. As this solution is deterministic, a sequence of independent and identically distributed (i.i.d.) error terms is finally added to this model to yield successive sales over given time periods (of same duration).

Bass interprets these aforementioned forces as originating from the distinction between “innovators” (or early adopters) and “imitators” (or late adopters). Interestingly, an earlier 1962 edition of Rogers's text actually provided the motivation behind his formulation (see Bass 1969, p. 215). This primitive notion of innovators vs. imitators can be formalized in a number of different ways. Some examples of these can be found in a review by Hardie et al. (1998), and in related work by Mahajan et al. (1990), Niu (2002, 2006), Rogers (2003), Schmidt and Druehl (2005), Tanny and Derzko (1988), and Van den Bulte and Joshi (2007). As will be seen in section 3 below, the formulation of our model, which extends Niu (2006), is also related to this notion (see the committed‐ and potential‐buyer populations in section 3).

In their review of the general literature, Hardie et al. (1998) surveys eight forecasting methods. They test all eight methods against 19 data sets from a variety of products to determine which methods work better, and under what settings. The products in their data sets are items such as shelf‐stable juices, cookies, salty snacks, and salad dressings. Their results show that the BM performs well compared with other forecasting methods.

Rogers (2003) and Mahajan et al. (1990) present distinctly different approaches on how to treat different populations. Bass conceptualized the innovator and imitator populations as purchasing concurrently throughout the life cycle. In contrast, Rogers (2003) conceptualized these populations as being time sequential, meaning that early purchases were due to one population and later purchases were due to the other. In other words, the two populations do not have concurrent purchase activity. Similar to Rogers (2003), Mahajan et al. (, p. 1990 describe how the innovator and imitator populations coalesce into time sequential purchasing populations. A related comparison of these formulations can also be found in Mahajan et al. (2000), where the authors further discuss the innovators‐vs.‐imitators concept.

Another interesting fact regarding the Bass formulation is that it does not segregate the innovator and imitator populations. In an attempt to better understand the effect of having an explicit segregation of these two populations, Tanny and Derzko (1988) propose and test a “two‐compartment” model. They conclude that their model does not lead to superior empirical performance over the BM.

Continuing the spirit of Tanny and Derzko (1988) and other related work (e.g., Steffens and Murthy 1992), Van den Bulte and Joshi (2007) studied a model that has two distinct market segments, namely influencers and imitators (analogous to innovators and imitators). They assume that these populations make purchases concurrently and that there is an asymmetric influencing effect from the influencers onto the imitators. Both these features (i.e., having two concurrent populations and allowing interactions between the populations) are desirable improvements over previous work.

We now provide a summary of a number of other forecasting models. Several of them are developed for music titles and theatrical movie releases, which are in some regards similar to Blockbuster's products. In particular, they also have short life cycles.

Garber et al. (2004) present a model for the rate of product sales based on the notion of a “localized sales density.” The authors posit that if the density of product purchasers within a geographical area is high enough, then a word‐of‐mouth effect will become very strong, resulting in what they refer to as a “contagion process.” Conversely, if the density of purchasers within a geographical area is low, then momentum behind product sales diminishes rapidly.

Jedidi et al. (1998) analyzed box office releases using an exponential‐decay model. They developed a technique that categorized a new movie as a member of one of four mutually exclusive movie clusters, where the clusters differ in opening strength and decay rates.

Moe and Fader (2001) studied weekly music sales, also using an exponential‐decay model. Similar to Jedidi et al. (1998), they associated each product with a “cluster,” but they developed a more rigorous approach. In Moe and Fader's model, each cluster had two parameters, namely a constant rate of purchase and a market‐penetration level. These parameters were estimated from past performance of products in that cluster. Sales forecasting then involves the classification of to which cluster a new product belongs, and applying the parameters from that cluster.

Sawhney and Eliashberg (1996) modeled an individual's time to see a movie as the sum of two random variables, namely the time to decide to see the movie and the time to act on the decision. The authors also assumed that these time intervals are independent, meaning that the time it takes to decide does not influence how fast an individual will act and vice versa. From their empirical analysis, the authors observed that it was reasonable to assume that the distributions of these time intervals are exponential, Erlang, or gamma. Indeed, the authors had good success at forecasting sales given only two or three actual sales data points.

Eliashberg et al. (2000) developed a model called MOVIEMOD to forecast box office movie performance. In this model, it is assumed that, prior to the release of a movie, individuals (potential consumers) can be in one of six possible “behavioral” states, namely undecided, considerer, rejecter, positive spreader, negative spreader, and inactive (Mahajan et al. 1984). All consumers begin in the undecided state and may evolve over time to other states according to an “interactive” Markov chain (Conlisk 1976). The formulation by Eliashberg et al. (2000) reflected the impact of word‐of‐mouth interactions between individuals, marketing activities, and movie experience. Their model was calibrated using consumer clinic experiments, and the resulting forecasts worked quite well.

Lee et al. (2003) and Ainslie et al. (2005) both utilized a hierarchical Bayesian framework for forecasting sales. In Lee et al. (2003), a model was developed to forecast music album sales prior to an album's release; and in Ainslie et al. (2005), a model was developed to forecast movie theatrical performance when consumers have the ability to choose from one of multiple movies. Both papers allowed for dynamic population sizes and sales updates between periods as actual data becomes available, and this was made possible by the Bayesian framework.

Lastly, in Lilien et al. (1981), Hahn et al. (1994), and Fader et al. (2004), a variety of forecasting techniques for consumable products (for example, peanut butter) were considered. For such products, there existed a significant amount of repeat purchases. Clearly, demands for consumables were quite different from that of innovative, short‐life‐cycle products, such as DVD or game products.

The prior work cited above is clearly valuable in furthering research in sales forecast models. However, our primary focus in this article was on movie and game products, which have short life cycles. For such products, we believe that it is important for the model to capture three main features: (i) formulation of multiple consumer sub‐populations, (ii) allowing interactions between consumer sub‐populations, and (iii) having dynamic parameter updates over time, both within each sub‐population and across different sub‐populations. Feature (i) is similar in spirit to Tanny and Derzko (1988); in our setting, the primary rationale is that some consumers may have already been exposed to the product (or product information) and therefore have formed an opinion that is strong enough to alter their purchase behavior. Both (ii) and (iii) are, similar to the BM, motivated by the fact that existing buyers of a product have a dynamic influence over the entire residual population; moreover, these two features are of particular importance for short‐life‐cycle products because such products typically have rapidly changing sales trajectories. To the best of our knowledge, a sales forecast model that satisfactorily addresses these features does not appear to be available in the literature. This perhaps is due in part to the difficulty in the formulation of features (ii) and (iii).

In this article, we will develop a sales model that incorporates all three features above. The construction of our model relies closely on recent stochastic extensions of the BM by Niu (2002, 2006). The model is presented in the next section.

3. The Model

To facilitate understanding, we begin with some informal motivation. Our basic proposition is that DVD and game sales for both rental and retail products can be effectively modeled by considering three sales components that are derived from two distinct consumer populations.

First, we assume that there is a sales component due to a segment of consumers who have already decided to purchase (or rent) a product after its release (possibly due to prior exposures to the product, e.g., the theatrical release of a movie or prelaunch advertising). We refer to this segment as committed buyers. As committed buyers have already made their decisions a priori, we will further assume that the timing of their purchases is independent of the rest of the population. More specifically, we define each period as a week, and postulate that the trajectory of weekly sales from this segment of consumers decays exponentially over time according to an intrinsic purchase rate. This is illustrated in Figure 3 as the committed‐buyers curve.

Figure 3

Decomposition of Total Sales

Second, we assume that there is a sales component due to a segment of consumers whose purchase timing follows a bell‐shaped diffusion curve. We will refer to this segment of consumers as potential buyers. In contrast to the committed buyers, we assume that successive weekly sales from this segment of consumers depend on a pair of recursively updated purchase rates, both of which are tied to the extents of the intrinsic interest in the product and of the force of the unidirectional influencing effect from an existing buyer. A typical sales trajectory of this segment of consumers is illustrated by the potential‐buyers curve in Figure 3.

Lastly, we assume that there is a tertiary sales component due either to the effect of networking within a closely tied group of consumers in the case of DVD rentals and all retail products, or to re‐rents in the case of game rentals. This gives rise to two model variants, which we will refer to as the Networking Model and the Re‐Rent Model.

In the Networking Model, we assume that there is a networking effect, which explicitly takes into account the fact that current‐period sales may have a stronger impact in the subsequent period on the purchase activity of other potential buyers within their respective social groups. This is reasonable for retail and DVD rental products. The trajectory of this third sales component is shown as the networking/re‐rent effect curve in Figure 3. This curve exhibits no first period sales, a second period sales spike (which is due to the networking effect from those who bought in the first period), and finally an exponential‐looking decay. Note that the final decay is an aggregation of the impacts of the current sales from both the committed buyers and the potential buyers, and the precise trajectory of this decay is dependent on the relative sales strengths of these two populations.

In the Re‐Rent Model, which is for game rentals, we replace the networking effect by a re‐rent effect. Specifically, we postulate that a fraction of a given period's new renters will re‐rent in the subsequent period, and that the impact of re‐rents significantly dominates that of the networking effect, so that the latter could be ignored. These assumptions are reasonable because the intensity or difficulty of a game may entice a substantial number of new renters to rent again. The sales trajectory from this re‐rent effect is analogous to that from the networking effect, and therefore is also displayed as the networking/re‐rent effect curve in Figure 3.

When combined, these three sales components yield the total‐sales curve, which is also shown in Figure 3. In general, the resulting composite sales curve could exhibit some combination of (i) an initial peak (due to a high volume of sales to the committed buyers), (ii) a second period spike (due to the networking or re‐rent effect from the first period), and (iii) a late‐period uptick (due to the sales peak of the potential buyers).

We will present the detailed formulations of the Networking and the Re‐Rent Models in sections 3.1 and 3.2, respectively.

3.1. The Networking Model

The Networking Model formulated here is for DVD and game retail, and for DVD rental products. We will assume that, for each of the periods i = 1,…,N, where N is a fixed horizon, there are three sales components,

S_{ci}

S_{pi}

, and

S_{ni}

. The variable

S_{ci}

denotes sales from the committed‐buyer population,

S_{pi}

from the potential‐buyer population, and

S_{ni}

from the networking effect. All three variables are assumed to be random. In addition, we assume that, for each i, there is an independent random error

ϵ_{i}

, which captures all other sales that are due to unaccounted exogenous influences; furthermore,

ϵ_{i}

is, for all i, normally distributed with mean 0 and variance

δ^{2}

The total sales for period i,

S_{i}

, is given by

S_{ci} + S_{pi} + S_{ni} + ϵ_{i}

. Our aim is to understand

{S_{i}, i = 1, \dots, N}

. In principle, one could attempt to formulate and study a detailed vector process

{(S_{ci}, S_{pi}, S_{ni}, ϵ_{i}), i = 1, \dots, N}

. However, as the components of this vector process are dependent in extremely complicated ways, it seems unlikely for such an approach to yield tractable results.

Now, define

μ_{i} \equiv E [S_{ci} + S_{pi} + S_{ni}]

and

ψ_{i}^{2} \equiv Var [S_{ci} + S_{pi} + S_{ni}]

; and let

σ_{i}^{2} = ψ_{i}^{2} + δ^{2}

. Then, we have

E [S_{i}] = μ_{i}

and

Var [S_{i}] = σ_{i}^{2}

. This suggests that a reasonable approach is to work with an approximate heteroscedastic nonlinear regression model that strives to preserve both the means and the variances of the

S_{i}

's. Therefore, for i = 1,…,N, we will simply make the assumption that

S_{i} = μ_{i} + σ_{i} Z_{i},

where

{Z_{i}, i = 1, \dots, N}

is a sequence of i.i.d. standard normal random variables.

The complete specification of (1) requires that we provide a full set of values for the

μ_{i}

's and the

σ_{i}

's. Define

μ_{ci} \equiv E [S_{ci}]

μ_{pi} \equiv E [S_{pi}]

, and

μ_{ni} \equiv E [S_{ni}]

, and note that

μ_{i} = μ_{ci} + μ_{pi} + μ_{ni}

. The core of our formulation is the development of a set of recursive schemes that iteratively compute, for every i, a reasonable estimate for each of the means

μ_{ci}

μ_{pi}

, and

μ_{ni}

and for the variance

σ_{i}^{2}

, based on the historical estimates of the means of all sales components prior to period i. This approach is similar in spirit to the expected‐history piecewise (i.e., period‐by‐period) diffusion model in Niu 2006, p. 686, section 3.2), and is an extension of the formulation there to the present setting of a vector series of dependent sales components. We will develop the scheme for

{(μ_{ci}, μ_{pi}, μ_{ni}), i = 1, \dots, N}

in 3.1.1 and then in 3.1.2, that for

{σ_{i}^{2}, i = 1, \dots, N}

. A summary of our model will be given in 3.1.3.

3.1.1. Formulation of the Means

We will start with the

μ_{ci}

's. First, we define the market ceiling m as the size of the total population of possible buyers of a product. To simplify discussion, we will focus on DVD movie products (both rental and retail). At the Digital Entertainment Group's website, it is reported that the number of US households with a DVD player is estimated at 88 million (Digital Entertainment Group 2007). Therefore, m will be set at 88,000,000. (The corresponding settings for game products are given in 3.2, Table 1.) Next, we assume that the fraction of the market ceiling that is considered to be committed buyers is

π_{c}

. This implies that the size of the committed‐buyer population is given by

m π_{c}

Table 1
Game Platform Sizes

Platform Console units ( $10^{3}$ )

XBox 360 17,200

PS3 7400

Wii 5600

PS2 35,000

GCube 9000

NDS 5000

PSP 5000

Platform	Console units ( $10^{3}$ )
XBox 360	17,200
PS3	7400
Wii	5600
PS2	35,000
GCube	9000
NDS	5000
PSP	5000

To evaluate

μ_{c 1}

, we will assume that the time to product purchase for every committed buyer is independent and is exponentially distributed at rate

α_{c}

; that is, for all t ≥ 0, the probability for a committed buyer to make a purchase no later than t is given by

F_{c} (t) \equiv 1 - \exp (- α_{c} t)

. As the first period has duration 1, this implies that the fraction of committed buyers who will make their purchase in period 1 is given by

F_{c} (1) = 1 - \exp (- α_{c})

. Therefore,

μ_{c 1} = m π_{c} F_{c} (1)

Beyond period 1, we shall find it convenient to define

r_{ci}

, for i = 1,…,N, as the expected size of the residual committed‐buyer population at the beginning of period i. Observe that, by definition, we have

r_{c 1} = m π_{c}

and hence

μ_{c 1}

can be rewritten as

r_{c 1} F_{c} (1)

. More generally, it should be clear that, for each i,

μ_{ci}

depends on

r_{ci}

, and that

r_{ci}

, in turn, depends on

μ_{c 1}, \dots, μ_{c, i - 1}

, i.e., on the “past”

μ_{ci}

's. We will therefore develop a set of joint recursions for

{(r_{ci}, μ_{ci}), i = 1, \dots, N}

As a generic example, let us consider period 2. Observe that, at the start of period 2, the mean size of residual committed buyers

r_{c 2}

is equal to the mean size of the previous period's residual

r_{c 1}

minus the mean number of sales to committed buyers in that previous period; that is, we have

r_{c 2} = r_{c 1} - μ_{c 1}

. Next, paralleling

μ_{c 1} = m π_{c} F_{c} (1)

, we have

μ_{c 2} = m π_{c} [F_{c} (2) - F_{c} (1)]

, where

F_{c} (2) - F_{c} (1)

is the probability for a committed buyer to make the purchase in period 2. To create a recursion, we will rewrite this expression as

μ_{c 2} = m π_{c} [1 - F_{c} (1)] \frac{F_{c} (2) - F_{c} (1)}{1 - F_{c} (1)} .

Because

m π_{c} = r_{c 1}

m π_{c} F_{c} (1) = μ_{c 1}

, and

r_{c 1} - μ_{c 1} = r_{c 2}

, we have

m π_{c} [1 - F_{c} (1)] = r_{c 2}

. Moreover, as

[F_{c} (2) - F_{c} (1)] / [1 - F_{c} (1)]

is the conditional probability for a committed buyer to make the purchase during period 2, given that it is made after period 1, we have, from the memoryless property of the exponential distribution,

[F_{c} (2) - F_{c} (1)] / [1 - F_{c} (1)] = F_{c} (1)

. Hence,

μ_{c 2} = r_{c 2} F_{c} (1)

Repeating this argument then yields that for 1 ≤ i ≤ N,

μ_{ci} = r_{ci} F_{c} (1),

where

r_{ci} = m π_{c} - \sum_{j = 1}^{i - 1} μ_{cj}

(when i = 1, the sum is interpreted as 0). Note that, to facilitate computation, (3) can also be written as

r_{c 1} = m π_{c}

and, for i ≥ 2,

r_{ci} = r_{c, i - 1} - μ_{c, i - 1} .

We now consider (2) as given and turn to the formulation of the

μ_{pi}

's and the

μ_{ni}

's, jointly. These two sets of means need to be developed together because, as a result of the one‐way influencing effect from those who have already purchased the product, the

S_{pi}

's and the

S_{ni}

's depend not only on each other but also on the

S_{ci}

's.

Paralleling the committed‐buyer population, we will assume that the fraction of the market ceiling m that is considered to be potential buyers is

π_{p}

. This implies that the size of the potential‐buyer population is

m π_{p}

For 1 ≤ i ≤ N, let

r_{pi}

be the expected size of the residual potential‐buyer population at the beginning of period i. Again, we shall find it convenient to work with a set of joint recursions for

{(μ_{ni}, r_{pi}, μ_{pi}), i = 1, \dots, N}

We begin with period 1. We will first set

S_{n 1}

and hence

μ_{n 1}

to 0; this is because, at the start of period 1, no one has yet purchased the product and therefore there is no networking effect. Next, we have

r_{p 1} = m π_{p}

by definition. To evaluate

μ_{p 1}

, we need to develop an analogue of

F_{c} (t)

that describes the timing of product purchases by potential buyers. Our formulation will be based on the stochastic Bass Model (SBM) in Niu (2002). The basic idea is to assume that

S_{p 1}

(and in fact each of the subsequent

S_{pi}

s) results from a run in the time interval [0,1], i.e., in period 1, of an SBM with appropriately chosen parameters. To make our formulation more self‐contained, a brief description of the SBM is given next.

Consider a product that has a potential‐adopter population of size n (the term “adopter” is used here to avoid confusion with the terms “committed buyers” and “potential buyers” in our model). The SBM assumes that each potential adopter will purchase exactly one unit of the product, and that the cumulative number of adopters of the product evolves according to a pure birth process with state‐dependent birth rates. For any given n, the birth rates depend on two parameters, which are denoted by α and β. Specifically, it is assumed that if the current state (i.e., the total number of existing adopters) is j, where 0 ≤ j ≤ n, then the birth rate (i.e., the rate of time to next purchase) of the process is

λ_{nj} \equiv (n - j) (α + \frac{β}{n - 1} j) .

We will refer to an SBM with birth rates (5) as one with specification (n;α,β).

In (5), the first term n − j is the size of the residual potential‐adopter population. The second term is the rate for any of the residual potential adopters to make a purchase. The parameter α is an intrinsic purchase rate, independent of the existing adopters, for any of the potential adopters to make the purchase, given that the purchase has not yet been made; and its magnitude reflects the strength of the primitive appeal of the product. The parameter β is called the induction rate, and it reflects the strength of the total force of influence an existing adopter of the product has on the entire population. An important concept behind (5) is that the total induction force β from every existing adopter is apportioned uniformly to all other members of the entire population, so that each of the other members receives a share of magnitude β/(n − 1) (if a potential adopter has already made the purchase, the exerted influence is ignored). This apportionment is what explains why the existing adoption count j is multiplied by β/(n − 1) in (5).

For an SBM with specification (n;α,β), it is shown in Niu 2002, p. 252) that if n is sufficiently large, then the distribution of time to purchase for a randomly selected potential adopter can be approximated by

F (t) = \frac{1 - e^{- (α + β) t}}{1 + (β / α) e^{- (α + β) t}}, t \geq 0 .

The S‐shaped distribution (6), due originally to Bass (1969), will be used repeatedly in the sequel. To simplify notation, we will henceforth denote it as F(t;α,β).

We now proceed to formulate

S_{p 1}

. Our intent is to model this sales component as the stochastic outcome of a run of an SBM. However, observe that purchases made by potential buyers are influenced by the entire cumulative sales history, which includes not only purchases made by potential buyers but also those made by committed buyers. A generic pattern of the sales history is depicted in Figure 4. It can be seen from this figure that the two sales components from the committed and the potential buyers are interleaved. This interleaving of sales makes the analysis of a faithful formulation of

S_{p 1}

extremely challenging.

Figure 4

Interleaved Sales History

To overcome this difficulty, we will develop a two‐step approximation procedure. In the first step, we will put together a preliminary analysis of sales to potential buyers that does not consider any influencing effect from the interleaved committed‐buyer sales; and then, in the second step, we will make an adjustment to the preliminary analysis that incorporates an incremental sales component which reasonably accounts for the effect of induction from the interleaved sales due to the committed buyers.

We begin with two basic assumptions: From any time epoch t, t ≥ 0, onward, (i) if a potential buyer has not yet made the purchase, then, in the absence of any influence from existing buyers, the distribution of time to purchase for that potential buyer is exponential with rate α; and (ii) if a committed or potential buyer has already made the purchase, then the distribution of time to having a “contact” between that buyer and any one of the remaining potential buyers, and thereby inducing a sale if the latter has not yet made the purchase, is exponential with rate β/(m − 1), where m = 88,000,000. It is important to note that the apportionment of β here is based on the entire market ceiling m, and not just

r_{p 1}

(or

r_{pi}

, in general); this is because random contacts made by an existing buyer are not limited to the potential‐buyer population.

A careful reflection on these two assumptions suggests that, with m given, we can take the viewpoint that the rates α and β are the defining attributes of every potential buyer. In the same vein, we can also view

α_{c}

and β as the defining attributes of every committed buyer. Note that the total induction force β (per existing buyer) is assumed to be the same for all committed and potential buyers. These viewpoints will be very helpful in understanding the ensuing formulation.

Denote by

S_{p 1}^{(1)}

the hypothetical total potential‐buyer sales that would accrue in period 1 if all induction force generated by the committed buyers are ignored. Then, in our first step, we claim that under assumptions (i) and (ii) above,

S_{p 1}^{(1)}

is the random outcome of a run in [0,1] of an SBM with specification

(r_{p 1}; α_{1}, β_{1}^{(1)})

, where

α_{1} = α

and

β_{1}^{(1)} = β \frac{r_{p 1} - 1}{m - 1} .

We hasten to point out that as given in (7),

β_{1}^{(1)}

is a function of

r_{p 1}

, which is the population size of the SBM stated in the claim. According to the definition of the SBM, the induction rate β in (5) is a parameter that is independent of the population size n. Thus, the statement of the claim does not appear to be valid. However, observe that, as

r_{p 1} = m π_{p}

, with m = 88,000,000, the right‐hand side of (7) essentially equals

β π_{p}

, which is independent of

r_{p 1}

. Therefore, although there is a slight abuse of language and notation, the statement of the claim is justified, and we will continue to use this language in the remainder of our formulation.

We also note that the ratio

(r_{p 1} - 1) / (m - 1)

in (7) has a simple interpretation; it can be viewed as a relative‐size adjustment between the potential‐buyer population and the market ceiling. This means that if we limit the scope of our attention to the potential‐buyer population, then the original total influence β from a consumer, which spans the entire market ceiling (minus the given consumer, technically), should be scaled down proportionately. Indeed, we now see that

β_{1}^{(1)}

ought to be at

β π_{p}

when m is large.

The proof of the claim is essentially the same as that for (26) and (27) in Niu (2006). An adaptation is provided here for completeness. At the beginning of period 1, we have

r_{p 1}

potential buyers. As all influences from the committed buyers are to be ignored, assumption (i) (with t = 0) implies that the time to purchase for each of these potential buyers is exponential with rate α. As the minimum of independent exponential random variables is also exponential, it follows that the distribution of time to first purchase by a potential buyer is exponential with parameter

r_{p 1} α

. This is consistent with (5) with the substitutions

n = r_{p 1}

and j = 0. Next, observe that after the first purchase is made by a potential buyer, the status of this individual switches from a potential buyer to an existing buyer. As an existing buyer, this individual then begins to exert an induction force of magnitude β/(m − 1) on every other member in the entire market ceiling, including, in particular, those in the residual potential‐buyer population. Therefore, the rate of the time to purchase for each of the remaining

r_{p 1} - 1

potential buyers is now boosted from α to α + β/(m − 1). This implies that the second inter‐purchase time from the potential‐buyer population is exponential with rate

(r_{p 1} - 1) (α + \frac{β}{m - 1}) .

Observe that if we rewrite β/(m − 1) as

β_{1}^{(1)} / (r_{p 1} - 1)

, that is, if we define

β_{1}^{(1)}

according to (7), then this rate is, again, consistent with (5), now with the substitutions,

n = r_{p 1}

, j = 1, and

β = β_{1}^{(1)}

. Continuation of this argument for

j = 2, \dots, r_{p 1}

then leads to the conclusion that the cumulative sales from the potential buyers in the time interval [0,1] evolves as an SBM with specification

(r_{p 1}; α, β_{1}^{(1)})

. Finally, designating α as

α_{1}

, with the subscript explicitly reflecting the current period index, completes the proof.

Now, for t ≥ 0, define

F_{p 1}^{(1)} (t)

as the probability for a potential buyer to have purchased the product by time t in the absence of any influence from the committed buyers. Then, assuming that

r_{p 1}

is sufficiently large, the above claim implies that

F_{p 1}^{(1)} (t)

can be approximated by

F (t; α_{1}, β_{1}^{(1)})

(see (6)). In addition, let

μ_{p 1}^{(1)} \equiv E [S_{p 1}^{(1)}]

; then, by setting t = 1 in

F_{p 1}^{(1)} (t)

, we obtain

μ_{p 1}^{(1)} = r_{p 1} F (1; α_{1}, β_{1}^{(1)}) .

This completes the first step of our analysis.

The aim of the second step of our analysis is to make an upward adjustment to (8) that takes into account the impact of the dynamic influence from successive purchases made by the committed buyers.

Observe (see Figure 4) that between every pair of consecutive purchases by potential buyers, we have a random number of purchases made by committed buyers. It is stipulated in assumption (ii) that a committed buyer will begin to contribute an induction force of magnitude β/(m − 1) on each member of the potential‐buyer population as soon as that buyer makes the purchase. This has so far been ignored in the preliminary estimate (8). We will now propose a remedy.

Recall that the mean number of committed‐buyer purchases in period 1 is given by

μ_{c 1}

. As the mean number of potential‐buyer purchases in the absence of influence from the committed buyers is

μ_{p 1}^{(1)}

, we see that the average number of committed‐buyer purchases per each purchase made in

S_{p 1}^{(1)}

is given by

μ_{c 1} / μ_{p 1}^{(1)}

Conceptually, successive contributions made by the committed buyers to the induction force on the potential buyers should accrue sequentially over time, one by one. An analysis at this level of detail appears to be intractable. The key approximation in our remedy is that we will lump all committed‐buyer purchases between a pair of consecutive (hypothetical) purchases in

S_{p 1}^{(1)}

into a single “batch” and pretend that the entire batch arrives at the same time as the first of the pair. This viewpoint is illustrated in Figure 5.

Figure 5

Batching of Committed‐Buyer Sales

Now, according to the formulation of

S_{p 1}^{(1)}

, each time a potential buyer makes a purchase, a new incremental contribution to the induction force on each of the remaining potential buyers, of magnitude

β_{1}^{(1)} / (r_{p 1} - 1)

, is created. The batching proposed above suggests that a reasonable approach to account for the corresponding contributions from purchases made by the interleaved committed buyers is to boost the contribution

β_{1}^{(1)} / (r_{p 1} - 1)

by the factor

1 + μ_{c 1} / μ_{p 1}^{(1)}

, each time a purchase counted in

S_{p 1}^{(1)}

is made. This is tantamount to saying that we could boost

β_{1}^{(1)}

β_{1}

, defined by

β_{1} = β_{1}^{(1)} (1 + \frac{μ_{c 1}}{μ_{p 1}^{(1)}}),

and consider an accelerated SBM with specification

(r_{p 1}; α_{1}, β_{1})

. We will therefore formally assume that

S_{p 1}

can be modeled as the outcome of a run in [0,1] of this new SBM. Moreover, let

F_{p 1} (t)

, for t ≥ 0, be the probability for a potential buyer to have purchased the product by time t; then, assuming that

r_{p 1}

is large, the specification of this SBM implies that

F_{p 1} (t) = F (t; α_{1}, β_{1})

(which is now seen as the desired analogue of

F_{c} (t)

). Paralleling (8), we finally arrive at

μ_{p 1} = r_{p 1} F (1; α_{1}, β_{1}) .

It is interesting to note that although

S_{p 1}^{(1)}

is only a hypothetical count, we have used it constructively as a stepping stone in the formulation of

S_{p 1}

We now go beyond period 1 and consider an arbitrary period i, i ≥ 2. We will continue to model sales in a period based on the SBM. However, with i ≥ 2, there would exist an existing sales history at the beginning of period i. Therefore, we need to develop a scheme that dynamically updates the specifications of the successive SBMs to properly reflect any given “expected history” (Niu 2006, p. 686, section 3.2).

We begin with the formulation of the networking effect. Our inclusion of a networking effect in the model is motivated by the fact that sales to the committed and the potential buyers in a period could have a stronger near‐term impact on some members (e.g., those in the same social group) of the residual potential‐buyer population than the force already built into the induction rate β. This naturally suggests that we allow the sales component

S_{ni}

to depend on

S_{c, i - 1}

and

S_{p, i - 1}

, i.e., on the number of purchases made by the committed and the potential buyers in the previous period. Our specific assumption will be that each of the purchases counted in

S_{c, i - 1} + S_{p, i - 1}

generates (independently of everything else) an average of ν potential‐buyer purchases during period i. As

E [S_{c, i - 1} + S_{p, i - 1}] = μ_{c, i - 1} + μ_{p, i - 1}

, this implies that, for i ≥ 2, we have

μ_{ni} = (μ_{c, i - 1} + μ_{p, i - 1}) ν .

Next, we consider

r_{pi}

. Although

r_{pi}

is by definition an expected count, we will tacitly pretend that it is the actual size of the potential‐buyer population at the start of period i. (Ultimately, we are interested in a mean purchase count, for which a justification can be provided by conditioning on the actual count.) This will allow us to properly initiate an SBM with population size

r_{pi}

Paralleling

r_{p 1}

, we would have

r_{pi} = r_{p, i - 1} - μ_{p, i - 1}

if there were no networking effect. To accommodate networking, we will assume that

S_{ni}

is drawn from the residual potential‐buyer population and set aside as a reserve just before the onset of period i, and that the actual purchases from this reserve are to take place in the course of period i. Therefore, we will actually have

r_{pi} = r_{p, i - 1} - μ_{p, i - 1} - μ_{ni} .

Note that, with

r_{p 1} = m π_{p}

, (12) can also be written as

r_{pi} = m π_{p} - \sum_{j = 1}^{i - 1} (μ_{pj} + μ_{nj}) - μ_{ni},

which is an analogue of (3).

With

r_{pi}

given, the formulation of

S_{pi}

is similar to that for

S_{p 1}

. Again, we will start with

S_{pi}^{(1)}

, which is defined as the total number of purchases that would be made in period i by potential buyers, if there were no committed‐buyer sales and networking effect in that period. As in period 1, we will show that

S_{pi}^{(1)}

is an SBM with a pair of intrinsic purchase rate and induction rate, denoted by

α_{i}

and

β_{i}^{(1)}

, which properly reflect the past sales history. The formulation of

α_{i}

and

β_{i}^{(1)}

is similar to (31) and (32) in Niu (2006). We will provide an adaptation here.

Observe that at the start of period i, the expected total number of existing purchases is given by

\sum_{j = 1}^{i - 1} (μ_{cj} + μ_{pj} + μ_{nj})

. We will, again, treat this expected history as an actual count (see Niu 2006, p. 684, paragraph after (30), for a reasonable justification). It then follows from assumptions (i) and (ii) that, at the start of period i, the total rate for any one of the

r_{pi}

potential buyers to make a purchase is

α + \frac{β}{m - 1} \sum_{j = 1}^{i - 1} (μ_{cj} + μ_{pj} + μ_{nj}) .

(The first term here is due to assumption (i); and the second, assumption (ii).) This implies that the distribution of the resulting time to first purchase (a minimum of exponential random variables) from the residual potential‐buyer population is exponential with a parameter given by

r_{pi}

times the rate in (13). A comparison of this parameter with

λ_{n 0}

in (5) suggests that (13) may qualify as the intrinsic purchase rate of an SBM with population size

r_{pi}

Denote the rate in (13) by

α_{i}

; and, to facilitate computation, rewrite this rate as

α_{i} = α_{i - 1} + \frac{β}{m - 1} (μ_{c, i - 1} + μ_{p, i - 1} + μ_{n, i - 1}) .

Continuing beyond the time to first purchase, we next claim that the total purchase

S_{pi}^{(1)}

is distributed as the outcome of a run that starts at time i − 1 and evolves for a duration of 1 of an SBM with specification

(r_{pi}; α_{i}, β_{i}^{(1)})

, where

α_{i}

is as given in (14) and

β_{i}^{(1)} = β \frac{r_{pi} - 1}{m - 1} .

The proof of this claim is identical to that for the corresponding claim for

S_{p 1}^{(1)}

. Details are therefore omitted.

The rest of the analysis for period i is similar to that for period 1. The only remaining difference is that we now have a networking effect. We will therefore be brief.

For t ≥ 0, define

F_{pi}^{(1)} (t)

as the probability for a residual potential buyer at the start of period i to have purchased the product by time (i − 1) + t in the absence of any influence from both the committed buyers and the networking effect in that period. For sufficiently large

r_{pi}

F_{pi}^{(1)} (t)

can be approximated by

F (t; α_{i}, β_{i}^{(1)})

. Let

μ_{pi}^{(1)} \equiv E [S_{pi}^{(1)}]

; then, we have

μ_{pi}^{(1)} = r_{pi} F (1; α_{i}, β_{i}^{(1)}) .

To determine a reasonable boost to (15), we will make the additional assumption that the cumulative number of purchases in period i that are due to the networking effect is governed by an independent Poisson process at rate

μ_{ni}

. This roughly means that purchases due to networking are spread out evenly in period i. It follows that between every pair of consecutive sales counted in

S_{pi}^{(1)}

, we would approximately have an average of

(μ_{ci} + μ_{ni}) / μ_{pi}^{(1)}

purchases that are due either to committed buyers or to the networking effect. Therefore, we should boost

β_{i}^{(1)}

by the factor

1 + (μ_{ci} + μ_{ni}) / μ_{pi}^{(1)}

, and we will model

S_{pi}

as the outcome of a run in the time interval [i − 1,i] of an SBM with specification

(r_{pi}; α_{i}, β_{i})

, where

α_{i}

is given by (14) and

β_{i} = β_{i}^{(1)} (1 + \frac{μ_{ci} + μ_{ni}}{μ_{pi}^{(1)}}) .

Finally, for t ≥ 0, let

F_{pi} (t)

be the probability for a residual potential buyer at the start of period i to have purchased the product by time (i − 1) + t; then, we approximately have

F_{pi} (t) = F (t; α_{i}, β_{i})

, and therefore

μ_{pi} = r_{pi} F (1; α_{i}, β_{i}) .

3.1.2. Formulation of the Variances

In this subsection, we will develop a recursive scheme that yields a set of approximations for the

σ_{i}^{2}

's (see (1)). These variances not only quantify the extent of variability of the successive sales but also will allow us to estimate the parameters in our model using the method of maximum likelihood.

Recall that the total sales

S_{i}

is given by

S_{ci} + S_{pi} + S_{ni} + ϵ_{i}

. For i ≥ 1, define

S_{i}^{(1)} \equiv S_{ci} + S_{pi} + S_{ni};

and recall further that we had denoted the variance of

S_{i}^{(1)}

ψ_{i}^{2}

. As the

ϵ_{i}

's have variance

δ^{2}

and are independent of the other sales components, our analysis will focus on the

ψ_{i}^{2}

s. Observe that, while

S_{ci}

and

S_{ni}

are independent,

S_{pi}

depends on both

S_{ci}

and

S_{ni}

in very complicated ways. Our analysis of

ψ_{i}^{2}

, therefore, will be based on a host of simplifying assumptions that are intended to approximate this dependence. The idea is to approximately decompose the count

S_{pi}

into a set of more‐manageable sub‐counts.

We begin with the observation that, as

β_{i} \geq β_{i}^{(1)}

, the birth rate for every state (see (5)) in the defining SBM for

S_{pi}

is no less than (or no slower than) the corresponding birth rate for the same state in the defining SBM for

S_{pi}^{(1)}

. By coupling the inter‐purchase times in these two SBMs, it follows (similar to, e.g., Ross 1996, p. 412, Example 9.2(c)) that

S_{pi}

and

S_{pi}^{(1)}

can be constructed on the same probability space to satisfy

S_{pi} \geq S_{pi}^{(1)}

with probability 1.

Under this construction, if we define

S_{pi}^{(2)} \equiv S_{pi} - S_{pi}^{(1)},

which is a non‐negative random variable, then we have the decomposition

S_{pi} = S_{pi}^{(1)} + S_{pi}^{(2)}

. Now, in this decomposition, the first variable

S_{pi}^{(1)}

is, as defined in 3.1.1, independent of both

S_{ci}

and

S_{ni}

; therefore, the second variable

S_{pi}^{(2)}

can be interpreted as the number of potential‐buyer sales in period i that are due to the influence from both the accumulation of purchases from the committed buyers and the networking effect. These two variables are, of course, dependent by definition. Observe, however, that a portion of this dependence, physically speaking, could be considered as a consequence of the fact that both

S_{pi}^{(1)}

and

S_{pi}^{(2)}

are drawn concurrently from the same residual potential‐buyer population. This suggests that if

r_{pi}

is sufficiently large, then the extent of this dependence may not be too significant. Given this motivation, we will make the approximating assumption that

S_{pi}^{(1)}

and

S_{pi}^{(2)}

are independent.

We will next decompose

S_{pi}^{(2)}

further. As suggested above, we will take the viewpoint that sales counted in

S_{pi}^{(2)}

are “generated” by the influence of those counted in

S_{ci}

and

S_{ni}

. Define

μ_{pi}^{(2)} \equiv E [S_{pi}^{(2)}]

; then, as the mean of

S_{ci} + S_{ni}

μ_{ci} + μ_{ni}

, the average number of sales counted in

S_{pi}^{(2)}

per each sale counted in

S_{ci} + S_{ni}

is given by

γ_{i} \equiv \frac{μ_{pi}^{(2)}}{μ_{ci} + μ_{ni}},

where, from (20), (18), and (16),

μ_{pi}^{(2)} = r_{pi} [F (1; α_{i}, β_{i}) - F (1; α_{i}, β_{i}^{(1)})]

. This motivates the approximation that

S_{pi}^{(2)} = \sum_{k = 1}^{S_{ci} + S_{ni}} X_{k},

where the

X_{k}

's are assumed to be i.i.d. random variables (non‐negative and integer valued) with mean

γ_{i}

With the decomposition of

S_{pi}^{(2)}

in (22), (19) becomes

S_{i}^{(1)} = S_{ci} + S_{ni} + S_{pi}^{(1)} + \sum_{k = 1}^{S_{ci} + S_{ni}} X_{k},

which can be rewritten as

S_{i}^{(1)} = \sum_{k = 1}^{S_{ci} + S_{ni}} (1 + X_{k}) + S_{pi}^{(1)} .

Note that all of the variables in the approximate representation (23) are independent, and have a known mean. In addition, it is easily seen that upon taking expectations, (23) yields

E [S_{i}^{(1)}] = μ_{ci} + μ_{ni} + μ_{pi}

; that is, we have maintained the original mean in this representation.

To compute the variance of

S_{i}^{(1)}

, i.e.,

ψ_{i}^{2}

, we will further make the blanket assumption that all of the variables in (23) are cumulative counts in independent Poisson processes with rates given by their respective means (for

S_{ni}

, this assumption has already been made in 3.1.1). Consider

S_{pi}^{(1)}

, for example; although the birth rates in its defining SBM are state dependent, this Poisson assumption is not unreasonable, because if

r_{pi}

is large, then within the short time span of a week, the birth rates do not change substantially. For the remaining variables in (23), the corresponding assumptions can be loosely justified in similar manners.

As the variance of a Poisson random variable is equal to its mean, the Poisson assumption above allows us to calculate the variance of the approximation in (23) in terms of the known means. This calculation is now straightforward, and is worked out in section B of Online Appendix S1. The result is:

ψ_{i}^{2} = (μ_{ci} + μ_{ni}) [1 + 3 γ_{i} + 2 γ_{i}^{2}] + μ_{pi}^{(1)} .

Finally, as

σ_{i}^{2} = ψ_{i}^{2} + δ^{2}

, we arrive at the variance approximation

σ_{i}^{2} = (μ_{ci} + μ_{ni}) (1 + 3 γ_{i} + 2 γ_{i}^{2}) + μ_{pi}^{(1)} + δ^{2} .

It is perhaps interesting to observe that if

S_{ci} + S_{ni}

and

S_{pi}

were independent, we would have

\begin{matrix} ψ_{i}^{2} & = Var [S_{ci} + S_{ni}] + Var [S_{pi}] \\ = μ_{ci} + μ_{ni} + μ_{pi}^{(1)} + μ_{pi}^{(2)}, \end{matrix}

which can be written as

(μ_{ci} + μ_{ni}) (1 + γ_{i}) + μ_{pi}^{(1)}

. The difference between (24) and (26) is

2 (μ_{ci} + μ_{ni}) γ_{i} (1 + γ_{i})

. Hence,

(μ_{ci} + μ_{ni}) γ_{i} (1 + γ_{i})

is the approximate covariance between

S_{ci} + S_{ni}

and

S_{pi}

3.1.3. Model Summary

We now provide a summary of the Networking Model. In this model, it is assumed that sales in successive time periods are given by the heteroscedastic nonlinear regression equation defined in (1). The mean

μ_{i}

and the standard deviation

σ_{i}

in (1) are specified via a set of recursions.

To compute the means in (1), we decomposed

μ_{i}

into the sum

μ_{ci} + μ_{pi} + μ_{ni}

, and developed a recursion for each of these component means, given in (2), (18), and (11), respectively. The core component of our analysis is that for potential buyers, i.e., for

μ_{pi}

. It is accomplished in two steps. In the first step, we estimated

μ_{pi}

using an SBM, assuming that there were no influences from either the committed buyers or networking. We then, in the second step, boosted this preliminary estimate using an accelerated version of the SBM in the first step that reasonably incorporates the left‐out influence.

An important aspect of the accelerated SBM is that the intrinsic purchase rate

α_{i}

and the induction rate

β_{i}

in its specification, given in (14) and (17), respectively, explicitly formalize the intuitive concept that as we move from period to period, the purchase rate of the residual potential buyers should be properly updated to reflect the growing influence from the accumulation of existing buyers, the cross influence between different consumer sub‐populations, and the dynamically changing sizes of the residual sub‐populations. As discussed at the end of section 2, the incorporation of these features in a sales forecast model is of particular relevance to short‐life‐cycle products.

To compute the standard deviations in (1), we proposed a Poisson approximation for the total sales in each period, given in (23), and used this approximation to estimate the variance of the total sales, which is given in (25). A desirable feature of the estimates for the

σ_{i}^{2}

s is that they depend only on the historical means of the sales components.

With m set exogenously, the full specification of our model requires seven parameters, namely

π_{c}

π_{p}

α_{c}

, α, β, ν, and δ. Given the actual sales

{s_{i}, i = 1, \dots, N}

of a product title, these parameters can be estimated via the maximum likelihood procedure, using the log‐likelihood function

- \sum_{i = 1}^{n} \ln (σ_{i}) - \frac{1}{2} \sum_{i = 1}^{n} {(\frac{s_{i} - μ_{i}}{σ_{i}})}^{2} .

3.2. The Re‐Rent Model

We now move on to the formulation of the Re‐Rent Model. The structure of this model is basically the same as that of the Networking Model. Therefore, we will mainly discuss features that are either new or different, and our discussion will be brief.

The Re‐Rent Model is developed here for game rental products. Specifically, for each time period i, we assume that the total sales is given by the sum

S_{i} = S_{ci} + S_{pi} + S_{ri} + ϵ_{i}

, where the variables

S_{ci}

S_{pi}

, and

ϵ_{i}

are defined as before, but now the new variable

S_{ri}

, which replaces

S_{ni}

, denotes sales from the re‐rent effect. Again, we will work with (1), and estimate

μ_{i} \equiv E [S_{i}]

and

σ_{i}^{2} \equiv ψ_{i}^{2} + δ^{2}

, where

ψ_{i}^{2} \equiv Var [S_{ci} + S_{pi} + S_{ri}]

, based on recursive estimates of the mean sales

μ_{ci} \equiv E [S_{ci}]

μ_{pi} \equiv E [S_{pi}]

, and

μ_{ri} \equiv E [S_{ri}]

The market ceiling for a game product is denoted by m. Unlike DVD movie products, which have a common market ceiling, the value of m now depends on the game hardware. For each hardware platform, we will set m according to the total number of console units for that platform. The sizes of different platforms are listed in Table 1. The listed values came from the website of PVC Gaming News and Reviews (PCVSConsole.com 2007).

There are two reasons for not aggregating game rentals across all platforms. The first is that certain titles may be released only on specific gaming platforms; and the second is that some gaming platforms are more advanced, and therefore elicit different renter behavior (such as a higher degree of product fanaticism) than others.

Apart from setting m differently, the formulation and analysis of the committed‐buyer population is the same as that for the Networking Model. Therefore, the recursion for

(r_{ci}, μ_{ci})

in (4) and (2) applies to the Re‐Rent Model without any modification.

For the potential‐buyer population, there are, however, a few necessary modifications in the formulation. All of the modifications are due to the reasonable assertion (which we make) that a repeat rental by a renter will not generate any incremental influence beyond the induction force that has already been activated by that renter at the time of the first rental.

Consider period i, i ≥ 1. We will assume that a fraction η of the renters counted in

S_{c, i - 1} + S_{p, i - 1}

(note that we could have included

S_{r, i - 1}

here; this is not done because the observed data indicate that the likelihood of having multiple re‐rents is negligible) will rent again in period i; or alternatively, each new renter in period i − 1 has a probability η of re‐renting in period i. This implies that

μ_{ri} = (μ_{c, i - 1} + μ_{p, i - 1}) η,

which is analogous to (11).

Next, observe that in contrast to the Networking Model, where each purchase due to the networking effect comes from a new potential buyer (i.e., one who has not yet made a purchase), the re‐rents in the model here, by definition, do not draw from the residual potential‐buyer population. It follows that (12) and (14) should be modified to

r_{pi} = r_{p, i - 1} - μ_{p, i - 1}

and

α_{i} = α_{i - 1} + \frac{β}{m - 1} (μ_{c, i - 1} + μ_{p, i - 1}),

respectively. With the new recursions (29) and (30), the functional forms of (15) and (16) stay the same in the Re‐Rent Model. The only remaining modification in the recursion for

(μ_{ri}, r_{pi}, μ_{pi})

is that (17) becomes

β_{i} = β_{i}^{(1)} (1 + \frac{μ_{ci}}{μ_{pi}^{(1)}}) .

That

μ_{ri}

does not appear in (31) is because the influence from those who rent a second time has already been accounted for when

α_{i - 1}

is updated to

α_{i}

(namely, the expected count

μ_{c, i - 1} + μ_{p, i - 1}

in (30)).

Note that apart from boosting the total expected sales

μ_{i}

, via

μ_{ri}

, re‐rents do not participate in the formulation of rentals from the potential‐buyer population (see (29)–(31)). Consequently, the impact of the re‐rent effect in the model is “softer” than that of the networking effect in the Networking Model.

We now consider the

σ_{i}^{2}

s. The first observation is that the ratio in (21) should be modified to

γ_{i} \equiv \frac{μ_{pi}^{(2)}}{μ_{ci}} .

This again is because re‐rents do not activate any new influence. Next, for i ≥ 1, define

S_{i}^{(1)} \equiv S_{ci} + S_{pi} + S_{ri}

; then, paralleling the development in section 3.1.2, we have the approximate representation

S_{i}^{(1)} = \sum_{k = 1}^{S_{ci}} (1 + X_{k}) + S_{ri} + S_{pi}^{(1)},

where

S_{ci}

S_{ri}

, and

S_{pi}^{(1)}

are independent random variables with means

μ_{ci}

μ_{ri}

, and

μ_{pi}^{(1)}

, respectively, and

{X_{k}, k = 1, 2, \dots}

is an independent sequence of i.i.d. random variables with mean

γ_{i}

(now given by (32)). Finally, a calculation similar to the one in section B of Online Appendix S1 yields that

σ_{i}^{2} = μ_{ci} (1 + 3 γ_{i} + 2 γ_{i}^{2}) + μ_{ri} + μ_{pi}^{(1)} + δ^{2} .

Note that in this case, the approximate covariance between

S_{ci}

and

S_{pi}

(both are independent of

S_{ri}

) is

μ_{pi}^{(2)} (1 + γ_{i})

4. An Empirical Study

The core of our empirical study is a thorough analysis of 352 product titles (170 DVD rental titles, 98 DVD retail titles, 69 game rental titles, and 15 game retail titles), using the models developed in section 3. All of these titles are new products released in the 10‐week time frame from April 2, 2007, through June 9, 2007. The actual sales data were collected from the Blockbuster order management system for the US operations.

An important goal of our empirical work was to test the Networking and the Re‐Rent models against the available data sets, while the models were still under development. The benefit of this approach is that it allowed us to benchmark the validity of various proposed features of the models. For this iterative calibration, we utilized version 7.1 of the Premium Solver (Frontline Systems Inc., Incline Village, NV, USA) add‐in for Excel (Standard Evolutionary Solver), which incorporates a genetic algorithm for solving non‐linear optimization problems, and a desktop version of sas 9.1, (SAS Institute Inc., Cary, NC, USA) which is useful in reporting the necessary measures of statistical significance. This process resulted in a set of seven fitted parameters that characterizes the sales trajectory of each of the 352 products, along with a corresponding set of parameter p‐values and t‐statistics.

To make parameter predictions for other product titles, we collected a set of “environmental” data that we believe should have a reasonable impact on product sales. Environmental data for DVD movie products (e.g., theatrical sales, MPAA rating, etc.) were obtained from the website 2007; and environmental data for game products (e.g., ESRB rating, genre, replayability rating, etc.) were obtained from the website 2007 Using the collected environmental data and the fitted parameters for the entire set of 352 test titles, we next developed a suite of regression equations (with the environmental information as independent variables) that are individually designed for each parameter (the dependent variable) in our models. The econometric modeling package stata, version 10 (StataCorp LP, College Station, TX, USA), is used for these regressions. The fitted regression equations are then used to make parameter predictions, and thereby explicit sales forecasts, for a separate set of product titles.

Finally, to improve the performance of our forecasts, we further developed a method that makes an adjustment to the initial predictions based on the actual first‐period sales. We believe that the resulting procedure should be of interest to industry practitioners, as it could offer improved ordering or replenishment decisions.

The remainder of this section is organized as follows. In section 4.1, we describe the parameter‐fitting procedure in detail; in section 4.2, we discuss the regression models that are used to make reasonable initial predictions of parameter values for new products; and in section 4.3, we present a method that significantly improves the initial forecasts using actual first‐period sales.

4.1. Fitting Parameters

As mentioned above, both the Premium Solver and sas are used in our model fits. The Evolutionary Solver embedded in the Premium Solver is particularly helpful because it is capable of producing near‐optimal solutions, even though the log‐likelihood functions in our models (see (27)) are rather nonlinear. We also extensively relied on the graphing tools in Excel itself; and this provided valuable visualizations of the sales trajectories. However, the Premium Solver does not provide standard errors of the parameter estimates. To generate this important information, we used sas. Thus, these two programs nicely complement each other.

Our approach is to first use the Premium Solver to generate, for each product, an initial solution, which typically is near the globally optimal solution, and then to run sas with this initial solution. In addition to providing the needed statistical‐significance information, the sas run also serves to confirm the (near) optimality of the Solver output. Sample sas programs for both the Networking Model, and the Re‐Rent Model can be found in section C of Online Appendix S1.

As we have 352 different products, it is natural to expect that there would exist many different types of sales trajectories. Therefore, we opted to first perform our iterative model development on a smaller subset of 12 test titles. This test set was chosen because the sales characteristics of these titles covered a diverse range of patterns. The rationale is that if our models are calibrated against this test set, then the resulting models would have a good chance of performing well for the remaining products. This turns out to be true, and in fact, some aspects of our model formulation were actually motivated by the patterns observed in this set of test titles.

The fitted parameters and their standard errors for the titles in the test set are given in Table 2. For most titles, the asymptotic standard errors, which are shown in parentheses below the fitted parameters, are quite reasonable. Note that in a few cases, the fitted values of β, ν, and η are given as 0.000000. This suggests that the induction rate, the networking effect, or the re‐rent effect, respectively, did not play a significant role in the sales for these products.

Table 2
Fitted Parameters for the 12 Test Titles

Product title Parameters

$π_{c}$
$π_{p}$

$α_{c}$
α β ν η δ

DVD rental 0.001404 0.003591 0.567677 0.007821 24.0521 0.135438 – 817.442

Volver (0.000082) (0.000066) (0.039625) (0.000640) (1.45651) (0.017930) (156.689)

DVD rental 0.000086 0.000443 0.543802 0.003558 133.655 0.881185 – 148.304

Bottom Feeders (0.000008) (0.000022) (0.058730) (0.000716) (17.8914) (0.038518) (30.7368)

DVD rental 0.020911 0.022135 0.375590 0.017086 2.47180 0.041747 – 13,432.4

The Good Shepherd (0.003433) (0.002582) (0.066114) (0.004001) (0.629676) (0.041431) (2473.22)

DVD retail 0.000305 0.000337 1.78100 0.140463 0.000035 0.109329 – 180.071

Apocalypto (0.000109) (0.000140) (0.545588) (0.261581) (614.155) (0.059829) (88.8102)

DVD retail 0.000040 0.000165 28.0417 0.372491 0.000000 0.000000 – 234.842

The Queen (0.000005) (0.000018) (1.33755) (0.051576) (0.000000) (0.000000) (64.3186)

DVD retail 0.000003 0.000011 0.579185 0.003359 9053.56 0.000001 – 4.99154

Dora: Shy Rainbow (0.000003) (0.000007) (4.66660) (0.111529) (3884.29) (0.000001) (2.52671)

Game retail: PS2 0.000008 0.000024 1.71062 0.106908 3509.02 0.911623 – 0.000001

Spiderman 3 (0.000011) (0.000010) (0.587182) (0.383103) (13,362.2) (0.056219) (0.000001)

Game retail: X360 0.000032 0.000082 38.1106 0.217663 2684.74 0.000011 – 41.9622

Guitar Hero II Bundle (0.000183) (0.000226) (1.76614) (4.00499) (64,205.2) (0.244179) (54.5710)

Game retail: NDS 0.000032 0.000292 2.21656 0.001413 230.322 1.27663 – 7.29972

Pokemon Pearl Version (0.000005) (0.000012) (0.315157) (0.000706) (23.6890) (0.056990) (2.85709)

Game rental: PS2 0.006902 0.002353 0.091251 0.007782 72.4746 – 0.000000 596.514

Spiderman 3 (0.000322) (0.000366) (0.009157) (0.001407) (6.64502) (0.000000) (171.568)

Game rental: WII 0.003962 0.004121 0.255803 0.013392 2.53456 – 0.190903 388.494

The Bigs (0.001858) (0.012460) (0.110266) (0.060486) (14.0837) (0.109473) (89.6458)

Game rental: PS2 0.000104 0.000269 0.488631 0.005530 395.316 – 1.44858 37.0466

Brave—Spirit Dancer (0.000009) (0.000008) (0.072266) (0.002821) (42.2423) (0.025943) (10.7086)

Product title	Parameters
DVD rental	0.001404	0.003591	0.567677	0.007821	24.0521	0.135438	–	817.442
Volver	(0.000082)	(0.000066)	(0.039625)	(0.000640)	(1.45651)	(0.017930)		(156.689)
DVD rental	0.000086	0.000443	0.543802	0.003558	133.655	0.881185	–	148.304
Bottom Feeders	(0.000008)	(0.000022)	(0.058730)	(0.000716)	(17.8914)	(0.038518)		(30.7368)
DVD rental	0.020911	0.022135	0.375590	0.017086	2.47180	0.041747	–	13,432.4
The Good Shepherd	(0.003433)	(0.002582)	(0.066114)	(0.004001)	(0.629676)	(0.041431)		(2473.22)
DVD retail	0.000305	0.000337	1.78100	0.140463	0.000035	0.109329	–	180.071
Apocalypto	(0.000109)	(0.000140)	(0.545588)	(0.261581)	(614.155)	(0.059829)		(88.8102)
DVD retail	0.000040	0.000165	28.0417	0.372491	0.000000	0.000000	–	234.842
The Queen	(0.000005)	(0.000018)	(1.33755)	(0.051576)	(0.000000)	(0.000000)		(64.3186)
DVD retail	0.000003	0.000011	0.579185	0.003359	9053.56	0.000001	–	4.99154
Dora: Shy Rainbow	(0.000003)	(0.000007)	(4.66660)	(0.111529)	(3884.29)	(0.000001)		(2.52671)
Game retail: PS2	0.000008	0.000024	1.71062	0.106908	3509.02	0.911623	–	0.000001
Spiderman 3	(0.000011)	(0.000010)	(0.587182)	(0.383103)	(13,362.2)	(0.056219)		(0.000001)
Game retail: X360	0.000032	0.000082	38.1106	0.217663	2684.74	0.000011	–	41.9622
Guitar Hero II Bundle	(0.000183)	(0.000226)	(1.76614)	(4.00499)	(64,205.2)	(0.244179)		(54.5710)
Game retail: NDS	0.000032	0.000292	2.21656	0.001413	230.322	1.27663	–	7.29972
Pokemon Pearl Version	(0.000005)	(0.000012)	(0.315157)	(0.000706)	(23.6890)	(0.056990)		(2.85709)
Game rental: PS2	0.006902	0.002353	0.091251	0.007782	72.4746	–	0.000000	596.514
Spiderman 3	(0.000322)	(0.000366)	(0.009157)	(0.001407)	(6.64502)		(0.000000)	(171.568)
Game rental: WII	0.003962	0.004121	0.255803	0.013392	2.53456	–	0.190903	388.494
The Bigs	(0.001858)	(0.012460)	(0.110266)	(0.060486)	(14.0837)		(0.109473)	(89.6458)
Game rental: PS2	0.000104	0.000269	0.488631	0.005530	395.316	–	1.44858	37.0466
Brave—Spirit Dancer	(0.000009)	(0.000008)	(0.072266)	(0.002821)	(42.2423)		(0.025943)	(10.7086)

The fit statistics for each of the 12 test titles are provided in Table 3. It can be seen that the

R^{2}

values range from 0.9209 to 0.9992, with 10 of the 12 titles above 0.9500. Thus, our models fit the sales data for these titles exceedingly well. Note that the sum of squared errors (SSE) vary widely from 241 to 3432E + 6; this, however, is not unexpected, as there exist significant differences in the sales volumes of these products.

Table 3

Fit Statistics for the 12 Test Titles

Product title	Fit statistics
Product title	$R^{2}$	SSE ( $10^{6}$ )
DVD rental	0.9962	13.0414
Volver
DVD Rental	0.9810	0.4239
Bottom Feeders
DVD rental	0.9931	3432.2610
The Good Shepherd
DVD retail	0.9992	0.470865
Apocalypto
DVD retail	0.9866	0.795008
The Queen
DVD retail	0.9209	0.001156
Dora: Shy Rainbow
Game retail: PS2	0.9988	0.000241
Spiderman 3
Game retail: X360	0.9671	0.025339
Guitar Hero II Bundle
Game retail: NDS	0.9670	0.002154
Pokemon Pearl Version
Game rental: PS2	0.9961	7.303532
Spiderman 3
Game rental: WII	0.9398	2.514553
The Bigs
Game rental: PS2	0.9829	0.038447
Brave—Spirit Dancer

We now summarize the results for the entire set of 352 titles. First and foremost, we found that overall, the

R^{2}

values were consistently high. Specifically, the

R^{2}

values ranged from a low of 0.3587 to a high of 0.9999, with an average of 0.9578. Only 34 of the 352 titles resulted in an

R^{2}

of <0.9000. This is illustrated in Figure 6. Note, in particular, that the

R^{2}

values are systematically high for titles with a large first‐week sales. In fact, the only cases where we have low

R^{2}

values are for titles with low first‐week sales. Thus, our models provide exceptionally robust fits, especially when the sales volume is not “low.”

Figure 6

R^{2}

‐Values for all 352 Data Sets

As discussed at the beginning of section 3 (see Figure 3, in particular), the key feature in our models is a decomposition of the total sales into three components. To see how the sales components work together effectively to cover a diverse range of sales patterns, we next present the detailed model fits for two examples, one for the Networking Model and the other for the Re‐Rent Model. We also provide three additional examples in section D.1 of Online Appendix S1.

Our first example is “Aurora Borealis.” This is a DVD rental, and hence the Networking Model is used. The fitted sales are plotted in Figure 7. It can be seen that the sales trajectory exhibits a second‐period spike, which is followed by a rapid decay, and then a slight late‐period increase. This is clearly a very challenging trajectory. Nevertheless, our model produced an

R^{2}

of 0.9321, with a fairly even set of Z‐scores. The decomposed sales components are also shown in the figure. Each of the three sales components plays a strong role. In particular, it can be seen that the networking effect is responsible for the sales spike in the second period, and the growth in potential‐buyer sales is responsible for the late‐period increase in total sales. We note in addition that in our model tests, it was observed that a second‐period spike tended to occur more frequently for titles with a low sales volume. This seems to be related to the fact that for smaller titles, there would be less “prelaunch exposure” and consequently, sales might rely more heavily on the networking effect.

Figure 7

Aurora Borealis

In Figure 8, the fitted sales for our second example, “XBox 360 Forza Motorsport 2,” is plotted. This is a game rental, and hence the Re‐Rent Model is used. The sales trajectory of this game exhibits a sharp drop from the first to the second period, and this is followed by a nearly linear decay pattern. Despite the peculiarity of this trajectory, our model managed to produce an

R^{2}

of 0.9814. An examination of the decomposed sales components in the figure shows that the sharp initial drop in total sales can be attributed to the fact that the size of the committed‐buyer population is dominant. Moreover, it is very interesting to observe that the strength of the committed‐buyer sales in the first period generated a significant spike in the sales to the potential buyers in the same period; this is a consequence of the intra‐period boost to the induction force on the potential buyers from purchases made by the committed buyers, as explicitly formalized in the second term in (31) (with i = 1). Note, however, that after the initial sales spike, which is a significant “drain” on the potential‐buyer population, subsequent sales to the potential buyers return to a traditional diffusion curve. Finally, we also see that the re‐rent effect for this title is negligible.

Figure 8

XBox 360 Forza Motorsport 2

These two detailed examples (together with the ones in Online Appendix S1) clearly indicate that our models are capable of offering an improved understanding of the sources of variability in the sales trajectories of these short‐life‐cycle products.

We next report a number of empirical facts regarding the fitted parameters for all 352 data sets. The parameter

π_{c}

, which determines the size of the committed‐buyer population, ranged from 0.00000 to 0.01970, with 98% of the values lower than 0.00010; and the corresponding purchase rate,

α_{c}

, landed between 0.00001 and 5.00000 for 87% of the titles. In contrast, the parameter

π_{p}

, which determines the size of the potential‐buyer population, ranged from 0.00000 to 0.05000 for 96% of the titles; this reflects the fact that, for the same title, the number of potential buyers, in general, tends to be larger than that of committed buyers. Finally, the parameters α and β were found to be typically between 0.00000 to 0.25000 (for 95% of the titles) and between 0 and 4000 (for 93% of the titles), respectively. Overall, the wide ranges of these parameter values underscore the difficulty with forecasting sales for short‐life‐cycle products. We also found that the values for

π_{c}

and

π_{p}

are significantly higher for rental products than for retail products. This is not surprising as the data used here are from Blockbuster, which has a very large market share in the rental arena, but a relatively small market share in retail products. Lastly, we note that the parameter

α_{c}

is consistently larger than α. This, again, is not surprising as

α_{c}

is the purchase rate for those who have already decided to purchase or rent the product upon its release.

Finally, we comment on the relative strengths of different parameters. The parameters

π_{c}

π_{p}

, and

α_{c}

were found to be consistently stronger, which is evidenced both by their average t‐statistics, all of which exceeded 4.2, and by their individual t‐statistics, which exceeded 1.5 in 67% of the cases. While the remaining parameters were also strong in an overall sense, as evidenced by their average t‐statistics, which were at 1.4 or higher, the percentages of parameters with a t‐statistic exceeding 1.5 dropped to as low as 33%. Thus, taken together, these results offer solid support for our models. A tabulated summary of these statistics is provided in Table 3 in section D.1 of Online Appendix S1.

4.2. Predicting Parameters

As discussed in section 4.1, the Networking and the Re‐Rent Models have been shown to be capable and robust enough to describe the DVD and game sales in the Blockbuster environment. In particular, the sales trajectory for each of the 352 titles has been characterized by a set of seven parameters; that is, for any given set of parameters, the successive expected total sales of a title can be explicitly computed via the recursions developed in section 3. Our goal now is to estimate, for a given new product, these seven parameters prior to any sales activity, based solely on environmental data. For this purpose, we used the parameter characterizations to develop a set of regression equations that, depending on the product type (e.g., movies or games), relate each of the parameters to an appropriate set of environmental variables. Our approach in developing these regressions involved three basic steps, which we summarize next.

First, for each product category, we conducted a correlation analysis, using the fitted parameters for titles in that category, to determine which environmental variables are likely to have a reasonable impact on the parameter values. In our correlation analysis, we also examined how strongly the variables in any proposed set of environmental variables are related to each other; and this was helpful in avoiding possible multi‐collinearity issues.

Second, for each of the parameters, we excluded from consideration all titles with a “low” significance level for that parameter. Our specific criterion is that if the p‐value for a fitted parameter of a title is higher than 0.05, then that title is not used in the analysis for that parameter. Note that this implies that a particular title could be used in the analysis of some of the parameters, but is excluded from the analysis of the other parameters. This is not unreasonable as an insignificant parameter is not expected to have a substantial impact on the values of the remaining fitted parameters.

Lastly, for each parameter, we carefully experimented with various functional forms for the regression equation, using the fitted values from the selected subset of titles, until one with reasonable fit is found for that parameter.

As DVD movie rental is of primary interest at Blockbuster, we will discuss in the remainder of this section results for this product category only (based on the Networking Model).

For movie rental, we identified, after much effort, nine environmental variables that seemed to be most relevant in predicting parameters. These are: (i) four‐week gross theatrical sales, (ii) presence of significant awards, (iii) large studio, (iv) medium studio, (v) large genre appeal, (vi) medium genre appeal, (vii) MPAA rating of UNRATED, (viii) MPAA rating of R, and (ix) MPAA rating of PG‐13. Except for four‐week gross theatrical sales, which is a numeric variable, all other variables in this list are non‐numeric, and are therefore formulated as indicator variables.

In the interest of brevity, we will now further limit our discussion to the parameter

π_{c}

. Details for the remaining parameters are provided in section D.2 of Online Appendix S1.

Our starting point is the belief that the extent of consumer awareness of a movie title should have an influence on the size of the committed‐buyer population. Moreover, it seems reasonable to expect that the 4‐week gross theatrical sales of a movie, which we denote by y, serve as a good proxy for consumer awareness. Indeed, we found a strong positive correlation between

π_{c}

and y. Continuing further, we observed that the relationship between the

π_{c}

values and ln (y) appears to be S‐shaped. (The transformation ln (y) is used here because the order of magnitudes of the y values for different movie titles varied over an extremely wide range.) A plot of this relationship is shown in Figure 9. The shape of this relationship suggests that: (i) when ln (y) is low, the

π_{c}

s are not sensitive to changes in ln (y); (ii) as ln (y) grows to a moderate range (say 12 < ln (y) < 18), the

π_{c}

s become very responsive to ln (y); and finally (iii) when ln (y) is extremely large, the

π_{c}

s would eventually approach a ceiling. These observations are clearly consistent with common‐sense intuition.

Figure 9

π_{c}

vs. ln (y)

Motivated by the further belief that as the only numeric variable, ln (y) would offer a stronger differentiating power than the other environmental variables, our strategy for constructing the regression for

π_{c}

is to first develop a backbone functional relation between

π_{c}

and ln (y) only, and then to fine tune this backbone relation by incorporating into it some of the other environmental variables listed above. This strategy, in fact, is also employed, in section D.2 of Online Appendix S1, in the construction of the regression models for all other parameters.

Our specific choice of the functional form in the backbone relation is where b and c are parameters that are to be estimated, and, for brevity, we have omitted the statement of a normally distributed error term. It can be shown that, as a function of ln (y), (34) is S‐shaped, with the parameter b serving as a control on the overall rate of growth and c serving as a control on the “midrange” steepness, or responsiveness. Thus, this functional form is well suited for modeling the data displayed in Figure 9. Indeed, this is confirmed by the curve labeled as “Predicted

π_{c}

with Environmental Variable y Only” in Figure 10; note that the horizontal axis in this figure, which is different from that in Figure 9, is an index of the movie titles in ascending order of y.

Figure 10

Predicted

π_{c}

vs. Fitted

π_{c}

To improve upon (34), we implemented a multiplicative adjustment to each of the shape parameters b and c using two additional environmental variables, large studios and the presence of significant awards (both of which are indicator variables). These two environmental variables, which we shall denote by r and s, respectively, were selected due to a combination of the extents of their correlations with

π_{c}

and experimentation. Our final regression model for

π_{c}

where d, e, and f (together with the original b and c) are parameters that are to be estimated from data.

Finally, to assess the performance of (35), we used the environmental data alone to predict the

π_{c}

values for the subset of titles whose significance levels for

π_{c}

are ≤0.05. The resulting predicted

π_{c}

s are shown in the curve “Predicted

π_{c}

with All Environmental Variables” in Figure 10. It can be seen that the adjusted predictions offer significant improvements over the original predictions. We also see in this figure that the predicted

π_{c}

s match the fitted

π_{c}

s quite well. The

R^{2}

value for this adjusted fit is 0.8760.

4.3. Forecasting Sales

Using the regression models developed in section 4.2 and in section D.2 of Online Appendix S1, we are now able to forecast sales for a DVD movie title prior to its release, based solely on publicly available environmental data. To assess the performance of our method, we selected a new test set of 11 movie titles and conducted sales forecasts. The titles in this test set are chosen randomly from new releases during the weeks of July 16, 2007 and July 23, 2007; thus, they have not previously been used in any way in our model development.

The performance statistics of our sales forecasts for these 11 titles (based on environmental data) are given in Table 4 under the section heading “Initial Forecast Statistics” (the section labeled “Adjusted Forecast Statistics” will be discussed later in this subsection). The reported statistics are mean absolute deviation (MAD), mean absolute percent deviation (MAPD), and mean squared error (MSE). Of these standard statistics, MAPD is most informative, as it is scaled by the size of the sales volume of a title. It can be seen that our sales forecasts are reasonable for some titles (e.g., an MAPD of 21.87% for “The Number 23”), but are poor for several titles (e.g., an MAPD of 225.81% for “Avenue Montaigne”). Note that we are unable to compare the forecast accuracy of our approach against that of the method employed at Blockbuster as their method is proprietary.

Table 4
Forecast Statistics for Test Data Set

Rental DVD title Initial forecast statistics Adjusted forecast statistics

MAD MAPD (%) MSE ( $10^{6})$ MAD MAPD(%) MSE (
$10^{6}$
)

Premonition 58,416 28.39 5673.2 22,207 16.75 675.23

Factory Girl 41,927 111.43 2403.4 13,439 35.72 312.95

The Number 23 32,102 21.87 1419.0 46,677 31.79 3520.9

Zodiac 41,229 35.32 6783.6 32,479 27.82 2149.1

The Hills Have Eyes 2 21,700 29.56 1304.5 14,386 19.60 289.10

Slow Burn 24,128 44.99 1429.9 11,662 21.74 328.59

Perfume: Story of a Murder 67,590 161.68 22,306.4 15,111 36.15 310.81

Nomad 42,780 129.30 9093.6 15,857 47.93 414.05

The Host 31,113 222.22 2462.0 2607 18.62 12.806

Tabunfire: Dynamite Warrior 3511 98.05 20.621 1434 40.05 3.0479

Avenue Montaigne 6382 225.81 89.947 412 14.57 0.3327

Rental DVD title	Initial forecast statistics	Adjusted forecast statistics
Premonition	58,416	28.39	5673.2	22,207	16.75	675.23
Factory Girl	41,927	111.43	2403.4	13,439	35.72	312.95
The Number 23	32,102	21.87	1419.0	46,677	31.79	3520.9
Zodiac	41,229	35.32	6783.6	32,479	27.82	2149.1
The Hills Have Eyes 2	21,700	29.56	1304.5	14,386	19.60	289.10
Slow Burn	24,128	44.99	1429.9	11,662	21.74	328.59
Perfume: Story of a Murder	67,590	161.68	22,306.4	15,111	36.15	310.81
Nomad	42,780	129.30	9093.6	15,857	47.93	414.05
The Host	31,113	222.22	2462.0	2607	18.62	12.806
Tabunfire: Dynamite Warrior	3511	98.05	20.621	1434	40.05	3.0479
Avenue Montaigne	6382	225.81	89.947	412	14.57	0.3327

Given that our sales model is solidly supported by the empirical results in section 4.1, the performance of the initial sales forecasts should be viewed as a reflection of the predictive power of the environmental variables. In general, if for example, data on product preorders (see, e.g., Moe and Fader 2002, Hui et al. 2008), which has higher power than the environmental data that we have used, were available, it could have been used to improve our forecasts. As such data are not available, we will develop in the remainder of this subsection a procedure that adjusts the initial forecasts based on the actual first‐period sales. The use of first‐period sales allows us to reasonably gauge the “best‐case” performance of our forecasting scheme, in the sense that this is similar to having preorder information.

As noted in section 4.2, the environmental variable y has the strongest correlation with

π_{c}

; this, in fact, is the case with parameters

π_{p}

and β as well. Observe, in addition, that these three parameters play significant roles in the characterization of the sales trajectory of a title, in that

π_{c}

and

π_{p}

, together, control the overall sales magnitude, and that β controls the rate of growth in cumulative sales. Our strategy, therefore, is to first use the actual first‐period sales to adjust the predicted values of these three parameters only, and then to use the resulting adjusted predictions for

π_{c}

π_{p}

, and β, together with the original predictions for the remaining four parameters, to produce an adjusted sales forecast.

We begin with the adjustment for β. From section D.2.4 of Online Appendix S1, the backbone regression equation for β is

β = a (1 - \exp (\frac{- b}{(\ln (y {))}^{c}})),

where a, b, and c are parameters to be estimated from data. The idea is to replace y in (36) by the actual first‐period sales. This is quite sensible because the actual first‐period sales serve as a more‐current proxy than y for the extent of consumer awareness of a movie title. The revised version of (36) is then used to produce a new prediction for β. Note that, to avoid possibly unwarranted complexity, we do not rely on any of the other environmental variables in this adjusted prediction.

To adjust the predictions for

π_{c}

and

π_{p}

, we will do a simple scaling. Observe that if we let β equal the adjusted prediction and fix the values of

α_{c}

, α, ν, and δ according to their original predictions, then the sales forecast can be viewed as a function of

π_{c}

and

π_{p}

only. This suggests that if we further fix the ratio

r \equiv π_{c} / π_{p}

according to the original predictions as well, then the sales forecast can be scaled up or down by a joint adjustment to

π_{c}

and

π_{p}

that preserves the given ratio r. Our method therefore is to manipulate the values of

π_{c}

and

π_{p}

in this manner until the sales forecast for the first period matches the actual sales for that period, and then to take the resulting values for

π_{c}

and

π_{p}

as their adjusted predictions.

The performance of the adjusted sales forecasts for all 11 test titles is reported in Table 4, under the section heading “Adjusted Forecast Statistics.” It can be seen that the MAPDs improved uniformly. The dramatic reduction in the MAPDs for “The Host” and “Avenue Montaigne” is particularly noteworthy.

Finally, we illustrate the forecast adjustments for two of the test titles, “Premonition” and “Factory Girl.” Corresponding results for the remaining nine titles are provided in section D.3 of Online Appendix S1.

Forecasting results for “Premonition” are plotted in 11. The initial sales forecast matches the actual sales trajectory reasonably well. However, the forecast has a slight sales spike in the second period, which is absent in the actual sales trajectory. The spike is removed in the adjusted sales forecast, and this can be attributed primarily to the adjustment in the predicted β.

Figure 11

Sales Forecasts for “Premonition"

The sales forecasts for “Factory Girl” is shown in Figure 12. The initial sales forecast for the first period, in this case, is substantially lower (over 50,000) than the actual sales for that period. Moreover, the sales forecast is almost linear between periods one and fourteen, which is dramatically different from the actual sales. The trajectory of the adjusted sales forecast is seen to offer significant improvements both in magnitude and in shape. Thus, all three adjustments to the initial predictions for

π_{c}

π_{p}

, and β have contributed positively for this title.

Figure 12

Sales Forecasts for “Factory Girl"

4.4. Financial Impact on Blockbuster's Retail Operations

In this subsection, we outline the potential benefits of improved forecasts on Blockbuster's retail operations. We will first describe the forecasting method currently in use at Blockbuster, and then illustrate its potential financial impact.

The prevalent forecasting tools at Blockbuster, and this industry in general, are typically confined to rudimentary heuristic methods that are dependent on both historical analogies and personal expertise. At Blockbuster, the management adopts an approach that, similar to the Delphi method, uses a combination of theatrical performance, recent performance of like titles, and the existence of competing titles. This forecasting method is more of an art than a science. Consequently, the results are typically not reproducible and have an inherent degree of error, which is often significant. Therefore, to avoid lost sales, most companies in this industry prefer to err on the side of having inventories that are higher than their sales projections.

The extent of over‐stocking (which is due to a lack of confidence in the forecasting method) at Blockbuster can be illustrated by the set of sample products listed in Table 5. Observe that stocking levels for these products are typically higher than first‐week demands. This is particularly alarming for rental products, as they can be rented multiple times in a week. Moreover, these discrepancies vary dramatically. For rental products, in one instance, the quantity purchased for consumer rentals is as high as three times what was rented; and for retail products, in one instance, the quantity purchased for sales to customers is 26 times what was sold in the first week.

Table 5
Sample of Rental and Retail Products

Format Title Quantity purchased First week sales

Rental: DVD Deja Vu 766,700 987,323

Rental: DVD The Fountain 37,125 65,397

Rental: DVD Dora: Shy Rainbow 6200 4363

Rental: Game (X360) Spiderman 3 33,000 26,621

Rental: Game (WII) Mario Party 8 5400 4672

Rental: Game (PS3) MLB 07 4200 1386

Retail: DVD Deja Vu 58,590 16,183

Retail: DVD The Fountain 4200 2260

Retail: DVD Dora: Shy Rainbow 3930 149

Retail: Game (X360) Spiderman 3 1572 412

Retail: Game (WII) Mario Party 8 600 210

Retail: Game (PS3) MLB 07 600 64

Format	Title	Quantity purchased	First week sales
Rental: DVD	Deja Vu	766,700	987,323
Rental: DVD	The Fountain	37,125	65,397
Rental: DVD	Dora: Shy Rainbow	6200	4363
Rental: Game (X360)	Spiderman 3	33,000	26,621
Rental: Game (WII)	Mario Party 8	5400	4672
Rental: Game (PS3)	MLB 07	4200	1386
Retail: DVD	Deja Vu	58,590	16,183
Retail: DVD	The Fountain	4200	2260
Retail: DVD	Dora: Shy Rainbow	3930	149
Retail: Game (X360)	Spiderman 3	1572	412
Retail: Game (WII)	Mario Party 8	600	210
Retail: Game (PS3)	MLB 07	600	64

The cost of over‐stocking can be roughly estimated as follows. For retail products, the average product cost is about $15. If we assume that there are 20 new titles per week for 52 weeks in a year, and that the average size of over‐stocking is 250 pieces per title (conservatively assuming only 1000 pieces purchased and 25% over‐stocking), then product costs for over‐stocking would exceed $4 million annually. For rental products, we can extrapolate from the six titles in Table 5. If we assume that the ideal scenario is for all stores to have sufficient products to meet 100% of first week sales, and that there are 1.3 rents per week per copy (which comes from historical data), then, against this ideal scenario, the company would end up with an average excess of 3108 pieces per title. Now, with about 35 new titles per week for 52 weeks, and still assuming a product cost of $15 per piece, this equates to about $84 million annually.

These cost estimates indicate that there exists significant opportunities for cost reduction in Blockbuster's retail operations (a similar estimate can be made to show significant revenue loss due to under‐stocking). Our method clearly has the potential to enhance revenue and reduce cost for Blockbuster. For example, after an initial order (which could be based on a combination of both our initial forecast and expert opinion at Blockbuster) is placed, our adjusted forecast, which takes into account the actual first‐period sales can be used to revise the ordering decision for subsequent periods. That is, our method can help the operations manager to quickly respond to any mismatch in the initial order, and thereby improve the net profit for the remainder of the life cycle of a product.

5. Conclusion

In this article, we have used the actual sales history of 352 different new product releases from Blockbuster to develop and test two versions of a sales model. Our model integrates: (i) multiple consumer sub‐populations, (ii) cross interactions between the sub‐populations, and (iii) dynamic updates, both within and across time periods, for the purchase rates of the consumers in the sub‐populations.

Using publicly available environmental data and actual first‐period sales, we demonstrated that the model is capable of delivering reliable sales forecasts for DVD movie rental. We also discussed the potential for the model to make a significant financial impact on Blockbuster's retail operations.

The key feature in our sales model, namely the demand decomposition exhibited in Figure 3, is by no means particular to just new product releases at Blockbuster. This modeling approach, which we believe is a useful contribution of this article, can be adapted to suit other short‐ (and possibly long‐) life‐cycle products. The formulation can also be integrated into other models for operational decisions (see, e.g., Bassamboo et al. 2009, Debo et al. 2006, Ho et al. 2002, Kumar and Swaminathan 2003). We are currently pursuing further research in such models in the context of online movie rental operations.

Our work suggests two possible managerial implications. The sales decomposition suggests that managers could try to “conceptually” manipulate the relative sizes of different market segments. For example, even though explicit identifications of the segments are not possible, in‐store or online advertising could be used in an attempt to boost the size of the committed‐buyer population. The sales trajectory of a product can be expected to have lower variability if the size of this population is relatively larger.

Our sales model has been shown to be accurate (see Figure 6). The performance of sales forecasts based on our model, however, will further depend on the availability of quality input data. This suggests that Blockbuster could narrow down its forecasting effort to that of building reliable methods for predicting model parameters. In this article, the sales forecasts (see section 4.3) are based on environmental information. In general, if independent forecasts from in‐house experts and/or preorder or other relevant data are available, they could be built into the parameter‐prediction models to produce improved forecasts.

Finally, we note that we did not consider the effects of price and advertising in our formulation. This is due to the fact that product offerings at Blockbuster typically have a fixed price, and that Blockbuster relies primarily on studio advertising (which can be taken as a form of environmental information). These effects could have significant impacts on the sales trajectory of products in other arenas, and are therefore worthy of further study.

References

Ainslie

Dreze

Zufryden

. 2005. Modeling movie life cycles and market share. Mark. Sci. 24: 508–517.

Bass

F. M.

1969. A new product growth model for consumer durables. Manage. Sci. 15: 215–227.

Bassamboo

Kumar

Randhawa

R. S.

. 2009. Dynamics of new product introduction in closed rental systems. Oper. Res. 57: 1347–1359.

Blockbuster Incorporated . 2006. Annual Report, Form 10‐k. Available at http://www.blockbuster.com (accessed date October 6, 2007).

Blockbuster Incorporated . 2009. Annual Report, Form 10‐k. Available at http://www.annualreports.com/company/2838 (accessed date August 7, 2010).

Conlisk

1976. Interactive Markov chains. J. Math. Soc. 4: 157–185.

Debo

L. G.

Toktay

L. B.

Van Wassenhove

L. N.

. 2006. Joint life‐cycle dynamics of new and remanufactured products. Prod. Oper. Manag. 15: 498–513.

Digital Entertainment Group . 2007. DEG Year End 2006. Home Entertainment Sales Update. Available at http://www.dvdinformation.com/News/press/CES010807.htm (accessed date October 6, 2007).

Eliashberg

Jonker

J.‐J.

Sawhney

Weirenga

. 2000. MOVIEMOD: An implementable decision support system for pre‐release market evaluation of motion pictures. Mark. Sci. 19: 226–243.

10.

Fader

P. S.

Hardie

B. G. S.

Huang

C. Y.

. 2004. A dynamic changepoint model for new product sales forecasting. Mark. Sci. 23: 50–65.

11.

GameInformer.com. 2007. Reviews 2007. Available at http://www.gameinformer.com (accessed date December 26, 2007).

12.

Garber

Goldenberg

Barak

Muller

. 2004. From density to destiny: Using spatial dimension of sales data for early prediction of new product success. Mark. Sci. 23: 419–428.

13.

Hahn

Park

Krishnamurthi

Zoltners

A. A.

. 1994. Analysis of new product diffusion using a four‐segment trial‐repeat model. Mark. Sci. 13: 224–247.

14.

Hardie

B. G. S.

Fader

P. S.

Wisniewski

. 1998. An empirical comparison of new product trial forecasting models. J. Forecast. 17: 209–229.

15.

T.‐H.

Savin

Terwiesch

. 2002. Managing demand and sales dynamics in new product diffusion under supply constraints. Manage. Sci. 48: 187–206.

16.

Hui

S. K.

Eliashberg

George

E. I.

. 2008. Modeling DVD preorder and sales: An optimal stopping approach. Mark. Sci. 27: 1097–1110.

17.

IMDBPro.com.

2007. Available at http://www.imdbpro.com (accessed date December 12, 2007).

18.

Jedidi

Krider

R. E.

Weinberg

C. B.

. 1998. Clustering at the movies. Mark. Lett. 9: 393–405.

19.

Kumar

Swaminathan

J. M.

. 2003. Diffusion of innovations under supply constraints. Oper. Res. 51: 866–879.

20.

Lee

Boatwright

Kamakura

W. A.

. 2003. A Bayesian model for prelaunch sales forecasting of recorded music. Manage. Sci. 49: 179–196.

21.

Lilien

G. L.

Rao

Kalish

. 1981. Bayesian estimation and control of detailing effort in a repeat purchase diffusion environment. Manage. Sci. 27: 493–506.

22.

Mahajan

Muller

Kerin

R. A.

. 1984. Introduction strategy for new products with positive and negative word‐of‐mouth. Manage. Sci. 5: 1389–1404.

23.

Mahajan

Muller

Srivastava

R. K.

. 1990. Determination of adopter categories by using innovation diffusion models. J. Mark. Res. 27: 37–50.

24.

Mahajan

Muller

Wind

. 2000. New‐product diffusion models: From theory to practice. Mahajan

Muller

Wind

, eds. New‐Product Diffusion Models. Kluwer Academic Publishers, Norwell, MA, 3–24.

25.

Moe

W. W.

Fader

P. S.

. 2001. Modeling hedonic portfolio products: A joint segmentation analysis of music compact disc sales. J. Mark. Res. 38: 376–385.

26.

Moe

W. W.

Fader

P. S.

. 2002. Using advanced purchase orders to forecast new product sales. Mark. Sci. 21: 347–364.

27.

Niu

S.‐C.

2002. A stochastic formulation of the Bass model of new‐product diffusion. Math. Prob. Eng. 8: 249–263.

28.

Niu

S.‐C.

2006. A piecewise‐diffusion model of new‐product demands. Oper. Res. 54: 678–695.

29.

PCVSConsole.com. 2007. Platform Comparison

2007. Market Analyst Sales Predictions. Available at http://forum.pcvsconsole.com/viewthread.php?tid=16280 (accessed date November 15, 2007).

30.

Rogers

E. M.

2003. Diffusion of Innovations, 5th edn. Free Press New York.

31.

Ross

S. M.

1996. Stochastic Processes, 2nd edn. John Wiley, New York, NY.

32.

Sawhney

M. S.

Eliashberg

. 1996. A parsimonious model for forecasting gross box‐office revenues of motion pictures. Mark. Sci. 15: 113–131.

33.

Schmidt

G. M.

Druehl

C. T.

. 2005. Changes in product attributes and costs as drivers of new product diffusion and substitution. Prod. Oper. Manag. 14: 272–285.

34.

Steffens

P. R.

Murthy

D. N. P.

. 1992. A mathematical model for new product diffusion: The influence of innovators and imitators. Math. Comput. Model. 16: 11‐26.

35.

Tanny

S. M.

Derzko

N. A.

. 1988. Innovators and imitators in innovation diffusion modeling. J. Forecast. 7: 225–234.

36.

Van den Bulte

Joshi

Y. V.

. 2007. New product diffusion with influentials and imitators. Mark. Sci. 26: 400–421.

Product title	Parameters
Product title	$π_{c}$	$π_{p}$	$α_{c}$	α	β	ν	η	δ
DVD rental	0.001404	0.003591	0.567677	0.007821	24.0521	0.135438	–	817.442
Volver	(0.000082)	(0.000066)	(0.039625)	(0.000640)	(1.45651)	(0.017930)		(156.689)
DVD rental	0.000086	0.000443	0.543802	0.003558	133.655	0.881185	–	148.304
Bottom Feeders	(0.000008)	(0.000022)	(0.058730)	(0.000716)	(17.8914)	(0.038518)		(30.7368)
DVD rental	0.020911	0.022135	0.375590	0.017086	2.47180	0.041747	–	13,432.4
The Good Shepherd	(0.003433)	(0.002582)	(0.066114)	(0.004001)	(0.629676)	(0.041431)		(2473.22)
DVD retail	0.000305	0.000337	1.78100	0.140463	0.000035	0.109329	–	180.071
Apocalypto	(0.000109)	(0.000140)	(0.545588)	(0.261581)	(614.155)	(0.059829)		(88.8102)
DVD retail	0.000040	0.000165	28.0417	0.372491	0.000000	0.000000	–	234.842
The Queen	(0.000005)	(0.000018)	(1.33755)	(0.051576)	(0.000000)	(0.000000)		(64.3186)
DVD retail	0.000003	0.000011	0.579185	0.003359	9053.56	0.000001	–	4.99154
Dora: Shy Rainbow	(0.000003)	(0.000007)	(4.66660)	(0.111529)	(3884.29)	(0.000001)		(2.52671)
Game retail: PS2	0.000008	0.000024	1.71062	0.106908	3509.02	0.911623	–	0.000001
Spiderman 3	(0.000011)	(0.000010)	(0.587182)	(0.383103)	(13,362.2)	(0.056219)		(0.000001)
Game retail: X360	0.000032	0.000082	38.1106	0.217663	2684.74	0.000011	–	41.9622
Guitar Hero II Bundle	(0.000183)	(0.000226)	(1.76614)	(4.00499)	(64,205.2)	(0.244179)		(54.5710)
Game retail: NDS	0.000032	0.000292	2.21656	0.001413	230.322	1.27663	–	7.29972
Pokemon Pearl Version	(0.000005)	(0.000012)	(0.315157)	(0.000706)	(23.6890)	(0.056990)		(2.85709)
Game rental: PS2	0.006902	0.002353	0.091251	0.007782	72.4746	–	0.000000	596.514
Spiderman 3	(0.000322)	(0.000366)	(0.009157)	(0.001407)	(6.64502)		(0.000000)	(171.568)
Game rental: WII	0.003962	0.004121	0.255803	0.013392	2.53456	–	0.190903	388.494
The Bigs	(0.001858)	(0.012460)	(0.110266)	(0.060486)	(14.0837)		(0.109473)	(89.6458)
Game rental: PS2	0.000104	0.000269	0.488631	0.005530	395.316	–	1.44858	37.0466
Brave—Spirit Dancer	(0.000009)	(0.000008)	(0.072266)	(0.002821)	(42.2423)		(0.025943)	(10.7086)

Rental DVD title	Initial forecast statistics			Adjusted forecast statistics
Rental DVD title	MAD	MAPD (%)	MSE ( $10^{6})$	MAD	MAPD(%)	MSE ( $10^{6}$ )
Premonition	58,416	28.39	5673.2	22,207	16.75	675.23
Factory Girl	41,927	111.43	2403.4	13,439	35.72	312.95
The Number 23	32,102	21.87	1419.0	46,677	31.79	3520.9
Zodiac	41,229	35.32	6783.6	32,479	27.82	2149.1
The Hills Have Eyes 2	21,700	29.56	1304.5	14,386	19.60	289.10
Slow Burn	24,128	44.99	1429.9	11,662	21.74	328.59
Perfume: Story of a Murder	67,590	161.68	22,306.4	15,111	36.15	310.81
Nomad	42,780	129.30	9093.6	15,857	47.93	414.05
The Host	31,113	222.22	2462.0	2607	18.62	12.806
Tabunfire: Dynamite Warrior	3511	98.05	20.621	1434	40.05	3.0479
Avenue Montaigne	6382	225.81	89.947	412	14.57	0.3327

A Sales Forecast Model for Short‐Life‐Cycle Products: New Releases at Blockbuster

Abstract

Keywords

1. Introduction

3. The Model

3.1.1. Formulation of the Means

Table 1 Game Platform Sizes Platform Console units ( 10 3 ) XBox 360 17,200 PS3 7400 Wii 5600 PS2 35,000 GCube 9000 NDS 5000 PSP 5000

3.1.3. Model Summary

3.2. The Re‐Rent Model

4. An Empirical Study

4.1. Fitting Parameters

References

Table 1
Game Platform Sizes

Platform Console units ( $10^{3}$ )

XBox 360 17,200

PS3 7400

Wii 5600

PS2 35,000

GCube 9000

NDS 5000

PSP 5000