Abstract
The increasing pervasiveness of social networks allows users to share purchase behaviors with their online friends. In this study, we examine optimal pricing strategies of a monopolistic firm using an analytical model that accounts for behavioral observational learning in social networks. We show that a seller could potentially control the information available to future customers and induce behavioral observational learning, using an information‐revealing pricing strategy. This result suggests that offering introductory discounts is not always an effective method to boost purchases in social networks. It could prevent the behavioral observational learning that would increase future customers' willingness to pay.
Introduction
And therein lay the secret to all fads: the herd instinct. People wanted to look like everybody else. That was why they bought white bucks and pedal pushers and bikinis. Willis, C. (1996) p. 33
The rapid growth of social networks has been changing the way consumers interact with businesses. In the canonical models of observational learning (Banerjee 1992, Bikhchandani et al. 1992), people make their decisions on whether to dine on the basis of how many consumers are already in a restaurant. The essence of observational learning is that an individual's decision is affected by the observation of others' choices because of its informational content. Social networking sites are making people's behaviors more observable to their friends. Socially shared purchases have become a mainstream activity of consumers and one of the top drivers for online sales. For example, Amazon encourages consumers to socially share their purchases across Facebook, Twitter, and e‐mail. Recently, the increasing pervasiveness of location‐acquisition technologies has allowed users to “check in” at physical venues and share the locations with their online friends. Consumers can use location‐based services (e.g., Foursquare, Facebook Place, or Google+) to navigate and engage with venues by receiving relevant coupons or ads based upon the location. Introducing socially shared purchases is a relatively new feature of e‐commerce that can drive both significant referral visits and conversion uplift. The use of social sharing content to show friend purchase activity leads to significant observational learning on Facebook and other social networking websites.
The widespread adoption of social networks adds an important dimension to prior observational learning literature. As a result of the new marketing technology of social networks, a striking difference has arisen: In the previous observational learning story (e.g., Banerjee 1992), people can observe all of the choices made by people before them, including many anonymous consumers. This is not an appropriate assumption in social networks. In the current practice, people are connected by a social network, and they observe only their friends' choices. Acemoglu et al. (2011) explored an observational learning model over a general social network. However, a seller's pricing strategies are absent from the model. On the other hand, a large stream of literature has been devoted to investigating sellers' pricing strategies in the absence of observational learning. Kumar and Sethi (2009) examined an optimal control model of dynamic pricing for web content providers. Gupta et al. (2011) studied investment incentives for network infrastructure owners under different pricing strategies. August and Niculescu (2013) examined how a software firm optimally determines its pricing in a setting where it can harness customer error reporting. Sellers' other strategies, such as the introduction of product upgrades (Ji et al. 2011, Mehra et al. 2014) and introduction of agency model for digital goods (Tan and Carrillo 2017, Tan et al. 2016), have also been considered in the literature.
In this study, we extend the literature by examining the interaction of pricing strategies and observational learning in social networks. We propose a behavioral inference rule that is cognitively simple for a consumer's complex decision making in social networks and relax a key assumption that has been implicitly accepted in prior observational learning literature: A consumer can observe the choices of all others who have made their choices ahead of her.
When designing its optimal pricing policy, a monopolistic firm could potentially control the process of observational learning using different pricing strategies. In our context, a firm faces the following two trade‐offs of using different pricing strategies: (i) In standard static monopoly pricing, a monopolistic firm faces a downward sloping demand curve. Setting a higher price implies a higher revenue per unit (positives associated with an increase in price), but on the other hand, the quantity demanded decreases with the price (negatives associated with an increase in price). This is the conventional static trade‐off discussed in the pricing literature. (ii) In our study, we focus on the second tradeoff associated with positives and negatives of pricing strategies in a dynamic setting. If the price is too high, then very few early consumers will adopt the product, and the monopolistic firm will not be able to use observational learning to boost its sales in the second period. On the other hand, if the price is too low, the effect of observational learning is also limited because late consumers would know that their friends purchased the product due to the low price instead of its high quality.
Therefore, if the price is set sufficiently high (information revealing) so that prior friends receiving different private signals make different purchase decisions, a subsequent consumer is able to infer the product quality through the decisions of her friends. Conditional on an information‐revealing pricing strategy, the subsequent consumer is more likely to infer that the product quality is high when the number of her friends who adopt the product is larger (Lemma 2 in section 4.3). If the firm charges such a low price (information pooling) in the first period that almost all consumers adopt the product, this price would not reveal any quality information of the product to future consumers. In this case, observational learning will play no role in increasing the future consumers' willingness to pay.
In this study, we develop a two‐period model and compare the optimal pricing policy under rational observational learning with that under behavioral observational learning. An important feature of our model is that the intensity of observational learning is endogenously determined by different pricing strategies. We find that the information‐revealing pricing strategy is optimal only in extreme market conditions (i.e., the ex ante uncertainty of product quality is very high or consumers have very precise private information on product quality) under rational learning. However, under behavioral learning, the use of information‐revealing pricing strategy is optimal in a wide range of market conditions. This result differs from that in the seeding literature (e.g., Dou et al. 2013, Ho et al. 2012). Introductory discounts could prevent observational learning, and hence are not always an effective method to boost purchases.
Literature Review
There are two fundamental supporting pillars in our research: (i) optimal pricing of experience goods, and (ii) social learning. Therefore, our study is closely related to these two streams of literature.
The Literature on Optimal Pricing of Experience Goods
A central topic of our study is the optimal pricing strategies. In a seminal paper, Shapiro (1983) examined a monopolist's optimal pricing strategies for non‐durable experience goods. Villas‐Boas (2006) extended the model to a duopoly setting with dynamic competition. Dudine et al. (2006) investigated dynamic monopoly pricing of storable goods in an environment where demand changes over time. In a continuous‐time model, Bergemann and Valimaki (2006) analyzed the optimal price path for a monopolist that sells an experience good over time to a population of heterogeneous buyers with independent private valuations. They found that the optimal price trajectory critically depends on whether the market is a niche or mass market. Jing (2011a) explored the effectiveness of two common pricing strategies—behavior‐based price discrimination and price commitment in a two‐period model of non‐durable experience goods. Jing (2011b,c) examined the impact of social learning on dynamic pricing of durable goods in a two‐period model. Fainmesser and Galeotti (2016) developed a model in which a monopoly sells a network good and price discriminates based on information about consumers' influence and consumers' susceptibility to influence.
The focus of our study is also the optimal pricing strategy of a monopolistic firm. However, our study differs from this stream of literature in two aspects. First, the prior literature has largely focused on consumer self‐learning through consumption in a setting of non‐durable goods, where the product is repeatedly purchased and learning occurs during consumption (e.g., Bergemann and Valimaki 2006, Jing 2011a, Shapiro 1983, Villas‐Boas 2006). In contrast, we focus on consumers' observational learning from friends' decisions. It is worth noting that Jing (2011b,c) and Fainmesser and Galeotti (2016) also considered the impact of other consumers' purchase decisions. However, our underlying learning mechanism is quite different from theirs. In Jing (2011b,c), the underlying learning mechanism behind consumer interaction is not modeled explicitly, and the speed of consumer social learning is controlled by an exogenous parameter of learning intensity. The underlying mechanism in Fainmesser and Galeotti (2016) is network effects (payoff externalities): A consumer's payoff depends positively on the number of other people who consume the product. In contrast, our learning mechanism is explicitly modeled in the framework of observational learning (based on information externalities 1 ): Consumers infer product quality through their friends' decisions. More importantly, the intensity of observational learning in our model is endogenously controlled by the firm's pricing strategies (information‐pooling strategy vs. information‐revealing strategy).
Second, our paper studies the interaction between observational learning and optimal pricing in a unique social network context, which has been overlooked in the previous optimal pricing literature. We propose a behavioral inference rule that is cognitively simple for consumers' complex decision making in social networks, and demonstrate that the optimal pricing strategy under behavioral observational learning is different from that under conventional rational observational learning.
The Literature on Social Learning and Diffusion
In the prior literature, there are two important forms of social learning: word‐of‐mouth communication (Chevalier and Mayzlin 2006, Samiei and Tripathi 2014) and observational learning (Banerjee 1992). In general, consumers' purchase decisions can be influenced by friends' opinions (word of mouth), and/or friends' purchase behaviors (observational learning). Chen et al. (2011) differentiated between word of mouth and observational learning using a natural experiment on Amazon. In our study, we focus on the second underlying mechanism: consumers' knowledge of the product is only through observing the actions of their friends, that is, observational learning.
A handful of empirical studies have explored observational learning in different contexts. Duan et al. (2009) empirically examined herd behavior and informational cascades in the context of online software adoption. Zhang and Liu (2012) studied observational learning in microloan markets. Qiu et al. (2014a) estimated a structural model of restaurant discovery and observational learning using data from a major location‐based social networking website in China. Our present work differs from these prior studies in two aspects. First, most of the previous studies focused exclusively on the consumer side and examine whether and how a consumer is influenced by observational learning. In this study, we explore how a monopolistic seller can induce observational learning, using pricing strategies to maximize its profit. Second, we relax the assumption of rational observational learning, in which a consumer can observe all prior consumers' decisions, and propose a more realistic behavioral inference rule in a context of social networks. We also compare the optimal pricing policy under our behavioral inference rule with that under rational learning. Our analytic model provides a framework for further empirical work on observational learning and optimal pricing in social networks.
In the full‐rationality theory of observational learning (Banerjee 1992, Bikhchandani et al. 1992), besides realizing that the previous movers' actions reflect these movers' own signals, each consumer also takes into account that these predecessors themselves also infer from still earlier actions. Such a high rationality requirement seems unrealistic or extreme in certain circumstances (Eyster and Rabin 2010), especially when a consumer does not have complete information on the structure of social networks. An extensive literature shows that people have limited cognitive abilities to process information and provides evidence that consumers use decision rules that are cognitively simple (Kahneman 2003, Lacetera et al. 2012). The question of how decision‐makers' cognitive constraints limit their ability to process available information has received increasing interest in the behavioral operations management literature (Bendoly et al. 2010). Recently, behavioral issues in supply chain pricing have also been discussed (Katok et al. 2014).
It is also instructive to contrast our model with the literature on diffusion (Jiang and Jain 2012). Ho et al. (2012) further extended the two‐segment diffusion model and examine the role of introductory discounts in boosting product purchases. In this study, we do not assume that social contagion is at work in every instance and try to open the black box of the diffusion mechanism. A consumer makes an inference about product quality by observing other people's choices. The adoption behaviors of her friends are informative signals that reflect the product quality, but do not directly enter her utility function.
Comparison of Different Learning Mechanisms
A Motivating Example
We begin our analysis by describing an illustrative example to show the difference between behavioral observation learning and rational observational learning. In Appendix S3, we conduct a laboratory experiment and empirically show the use of observational learning mechanism in reality. A basic (implicit) assumption of our study is that consumers use observational learning to make their purchase decisions. Although it is a common assumption in the prior observational learning literature, in social network settings, it is not clear whether consumers will actually use observational learning in their process of decision making. In other words, a consumer can simply talk to his/her friends who have adopted the product and ask them if they found the quality to be high or low. This is particularly plausible in the context of social networks. Therefore, in our experiment, we want to investigate under which conditions consumers are more likely to employ the observational learning mechanism instead of asking their friends directly.
More specifically, in our experiment, we are interested in two factors: (i) the price of a product, and (ii) the strength of social ties between consumers. Our experimental results showed that consumers are more likely to employ the observational learning mechanism instead of talking to their friends directly when the price is lower (the benefit of asking friends' opinions is smaller) and when the strength of social tie is weaker (the cost of asking friends/acquaintances' opinions is higher). In reality, the observational learning mechanism is more likely to be used when your ordinary Facebook friends (your acquaintances) share their purchases of small ticket items. In particular, Gee et al. (2016) found that weak ties (acquaintances) comprise a majority of a person's social network, so the use of the observational learning mechanism is likely to be common in practice.
In our motivating example, we consider consumers decide whether to buy a tablet accessory, such as a case with keyborad or an iPad mount. We choose this specific product category because our analytical model focuses more on the vertical quality dimension in which consumers share very similar opinions than the horizontal differentiation (tastes). 2 Moreover, in our laboratory experiment (see Appendix S3), we find that consumers are more likely to employ observational learning mechanism when the product is a small ticket item, such as tablet accessory.
The quality of the product is unknown to all consumers and is represented by a binary random variable: V ∈ {V
H
, V
L
}, and V
H
> V
L
> 0. We assume a common prior on the probability that the product is of high quality:
There are 100 consumers in this town, and they are connected by a social network. Using the social network service, consumers can see if any of their friends have recently bought the product. Consumers make sequential decisions: 50 consumers decide whether to buy the product in week 1, 50 consumers decide whether to buy the product in week 2. Before a consumer makes her decision, she receives a binary private signal independently and identically drawn from a Bernoulli distribution. The signal can be either S
H
or S
L
, and without loss of generality, we further assume that the signals are informative with the following values:
We focus on a simple social network illustrated in Figure 1. Each node represents a consumer, and each link represents a symmetric social connection in the social network. Consumer i is our foacl consumer who makes her decision in week 2. As shown in Figure 1, she has eight friends, and one of her friends has adopted the product (A). The rest non‐adopters (NA) are either consumers in week 2 or consumers who did not adopt the product in week 1. We assume that all consumers are risk neutral, and a consumer purchases the product if and only if her expected utility from the product is greater than or equal to the price,

A Simple Social Network of Consumer i [Color figure can be viewed at
Rational Observational Learning
Under the classical framework of rational observational learning, consumers try to infer product quality by observing all prior consumers' decisions, including many anonymous consumers (e.g., Banerjee 1992, Bikhchandani et al. 1992). In our example, consumer i needs to know all week 1 consumers' decisions to implement rational observational learning proposed in the literature. Suppose the vector of all week 1 consumers' decision is d = {d
1, d
2
, ···, d
j
, ···, d
50}, where d
j
indicates whether consumer j adopts the product (adopt: A; not adopt: NA). Under rational learning, consumer i's expost posterior is as follows:
Cosumer i will adopt the product if the expected utility is greater than the price:
In essence, the classical rational observational learning requires consumers to have extraordinary capability to acquire and process information: they need to have complete knowledge of all previous consumers' decisions and extract useful information from these decisions. However, in reality, consumers have limited time and attention to acquire and process information, and the cognitive capacity constraint limits the amount of information that they can acquire and process (Kahneman 2003). Behavioral observational learning based on friends' decisions can be thought of as reflecting a type of bounded rationality in which consumers are simply unable to acquire and make inferences from all prior consumers' decisions. In other words, behavioral observational learning is a simple and resonable decision rule based on friends' decisions. 3
Behavioral Observational Learning
In this section, we propose a behavioral inference rule based on a focal consumer's friends' decisions. Now we consider consumer i's choice under our behavioral inference rule. When an early consumer in week 1 receives S
H
, her posterior belief is as follows:
Let θ be the fraction of consumers who adopt the product over all consumers. If the product quality is V
H
, then the fraction of consumers in week 1 who receive S
H
is 3/4 because
On the basis of the information, we assume that consumer i performs a hypothesis test between two point hypotheses:
If H 0 is not rejected, consumer i thinks the product quality is V H . If H 0 is rejected, consumer i believes the quality is V L . According to the Neyman–Pearson lemma (Casella and Berger 2002), the likelihood‐ratio test is uniformly most powerful (UMP) for testing simple hypotheses, so it is a test that has the highest power among all competitors. We use the following likelihood‐ratio test as consumers' decision rule:
Do not reject H
0, if Λ > C; Reject H
0, if Λ
where Λ is the ratio of the likelihood function, which will be specified later (a higher value of Λ means that the observed data is more likely to occur under the null hypothesis as compared with the alternative), and c is a constant, which is chosen to obtain a specified significance level, and 0 < c < 1. In our study, we choose a c to obtain a 5% significance level, which is very typical for hypothesis testing. Our results remain qualitatively similar if we choose a c to obtain a 1% or 10% significance level. The significance level reflects consumers' tolerance for type‐I error. Setting a 5% significance level in our decision rule means that the probability of rejecting the null hypothesis given that it is true is at or below 5%. Essentially, in the decision‐making process, consumers care about the number of friends who have purchased the product and the total number of friends. In the following paragraphs, we show that consumers adopt a cut‐off decision rule: If the number of friends who have purchased the product is greater than a threshold value, then they will decide to purchase the product. The threshold value is an increasing function of the total number of friends.
Our behavioral inference rule is intuitive, plausible, and tractable. First, the simple cut‐off decision rule is a cognitively simple and heuristic way for consumers to solve learning problems in a complex real‐world environment. The limited observability of consumers significantly complicate Bayesian updating and require extraordinary analytical and computational capabilities for a fully rational approach (Bala and Goyal 1998). It is highly impractical and cognitively demanding for an individual to observe all prior consumers' decisions and adopt Bayesian learning to do the required calculations (Kahneman 2003). Instead, we propose a fairly intuitive cut‐off decision rule, where consumers observe only their friends' decisions. The essence of our behavioral inference rule is consistent with the idea of procedural rationality proposed by Simon (1990): Procedural rationality is not an optimizing technique, but a heuristic method for arriving at satisfactory solutions with modest amounts of computation.
Second, the proposed behavioral inference rule based on a cut‐off strategy simply says that a consumer's decision depends on the relative popularity of a product. A growing stream of literature has discussed and proposed similar heuristic learning mechanisms to simplify Bayesian learning (Ellison and Fudenberg 1993, Eyster and Rabin 2010). For instance, Ellison and Fudenberg (1993) examined a non‐Bayesian social learning model in which people use a simple rule‐of‐thumb method based on the relative popularity of technologies to decide which technology they want to adopt. It is worth noting that an alternative naive learning mechanism is proposed by Golub and Jackson (2010). In their model, individuals average their estimates or beliefs with those of their friends. The key assumption is that individuals report their signals and opinions truthfully to their friends. This naive learning mechanism is not suitable for our context, because in observational learning, people observe the decisions of their friends instead of the beliefs of their friends. Although our cut‐off rule looks different from the rule proposed by Golub and Jackson (2010) on the surface, the spirit is similar: People who use the cut‐off rule actually average the decisions of their friends.
Additionally, the implication of our cut‐off decision rule is supported by empirical evidence in prior literature. Duan et al. (2009) empirically found that the relative popularity of products determines the timing and direction of observational learning. Finally, the optimal pricing problem in the presence of Bayesian observational learning is highly complex. Our behavioral inference rule makes the model analytically tractable.
We assume that consumer i receives S
L
, and the information set of consumer i is given by:
If the product quality is V
H
, then θ = θ
0 = 3/8. The likelihood of observing I in reality when the product quality is V
H
is given by:
Λ is the ratio of the likelihood function when consumer i receives S
L
and n
i
of her friends adopted the product:
The intuition for this decision rule is straightforward. The information that consumer i. observes is that she receives a private signal S
L
, and n
i
of her friends adopted the product. A higher value of Λ suggests that the observed data is more likely to occur under V = V
H
as compared with V = V
L
. Therefore, the decision rule for consumer i is a simple cut‐off strategy: When
Suppose that all other things being equal, but consumer i receives S
H
instead. Therefore, the information set of consumer i is I = {n
i
friends adopted the product and private signal is S
H
}. Following owing the same procedure, we can obtain the decision rule:
It means that the threshold value when consumer i receives S H is smaller than that when consumer i receives S L . In other words, when consumer i receives a high signal S H , she needs a smaller number of friends' adoption to justify that the product quality is high.
The aforementioned motivating example is oversimplified in an important way. The price charged by the firm is fixed. Some natural questions arise: What happens when prices are instead control variables? Could the seller manipulate the price to induce more purchases? In the next section, given that consumers adopt the behavioral inference rule, we study the optimal pricing strategy in the presence of observational learning in social networks.
Optimal Pricing under Different Learning Mechanisms in Social Networks
Model Setup
In this section, we study how a firm can use an information‐revealing pricing strategy to control the process of observational learning in social networks. Assume that a firm faces a unit measure of consumers, C = [0, 1], who are embedded in a social network. The degree of consumer i is the number of consumer i's friends, and each consumer has degree n. The equal degree assumption makes our model analytically tractable. This assumption is in line with the extant literature that has looked at similar modeling abstractions, such as Jackson (2008) and Qiu et al. (2016). Actually, in a large body of prior experimental studies on social networks, the equal degree assumption represents a balanced social network in reality and is a useful metaphor for many market environments (Carpenter et al. 2012, Charness et al. 2007, Qiu et al. 2014b, and Rosenkranz and Weitzel 2012). However, we do believe that relaxing equal degree assumption is important because it is difficult to envision that equal degree can fully mirror the circumstances of the environment of interest. We conduct additional numerical analyses to relax this assumption and introduce degree heterogeneity in Appendix S5.
Each consumer decides whether to adopt a product. The product is an experience good. It is common knowledge that the quality of the product is represented by a binary random variable: V ∈ {V H , V L }, and the true value of V is initially unknown to the buyers and the seller. In other words, we assume there is no asymmetric information between the buyers and the seller. This is a standard assumption in the literature on optimal pricing in the presence of learning (Bergemann and Valimaki 2006, Jing 2011a, Welch 1992). Another reason for adopting this assumption is to maintain the focus of observational learning. If the seller has some private information about the product quality, she might be able to signal this quality information through prices in a signaling game (Campbell 2015). As we focus on consumers' observational learning, we improve tractability by assuming that the seller has no private information. This assumption also captures the market information structure for experience products in reality. Li and Wang (2014) demonstrated that for experienced goods, such as music, video games, movies, and books, the firm and consumers are equally informed after the product launch because of open access to online product reviews.
We assume that all consumers are risk neutral, and the payoff function is V − P, where P is the price set by the firm. The pricing strategies will be specified later. All consumers share a common prior belief about the product quality:
It is worth noting that the binary signal structure is widely used in the observational learning literature in different fields, such as economics (Bikhchandani et al. 1992, Guarino et al. 2011), finance (Welch 1992), marketing (Iyer et al. 2007, Zhang et al. 2015), and information systems (Li and Wang 2014). One of the major advantages of a binary private signal structure is the analytical tractability. Smith and Sorensen (2000) showed that the binary signal structure can be extended to a signal with many possible realizations, but at significant algebraic cost. By assuming that the private information is either “high” or “low,” we are able to completely characterize the optimal pricing policy and focus on two pricing strategies that will be highlighted in our study: information‐revealing and pooling pricing strategies. In reality, the binary signal structure can be a simplified but useful approximation of the actual environment. A large stream of experimental literature has adopted the binary signal assumption (Alevy et al. 2007, Cipriani and Guarino 2005, Goeree et al. 2007). In real‐world situations, private quality information could come from a variety of sources. For instance, in digital music markets, a binary private signal can be interpreted as a consumer's perceived quality (positive or negative) when she observes the song title, the album title, the artist name, the album artwork, and any other visible information. A consumer can also form her own private opinions (positive or negative) on quality by reading online product reviews (such as Yelp.com) or third‐party professional reports (Chen and Xie 2005, Zhang et al. 2015).
We consider a dynamic model with two time periods. Assume that the firm is a monopoly and the marginal cost is a constant, which can be normalized to 0 without loss of generality. The monopolistic firm is risk neutral and maximizes the sum of profits in the two periods. To ease exposition, we ignore discounting in our model like Jing (2011b,c). At the beginning of period 1, the firm announces its pricing strategy,
The firm faces two groups of consumers: an initial group of early consumers and a group of late consumers. In period 1, fraction λ of all consumers, C 1 = [0, λ], arrive and decide whether to adopt the product solely on the basis of the private signals they receive. We label them as early consumers. In period 1, consumer i's net payoff from adopting the product is V − P 1. Early consumers adopt the product if the expected valuation is greater than the first period price. Following Banerjee (1992), we assume that early consumers are myopic and do not consider the strategic behavior of delaying the decision‐making process to obtain more information. In other words, the two groups of consumers act in an exogenously given order (Bikhchandani et al. 1992).
In period 2, fraction 1 − λ of all consumers, C 2 = [λ, 1], make inferences about the quality and decide whether to adopt the product. We call these later decision makers late consumers. We consider two scenarios where late consumers follow the rational inference rule or the behavioral inference rule. As is standard in the observational learning literature (Bikhchandani et al. 1992, Smith and Sorensen 2000, Zhang et al. 2015), we assume a flat common prior α = 1/2 and the fraction λ = 1/2 without loss of generality. The analytical insights remain the same when λ ∈ (0, 1) and α ∈ (0, 1). In Appendix S2, we provide additional numerical analyses and show that our results are robust when we vary the value of exogenous parameters. Another reason we set α = 1/2 is that the role of observational learning is highlighted when the prior uncertainty is high. If α → 0 or 1, consumers are certain about the product quality and the impact of observational learning is insignificant. Setting the common prior, α, to be 1/2 is consistent with our assumption that the product is an experience good.
In our study, we assume that the proportion of early consumers in period 1, λ, is an exogenous parameter and is public information. First, this assumption is widely adopted in the prior economics and marketing literature for analytical tractability. For instance, Sun (2012) assumed a unit mass of early consumers enter the market in the first period, and another unit mass of late consumers enter the market in the second period. In particular, the seminal papers of observational learning (Banerjee 1992, Bikhchandani et al. 1992) have two working assumptions: (i) a sequence of consumers enter the market in an exogenous order (the sequence of decisions or the position of a consumer in the decision‐making queue is fixed and exogenous), which is equivalent to our assumption that the proportion of early consumers in period 1 is exogenous and fixed; and (ii) consumers have precise information about their position in the sequence of decisions (common knowledge), which is equivalent to our assumption that the proportion of early consumers in period 1 is public information shared by all consumers. These two working assumptions have been adopted by almost all observational learning models in diferent contexts in the past two decades (e.g., Acemoglu et al. 2011, Cao et al. 2011, Eyster and Rabin 2010, Guarino et al. 2011, Hendricks et al. 2012, Herrera and Hörner 2013, Moretti 2011, Moscarini et al.1998, Smith and Sorensen 2000, Zhang et al. 2015). Our assumption is consistent with the literature tradition and is an acceptable trade‐off for analytical tractability. It is worth noting that the exogenous λ (the proportion of early consumers in period 1) can be motivated by awareness effects. For example, in an observational learning model, Herrera and Hörner (2013) assumed that consumers randomly arrive and make decisions. Similarly, in our context, we can interpret the exogenous λ as some sort of awareness effects: a proportion of λ (randomly chosen) consumers becomes aware of the existence of the product and make their decisions in time period 1, and the rest are aware of the product in time period 2. The value of λ may reflect factors, such as firm‐side advertising, which are abstracted away in this model.
Second, our assumption is not only typical in consumer learning models, it is also realistic in many scenarios. In our model, when early consumers make their decisions, they do not need to know the proportion, λ. Only the firm and late consumers need to know it. In general, it is relatively easy for firms to obtain an estimate of λ by conducting consumer surveys (asking when consumers are likely to make their purchase decisions). However, consumers may also be able to have some rough ideas on λ from multiple information sources in different contexts. For instance, consumers may infer λ from survey results and reports in various popular consumer magazines, such as PC Magazine, PC World, and Consumer Reports, and a number of third‐party websites, such as CNET.com. Another example is an online market for music named Amie Street Music where users can listen to a sample of a song before they buy it at no monetary cost. They can also observe how many of other consumers have listened to the sample (potential consumers who are interested in the song) and the trend (Newberry 2016). These pieces of information can help consumers figure out λ. Similarly, in Amazon or Apple's App Store, the observable sales rank or download rank (over time) may help consumers infer the proportion of early consumers in period 1 (Chevalier and Goolsbee 2003, Garg and Telang 2013). 4
Optimal Pricing Strategy under Rational Learning
We start with the rational learning scenario in which a subsequent consumer can observe all prior consumers' decisions. The key feature of rational observational learning is that a late consumer first observes the adoption decisions of all early consumers; then, upon the actions of all early consumers and her private signal, she infers the product quality using Bayesian updating and makes her own adoption decision. In this baseline scenario, there are two types of learning processes: (1) Private learning, in which early consumers learn from their private signals; and (2) rational observational learning, in which late consumers learn from all early consumers' decisions as well as their private signals. In period 1, if an early consumer receives a high signal, S
H
, then her posterior belief becomes
The firm faces an optimal pricing problem: maximizing the expected profits by setting appropriate prices P 1 and P 2. The following lemma characterizes the possible locally optimal pricing strategies for the firm. All the proofs can be found in Appendix S1.
Under rational observational learning and credible price commitment, the possible locally optimal pricing strategies for the firm are Strategy 1
Here, we provide a proof sketch of the lemma. In period 1, an early consumer either receives a high signal or a low signal. The willingness to pay of a consumer receiving a high signal is
Notice that
If
Under rational learning and credible price commitment, (i) if Optimal Pricing Strategy under Rational Learning
A proof sketch is provided as follows. According to Lemma 1, we have three possible pricing strategies. We compare the firm's profits under three different pricing strategies and the result follows. The proposition implies that when the private signal is sufficiently noisy (
To provide some additional intuition, we conduct a numerical analysis and visualize Proposition 1 in Figure 2. For simplicity, we assume there are 10,000 consumers in both period 1 and period 2. The profits of pricing strategies 1, 2, and 3 under rational learning are represented by π 1, π 2, and π 3, respectively. We define the difference between the profits generated by the information‐revealing pricing strategy and the pooling pricing strategy as π 3 − max{π 1, π 2}. Figure 2 depicts the contour lines of the profit difference for different values of the precision of the private signal, q, and the ratio V H /V L . Note that the contour level‐0 line means the profit difference is 0. Therefore, the whole region can be divided by this contour line. When the profit difference is less than 0, the firm should use the pooling pricing strategy. When the profit difference is greater than 0, the firm should use the information‐revealing pricing strategy.

Profit Comparison of Different Pricing Strategies under Rational Learning [Color figure can be viewed at
As for result (ii) in Proposition 1, we increase the value of q and examine its impact on the parameter space of V
H
/V
L
in which the information‐revealing pricing strategy is optimal. When q = 0.6, the information‐revealing strategy is never optimal no matter what the value of V
H
/V
L
is; when q = 2/3, the information‐revealing strategy is preferred when
The implication of result (iii) is similar: When V H /V L = 2, the information‐revealing strategy is never optimal no matter what the value of q is; when V H /V L = 3, the information‐revealing strategy is preferred when q ∈ [0.8, 1]; and when V H /V L = 5; the information‐revealing strategy is preferred when q ∈ [0.7, 1]. Similarly, we can see that when V H /V L is larger, the information‐revealing pricing strategy is optimal in a wider range of the parameter space of q. In other words, the use of information‐revealing pricing strategy is optimal in a wider range of market conditions when V H /V L is larger.
Note that in Figure 2, our parameter choice,
Optimal Pricing Strategy under Behavioral Learning
Under behavioral observational learning, we relax the assumption of rational learning, in which a late consumer can observe all early consumers' decisions. In this section, a late consumer first observes the decisions of her friends; then, upon the actions of her friends and her private signals, they follow the behavioral inference rule proposed in section 3.3. For simplicity, we consider a regular social network, where every consumer has the same degree n (Jackson 2008). Following Galeotti and Goyal (2009), we assume that consumers believe that the friendship formation is a random sampling process. This assumption avoids complicated inference. Each consumer randomly draws n consumers as her friends from all consumers. In other words, each consumer chooses a n‐sized sample: she makes n draws, and each draw is independent from each other. Note that n measures the level of social interactions, and a higher n means a denser social network. Consumer i observes the number of her friends who adopted the product, n i .
As is discussed in the section of rational learning, the firm can use two types of pricing strategies depending on whether all early consumers adopt the product in period 1. If a pricing strategy is used such that all early consumers adopt the product in period 1, a late consumer will learn nothing from her friends' decisions, and the pricing strategy is information pooling. If a pricing strategy is used such that some early consumers adopt the product in period 1 but some do not, a late consumer will be able to infer the product quality from her friends' decisions, and the pricing strategy is information revealing.
Similar to the rational learning scenario, there are two types of learning processes under behavioral observational learning: (1) Private learning, in which early consumers learn from their private signals; and (2) behavioral observational learning, in which late consumers learn from early friends' decisions as well as their private signals. In the following lemma, we show that a late consumer's inference rule under behavioral observational learning is a simple cut‐off strategy when an information‐revealing pricing strategy is used. When a pooling pricing strategy is adopted, a late consumer will focus only on her private signal because no quality information is revealed in period 1.
Under behavioral learning, a late consumer's inference rule when the firm adopts an information‐revealing pricing strategy is as follows: A late consumer i who receives S
H
will infer that the product quality is V
H
if n
i
is greater than a threshold
We provide a proof sketch as follows. If the firm adopts a pooling pricing strategy, a late consumer learns nothing from friends' deicsions. If the firm adopts a revealing pricing strategy (
Following a similar argument in the proof sketch of Lemma 1, we can obtain the following result: Under behavioral learning and credible price commitment, the possible locally optimal pricing strategies for the firm are strategy 1,
A natural question arises: When does the use of information‐revealing pricing strategy increase the profits of the firm? We first provide a numerical example, presented in Figure 3, to offer some intuition. We set the number of friends of each consumer to n = 10. Note that n should be small relative to the number of consumers in period 1. That is the key point of relaxing the assumption in rational learning: people observe only their friends' decisions instead of all of the choices made by prior consumers. For the behavioral decision rule based on hypothesis testing, we choose a c to obtain a 5% significance level. Like the numerical analysis in rational learning, we assume there are 10,000 consumers in both period 1 and period 2. Similarly, the profits under pricing strategies 1, 2, and 3 are represented by π 1, π 2, and π 3, respectively. Figure 3 depicts the contour lines 5 of the profit difference π 3 − max{π 1, π 2} for different values of the precision of the private signal, q, and the ratio V H /V L . This numerical example shows that when q and V H /V L are large, the use of information‐revealing pricing strategy increases the profits of the firm. When q becomes larger, the private signal is more informative. When V H /V L is large, the uncertainty of product quality is high. Under both cases, it is more valuable to observe other consumers' choices. In other words, observational learning is more effective in increasing the consumers' willingness to pay. Thus, pricing strategy 3 dominates the pooling pricing strategy. More importantly, Figure 3 shows that under behavioral learning, the firm should adopt the information‐revealing pricing strategy in a wide range of market conditions.

Profit Comparison of Different Pricing Strategies under Behavioral Learning
We also examine the effect of network density, n, on the choice of optimal pricing strategies. In Figure 4, we depict the contour lines of the profit difference, π
3 − max{π
1, π
2}, when the network density varies (n = 10, 20, 30), and indicate the region where the information‐revealing or pooling pricing strategy is optimal. Recall that in reality, the network density n should be small relative to the total number of consumers. We find that the use of information‐revealing pricing strategy is optimal in a wider range of market conditions when the network is denser (given that n is relatively small). However, when n is large, an increase in n makes the information revealing strategy less optimal. In our current parameter setup, we find that the upper limit of the network density is

The Effect of Network Density on the Choice of Optimal Pricing Strategies under Behavioral Learning (n is Small) [Color figure can be viewed at

The Effect of Network Density on the Choice of Optimal Pricing Strategies under Behavioral Learning (n is Large) [Color figure can be viewed at
There are two underlying trade‐off forces that make
In particular, it is worth noting that in reality, the first underlying force usually dominates the second one because the degree n should be very small relative to the total number of consumers in the first period. For example, the number of potential consumers for a product could be in millions, but the social network degree for an ordinary consumer is just in hundreds. In Figure 4, n = 10, and the total number of consumers in the first period is 10,000. Therefore, the first force dominates, and we find that the use of information‐revealing pricing strategy is optimal in a wider range of market conditions when the network is denser (it holds as long as
In the following proposition, we generalize the numerical results and describe the optimal pricing strategies under different market conditions, followed by a discussion of the managerial implications. For notational simplicity, let Under behavioral learning and credible price commitment, the information‐revealing pricing strategy 3, {P
1 = μ
H
V
H
+ (1 − μ
H
)V
L
, P
2 = V
H
} is optimal for the firm if (i)
A proof sketch is provided as follows. We only need to compare the profits under the three different pricing strategies mentioned earlier. For pricing strategies 1 and 2 that do not induce observational learning, late consumers do not update the prior belief by observing the decisions of their friends. For pricing strategy 3 that induces observational learning, late consumers use the behavioral inference rule to make their decisions. From Lemma 2, we know how to compute
Proposition 2 characterizes under what market conditions information‐revealing or pooling pricing strategy is optimal in the presence of behavioral learning. There are two trade‐offs in choosing the pooling or information‐revealing pricing strategy: (1) The monopoly faces a downward sloping demand curve. We assume that the monopolist firm cannot make price discrimination to gain more profit. In other words, the firm can charge only one price. By raising P
1, the firm will lose some business in period 1. (2) The trade‐off associated with observational learning and information pooling. In social networks, consumers can observe the decisions of their friends. Hence, the firm can control the information available to late consumers by charging different prices. It is the crucial trade‐off on which our model focuses. If the firm charges
Optimal Pricing Strategy under Rational Learning Vs. Behavioral Learning
In this section, we compare optimal pricing strategy under rational learning with that under behavioral learning and obtain several important insights on what the corresponding optimal pricing strategies are. In Figures 2 and 3, we find that the information‐revealing pricing strategy is optimal only in extreme market conditions under rational learning. However, under behavioral learning, the use of information‐revealing pricing strategy is optimal in a wide range of market conditions. For instance, when q = 0.75, the information‐revealing pricing strategy is never optimal under rational learning no matter how large V H /V L is. However, under behavioral learning, the information‐revealing pricing strategy is optimal when V H /V L ≥ 1.6. In other words, behavioral learning makes the information‐revealing pricing strategy optimal for the seller with a much lower threshold value of V H /V L .
Why do the optimal pricing polices differ under rational learning and behavioral learning? The key underlying intuition is that under rational learning, if the firm adopts the information‐revealing pricing strategy, a late consumer in period 2 can observe the decisions of all consumers in period 1, and hence will discover the true product quality. However, in our context, perfect learning is not consistent with the profit maximization objective of the firm except when V H /V L is high. Recall that as is standard in the literature (Bergemann and Valimaki 2006, Jing 2011a, Welch 1992), there is no asymmetric information about product quality between the buyers and the seller in our model. In other words, the firm is not completely certain about the product quality. When V H /V L is high, the firm can charge a high price if consumers know that the product quality is V H . Therefore, perfect learning benefits the firm in the case that V H /V L is high. However, when V H /V L is moderate or low, the benefit of perfect learning is lower. On the other hand, the risk of perfect learning comes from the fact that it may reveal the true product quality if the product quality is V L . Therefore, we can obtain: (i) under rational learning, information pooling is optimal when V H /V L is moderate or low.
Under behavioral learning, the region in which information pooling is optimal is smaller than that under rational learning. In contrast with rational learning, behavioral learning is not perfect learning, but still better informs consumers about product quality. In other words, under behavioral learning, information‐revealing strategy will help consumers know the product quality more precisely, but it will not reveal the product quality perfectly. Therefore, the risk of learning (the low quality is revealed) is lower, and it makes the information‐revealing strategy optimal for the seller in a wider range of market conditions with a much lower threshold value of V H /V L . We can obtain: (ii) Under behavioral learning, information pooling is optimal only when V H /V L is low. Combined (i) with (ii), we know that the region in which information pooling is optimal under behavioral learning is a proper subset of the region in which information pooling is optimal under rational learning (shown in Figures 2 and 3).
Our model has implications for how a manager can develop more effective pricing strategies in social networks. If a firm insists the full rationality assumption in which a consumer can observe all of her predecessors' choices, the pooling pricing strategy will be adopted in most of the cases. Our analytic insights suggest that in a more realistic rationality assumption, the information‐revealing pricing strategy can increase the firm's profits, and hence should be more widely adopted. It is worth noting that the managerial implication of our model is different from that of the seeding literature. Introductory discounts and a free demonstration to a targeted group of consumers are widely used as methods to boost adoption. Jing (2011b) studied seller‐induced learning, such as sponsoring test use, offering product demonstration and training seminars, in a durable goods market. Ho et al. (2012) showed that a firm can amplify social contagion and accelerate product purchases by offering introductory discounts. However, in our model, a low price in period 1 implies a pooling pricing strategy, which results in no information revelation to the new consumers in networks. Offering introductory discounts is not always an effective method to boost purchases. It could be detrimental to social contagion and product adoption. Similarly, Dey et al. (2013) examined the effect of consumer learning on the design of free software trials and found that a free trial may not be optimal in many practical situations. In our model, consumers make inferences about the quality according to the actions of their friends. Introductory discounts actually prevent observational learning that could increase the new consumers' willingness to pay. Thus, our model suggests that introductory discounts might not be an optimal pricing strategy under the circumstances in which behavioral observational learning plays an important role.
Conclusions and Future Research Directions
In the present study, we explored the optimal pricing in the presence of behavioral observational learning. The monopolist controls the speed of observational learning, using different pricing strategies. Our study offers some important managerial implications. We find that local merchants can benefit from the informative pricing strategy that results in more observational learning in social networks. Surprisingly, introductory discounts could prevent observational learning and reduce the monopoly profits under some circumstances.
Several additional steps can be taken to examine observational learning and optimal pricing in social networks. First, the firm in our model is assumed to be a monopoly. A future research direction would be to examine the competition effects, using the standard Hotelling model (Cheng et al. 2011). Second, following the literature (Niculescu and Wu 2014), we retained the credible price commitment framework. It would be interesting to relax this assumption in our future research. Third, following the prior literature on observational learning (Acemoglu et al. 2011, Banerjee 1992, Bikhchandani et al. 1992, Smith and Sorensen 2000), we assumed that all consumers share the same opinion on vertical quality in our model. However, we do realize that horizontal taste difference (consumers have different tastes) is a very important dimension of consumer preference. Chen and Xie (2005) differentiate between two types of consumers: “taste‐driven” and “quality‐driven” consumers. Essentially, we assume that the consumers in our model are quality‐driven consumers. A future research direction of our model is to allow products to differ in two dimensions: quality (vertical) dimension and taste (horizontal) dimension. Fourth, our analytical model focused on the channel of observational learning; however, in reality, consumers might contact their friends who adopted the product to figure out how well they liked it (word of mouth). It would be natural to examine different channels of social contagion, such as observational learning and word of mouth, in a unified analytical model. Finally, we did not allow price discrimination in the present model. We could relax this assumption and consider the use of price discrimination on the basis of social network structures, such as targeting consumers having high centrality measures.
Footnotes
Acknowledgments
The authors thank the department editor, the senior editor, and three anonymous reviewers for their detailed and constructive comments. The authors also thank Maxwell Stinchcombe, Thomas Wiseman, De Liu, Ying‐Yu Chen, and the participants of the 2012 INFORMS Conference on Information Systems and Technology (CIST) for helpful feedback.
1
The detailed difference between payoff externalities and information externalities can be found in Moretti (2011) and Qiu et al. (
).
2
An assumption of our model is that all consumers share the same utility function.
3
It is worth noting that even if consumer i in our example wants to conduct rational learning based on friends' decisions, it would be very complicated because she does not know other early consumers' decisions. She could form an expectation on all other early consumers' decisions, but there are 2 N possibilities, where N is the number of early consumers in week 1 whose decisions are not observed by consumer i. It is a high rationality requirement to compute the expected utility based on all possible cases.
4
We may also consider an example of consumers deciding whether to buy a newly released smartphone, such as the newest model of iPhone. Consumers make this decision when their existing service contracts expire, so that there is an exogenous sequence of actions determined by contracts. In this context, each consumer may observe the choices of her friends, neighbors, and coworkers.
