Abstract
The study investigates the flagging behavior as specific type of bystander intervention against uncivil user comments in comments sections on news sites. Two experimental studies examine the effects of intervention information, characteristics of response comments, and the type of victim attacked in a comment on flagging behavior, that is on reporting a comment to professional moderators. Our results indicate that intervention information is a promising strategy to motivate flagging. Flagging is based on responsibility attribution to professional moderators but not on self-responsibility perception. Type of victim and characteristics of other users’ posted responses to preceding comments (public disagreement and politeness) shape deviance perceptions of the situation and influence flagging behavior.
Comments that violate the community standards by threatening democratic rights and stereotyping social groups are among the most problematic challenges in comments sections of online newspapers. User engagement against such uncivil comments is strongly requested by most online newspapers to complement professional moderation. To act against inappropriate comments, most comments sections feature flagging buttons. These allow users to report violations of their usage policy to professional moderators. However, scientific knowledge on their use is scarce (Crawford and Gillespie, 2016; Goodman, 2013; on further social buttons, e.g. Stroud et al., 2017). This article aims to shed light on this kind of user engagement by integrating knowledge from bystander research. It addresses three influential factors and thus also extends research on bystander intervention in computer-mediated interactions.
Bystander research has compiled comprehensive knowledge on bystander behavior in offline situations (e.g. Fischer et al., 2011; Kalch and Meitz, 2015; Meitz et al., 2014; Latané and Nida, 1981; Niesta Kayser et al., 2010). Similar to offline situations harassments, threat of violence or stereotyping may be included in online comments. Users can flag such comments as worth of deletion by a moderator. Despite the moral similarity, bystander behavior in comment sections is framed by several particularities different from offline bystander behavior: (1) Offline attacks as well as cyberbullying attacks are directed against physically or virtually present individuals. In contrast, inappropriate comments often insult or devaluate a generalized group of victims (e.g. migrants, religious groups in general, Reich, 2011). (2) Intervention strategies in the online context and individual responsibility are ambiguous and less clear (Obermaier et al., 2016). (3) The audience of a comment is potentially larger but also anonymous and only virtually present. However, user responses that succeed a comment give indication of other users’ assessment of it. Thus, Study 1 addresses the influence of the type of victim, of information on intervention options and attributed responsibility, and of (dis)agreeing user responses on intervening by flagging an uncivil comment. Study 2 adds evidence on the influence of the style of writing of user responses.
User comments and their regulation
Comments sections have become a common feature of online news sites. Editors and journalist expect that comments sections promote user loyalty, that user comments complement the professional coverage as well as they provide feedback to journalists (Meyer and Carey, 2014). Reading comments is widespread among Internet users, and the readers perceive comments sections as relevant and interesting. Even if the number of those posting comments is still small compared to more traditional online practices (Bergstrom and Wadbring, 2015), comments sections offer new possibilities to lay communicators to participate in public discourse. However, the professionals retain the decision-making power over the user-generated content on their platforms (Nip, 2006). Challenging to readers and professionals is that not all comments adhere to the usage policies. Among the inappropriate posts, uncivil and impolite user comments are the main reasons for complaint and the main issue of moderation by professionals (Coe et al., 2014; Reich, 2011; Stroud et al., 2015). Incivility in online discussions is defined as “set of behaviors that threaten democracy, deny people their personal freedoms, and stereotype social groups” (Papacharissi, 2004: 267). In close relation to incivility is impoliteness. It includes name-calling, aspersion, pejorative speak, vulgarity, and further behaviors of not adhering to an etiquette (Papacharissi, 2004; also e.g. Brown and Levinson, 1987). Such posts may compromise the image of the newspaper (Reich, 2011), lessen the engagement of the users, and put at risk the benefits expected from deliberate online discussions. They may lead to attitude polarization, reduce open-mindedness, and increase negative intergroup emotions (Anderson et al., 2014; Borah, 2014; Hwang and Kim, 2018).
As part of a “long Western tradition of media regulation” (Crawford and Gillespie, 2016: 412), many popular German, US-American, and Western European news sites rely on users’ engagement in sanctioning inappropriate comments complementary to professional moderation (Naab, 2016a, 2016b; for an overview Goodman, 2013). One of these sanctions is flagging content. It is less constructive than writing response comments or rating a comment and does not allow for a discussion between the writer of a comment and its flagger. Depending on the design of the respective platform, flagging buttons are accompanied by further tools to engage in regulation, different sorts of content can be flagged, flaggers are asked for no, little, or detailed articulation of their reasons, and flags are processed differently (for a detailed overview, see Crawford and Gillespie, 2016). Flagging often leads to deletion by professional moderators and is thus the most consequential sanction of users. In many popular German online news sites, such as SPIEGEL ONLINE, or US-American news sites, such as The New York Times or The Guardian, flags are originally thought to report violations of usage policies by users, but they also indicate individual disagreement to specific views, are used for retribution, harassment, or to direct attention (Crawford and Gillespie, 2016). So far, research has rarely examined quantity and determinants of flag use in comments sections.
Study 1
Following the model of helping by Latané and Darley (1970) knowledge how to act is a crucial step that determines bystander intervention (Banyard, 2008). In offline bystander research, this is, for example, shown in studies that find higher intervention rates with subjects who had medical competence when facing a medical incident (Cramer et al., 1988). Most comments sections provide information on community standards and intervention options in their netiquette rules (Pankoke-Babatz and Jeffrey, 2002), albeit with differing obtrusiveness and they may be easily ignored. Thus, they are formal guidelines that justify sanctions, but are assumed to be relatively weak means to encourage active engagement. When information is recognized by users, it can enhance their skills and increase the quality of user engagement. For example, a more detailed invitation to leave comments results in comments of higher quality (Freelon, 2015; Sukumaran et al., 2011). Furthermore, explicit appeals to user engagement are effective means to increase regulation behavior. For example, intervention against an act of child grooming in an online chat was stronger when participants were informed that adherence to the community standards depended on the users’ surveillance than when informed that the chat was under computer surveillance (Palasinski, 2012). Emphasizing the options and intervention responsibility of users seems worthwhile to increase flagging:
H1. Information about user responsibility and intervention strategies will increase flagging behavior.
Many of the offline and online phenomena examined under the perspective of bystander behavior relate to helping certain individuals (Blair et al., 2005; Dooley et al., 2009; Kalch and Meitz, 2015; Meitz et al., 2014). Uncivil user comments differ in that they are often directed against social groups, ideas, and values in general (e.g. Reich, 2011). In general, bystander also intervenes against deviant behavior targeted against social values instead of particular victims (Brauer and Chekroun, 2005). However, bystander intervention is more likely when victims are socially stronger connected to the bystander (Levine and Crowther, 2008; Palasinski, 2012). De-personalized comments attacking social groups in general are assumed to induce less social connection to and relevance of the victims compared to attacks against specific individuals (Oliver et al., 2012) and thus result in decreased intervention behavior compared to comments attacking specific individuals:
H2. An uncivil comment that is directed against specific individual victims will more likely cause flagging than an uncivil comment directed against a social group in general.
For offline helping behavior, extensive evidence exists about effects of other bystanders on helping decisions (Fischer et al., 2011; Latané and Nida, 1981; for online communication, e.g. Obermaier et al., 2016). In comments sections, an indicator of others’ reactions and their evaluations of the situation are the existence and direction of one or more responses to a comment (Rim and Song, 2016).
In situations when norms are not salient, people tend to adjust to perceived group norms indicated by other users’ behavior (Stroud et al., 2015; Sukumaran et al., 2011). Disagreement by others in a response comment may indicate that other users perceive the uncivil comment as deviant. This should increase flagging because a user can feel certain to be in line with the norms of the forum. At the same time, helping behavior is less likely when other witnesses already assumed responsibility (Latané and Darley, 1970). A disagreeing response may indicate that other users recognized their responsibility, and thus reduce the need to intervene for a subsequent reader (e.g. Scaffidi Abbate et al., 2014). In contrast, agreement in a response may increase ambiguity about the deviance of the uncivil comment, which may decrease intervention likelihood (e.g. Fischer et al., 2011). At the same time, response agreement may underline the need for flagging because the norm violation doubled and thus increase flagging behavior (Fischer et al., 2011). Given the different potential explanations, we ask the following research question (RQ):
RQ1. Does flagging behavior differ when an uncivil comment receives agreement, disagreement, or no response?
Method
Design and participants
A 2 × 2 × 3 between-subjects design, varying intervention information (available vs. unavailable), type of victim (individuals vs. social group), and type of response (agreement vs. disagreement vs. none), was carried out. Students of a communication class at a German university were extensively trained in social science methods and distributed invitations to participate in the online study via mailing lists, e-mails, and postings in social communities. No compensation was given. Participants were randomly assigned to one of the 12 conditions and answered an online questionnaire. In total, 308 participants conducted the survey; however, 26 indicated to never or very rarely read user comments on news sites and social networks. To strengthen the external validity of the results, the analyses refer to the 282 participants who indicate basic usage of user comments (Mage = 23.54, SD = 9.34, 89 males [32%], 169 females [60%], and 10 people did not indicate gender).
In order to avoid confounding results for flagging behavior, the manipulation of type of victim and type of response was tested separately with 29 students (Mage = 22.62, SD = 3.60, 5 males [17%], 23 females [77%], and 2 missings) from various academic areas and years not included in the final sample.
Stimuli
A mockup news site including a comments section was created and integrated into an online questionnaire. Like regular news sites, it consisted of a news article followed by a comments section. Number and type of the provided comments varied due to the experimental condition. Participants were asked to read the article as well as the comments as they would normally do. In the instruction, participants were informed that the comments section provided all regular opportunities for usage and was fully functional. All buttons had mouse-over effects to highlight their functionality. The layout and style followed typical German news sites. The participants were informed that the comments were written by former participants and that their comments might be visible to future participants. This was to increase external validity by simulating the situation of interacting publicly. After the experiment, participants were fully debriefed.
The participants read an article of the politics section since politics is known to attract many but also worse user comments (Goodman, 2013). It reported on successive adoption by homosexual couples. This was a controversial issue in 2014 in Germany, allowed for counter-arguing, and opposing comments appeared realistic (Coe et al., 2014). The argumentation in the stimulus article balanced pro- and contra-argumentation lines and included personal experiences of an exemplary homosexual couple.
In the intervention condition, as example of news sites that introduce comment sections in detail, the comments section included a note about the usage policies, an explication of tools to intervene against uncivil comments (flagging, positive and negative evaluations, commenting), and an explicit reference that users should take responsibility and actively participate in sanctioning uncivil comments. It also linked to a more comprehensive netiquette. The non-intervention group represents news sites that give very few information to users about commenting behavior and thus included but a brief note introducing the beginning of the comments section.
The first comment of the discussion part was unobtrusive and identical in all conditions. It was followed by an uncivil comment that either attacked the homosexual couple serving as exemplar within the article (individual victims) or attacked homosexuals in general (social group). The uncivil comment included defamation and discrimination of homosexuals, profanity, and an implicit threat of violence. Depending on the experimental condition, the uncivil comment received either a response indicating disagreement by another user, indicating agreement by another user, or no response (Appendix 1). Below each comment, three buttons were provided to enable a positive evaluation of the comment (green thumbs up button), a negative evaluation (red thumbs down button), and flagging (red button labeled “Report”). Additionally, each comment was followed by a response field. At the end of the comments section, a field was included to enable commenting on the article without replying to any of the prior comments.
Measures
The mockup site captured if the participants clicked (1) or did not click (0) the flagging button in order to measure flagging behavior of the uncivil comment (86 participants, 30.5% flagged). Perceived deviance of the uncivil comment was measured with six items (e.g. “This user comment was clearly offending,” “This user comment was harmless” [reverse coded], “This user comment impinged personality rights”) on a 7-point scale (1 = strongly disagree, 7 = strongly agree) based on the definition of incivility by Papacharissi (2004, M = 6.37, SD = .86, α = .76). General flagging frequency was measured by asking participants to indicate how often they flag comments if possible (M = 3.26, SD = 2.34). Attitudes toward homosexuals were measured with seven items adapted from Herek and McLemore (2011) and additionally enhanced by three items referring to adoption rights of homosexuals (M = 6.37, SD = 0.87, α = .87). High values represent positive attitudes toward homosexuals. To check for perceived differences in intervention manipulation, respondents were asked to indicate if information on the intervention strategies against comments were given, if a netiquette was provided, and if users were able to flag (1 = yes, 2 = no, 3 = I am not able to remember). To control for message perceptions, people were asked to indicate how “believable” (M = 5.76, SD = 1.31) and “comprehensible” (M = 6.06, SD = 1.22) the article was (1 = strongly disagree, 7 = strongly agree). Additionally, participants were asked to indicate their agreement with the item “The comments section could have been published in a similar vein on an online news site” to control for the authenticity of the comments section (1 = strongly disagree, 7 = strongly agree, M = 5.51, SD = 1.47).
To check for a successful manipulation of the type of victim and type of response, 29 students not included in the final sample were asked to indicate how much the two versions of the uncivil comment attacked homosexuals in general or the homosexual couple described in the article (7-point semantic differential). Additionally, they indicated how much the response comments agreed or disagreed with the uncivil comment (7-point semantic differential).
Results
A treatment check was conducted before testing the hypotheses. Paired sample t-tests showed that the manipulations of type of victim and type of response were successful: The comment referring to individual victims was perceived as more offensive against the homosexual couple described in the article (M = 2.89, SD = 2.41) than the comment attacking homosexuals in general (M = 1.64, SD = 1.13), t(27) = 2.72, p = .011. The response arguing against the uncivil comment was perceived as disagreeing more intensely with the uncivil comment (M = 6.71, SD = 1.15) than the response supporting the uncivil comment (M = 1.07, SD = 0.26), t(28) = −25.03, p < .001. Regarding the intervention manipulation, significantly more participants in the intervention condition remembered that a netiquette was provided (n = 95, 62.1%) than in the non-intervention condition, n = 2, 1.7%, χ2(2) = 105.62, p < .001, and more participants remembered information on the intervention strategies against uncivil comments (n = 97, 63.4%) than in the non-intervention condition, n = 39, 33.3%, χ2(2) = 33.13, p < .001. In a similar vein, in the intervention condition, more participants remembered that they could flag the comments (n = 119, 77.8%) compared to participants in the non-intervention condition, n = 73, 62.4%, χ2(2) = 9.21, p = .010. However, in both groups, the percentage of participants that remembered an opportunity to flag was relatively high indicating that the obtrusive “Report” button itself served as a signal. The experimental groups neither differed with respect to perceived comprehensibility, F(11, 269) = 0.97, p = .472, and believability of the article, F(11, 269) = 0.94, p = .484, nor perceived authenticity of the comments section, F(11, 269) = 1.50, p = .133.
A binary logistic regression model was computed to analyze the influences on flagging (H1, H2, and RQ). Intervention information (available vs. unavailable), type of victim (individual victims vs. social group), and type of user response (agreement vs. disagreement vs. none) were entered as predictor variables. Intervention information and type of victim were dummy coded for analysis and type of response was transformed into two categorical dummy variables. For both variables, the none-response group was set 0 as a baseline. For the response agreement dummy variable, agreement was assigned the value 1 and disagreement the value 0. For the response disagreement variable, disagreement was assigned the value 1 and agreement the value 0. Flagging behavior was entered as dependent variable. General flagging frequency, perceived deviance, and attitudes toward homosexuals were entered as controls (Table 1).
Logistic regression analysis of the effects of intervention information, type of victim, and type of response on flagging behavior.
SE: standard error.
n = 282; victim: 0 = social group, 1 = individual victims; intervention information: 0 = non-intervention, 1 = intervention; response disagreement: 1 = disagreement, 0 = agreement, 0 = no response; response agreement: 1 = agreement, 0 = disagreement, 0 = no response.
In line with H1, flagging was more likely when intervention information was presented. However, there is no main effect of type of victim (H2). In line with existing bystander research, general flagging frequency increases flagging behavior. However, the three-way interaction between intervention information, response disagreement, and type of victim is significant indicating that particular combinations of all three characteristics influence flagging likelihood. In order to interpret this interaction, cross tabs were computed (Figure 1).

Flagging behavior in conditions of response direction, type of victim, and intervention information.
Providing intervention significantly increases flagging behavior in nearly all groups (individual victims and response disagreement, χ2(1) = 8.17, p = .006; individual victims and response agreement, χ2(1) = 6.06, p = .019; victimized social group and no response, χ2(1) = 10.87, p < .001; individual victims and no response, χ2(1) = 4.98, p = .042). The effect is not visible for an uncivil comment that attacks a social group and received response agreement, χ2(1) = 2.99, p = .111, or response disagreement, χ2(1) = 0.73, p = .442.
Discussion
The results strongly support the idea of giving obtrusive intervention information and emphasizing the need for user intervention in comments sections to increase user engagement. Intervention information increased flagging behavior in most cases of user responses. If the uncivil comment did not receive a response by others, people assumedly derive a feeling of own responsibility since obviously nobody else took care of a fulfillment of the usage policies. Compared to other situations that require bystander action, user comments are low in their level of danger, making intervention decisions more ambivalent and questionable. Expectations about user engagement address the individual responsibility of the users and reduce ambivalence whether an incident truly needs intervention, influencing the emergency awareness.
The positive effect of providing intervention information diminishes if the uncivil comment addresses an abstract social group and already received response disagreement but even so if response agreement was posted. Since user comments usually address abstract social groups, this case seems particularly relevant. Offending abstract social groups may be experienced as less deviant compared to attacks against individual victims. When another user posted a disagreeing response, this may have decreased perceived deviance because he/she has thus already clarified the standards. A response disagreement may be evaluated as adequate care taking, reducing the feeling of responsibility for the bystander. At the same time, response agreement may call deviance into question because someone else obviously did not perceive the comment as deviant. This may reduce the felt inappropriateness of the comment and reduce perceived appropriateness to counteract. This points to the necessity to further investigate the deviance of response agreement and disagreement as well as the perceived responsibility as mediating factors of flagging behavior (study 2).
Flagging behavior for the uncivil comment was also increased by general flagging frequency. This strengthens the arguments derived from offline bystander research that felt competence and experience increase intervention in computer-mediated interactions, too.
Regarding the news sites, two implications stand out: (1) Active users of comments sections are an important source for news organizations for critically monitoring incivility. (2) Providing guidance through detailed intervention information on norms and expectations of user engagement have the potential to motivate the large group of passive readers to actively support professional moderators.
Study 2
Bystander behavior in study 1 is shown to be less likely if an abstract social group is attacked, in particular if a disagreeing response has already been posted. This may be explained by varying degrees of perceived responsibility. Empirical research on offline bystander situations indicates that bystanders in high emergency situations feel responsible to help irrespective of other bystanders, whereas they do not do so in low emergency situations (Fischer et al., 2006, 2011). On the one hand, when a de-personalized social group is attacked and response disagreement is already given (as in study 1), users may no longer feel self-responsible to act and the previous response by others may be perceived as adequate intervention, similar to bystander behavior in low emergency situations. On the other hand, response agreement is more problematic and indicates a deviant situation not taken care of so far. We therefore assume that response agreement increases self-responsibility that in turn increases bystander behavior.
H1. Perceived self-responsibility mediates the effect of response direction (agreement vs. disagreement) on flagging behavior.
Despite taking self-responsibility to intervene in a perceived emergency, bystanders may also transfer responsibility to other people (Thornberg, 2007). Since most comments sections are supervised by professional moderators (Goodman, 2013), comment readers may allocate need for action to them. In contrast to offline bystander situations, in comments sections users may feel self-responsible but their engagement has limited consequence. The flagging tool enables readers to report the need of intervention to moderators, but the readers are not themselves able to delete or modify the other users’ comments. A more deviant situation is assumed to increase perceptions of professional responsibility and as a consequence flagging likelihood should increase.
H2. Attribution of responsibility to professional moderators mediates the effect of response direction (agreement vs. disagreement) on flagging behavior.
As agreeing responses reinforce an uncivil comment, it is plausible to assume that response agreement is perceived as more deviant than response disagreement. Aside from the response direction that gives an indication of the deviance of an uncivil comment, the style of the response may also add to the perceived deviance of the situation. User responses to uncivil comments may strongly differ according to the level of argumentation and style of writing (Ziegele et al., 2014). Among other factors, interaction may vary in its level of politeness (Brown and Levinson, 1987). Research on how impoliteness of a comment influences participation in online comments shows inconsistent findings (Borah, 2014; Ng and Detenber, 2005). The influence of politeness on flagging and other negative sanctions has not been studied yet. An impolite response agreement to an uncivil comment should be perceived as more deviant than an uncivil comment alone and result in a higher level of responsibility than an impolite response disagreement to an uncivil comment.
H3. Response politeness moderates the effect of response direction on perceived self-responsibility, attributed professional responsibility, and on flagging behavior.
Method
Design and participants
A 2 × 2 between-subjects design, varying response direction (agreement vs. disagreement) and politeness of the response (impolite vs. polite), was used. Participants were undergraduate students from a German university. Participation was voluntary and they did not receive course credit for participation. They were randomly assigned to one of the four conditions and answered the online questionnaire. After the experiment, participants were fully debriefed. In all, 159 participants filled the questionnaire. As in study 1, participants were excluded who indicated to never or rarely read user comments on news sites and social network sites (n = 13) leaving 146 participants (Mage = 26.16, SD = 11.50, 39 males [27%], 92 females [63%], and 15 people did not indicate gender) for the analyses.
Stimuli
Participants were instructed to read a mockup news site including a manipulated comments section. Article and page set up were identical to study 1. All groups in study 2 received the intervention information and the uncivil comment attacking the social group of homosexuals in general. Depending on the experimental condition, the uncivil comment received a polite response disagreement, an impolite response disagreement, a polite response agreement, or an impolite response agreement (Appendix 1).
Measures
Flagging behavior (79 participants, 44.9% flagged the uncivil comment) as dependent user reaction was measured as in study 1. Perceived self-responsibility was measured with three items adapted from Fischer et al. (2006, e.g. “I felt personally responsible to intervene against that comment,” M = 3.47, SD = 1.75, α = .92) on a 7-point scale (1 = strongly disagree, 7 = strongly agree). Attributed professional responsibility was measured using two items (e.g. “Professional moderators are responsible to delete uncivil comments,” M = 5.55, SD = 1.50, α = .75, r = .60, p < .001).
To check for a successful manipulation, participants were asked to indicate how much the response comment agreed with the uncivil comment. Response politeness was measured with one item about perceptions of inappropriate language used in the response comment. As in study 1, we measured general flagging frequency (M = 5.83, SD = 2.45) and attitudes toward homosexuals (M = 6.09, SD = 1.16, α = .92). Believability (M = 5.66, SD = 1.42), comprehensibility (M = 5.94, SD = 1.41), and authenticity of the comments section (M = 5.43, SD = 1.63) were measured to control for perceptions of the setting.
Results
In order to test whether the manipulation for response direction was successful, an analysis of variance (ANOVA) was conducted. As intended, the response that disagreed with the uncivil comment was perceived as less supportive (M = 2.03, SD = 1.73) than the response that agreed with the uncivil comment (M = 6.25, SD = 1.60), F(1, 142) = 241.89, p < .001,
To test for the assumed moderation and mediation effects (H1–H3), an indirect logistic regression model was computed using PROCESS for SPSS (20.000 Bootstrap samples), model 8 (Hayes, 2013). Self-responsibility and professional responsibility were entered as mediators. Flagging behavior was entered as dependent variable and response direction (agreement vs. disagreement) as independent variable, politeness of response (polite vs. impolite) was entered as moderator for the direct effect of response direction on flagging behavior as well as on perceived self-responsibility and attributed professional responsibility. Attitudes toward homosexuals, authenticity of the comments section, and general flagging frequency were entered as controls (Figure 2).

Indirect effects of response direction, response politeness, and responsibility perceptions on flagging behavior.
A significant indirect effect of response direction through attributed professional responsibility on flagging behavior is visible. As suggested, response disagreement decreases professional responsibility attribution, and thus flagging likelihood, but only in conditions of an impolite response B = −.38, standard error (SE) = .24, 95% confidence interval (CI) = [−.95, .04]. In the impolite condition, users tend to render responsibility to professional moderators and in turn flag more often when a response agrees compared to a response that disagrees with the preceding comment.
A second indirect effect is visible for the interaction of response direction and politeness: A response that is polite and disagrees with the uncivil comment increases attributed professional responsibility and in turn increases flagging behavior, B = .43, SE = .26, 95% CI = [.05, 1.06]. Perceived self-responsibility does not have a mediating effect.
Discussion
Bystander intervention depends on characteristics of the course of the discussion. Response agreement to an uncivil and impolite comment by other users increases the attribution of responsibility to professional moderators that in turn increases flagging to inform moderators. An agreeing response represents an unsolved deviant situation which involves the need for intervention. When additionally impolite language is used, agreement with an uncivil comment seems to increase the deviance and decrease the ambiguity of the situation so that subsequent users more easily interpret the uncivil comment as incident that needs intervention and alert professional moderators through flagging. A disagreeing response by another user reduces attributed responsibility. This may be the case because such reaction might be perceived as a sufficient sanction. In contrast, a polite response does not reduce attributed responsibility, and thus flagging. One possible explanation may be that a polite regulation against an uncivil comment is not perceived as an adequate reaction so that further intervention is in need.
The effect of response direction on bystander intervention is not mediated by perceived self-responsibility. This may be a result of the limited consequences of user engagement against inappropriate comments compared to the sanctions of professionals.
The results point to the direction that users indeed interpret flagging as a means to indicate violations of usage policies and as an expression of a need for consistent sanctions. Polite discursive replies of other users are not always perceived as sufficient. Although further research is needed, this might indicate why online discussions often take an increasingly aggressive course. The flagging option and an intervention by professionals may help to prevent such developments in online discussion.
General discussion
The two experimental studies provide an initial step to investigate the user engagement against uncivil comments referring to bystander decision-making processes. Three parallels to existing bystander research emerged: (1) The provision of detailed information increased flagging likelihood. This emphasizes the importance of knowledge and responsibility attribution as crucial steps for helping (Latané and Darley, 1970). Even if flagging is neither complex nor difficult, thorough communication and transparent handling by news organization can motivate active user engagement. A dialogue between professional moderators and users seems important to elaborate on how users can use flags effectively and to negotiate shared norms and mechanisms of social control in a comments section. Future research needs to investigate the effects of different types of information to differentiate motivational- and knowledge-based effects. (2) In line with bystander research on cyberbullying (Obermaier et al., 2016) and offline violent behavior (Fischer et al., 2011), the results show that increased deviance of the situation, introduced through the inappropriateness of a comment and succeeding responses, increases professional responsibility attributions (study 2) and makes bystander interventions more likely (study 1 and study 2). This also raises the question of the effects of accumulated incivility in comments and replies and of long-term effects of a habituation to incivility. (3) Even if there was no overall difference between individual victims and victimized social group on flagging (study 1), it becomes visible that in specific combinations of comment characteristics abstract social victims induced less flagging behavior. The question, under which circumstances anonymous bystanders help anonymous victims and advocate social values in general, becomes increasingly relevant, given the high number of online discourse formats that allow for fake-profiles or hide personal information. Future research needs to compare the effects of varying levels of anonymity on users’ decision processes.
Both studies have some relevant limitations: They examined flagging behavior in laboratory experimental settings. There is a clear need for field studies on flagging behavior. Furthermore, the findings need to be confirmed for intervention to help other victimized groups, may these be further groups referred to in a news article as well as active users in the comments section. Additionally, the samples of the studies are rather young and well-educated. German users of comments sections are well-educated, too. Findings with regard to their age are inconsistent (Hölig and Hasebrink, 2015; Springer et al., 2015). This may point to a potential bias and limited generalizability of the results. However, bystander research has shown that sociodemographic variables only have little influence on intervention behavior (Fischer et al., 2011); albeit it has to be tested whether this holds true for online bystander intervention on inappropriate comments.
The levels of perceived incivility of flagging comments depend on the attitudes and cultural norms in a society that refer to the respective topic under discussion. Attitudes toward homosexuals in the sample here were quite positive. In social groups that are more critical against homosexuals, perceptions of incivility of the comment as well as flagging behavior may be different and should be addressed in future research. This is especially in need since bystander intervention may vary with greater intergroup contact and less prejudice (Abbott and Cameron, 2014).
Both studies provided users with basic feedback tools, which are most prevalent in usual German news sites and stated that flagging will lead to supervision by professionals. However, professional moderation and means for users to sanction are in a steady modification also altering the relevance and meaning of flagging tools. Research will have to regard further intervention options such as dislike buttons and regulating replies. These may be used substitutively or complementary to flagging. This is further stressed by the fact that commenting on news coverage on Facebook gains relevance, which does not feature a flagging button but users engage in replying and liking. Such limited professional moderation after publication strengthens the general need for user engagement. It has to be tested whether determinants of flagging transfer to further sanctioning means. Given the current debates of regulation of inappropriate comments on Facebook, this seems a valuable asset.
Conclusion
Blocking discussions on controversial topics and deleting comments is a comprehensible reaction of news sites to violations of usage policies. Implementation of a flagging tool involves users in the process of regulation. Such efforts do not only save resources, but also integrate users in the negotiation of shared values. It seems worth to establish concertive control through users as a pillar of control complementary to professional moderation. The presented studies contributed to a deeper understanding of flagging in specific and user engagement in more general.
Footnotes
Appendix 1
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
