Diffusion of Measurement Invariance Assessment in Cross-National Empirical Marketing Research: Perspectives from the Literature and a Survey of Researchers

Abstract

The authors examine (1) the extent to which cross-national marketing scholars report measurement invariance (MI) assessment results and (2) what cross-national marketing scholars think about MI assessment in general. In Study 1, the authors analyze all cross-national empirical articles (243) published in 15 well-respected and peer-reviewed marketing journals from 2000 to 2005. Although the results indicate a steady growth of published cross-national empirical marketing research and assessment of MI, only 28% of the studies undertook the procedure. In Study 2, the authors analyze responses from 86 cross-national empirical marketing scholars regarding their knowledge about, attitudes toward, and use of MI assessment. The results indicate that the relatively low utilization of MI assessment is due to low MI knowledge and the sophistication of the techniques. The authors conclude with suggested implications for the field of international marketing and a discussion of future research directions.

Keywords

cross-national measurement equivalence cross-national research culture methodology

Globalization continues to drive rapid growth of international trade (Holt, Quelch, and Taylor 2004), global corporations, and nonlocal consumption alternatives (Alden, Steenkamp, and Batra 2006). Therefore, it is not surprising that the number of studies examining cross-national marketing topics is growing (e.g., Griffith, Myers, and Harvey 2006). Although such research provides valuable insights for both academics and practitioners, several scholars have emphasized the importance of minimizing the possibility of underlying biases in cross-national empirical research due to faulty data collection or analysis. Recommended approaches to avoid such problems include controlling for biases before or during data collection (e.g., Craig and Douglas 2000; Griffith and Schuster 2002; Van Herk, Poortinga, and Verhallen 2005) and assessing the measurement invariance (MI) of data already collected (e.g., Mullen 1995; Steenkamp and Baumgartner 1998).

With respect to the latter recommendation, several scholars have argued that the validity of cross-national data analyses could be questioned if MI is not established and reported (Hui and Triandis 1985; Sekaran 1983; Singh 1995; Van de Vijver and Leung 1997). Yet the results of MI are often not included in published studies (Aulakh and Kotabe 1993; Hult et al., in press; Malhotra and Agrarwal 1996; Sin and Cheung 1999). Through content analysis of consumer studies published between 1991 and 1996, Sin and Cheung (1999) find that researchers often take steps to minimize bias before data collection (e.g., by conducting double-back translation of surveys), but few report MI assessment thereafter.

More recently, Steenkamp and Baumgartner (1998) and Myers and colleagues (2000) have stressed the importance of conducting MI assessment in cross-national empirical marketing research and have provided step-by-step procedures. Following their call, in Study 1, we content-analyzed 243 cross-national empirical studies published between 2000 and 2005 in 15 top marketing journals for the presence or absence, methodology (if present or reported), and proper use or misuse of MI assessment. In Study 2, we conducted a survey of scholars who published cross-national empirical marketing research during this same period to understand more completely the results of the content analysis reported in Study 1. Before reporting the results of both studies, we offer a brief review of focal issues in MI assessment in cross-national marketing.

MI in Cross-National Empirical Marketing Research

Cross-national researchers frequently collect data in multiple countries. However, this draws attention to a potential source of bias: Namely, it is possible that observed differences in the results are not due to the manipulations or relationships of interest but rather to systematic cultural differences in interpretation and/or responses. For example, consumers may not interpret a given construct similarly across cultures (Myers et al. 2000; Singh 1995), or they may vary cross-culturally in their tendency to respond to certain scale items (Hui and Triandis 1989; Riordan and Vandenberg 1994). As a result of potential biases, scholars have argued that construct, metric, and scalar equivalence should be examined before results are compared across cultural or national boundaries (e.g., Mullen 1995; Steenkamp and Baumgartner 1998; Van Herk, Poortinga, and Verhallen 2005).

Over the years, researchers have recommended various procedures for establishing cross-national equivalence. For example, some have suggested assessing construct equivalence by comparing factors across independent samples using exploratory factor analyses (Reynolds and Harding 1983). Others have recommended methods such as visually checking patterns for possible invariance (Van de Vivjer and Leung 1997) or comparing Cronbach's alpha across groups. Moreover, researchers have proposed score standardization and ipsatization (Cunningham, Cunningham, and Green 1977) procedures to remedy possible biases. In addition, optimal scaling can help researchers investigate response set biases (Mullen 1995). Unfortunately, these methods either lack statistical power (e.g., exploratory factor analysis) or run the risk of eliminating all possible differences between groups, including those related to the research topic (e.g., standardization and ipsatization; Van de Vijver and Leung 1997).

To address these limitations, researchers have recently proposed more detailed cross-national MI procedures. These approaches often apply multigroup confirmatory factor analysis (CFA) or multigroup measurement model testing, using structural equations to diagnose both construct- and scale-related biases. For example, Mullen (1995) recommends use of multigroup LISREL to analyze the equality of the error covariance matrices in the measurement model. Singh (1995) pays special attention to construct equivalence and suggests use of latent variable structural equations to assess factor loading equivalence across cultures. Cheung and Rensvold (1999) extend Byrne, Shavelson, and Muthén's (1989) procedure of examining factorial invariance through CFA. They maintain that factorial invariance has two requirements: (1) that item responses load on the same constructs across groups and (2) that factor loadings are not significantly different from each other. Furthermore, Myers and colleagues (2000) note that examination of the covariance matrix for errors associated with items in the measurement model is also necessary.

Steenkamp and Baumgartner (1998) recommend a multigroup CFA approach to assessing MI, including the following: (1) configural—to determine whether the basic meanings and structure of research constructs are understood and conceptualized similarly in the different groups (e.g., countries, cultures); (2) metric—to determine whether scale intervals are perceived similarly across the groups; (3) scalar—to determine whether there is systematic response bias due to cross-group differences; and (4) two additional steps with even stricter requirements for determining MI. They also emphasize the importance of matching the level of invariance assessment with the research purpose.

Configural invariance is sufficient if the research purpose is to explore cross-nationally the basic structure of the construct. However, configural, metric, and scalar invariance are required for comparisons of means across countries. Finally, further complexities may arise depending on the specific analysis conducted by each research project—for example, whether standardized or unstandardized regression coefficients are compared. Moreover, the CFA approach has been extended to cross-national empirical studies that deploy emic or country-specific measures (Baumgartner and Steenkamp 1998). Many researchers find multigroup CFA to be a valuable diagnostic tool for evaluating MI (Myers et al. 2000).

Because several scholars note that enhanced validity accompanies establishment of MI (e.g., Steenkamp and Baumgartner 1998), it is important to investigate how MI tests have been received by the cross-national marketing academic community. With this in mind, in Study 1, we examine all empirical cross-national empirical marketing studies in 15 top area journals from 2000 through 2005.

Study 1

Sample

Empirical marketing articles with cross-national content published from 2000 to 2005 in 15 peer-reviewed journals constituted the sample for this study (see Table 1). The selection of journals was based on existing journal rankings and journals used in prior marketing content analyses (e.g., Nakata and Huang 2005). We selected the period 2000–2005 because it followed the publication of important methodological studies involving cross-national MI (e.g., Mullen 1995; Singh 1995; Steenkamp and Baumgartner 1998). As such, this time frame should effectively capture the assimilation and application of the most recently introduced MI assessment approach—namely, the multigroup CFA approach. The adoption of other, more traditional MI approaches has been well documented by prior content analysis research (e.g., Hult et al., in press; Sin and Cheung 1999).

Table 1

Definitions for Coding Scheme

Category	Definition	Measurement	Reliability	Authors
Research area	What is the main topic of the published article?	0 = Consumer behavior 1 = Strategy 2 = Methodology	100%	Reynolds, Simintiras, and Diamantopoulos (2003)
Methodology	What is the research methodology used?	0 = Comparative (examine differences or similarities between countries) 1 = Cross-cultural (examine the cross-national generalizability of a theory or a model)	77.5%
Country	How many countries examined?	0 = Two countries 1 = Three to ten 2 = More than ten countries	92.2%
Data collection design	What was the means of data collection?	0 = Survey 1 = Experiment 2 = Secondary data	96.2%
Single item	Does this research use single item measure?	0 = No 1 = Yes	77.5%
Journal	In which journals did the article appear?	0 = Journal of Marketing 1 = Journal of Marketing Research 2 = Journal of Consumer Research 3 = Journal of Business Research 4 = Journal of Retailing 5 = Marketing Science 6 = Journal of the Academy of Marketing Science 7 = Marketing Letters 8 = Journal of Advertising 9 = Journal of Advertising Research 10 = Journal of International Business Studies 11 = International Marketing Review 12 = Management International Review 13 = International Journal of Research in Marketing 14 = Journal of International Marketing 15 = European Journal of Marketing	100%	Nakata and Huang (2005)
MI assessed	Did study assess MI?	0 = No 1 = Yes	89.4%
MI approach	What MI approach was used?	0 = Other approaches 1 = CFA approaches	89.4%
Purpose	What is the purpose of the study?	0 = Explore basic meaning and structure of a construct (configural invariance) 1 = Compare means (metric and scalar invariance) 2 = Relate focal construct to other constructs in a nomological network (metric invariance) 3 = Compare standardized measures of association (factor variance invariance)	96.6%	Steenkamp and Baumgartner (1998)
Level of MI assessment	Of the studies that assessed MI, on which level was MI assessed?	0 = Equality of covariance matrices and mean vectors 1 = Configural invariance 2 = Metric invariance 3 = Scalar invariance 4 = Factor covariance invariance 5 = Factor variance invariance 6 = Error variance invariance	94.0%	Steenkamp and Baumgartner (1998)
Fit	Is the level of assessed purpose of the study? MI appropriate for the	0 = No 1 = Yes	94.9%

Using the title and abstract of each article published in the stated period, two researchers identified articles to be included, following these criteria: (1) analysis of multicountry samples, (2) inclusion of an empirical component, and (3) use of self-report data collected by a survey or in an experiment. Studies using only objective data (e.g., foreign direct investment, per capita income) were excluded. This process produced a sample of 243 articles.

Coding Procedure

The coding scheme is described in Table 1. Two researchers coded articles independently. We assessed intercoder reliability (91.6% agreement) using Perreault and Leigh's (1989) coding criteria. We report indexes for individual coding categories in Table 1. Differences were resolved through discussion.

Column 2 of Table 1 shows specification of the dimensions. If MI was assessed in the study, it was placed in one of two categories. We label the first category “CFA approaches.” These procedures use CFA or similar tools to assess MI (e.g., Mullen 1995; Singh 1995; Steenkamp and Baumgartner 1998). Although CFA approaches can vary in their specific analysis, they all assess MI by testing a multigroup measurement model, which in general is viewed as a high-quality diagnostic tool for evaluating MI (Myers et al. 2000). We refer to the second MI category as “other approaches.” These include use of other analytical efforts to assess MI, such as exploratory factor analysis (Reynolds and Harding 1983), variance checks for floor or ceiling effects (Van de Vivjer and Leung 1997), and score standardization (Cunningham, Cunningham, and Green 1977).

The three dimensions of “purpose,” “level of MI assessment,” and “fit” address Steenkamp and Baumgartner's (1998) recommendation that the level of invariance assessment match the research purpose. For studies using CFA approaches other than Steenkamp and Baumgartner's (1998) method, coders assessed the level of MI by matching the analyses reported in these studies with the different levels of Steenkamp and Baumgartner's methodology.

Results

As noted previously, our screening process identified 243 empirical international articles in 15 major journals from 2000 to 2005. Table 2 presents the number and percentage of articles published in each year. It shows that from 2000 to 2002, the number of cross-national articles grew steadily. Although a sharp decline occurred in 2003, the number of cross-national articles published since 2003 has increased. Table 3 depicts the distribution of articles across the 15 journals included in this study.

Table 2

Cross-National Empirical Marketing Articles per Year

Year	Number of Articles	Percentage
2000	34	13.99
2001	42	7.28
2002	49	20.16
2003	36	14.81
2004	39	16.05
2005	43	17.70
Total	243	100.00

Table 3

Cross-National Empirical Marketing Articles per Journal

Journal	Number of Articles	Percentage
Journal of Marketing	5	2.06
Journal of Marketing Research	4	1.65
Journal of Consumer Research	9	3.70
Journal of Business Research	28	11.52
Journal of Retailing	4	1.65
Marketing Science	1	.41
Journal of the Academy of Marketing Science	4	1.65
Marketing Letters	2	.82
Journal of Advertising	4	1.65
Journal of Advertising Research	4	1.65
Journal of International Business Studies	51	20.99
International Marketing Review	45	18.52
Management International Review	22	9.05
International Journal of Research in Marketing	14	5.76
Journal of International Marketing	25	10.29
European Journal of Marketing	21	8.64
Total	243	100.00

Cross-National MI Assessment

In this section, we review our data set of 243 cross-nationally focused marketing articles to determine (1) whether these studies reported MI testing following data collection; (2) if so, which MI assessment technique was reported; and (3) whether the assessment technique reported fits the research purpose. Table 4 presents our overall findings regarding the use of MI assessment from 2000 to 2005. Of 243 articles, 67 (28%) reported assessing MI.

Table 4

MI Assessment: Overview

	MI Reported
	CFA Approaches			MI Not Reported
	Fit	Not Fit	Other Approaches	MI Not Reported
Number of articles	41	14	12	176
Percentage	16.87	5.76	4.94	72.43

Among studies that reported assessment of MI, 82% employed CFA. Non-CFA methods used in the other studies included (1) exploratory factor analysis (Begley and Tan 2001; Mehta 2001; Neelankavil 2000; Tsang 2002), (2) generalizability theory (Cronbach et al. 1972; Sharma and Weathers 2003), (3) Cronbach's alpha (e.g., Mattila and Patterson 2004; Souchon et al. 2003), (4) profile analysis (carried out by displaying item means for each country and visually checking for equivalence; Morris and Pavett 1992; Souchon et al. 2003), and (5) face validity (Deshpandé, Farley, and Webster 2000). We now turn to a more detailed discussion of CFA use.

According to Steenkamp and Baumgartner (1998), it is essential to assess MI at a level that matches the research purpose. Thus, we also determined whether the MI assessment level matched the study's stated research purpose. For example, if the researchers were interested in examining nomological structure across national samples, determination of metric equivalence is sufficient. In contrast, if the general linear model is to be used, scalar equivalence is needed.

Of 55 CFA studies, 41 reported MI assessment at the level recommended, given research objectives and analyses, whereas 14 other studies did not report MI assessment at the appropriate level. For example, a few studies only reported configural invariance when scalar invariance should have been analyzed and reported as well. Several other studies employed analysis of variance to examine cross-national differences without reporting scalar invariance. In these studies, the potential for bias remains. Overall, our content analysis suggests that the validity of many cross-national empirical marketing studies could be enhanced with more consistent and complete investigation of MI.

Table 5 displays the extent of MI assessment in different journals. Journal of the Academy of Marketing Science ranks first with all four cross-national empirical articles published from 2000 to 2005 reporting MI. Among journals emphasizing cross-national research, International Journal of Research in Marketing has the highest percentage of studies (57%, 8 of 14) that report MI. Journal of International Marketing is second with 48% (12 of 25). We found lower percentages of MI reporting in journals that publish relatively few cross-national studies, including Journal of Marketing Research (0%, 0 of 4) and Journal of Consumer Research (11%, 1 of 9). However, among journals that publish relatively more cross-national studies, there are some with relatively low levels of reported MI assessment (e.g., Journal of International Business Studies, International Marketing Review).

Table 5

MI Assessment per Journal

	MI Reported		MI Not Reported
Journal	CFA Approaches	Other Approaches	MI Not Reported
Journal of Marketing	2 (40%)	0 (0%)	3 (60%)
Journal of Marketing Research	0 (0%)	0 (0%)	4 (100%)
Journal of Consumer Research	1 (11.11%)	0 (0%)	8 (88.89%)
Journal of Business Research	7 (25%)	0 (0%)	21 (75%)
Journal of Retailing	1(25%)	1 (25%)	2 (50%)
Marketing Science	0 (0%)	0 (0%)	1 (100%)
Journal of the Academy of Marketing Science	3 (75%)	1 (25%)	0 (0%)
Marketing Letters	0 (0%)	0 (0%)	2 (100%)
Journal of Advertising	0 (0%)	0 (0%)	4 (100%)
Journal of Advertising Research	0 (0%)	0 (0%)	4 (100%)
Journal of International Business Studies	5 (9.80%)	5 (9.80%)	41 (80.39%)
International Marketing Review	5 (11.11%)	3 (6.67%)	37 (82.22%)
Management International Review	4 (18.18%)	1 (4.55%)	17 (77.27%)
International Journal of Research in Marketing	8 (57.14%)	0 (0%)	6 (42.86%)
Journal of International Marketing	11 (44.00%)	1 (4.00%)	13 (52.00%)
European Journal of Marketing	8 (38.10%)	0 (0%)	13 (61.90%)
Total	55 (22.63%)	12 (4.94%)	176 (72.43%)

Table 6 presents information regarding MI assessment for studies with different research topics. Our analysis shows that the likelihood of reporting CFA approaches varies significantly across different research areas (χ² = 7.86, d.f. = 2, p = .02). Specifically, 23.3% (21 of 90) of the studies in consumer behavior, 20% (29 of 145) of the studies in strategy, and 62.50% (5 of 8) of the studies in methodology reported the use of CFA approaches. Overall, our review indicates that the majority (72.4%) of cross-national empirical marketing studies between 2000 and 2005 did not report MI.

Table 6

MI Assessment by Research Topic

	MI Reported		MI Not Reported
Research Topic	CFA Approaches	Other Approaches	MI Not Reported
Consumer behavior	21 (23.33%)	3 (3.33%)	66 (73.33%)
Strategy	29 (20.00%)	9 (6.21%)	107 (73.79%)
Methodology	5 (62.50%)	0 (0%)	3 (37.50%)
Total	55 (22.63%)	12 (4.94%)	176 (72.43%)

Possible Reasons for Nonadoption of MI Assessment

At least two limitations of the MI assessment technique may have hindered its adoption. First, multigroup analysis might prove daunting when data from a large number of countries are collected (Baumgartner 2004). Multigroup CFA has sample size requirements (Bagozzi and Yi 1989; Bollen 1989; Myers et al. 2000). Thus, obtaining sufficient numbers of respondents in each country site could prove problematic, leading to poor model fit. Second, studies using single-item measures cannot be tested for MI with multigroup CFA (Mullen 1995).

To examine the first possibility, we checked whether MI assessment differed depending on the number of countries included in a study (see Table 7). This analysis showed that the likelihood of reporting CFA approaches varied significantly depending on the number of countries investigated (χ² = 5.44, d.f. = 2, p = .033). Among 21 studies that collected data in more than ten countries, only one reported MI using CFA. This finding suggests that the difficulty of multigroup CFA increases with the number of countries involved. A study conducted by Van Birgelen and colleagues (2002) is the only one with data from ten or more countries to report MI. However, the authors obtained 68 or more observations in each country—a sufficiently large sample in all countries for multigroup CFA.

Table 7

MI Assessment by Number of Countries investigated

	MI Reported
Number of Countries	CFA Approaches	Other Approaches	MI Not Reported
Two countries	30 (27.52%)	3 (2.75%)	77 (70.64%)
Three to ten countries	24 (21.24%)	7 (6.19%)	82 (72.57%)
More than ten countries	1 (4.76%)	2 (9.52%)	18 (85.71%)
Total	55 (22.63%)	12 (4.94%)	176 (72.43%)

To examine the second possible reason for the relatively limited number of cross-national empirical marketing studies reporting MI, we coded each study in the sample on whether single or multiple items were used to compare constructs of interest. Our analysis identified seven studies that used single-item measures. In these studies, MI could not be and was not assessed using multigroup CFA. Combined, these two reasons exemplify challenges that may help explain the relatively limited use of multigroup CFA analyses to establish MI.

Conclusion

In Study 1, we addressed the behavioral component of the question, How are proposed MI approaches received by cross-national empirical marketing scholars? As such, we attempted to learn more about what cross-national empirical marketing scholars actually do with regard to MI testing. Notably, we found an increase in published cross-national empirical marketing research during the examined period and a paralleling trend of increased MI reporting. Although this trend is encouraging, only 28% of all articles reviewed reported MI. Furthermore, of the articles that did report MI results, 25.5% did not report MI assessment at the recommended level.

Given the results of Study 1, it is reasonable to question why MI results are not reported in the majority of the cross-national empirical studies under examination. In part, the answer can be found by better understanding what cross-national empirical marketing scholars think about MI assessment: Were the authors unaware of the necessity of MI assessment? Did they lack the statistical knowledge to conduct MI assessment? Did they decide to leave out MI assessment because it was detrimental to their results? Were they advised by reviewers or editors not to report MI assessment results? To answer these questions, we conducted Study 2.

Study

2 Sample

We collected data using an online questionnaire. The initial sample consisted of all the authors of the 243 articles analyzed in Study 1. Excluding overlaps (i.e., same author across articles) and authors for whom we could not find a valid e-mail address, we identified 335 unique marketing scholars. We sent these scholars an e-mail, inviting them to participate in an online survey about cross-national MI assessment. We sent a follow-up e-mail after approximately four weeks. This procedure resulted in a response rate of 26% and 86 usable surveys.

Measures

In the online questionnaire, we asked respondents to rate their knowledge of MI assessment with three items on seven-point scales. We also asked respondents to select the MI approaches they believed to be capable of establishing cross-national MI (from a list of ten approaches that we had identified in Study 1). Following this, we asked respondents whether they had reported MI assessments in their own cross-national empirical studies and, if so, which approaches they had employed. We also asked them about their reasons for not reporting MI results as well as for their feedback from the reviewers and editors regarding nonreporting of MI assessment. Finally, we asked about their general attitudes toward MI assessment.

Results

MI Knowledge

Although all scholars who participated in Study 2 had published cross-national empirical research (i.e., data were collected in more than one country), their self-reported knowledge on MI assessment was relatively low (M = 4.51 on a seven-point scale, where 7 = “in-depth knowledge”). In addition, reported experience with conducting MI tests was limited. For example, 17.4% of the respondents had never assessed MI, and 30.2% had assessed MI only once or twice (see Table 8). Consequently, almost 50% of the sample had no or limited experience with the method.

Table 8

Experience on MI Assessment

	Number of Respondents	Percentage
Never	15	17.40
1–2 times	26	30.20
3–5 times	30	34.90
6–10 times	5	5.80
11–20 times	7	8.10
More than 20 times	2	2.30
Don't know	1	1.20
Total	86	100.00

Notably, respondents did not view MI assessment as particularly important (M = 4.12 on a seven-point scale, where 7 = “very important”). However, MI knowledge was positively associated with respondents’ ratings of importance. Respondents who reported higher levels of MI assessment training were more likely to believe that establishment of MI is critical to the validity of a cross-national empirical study (standardized β = .47, t = 4.88, p < .001, adjusted R² = .21). These results imply that one driver of the limited MI assessment reporting in cross-national empirical studies may be insufficient knowledge.

Rating of Alternative MI Assessment Approaches

Table 9 presents respondents’ perceived validity and preferences of different MI assessment approaches. Almost 56% of all respondents (48 of 86) believed that the CFA approach was capable of ensuring the validity of cross-national data. The item response theory (IRT) approach (21%) and face validity approach (17%) were second and third, respectively. Notably, 14% of all respondents viewed none of the ten MI approaches as valid. Instead, they proposed other methods, such as multigroup causal analysis or maximum difference scaling, as possible means to assess MI.

Table 9

MI Assessment Approaches

MI Approaches	Approaches Believed to Be Valid	Approaches Employed
CFA	48 (55.81%)	42 (82.35%)
Exploratory factor analysis	14 (16.28%)	14 (27.45%)
Generalizability theory	10 (11.63%)	4 (7.84%)
IRT	18 (20.93%)	4 (7.84%)
Rasch's (1960) measurement theory	6 (6.98%)	3 (5.88%)
Score standardization	6 (6.98%)	8 (15.69%)
Optimal scaling	7 (8.14%)	3 (5.88%)
Compare Cronbach's alpha	8 (9.30%)	13 (25.49%)
Profile analysis	7 (8.14%)	3 (5.88%)
Face validity	15 (17.44%)	11 (21.57%)
None of the methods listed below are capable of establishing the validity of cross-national MI.	12 (13.95%)	—

In confirmation of Study 1's findings, a high percentage of cross-national empirical marketing scholars had not reported MI assessment results in their published cross-national empirical marketing research. Indeed, 58% of the respondents stated that they had not reported MI assessment results in all of their published cross-national empirical research, and 15% said that they had never included such information. When reported, CFA was the most frequently mentioned MI assessment method (82.35%), followed by exploratory factor analysis (27.45%) and Cronbach's alpha (25.49%). Significantly, the IRT approach, though believed to be a valid method by approximately 21% of the sample, was reportedly used by only 7.84%.

Reasons for Not Reporting MI Assessment Results

As we noted previously, approximately 58% of all respondents stated that they did not report MI assessment results in all of their cross-national empirical research. Respondents offered three explanations for this: (1) The data were not conducive to MI assessment (32%), (2) MI assessment was not viewed as necessary (32%), and (3) familiarity with MI assessment methodology was insufficient. In addition, 10% of the respondents conducted MI assessment but did not report the results (see Table 10). Of the scholars in our sample who did not include MI assessment in their study, 72% stated that neither reviewers nor editors mentioned the need for such information during the review process.

Table 10

Reasons for Not Reporting MI

	Number of Respondents	Percentage
I (we) didn't know about measurement invariance assessment.	7	14
I (we) didn't have enough familiarity with measurement invariance techniques to conduct measurement invariance assessment.	14	28
The data in my (our) study weren't conducive to measurement invariance assessment.	16	32
I (we) conducted measurement invariance assessment but didn't report it.	5	10
I (we) didn't think that measurement invariance assessment was necessary.	16	32
I (we) didn't believe in the validity of measurement invariance assessment.	2	4

In conclusion, Study 2 implies that the relatively slow growth of MI assessment in cross-national empirical research, as identified in the Study 1 literature review, may be due to a lack of MI knowledge. The technological sophistication of the MI approaches also might hinder their use. As one respondent noted, MI assessment tools “are so complex and time-consuming that the focus of our research work could end up deviating from substantive issues.”

General Discussion

Cross-national empirical marketing researchers, by definition, collect data in more than one country. Often, these data are then compared to determine the extent of national or cultural similarities and differences. This can be problematic, however, if perceptions of the measurement scale are dissimilar. To ensure valid analyses, cross-national empirical researchers have proposed several methods of MI assessment (e.g., Steenkamp and Baumgartner 1998).

Given the role of MI in establishing cross-national comparative validity, it is important to know how frequently MI is assessed and whether it is analyzed correctly. It is also important to know what cross-national empirical marketing scholars think about MI. Our findings are surprising. In general, we found that MI assessment has been repeatedly recommended in the cross-national empirical marketing literature (e.g., Craig and Douglas 2000; Mullen 1995; Myers et al. 2000; Steenkamp and Baumgartner 1998). Yet only 28% of the published cross-national empirical marketing studies from 2000 to 2005 reported doing this, and of these, approximately 75% did so appropriately, according to our review. With regard to our second purpose, we found that the low adoption rate may be due in part to cross-national empirical marketing scholars’ lack of MI knowledge coupled with the perceived sophistication of the different MI approaches.

Consideration of the current distribution channels of academic technology suggests possible strategies to increase awareness and understanding of cross-national MI assessment approaches. First, conferences and colloquia could provide vehicles for wider exposure to MI techniques. Second, special journal issues on MI assessment and related approaches might generate further discussion. Third, the gatekeepers in the marketing discipline (e.g., editors, reviewers) might consider making MI assessment (or some other generally accepted approach set) an important criterion during the evaluation of cross-national empirical manuscripts.

Our findings also suggest that cross-national empirical marketing scholars should continue to conduct research on this topic to further help disseminate the assessment of MI in cross-national empirical research. The slow adoption of MI assessment approaches found in Study 1 might be partially due to the sheer number of possible approaches. To illustrate, cross-national empirical marketing scholars can choose among multigroup CFA (Mullen 1995; Myers et al. 2000; Singh 1995; Steenkamp and Baumgartner 1998), the IRT approach (De Jong, Steenkamp, and Fox 2007; Lord and Novick 1969), measurement theory (Ewing, Salzberger, and Sinkovics 2005; Rasch 1960), and so on. The problem is that there are no agreed-on standards. This no doubt contributes to uncertainty among researchers, reviewers, and editors. Further research should compare different approaches and identify contingent factors that support the use of one method or another.

The respondents in Study 2 repeatedly called for a simple MI assessment approach. To date, most methodological studies on MI are somewhat complex. A straightforward manual might be helpful for cross-national empirical marketing scholars. Method simplification is also important for cross-national marketing managers who need to understand and adopt MI assessment approaches as well. For example, “a checklist for establishing data equivalence,” as Hult and colleagues (in press) propose, might be helpful.

With this said, it is unclear whether failure to assess MI always leads to false results. One respondent pointed out that it would be useful to show that MI assessment has a significant impact on a researcher's results and conclusions. Therefore, the potential for bias exists without assessing MI. However, researchers in the future should empirically investigate the question whether the failure to address MI is a fatal flaw or a study limitation, specifically the extent to which nonassessment of MI has produced inaccurate hypotheses tests and potentially incorrect conclusions. A meta-analysis might be an appropriate means to investigate this issue.

Despite repeated calls to report MI assessment, our research reveals a somewhat surprising reality—namely, limited reports of MI in cross-national empirical marketing articles and a lack of MI knowledge among cross-national marketing scholars. This reality raises questions about the validity of many cross-national empirical marketing studies. Establishing methodological standards for all published cross-national empirical marketing articles (e.g., reporting MI assessment) would increase confidence in and respect for the field. This effort would require collective efforts on the parts of scholars, conference organizers, reviewers, and editors. However, the effort required to make MI assessment standard practice would surely yield returns that far exceed the initial investment. A few international marketing journals (e.g., Journal of International Marketing, International Journal of Research in Marketing) already have led the way. We hope that our results will help convince others to follow.

References

Alden

Dana L.

, Steenkamp

Jan-Benedict E.M.

, and Batra

Rajeev

(2006), “Consumer Attitudes Toward Marketing Globalization: Antecedent, Consequent and Structural Factors,” International Journal of Research in Marketing, 23(3), 227–39.

Aulakh

Preet S.

, and Kotabe

Masaaki

(1993), “An Assessment of Theoretical and Methodological Development in International Marketing: 1980–1990,” Journal of International Marketing, 1(2), 5–28.

Bagozzi

Richard P.

, and Yi

Youjae

(1989), “On the Use of Structural Equation Models in Experimental Design,” Journal of Marketing Research, 26(August), 271–84.

Baumgartner

Hans

(2004), “Issues in Assessing Measurement Invariance in Cross-National Research,” paper presented at Sheth Foundation/Sudman Symposium on Cross-Cultural Survey Research, University of Illinois (October 1).

Baumgartner

Hans

, and Steenkamp

Jan-Benedict E.M.

(1998), “Multiple-Group Latent Variable Models for Varying Numbers of Items and Factors with Cross-National and Longitudinal Applications,” Marketing Letters, 9(1), 21–35.

Begley

Thomas M.

, and Tan

Wee-Liang

(2001), “The Socio-Cultural Environment for Entrepreneurship: A Comparison Between East Asian and Anglo-Saxon Countries,” Journal of International Business Studies, 32(3), 537–53.

Bollen

Kenneth

(1989), Structural Equations with Latent Variables. New York: John Wiley & Sons.

Byrne

B.M.

, Shavelson

R.J.

, and Muthén

Bengt

(1989), “Testing for the Equivalence of Factor Covariance and Mean Structures: The Issue of Partial Measurement Invariance,” Psychological Bulletin, 105(3), 456–66.

Cheung

Gordon W.

, and Rensvold

Roger B.

(1999), “Testing Factorial Invariance Across Groups: A Reconceptualization and Proposed New Method,” Journal of Management, 25(1), 1–27.

10.

Craig

C. Samuel

, and Douglas

Susan P.

(2000), International Marketing Research, 2d ed. New York: John Wiley & Sons.

11.

Cronbach

Lee J.

, Gleser

Goldine C.

, Nanda

Harinder

, and Rajaratnam

Nageswari

(1972), The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. New York: John Wiley & Sons.

12.

Cunningham

William

, Cunningham

Isabella C.M.

, and Green

Robert T.

(1977), “The Ipsative Process to Reduce Response Set Bias,” Public Opinion Quarterly, 41(Fall), 379–94.

13.

De Jong

Martijn

, Steenkamp

Jan-Benedict E.M.

, and Fox

Jean-Paul

(2007), “Relaxing Measurement Invariance in Cross-National Consumer Research Using a Hierarchical IRT Approach,” Journal of Consumer Research, 34(2), 260–78.

14.

Deshpandé

Rohit

, Farley

John U.

, and Webster

Frederick E.

Jr. , (2000), “Triad Lessons: Generalizing Results on High Performance Firms in Five Business-to-Business Markets,” International Journal of Research in Marketing, 17(4), 353–62.

15.

Ewing

Michael T.

, Salzberger

Thomas

, and Sinkovics

Rudolf R.

(2005), “An Alternative Approach to Assessing Cross-Cultural Measurement Equivalence in Advertising Research,” Journal of Advertising, 34(1), 17–36.

16.

Griffith

David A.

, Myers

Matthew B.

, and Harvey

Michael G.

(2006), “An Investigation of National Culture's Influence on Relationship and Knowledge Resources in Interorganizational Relationships Between Japan and the United States,” Journal of International Marketing, 14(3), 1–32.

17.

Griffith

David A.

, and Schuster

Camille

(2002), “Before Measurement Equivalence: Ensuring Conceptual Equivalence,” in American Marketing Association Summer Educators’ Conference Proceedings, Vol. 13, Jack

A. Lingren

, and Kehoe

William J.

, eds. Chicago: American Marketing Association, 315.

18.

Holt

Douglas B.

, Quelch

John A.

, and Taylor

Earl L.

(2004), “How Global Brands Compete,” Harvard Business Review, 82(9), 68–75.

19.

Hui

C. Harry

, and Triandis

Harry C.

(1985), “Multidimensional Scaling and Item Response Theory,” in Cross-Cultural and National Studies in Social Psychology, Diaz Buerrero

, ed. Amsterdam: North Holland, 17–31.

20.

Hui

C. Harry

, and Triandis

Harry C.

(1989), “Measurement in Cross-Cultural Psychology: A Review and Comparison of Strategies,” Journal of Cross-Cultural Psychology, 16(2), 131–52.

21.

Hult

, Tomas

, Ketchen

David J.

Jr. , Griffith

David A.

, Finnegan

Carol A.

, Gonzalez-Padron

Tracy L.

, Harmancioglu

Nukhet

(in press), “Data Equivalence in Cross-Cultural International Business Research: An Assessment and Guidelines,” Journal of International Business Studies, forthcoming.

22.

Lord

Frederic M.

, and Novick

Melvin R.

(1968), Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley.

23.

Malhotra

Naresh K.

, and Agarwal

James

(1996), “Methodological Issues in Cross-Cultural Marketing Research,” International Marketing Review, 13(5), 7–33.

24.

Mattila

Anna S.

, and Patterson

Paul G.

(2004), “The Impact of Culture on Consumers’ Perceptions of Service Recovery Efforts,” Journal of Retailing, 80(3), 196–206.

25.

Mehta

Rajiv

(2001), “Leadership and Cooperation in Marketing Channels: A Comparative Empirical Analysis of the USA, Finland and Poland,” International Marketing Review, 18(6), 633–66.

26.

Morris

Tom

, and Pavett

Cynthia M.

(1992), “Management Style and Productivity in Two Cultures,” Journal of International Business Studies, 23(1), 169–79.

27.

Mullen

Michael R.

(1995), “Diagnosing Measurement Equivalence in Cross-National Research,” Journal of International Business Studies, 26(3), 573–96.

28.

Myers

Matthew B.

, Calantone

Roger J.

, Page

Thomas J.

Jr. , and Taylor

Charles R.

(2000), “An Application of Multiple-Group Causal Models in Assessing Cross-Cultural Measurement Equivalence,” Journal of International Marketing, 8(4), 108–121.

29.

Nakata

Cheryl

, and Huang

Yili

(2005), “Progress and Promise: The Last Decade of International Marketing Research,” Journal of Business Research, 58(5), 611–18.

30.

Neelankavil

James P.

(2000), “Determinants of Managerial Performance: A Cross-Cultural Comparison of the Perceptions of Middle-Level Managers in Four Countries,” Journal of International Business Studies, 31(1), 121–40.

31.

Perrault

William D.

Jr. , and Leigh

Laurence E.

(1989), “Reliability of Nominal Data Based on Qualitative Judgments,” Journal of Marketing Research, 26(May), 135–48.

32.

Rasch

George

(1960), Probabilistic Models for Some Intelligence and Attainment Tests. Copenhagen: Danish Institute for Educational Research.

33.

Reynolds

Cecil R.

, and Harding

Richard E.

(1983), “Outcome in Two Large Sample Studies of Factorial Similarity Under Six Methods of Comparison,” Educational and Psychological Measurement, 43(3), 723–28.

34.

Reynolds

N.L.

, Simintiras

A.C.

, and Diamantopoulos

(2003), “Theoretical Justification of Sampling Choices in International Marketing Research: Key Issues and Guidelines for Researchers,” Journal of International Business Studies, 34(1), 80–89.

35.

Riordan

Christine M.

, and Vandenberg

Robert J.

(1994), “A Central Question in Cross-Cultural Research: Do Employees of Different Cultures Interpret Work-Related Measures in an Equivalent Manner?” Journal of Management, 20(3), 643–71.

36.

Sekaran

Uma

(1983), “Methodological and Theoretical Issues and Advancements in Cross-Cultural Research,” Journal of International Business Studies, 14(2), 61–73.

37.

Sharma

Subhash

, and Weathers

Danny

(2003), “Assessing Generalizability of Scales Used in Cross-National Research,” International Journal of Research in Marketing, 20(3), 287–95.

38.

Sin

Leo Y.M.

, and Cheung

Gordon W.H.

(1999), “Methodology in Cross-Cultural Consumer Research: A Review and Critical Assessment,” Journal of International Consumer Marketing, 11(4), 75–96.

39.

Singh

Jagdip

(1995), “Measurement Issues in Cross-National Research,” Journal of International Business Studies, 26(3), 597–619.

40.

Souchon

Anne L.

, Diamantopoulos

Adamantios

, Holzmüller

Hartmut H.

, Axinn

Catherine N.

, Sinkula

James M.

, Simmet

Heike

, and Durden

Geoffrey R.

(2003), “Export Information Use: A Five-Country Investigation of Key Determinants,” Journal of International Marketing, 11(3), 106–127.

41.

Steenkamp

Jan-Benedict E.M.

, and Baumgartner

Hans

(1998), “Assessing Measurement Invariance in Cross-National Consumer Research,” Journal of Consumer Research, 25(1), 78–90.

42.

Tsang

Eric W.K.

(2002), “Sharing International Joint Venturing Experience: A Study of Some Key Determinants,” Management International Review, 42(2), 183–205.

43.

Van Birgelen

Marcel

, de Ruyter

, de Jong

, and Wetzels

Martin

(2002), “Customer Evaluations of After-Sales Service Contact Modes: An Empirical Analysis of National Culture's Consequences,” International Journal of Research in Marketing, 19(1), 43–64.

44.

Van der Vijver

Fon

, and Leung

Kwok

(1997), Methods and Data Analysis for Cross-Cultural Research. London: Sage Publications.

45.

Van Herk

Hester

, Poortinga

Ype H.

, and Verhallen

Theo M.M.

(2005), “Equivalence of Survey Data: Relevance for International Marketing,” European Journal of Marketing, 39(3-4), 351–64.