Zum Hauptinhalt springen

A Comprehensive Psychometric Examination of the Lesbian, Gay, and Bisexual Knowledge and Attitudes Scale for Heterosexuals (LGB-KASH).

Raju, D ; Beck, L ; et al.
In: Journal of homosexuality, Jg. 66 (2019), Heft 8, S. 1104
Online academicJournal

A Comprehensive Psychometric Examination of the Lesbian, Gay, and Bisexual Knowledge and Attitudes Scale for Heterosexuals (LGB-KASH) 

Heterosexual attitudes toward individuals with other sexual orientations and identities, such as lesbian, gay, bisexual, transgender, intersex, asexual, etc. (LGBTQIA+), have been conceptualized as multidimensional and wide-ranging. With the objective of measuring these attitudes, Worthington and colleagues developed the Lesbian, Gay, and Bisexual Knowledge and Attitudes Scale for Heterosexuals (LGB-KASH). In this article, 540 college students were administered the LGB-KASH; these data were examined for their psychometric properties of the instrument using a sequential approach. The first step included descriptive analysis, including internal consistency measures and association rules, followed by the evaluation of dimension structure with factor analysis (confirmatory/exploratory), and, finally, item analysis using non-parametric (Mokken scale analysis) and parametric approaches (item response theory). Specific to the LGB-KASH factor structure, the current structure was not supported by these data; however, alternative subsets of items that could be used to revise the original instrument to obtain a more psychometrically sound measure are suggested.

Keywords: LGBTQIA+; psychometric analysis; association rule analysis; item response theory (IRT); Mokken scale analysis

In recent decades, numerous social, political, economic, and other gains have been achieved within and for the LGBTQIA+ community (lesbian, gay, bisexual, trans*, queer/questioning, and all other non-heteronormative and non-gender-normative identities). Increased awareness of and improved social attitudes toward these individuals have motivated these changes. Yet there is still plenty of work to be done to increase cultural competency toward LGBTQIA+ identified individuals.

It is useful to acknowledge the ever changing and evolving language of LGBTQIA+. LGBTQIA+ is a common acronym for the collective of individuals who identify as lesbian, gay, bisexual, trans*, queer/questioning, intersex, asexual, and others who do not claim normative gender and heterosexual identities. For the purpose of this study, the term queer will be used as the umbrella term to refer to individuals who may in any way identify as non-normative concerning their gender or sexuality. Even the LGBTQIA+ acronym may seemingly grant priority to certain identities within this broad population. It would be irresponsible to ignore that this term has long held a derogatory meaning and could potentially trigger unpleasant feelings; however, the term queer has been reclaimed as a term of empowerment in recent decades, especially since the development of academic "queer theory." This theory uses the term to challenge the categorization of gender and sexuality and expands the discussion beyond the commonly referenced "gay" and "lesbian" identities to be inclusive of all identities that deviate from the normative or dominant. Terms involving the "homo-" prefix (e.g., homosexual, homophobia, homonegativity) will be avoided unless it is used to mirror language used in previous research, as it is indicative of the restrictive notion that the individuals referenced have a romantic or sexual attraction toward or participate in sexual activity with someone of the same sex or gender. Also, this prefix maintains a negative connotation from previous decades in which homosexuality was still classified as a mental disorder by the American Psychiatric Association (APA).

Although queer is often used as a broad and encompassing term when discussing both sexual orientation and gender identity, the terms and language used within and about the LGBTQIA+ community is immense, and there are many other more specific terms that could be discussed. Due to the nature of this study attempting to create a measure that is more inclusive of non-normative gender identities, two terms that warrant further clarification are trans*/transgender and cisgender. Trans* or transgender is another umbrella term that refers to people whose self-identification, anatomy, appearance, manner, expression, behavior, and/or other perceptions differ from conventional or cultural expectations of congruent gender expression and designated birth sex. It should be noted that not everyone whose appearance or behavior is gender-atypical will self-identify as transgender. The asterisk (*) on trans* is alluding to the common technique used when searching through databases where an asterisk on the end of a prefix will return all the terms that include that prefix. For example, trans* could refer to transgender, transsexual, transman, or transwoman. but in general it is considered inclusive of all non-cisgender identities. Cisgender is a term that refers to someone who identifies either by nature or by choice with the sex and gender they were assigned at birth and who conforms to the mainstream gender-based societal expectations. This is also often referred to as "gender normative." This highlights the difficulty in trying to stay current in defining and measuring attitudes with this quickly evolving minority population.

Minority stress model

A recent meta-analysis shows that lesbian, gay, and bisexual individuals experience a higher rate of mental disorders among other negative outcomes such as increased distress, higher rates of suicide attempts, and a greater likelihood of substance abuse (Meyer, [18]). This research is careful and deliberate when explaining that an LGBTQIA+ identity itself is not a disorder or considered negative or causal of such outcomes, but the minority stress model proposes that these results can be attributed to the excess social stressors related to the stigma and prejudice of the dominant-minority conflict. Dentato ([6]) generally defined minority stress as "the relationship between minority and dominant values and resultant conflict with the social environment experienced by minority group members." In other words, the minority stress model asserts that attitudes and behaviors from the dominant group create explicit and implicit biases, prejudices, and discrimination against minority group members. For almost two decades, an adapted version of the minority stress model has been proposed to help explain several of these negative outcomes experienced by members of the queer community (Meyer, [16], [17]).

Attitude–behavior relationship

Researchers have been measuring attitudes for decades and have consistently found a relationship between one's attitude and their behaviors (see Cialdini, Petty, & Cacioppo, [5], for a review). The cyclical relationship between attitudes, the creation of stereotypes and stigmas, and the consequent behaviors are all contributing factors to the aforementioned negative outcomes as explained by the minority stress model. It is important to have valid and reliable measures of such attitudes and behavior from the dominant culture. Furthermore, understanding the personal treatment of LGBTQIA+ individuals has broader and more systematic implications such as the shifting of social norms and their influence on policy as well as meeting physical and mental health care needs of LGBTQIA+ individuals by culturally sensitive providers. Professional organizations such as the American Psychological Association's (American Psychological Association, [3]) guidelines for practice with LGBTQIA+ individuals and the U.S. Department of Health and Human Services have set goals to advance LGBT health in Healthy People 2020. The ability to evaluate the care administered is an integral part of determining if practice guidelines and goals are being met.

To evaluate the competency of professionals, improve mental and physical health outcomes for LGBTQIA+ individuals, and assess individual and social attitudes, it is imperative to have a reliable, culturally sensitive, and relevant assessment tool for both public and research use. In a previous study, Beck and colleagues (2014) examined the potential utility of a campus Safe Zone program as an intervention to increase the cultural competency of training clinicians in providing mental health services to individuals from the queer community. These researchers noted difficulty formulating a pre-/post-test design due to limitations in measurement options. Specifically, they noted most of the current measures assess attitudes toward specific identities that may fall under the queer label such as the Riddle Scale (Riddle, [24]), the Nungesser Homosexual Attitudes Inventory–Revised (Shidlo, [28]), the Homosexuality Attitude Scale (Kite & Deaux, [11]), the Attitudes Toward Lesbians and Gay Men Scale (Herek, [9]), and the Lesbian, Gay, and Bisexual Knowledge and Attitudes Scale for Heterosexuals (LGB-KASH; Worthington, Dillon, & Becker-Schutte, [38]). Unfortunately, these fail to capture the full spectrum of the community within one tool. The LGB-KASH is to date the most evolved and nuanced measure of attitudes toward the LGB population and, subsequently, was used in the study. However, due to its psychometric properties and lack of a more contemporary conceptualization of sexuality, Beck and colleagues (2014) cautioned that it could still present significant problems.

Current study: LGB-KASH

The aim of the current study was to assess the psychometric properties and, if warranted, provide recommendations for improved constructs for the LGB-KASH (Worthington et al., [38]), as well as, if warranted, provide recommendations for improving this instrument. The measure contains 28 items and includes five scales: Hate, LGB Knowledge, Religious Conflict, LGB Civil Rights, and Internalized Affirmativeness. Item scores consist of ratings on a 1 to 6 ordinal Likert scale (1 = very uncharacteristic of me or my views; 6 = very characteristic of me or my views). Table 1 shows the contents of the 28 items composing the five subscales/dimensions of the LGB-KASH. Higher scores in the items composing the Hate and Religious Conflict scales are associated with negative perceptions (more hate toward and conflict with non-heterosexual individuals), whereas higher scores in the items composing the LGB Knowledge, LGB Civil Rights, and Internalized Affirmativeness scales are associated with more openness toward and acceptance of non-heterosexual individuals.

Table 1. Descriptive statistics for items and scales and measures of internal consistency (N = 540).

Dimension (Cronbach's alpha)Item labelItem ContentMean (SD)Cronbach's alpha if item is droppedCorrected Item-Dimension Correlation*
Hate (0.74)1.59(0.68)
H1It is important to me to avoid LGB individuals.1.64 (1.00)0.660.64
H2LGB people deserve the hatred they receive.1.30(0.82)0.720.46
H3I would be unsure what to do or say if I met someone who is openly LGB.1.74(1.16)0.690.55
H4I sometimes think about being violent toward LGB people.1.17(0.55)0.740.37
H5Hearing about a hate crime against a LGB person would not bother me.1.93(1.30)0.740.40
H6I would feel self-conscious greeting a known LGB person in a public place.1.76 (1.15)0.680.56
Knowledge (0.81)1.72(0.80)
K1I am knowledgeable about the history and mission of the PFLAG organization.1.37(0.82)0.760.66
K2I am knowledgeable about the significance of the Stonewall Riot to the Gay Liberation Movement.1.67(1.12)0.740.68
K3I am familiar with the work of the National Gay and Lesbian Task Force.1.60(1.01)0.760.63
K4I could educate others about the history and symbolism behind the "pink triangle."1.46(0.94)0.770.60
K5I feel qualified to educate others about how to be affirmative regarding LGB issues.2.55(1.36)0.820.49
LGB Rights (0.89)4.43(1.38)
C1Health benefits should be available equally to same sex partners as to any other couple.4.87(1.44)0.870.75
C2Hospitals should acknowledge same sex partners equally to any other next of kin.4.65(1.58)0.860.79
C3I think marriage should be legal for same sex couples.3.95(1.95)0.870.74
C4It is wrong for courts to make child custody decisions based on a parent's sexual orientation.4.44(1.64)0.870.71
C5It is important to teach children positive attitudes toward LGB people.4.25(1.63)0.870.71
Rel. conflict (0.7)2.87(0.95)
R1I conceal my negative views toward LGB people when I am with someone who doesn't share my views.2.89(1.72)0.640.53
R2I keep my religious views to myself in order accept LGB people.2.88(1.58)0.700.31
R3I try not to let my negative beliefs about LGB people harm my relationships with the LGB individuals I know.3.52(1.84)0.620.57
R4I have difficulty reconciling my religious views with my interest in being accepting of LGB people.2.23(1.37)0.690.33
R5I can accept LGB people even though I condemn their behavior.3.62(1.66)0.640.53
R6I conceal my positive attitudes toward LGB people when I am with someone who is homophobic.2.39(1.37)0.710.22
R7I have conflicting attitudes or beliefs about LGB people.2.55(1.48)0.680.39
Internalized Affirmative (0.8)2.42(1.24)
I1I have had sexual fantasies about members of my same sex.1.61(1.29)0.780.49
I2Feeling attracted to another person of the same sex would not make me uncomfortable.2.36(1.72)0.780.51
I3I would display a symbol of gay pride (pink triangle, rainbow, etc.) to show support of the LGB community.2.35(1.68)0.720.68
I4I have close friends who are LGB.3.28 (1.93)0.780.51
I5I would attend a demonstration to promote LGB civil rights.2.50(1.69)0.710.73

  • 12 Note. *Correlations between each item and the subscale scores computed without the item,
  • 13 LGB = lesbian/gay/bisexual.

The instrument was developed in a series of four studies reported in the same article that consisted of: item generation, preliminary item screening, and exploratory examination of structure (study 1, = 422); estimation of internal consistency and testing of measurement model structure (study 2, = 574); estimation of test-retest reliability and convergent validity (study 3, = 45); and examination of sensitivity to differences across sexual orientation identities (study 4, = 190). It is noteworthy that study 2 was conducted using confirmatory factor analysis and tested three hypothetical structure models for the 28 items, but none of these models resulted in satisfactory fit to the data by a number of fit indices. Among these three models, the model that provided the best fit was a 5-factor model with intercorrelated factors. Worthington and colleagues ([38]) concluded that this 5-factor model provided a fair fit to the data, which was selected as the preliminary structure for the instrument. Further, these researchers also concluded that there was the possibility of improvement in confirmatory goodness-of-fit via model re-specification, and that with respect to the stability and validity of the instrument.

In order to meet the study aim (see Figure 1), this comprehensive psychometric evaluation included: (1) descriptive analysis including internal consistency measures and association rules (a data mining technique); (2) evaluation of dimension structure with factor analysis (confirmatory and exploratory factor analysis); and (3) item analysis with non-parametric (Mokken scale analysis) and parametric ordinal item response theory (IRT) methodologies. This final step ensured the robustness of the recommendations provided.

Graph: Figure 1. Data analysis steps used in the study.

Participants/procedure

This psychometric study had institutional review board approval from a large university from the southeastern United States. Six hundred college students in an Introduction to Psychology course were enrolled and administered questionnaires. Items were in random order, and a quality control indicator was included to ascertain if participants were responding randomly. The students participated in research as part of the course curriculum. The study was posted on the course Web page. Overall, the sample was 19 years old on average (SD = 1.2 years) and predominately identified as White/Caucasian (79.5%; Black or African American: 13.6%, Hispanic or Latina/Latino: 1.2%, Mixed or Other: 4.9%). Approximately two thirds of the participants identified as female (66.8%), one third as male (32.5%), but just under 1% identified as trans* or other (0.7%). As expected, a large majority of the participants identified as heterosexual or straight (87.1%), while the remaining 13% endorsed asexual (8.2%), gay or lesbian (1.0%), bisexual (1.7%), or queer, other, or a combination of sexual orientation identities (1.7%). Although 600 students responded to the questionnaire, a final sample of 540 was obtained after excluding questionnaires with incorrect answers to the quality control item, excluding LGBTQ respondents, and those with seven (at least 75%) or more missing item responses.

Data analysis

A combination of classical psychometric methods, a data mining approach, and parametric and non-parametric IRT methods was used for testing, exploring the instrument structure, and examining the usefulness of each item. Combining two analytical paradigms in performing item analysis (parametric and non-parametric) offers advantages in quality of the resulting instrument because classical psychometric analysis focuses on estimating and testing the dimensional structure of the items, whereas the IRT methods focus on the performance of the items in classifying subjects along an underlying latent continuum (Kelly, Kallen, & Suarez-Almazor, [10]; Kyngdon, [12]; Waugh & Chapman, [37]). The analysis strategy used in this study is summarized in Figure 1. The analyses were conducted using SAS software (version 9.4) and the R environment for statistical computing (version 3.1.2).

Step 1: Descriptive analysis

Descriptive statistics such as mean, standard deviation, and median were computed for each item and scale. Cronbach's alpha (internal consistency measure), change in alpha when an item is dropped, and corrected item-dimension correlations (correlations between each item and the dimension/subscale scores computed without the item) were estimated. Association rule analysis was conducted on the data. This is an exploratory data mining technique for any type of categorical data that provides sets of rules for co-occurring situations. The a priori association rule algorithm searches for the most frequently occurring item set in the dataset (Agrawal, Imieliński, & Swami, [2]). All 28 items were used to generate the association rules. The simplest association rules consist of two variables with their respective category that are related, an antecedent variable and a consequent variable (but no causality is implied). For example, gender (female) ≤LGTBQ+ acceptance (yes) implies that when gender is female (antecedent variable), then "yes" they accept LGTBQ+ individuals (consequent variable). The strength of these associations is measured by three quantities that are referred to in the data mining literature as support, confidence, and lift. A support = 60%, confidence = 90%, and lift = 2.1 in the above example can be interpreted as follows: the set (gender (female) = >LGTBQ+ acceptance (yes)) occurs 60% (support) in the complete dataset. Among the respondents who were female, 90% (confidence) accepted LGTBQ+ individuals, and respondents who are females are twice or 2.1 (lift) times more likely than males to accept LGTBQ+ individuals. The generated rules are descriptive/exploratory in nature and can be used after psychometric analysis in editing or removal of the item (Swiger, Raju, Breckenridge-Sproat, & Patrician, [31]). The association analysis was conducted with the R package arules (Hahsler, Grun, & Hornik, [8]).

Step 2: Analysis of factor structure

A confirmatory factor analysis (CFA) was conducted to test the factor structure of the LGB-KASH using a structural equation model fitted with maximum likelihood, and the Satorra-Bentler scaled test statistic and robust standard errors (Rosseel, [26]) were used to correct for non-normality if present. The model fit was assessed using the ratio of the χ2 statistic by its degrees of freedom (χ2/df) as well as other commonly used indices such as comparative fit index (CFI), root mean square error of approximation (RMSEA), and standardized root mean square residual (SRMR). The typically used values of χ2/df smaller than two, CFI values greater than 0.95, RMSEA values smaller than 0.06, and SRMR values smaller than 0.08 are all indicative of good fit (Tabachnick & Fidell, [32]). The CFA was fitted in R using the lavaan package (Rosseel, [26]). Because the original factor structure was not supported by the CFA, the underlying structure was explored using an oblique-rotated exploratory factor analysis (EFA) with principal axes extraction and squared multiple correlations as estimates of the prior communalities, and restricted to factors with eigenvalues ≥1. The package psych (Revelle, [23]) was used to perform EFA analysis.

Step 3: Item performance evaluation with IRT methods

Non-parametric methods and parametric models were used to examine item performance.

Non-parametric method

This involved the employment of several sub-steps.

Mokken Scale Analysis (MSA). MSA (Mokken, [19]; Sijtsma & Molenaar, [29]) is a non-parametric approach to analysis of survey instruments with the objective of obtaining scales that measure a single trait each, allow person-ordering at the trait level by means of the total scale score, and may provide a hierarchical ordering of items according to their increasing level of agreeability with or endorsement of the latent trait. Given a set of items, the approach consists of four steps: (1) assessment of dimensionality, (2) assessment of monotonicity, (3) assessment of scale properties, and (4) assessment of whether a hierarchical ordering of items is feasible. As opposed to other IRT methods, MSA does not require fitting of models.

MSA step 1

Dimensionality assessment is akin to cluster analysis of the items, and it is conducted by a computer algorithm that searches for optimal groupings of items with a minimal level of scalability, as measured by Loevinger's item scalability coefficient (Molenaar, [20]), a non-parametric measure computed from inter-item covariances, ranging from 0 to 1, with values closer to 1 indicating items that better separate on a unidimensional trait (low trait total scores from high trait total scores). For a set of items comprising one scale, the overall scalability coefficient H (after homogeneity) for the scale uses the information from the individual item scalability coefficients to summarize the strength of the relationship between the total score and the trait. A higher overall H coefficient reflects a more accurate person ordering on the trait levels by means of the total scale score. For scales to be used in practice, the following rules of thumb are suggested: 0.3 ≤ H < .4 means a weak scale; 0.4 ≤ H < 0.5 a scale of medium strength; H ≥ 0.5 a strong scale; and H < 0.3 means the items are not scalable (Mokken, [19]).

MSA step 2

Monotonicity assessment examines whether higher overall trait levels (estimated by the total scale scores) correspond, probabilistically, to higher item scores (i.e., as one moves from the lower levels of the latent trait to the higher levels, the probability of obtaining higher item scores should not decrease) for each item. It can be conducted in two sub-steps. First, response function plots are examined for each item to visually assess whether increases in the item score correspond, on average, to increases in the total scale score (Figure 2a). If sections of the curves are not monotonically increasing (Figure 2b), then the second sub-step consists of testing statistically, using non-parametric methods, whether the local decreases are significantly different from a non-increasing trajectory. Statistically significant difference is indicative of a violation of the monotonicity assumption, but also can be affected by small sample size. The sample size of 540 was considered sufficiently powered to detect relevant deviations from the monotonicity assumption with significance tests.

Graph: Figure 2. Hypothetical item response functions for an ordinal item with responses ranging from 1 to 6 in a scale composed of four items (total scale score ranging from 4 to 24). Panel A: For this particular item, the monotonicity assumption holds. Higher overall trait levels (estimated by the total scale scores) correspond, probabilistically, to higher item scores, i.e., as one moves from the lower levels of the latent trait to the higher levels, the probability of giving higher item scores is monotonically increasing. Panel B: Violation of the monotonicity assumption. For this particular item, higher overall trait levels do not necessarily correspond, probabilistically, to higher item scores.

MSA step 3

Reliability assessment was conducted by computing measures of internal consistency such as Cronbach's alpha, Guttman's lambda-2, or the Molenaar Sijtsma statistic (Molenaar & Sijtsma, [21]) and ensuring that these measures reached acceptable levels.

Unidimensionality, monotonicity, and internal consistency allow ordering persons by their level of a trait, based on their total scale scores; however, they do not assess whether the items constitute a hierarchical scale. The assessment of hierarchical order of items was conducted in two sub-steps. First, for each scale, a plot with the items' summary response functions was examined to visually determine if the individual item functions intersect (they may overlap, but not cross), as in Figure 3a. For scales with curves that cross (Figure 3b), non-parametric methods were used to determine whether the crossing sections were significantly different from overlapping trajectories. Statistically significant differences indicate items might not form a hierarchical scale. For this study, the implementation of MSA was conducted using the mokken package in R (van der Ark, [35], [36]). For step 1, the computer algorithm searched for groups of items with a lower bound scalability with coefficients of 0.5.

Graph: Figure 3. Hypothetical summary response functions for four ordinal items with individual item responses ranging from 1 to 6 (total scale score ranging from 4 to 24). Panel A: Hierarchical ordering of items with respect to endorsement of the latent trait. On average, item 1 provides the highest endorsement of the latent trait, followed by item 2, then item 3, and item 4. Panel B: The set of four items do not constitute a hierarchical scale.

Parametric IRT models

These IRT methods model agreeability with or endorsement of a latent trait based on the responses to a set of items (Bond & Fox, [4]). Given a set of items assumed to measure the same construct (i.e., unidimensional of Hate), IRT models assist in estimating the values of the latent trait and also examining the properties of the individual items (Lord, [13]). Although there are several different types of IRT models, the two types of polytomous IRT models used in this study were graded response model (GRM) (Samejima, [27]) and partial credit model (PCM; Masters, [15]) because they are used for items with ordered or Likert-type responses. Both divide the responses into different blocks, also known as thresholds. These thresholds are used in modeling conditional probabilities of trait endorsement. It is important to check more than one technique to evaluate the model fit., Raju and Colleagues ([22]) provided a full explanation of these methods. The parametric IRT models generate statistics that indicate whether an item fits the model or not. A chi-squared statistic with infit and outfit statistics indicating "bad" items is generated for each item with higher infit and outfit values indicating bad item fit. GRM and PCM analyses were conducted using the R packages ltm (Rizopoulos, [25]) and eRm (Mair & Hatzinger, [14]), respectively.

Results

Sample

A total of 540 questionnaires were included in the present analyses. Approximately 1.34% of data were missing, which were assumed to be missing at random. Low missing percent is considered manageable (Acuna & Rodriguez, [1]) and therefore imputed using the chain equations method (Van Buuren, [33]) with one of the best (Stekhoven & Buehlmann, [30]) ensemble technique random forest (Doove, van Buuren, & Dusseldorp, [7]) as the underlying prediction algorithm. The imputation was conducted using the R package mice (Van Buuren & Groothuis-Oudshoorn, [34]). Due to the small percent of individual item data missing, a single imputation was deemed sufficient prior to conducting the main analyses.

Item analysis

Descriptive analysis

Descriptive statistics for items and scales and measures of internal consistency are shown in Table 1. Cronbach's alpha was adequate for all dimensions/subscales (≥0.7). However, six items had corrected correlations ≤0.4, and removal of individual items on two dimensions (Hate and Religious Conflict) decreased internal consistency of the remaining items to levels below the commonly recommended 0.7 threshold. Table 2 shows the top association rules for the item pairs, ranked by lift, with support of at least 50%. These top rules were all found in the Knowledge scale. Exploring the first rule, 54% (support) of the total respondents indicated that they were not knowledgeable about the significance of the Stonewall riot and that they were not familiar with the work of the National Gay and Lesbian Task Force. Next, among respondents who indicated they were not knowledgeable about the significance of the Stonewall riot, 86% (confidence) indicated they were not familiar with the work of National Gay and Lesbian Task Force. Finally, respondents who indicated they were not knowledgeable about the significance of the Stonewall riot were 1.34 (lift) times more likely to indicate they were not familiar with the work of the National Gay and Lesbian Task Force, compared to those who indicated having knowledge of the Stonewall riot.

Table 2. Top five association rules by descending order of lift with support ≥.50 and lift ≥1.10. (N = 540).

Left-hand side (Antecedent)Right-hand side (Consequent)SupportConfidenceLift
Rules
1) I am NOT AT ALL knowledgeable about the significance of the Stonewall Riot to the Gay Liberation Movement= >I am NOT AT ALL familiar with the work of the National Gay and Lesbian Task Force0.540.861.34
2) I am NOT AT ALL knowledgeable about the history and mission of the PFLAG organization= >I am NOT AT ALL familiar with the work of the National Gay and Lesbian Task Force0.610.801.25
3) I am NOT AT ALL knowledgeable about the significance of the Stonewall Riot to the Gay Liberation Movement= >I am NOT AT ALL knowledgeable about the history and mission of the PFLAG organization0.600.941.23
4) I am NOT AT ALL knowledgeable about the significance of the Stonewall Riot to the Gay Liberation Movement= >I am NOT AT ALL knowledgeable about the history and mission of the PFLAG organization0.570.901.21
5) I am NOT AT ALL familiar with the work of the National Gay and Lesbian Task Force= >I could NOT AT ALL educate others about the history and symbolism behind the "pink triangle."0.560.871.17

Analysis of factor structure

Results from the CFA are presented in Table 3. Although all hypothesized path coefficients were significantly different from 0 at the 0.01 (uncorrected) significance level, and only four items had standardized paths ≤0.4 in magnitude, the CFA model did not provide adequate fit to the item data, as indicated by the χ2/df ratio and the other three fit indices. Based on these results, the hypothesized 5-dimension structure of the 28-item set was not supported by the data. Thus the underlying structure of the item data was explored with an EFA, whose results are shown in Table 4. Only three factors had eigenvalues ≥1, and this 3-factor solution explained 93% of the common variance (43% of the total variance), which suggested three underlying latent variables. However, the EFA results suggested that two hypothesized original dimensions, Hate and LGB Civil Rights, lay on the same latent variable (Factor 1), but their scale scores are in opposite directions (lower endorsement of Hate corresponds to higher endorsement of LGB civil rights). As per the CFA results, the estimated correlation between these two hypothesized dimensions is large at −0.77 (see Table 3). The EFA results also suggest some items of the Internalized Affirmativeness dimension lay in Factor 1 as well. As per the CFA results, the estimated correlation between the Internalized Affirmativeness and LGB Civil Rights dimensions is large at 0.74 (see Table 3). These findings further suggest the hypothesized 5-dimension structure for the 28 items is not stable.

Table 3. Confirmatory factor analysis of the original LGB-KASH structure (N = 540).

FactorItemStandardized Path CoefficientR2
1H10.770.59
1H20.530.28
1H30.650.42
1H40.400.16
1H50.500.25
1H60.670.45
2K10.770.60
2K20.780.61
2K30.740.54
2K40.660.43
2K50.550.31
3C10.790.63
3C20.840.70
3C30.810.65
3C40.750.56
3C50.790.62
4R10.60.36
4R20.320.10
4R30.730.53
4R40.40.16
4R50.670.45
4R60.230.05
4R70.520.27
5I10.440.19
5I20.490.24
5I30.850.73
5I40.600.35
5I50.900.82
Inter-factor correlations:
F2F3F4F5
 F1−0.075−0.7660.217−0.554
 F20.283−0.3020.527
 F3−0.3350.741
 F4−0.543

  • 14 Fit statistics:
  • 15 χ2/df = 1117.5/340 = 3.29, p < .0001
  • 16 CFI = 0.844
  • 17 RMSEA (90%CI) = 0.065 (.061,.069); Ho: RMSEA < .05; p < .0001
  • 18 SRMR = 0.087
  • 19 Note. p < 0.01 for all path coefficients. χ2/df: Goodness of fit χ2 statistic divided by its degrees of freedom; CFI: Comparative Fit Index; RMSEA: Root Mean Square Error of Approximation; SRMR: Standardized Root Mean Square Residual; see Table 1 for item labels.

Table 4. Exploratory factor analysis of the 28 LGB-KASH items (N = 540).

ItemFactor1Factor2Factor3h2
H1−0.720.49
H2−0.530.26
H3−0.580.33
H4−0.450.22
H5−0.490.25
H6−0.60.37
K10.750.54
K20.790.58
K30.690.49
K40.690.44
K50.540.37
C10.750.58
C20.800.64
C30.680.65
C40.690.51
C50.730.62
R10.620.41
R20.490.24
R30.690.53
R4−0.380.25
R50.590.40
R60.08
R7−0.430.340.39
I10.310.19
I20.320.25
I30.490.390.59
I40.470.39
I50.520.390.68
Eigenvalues7.452.731.57
Inter-Factor Correlations:
Factor1Factor2Factor3
Factor110.2183−0.1770
Factor20.21831−0.2126
Factor3−0.1770−0.21261

20 Note. h2: communality, proportion of common variance explained by the factor solution. Loadings <0.3 in absolute value not shown. See Table 1 for item labels.

Item performance evaluation with non-parametric IRT methods

Prior to analysis, the items composing the original Hate and Religious Conflict scales were reverse-coded so that all 28 items were in the same direction (higher scores associated with positive perceptions). The results of the MSA procedure are shown in Table 5. All the items were used in the first step of MCA. The algorithm selected 19 out of the 28 items forming five scales. The results suggest these 19 items were scalable, and each scale, respectively, allowed reasonable ordering of persons by their level of the latent trait based on their total scale scores. The first Mokken scale (scale 1 in Table 5) was composed of nine items (H1, H2, H3, C1, C2, C5, I3, I4, I5) belonging to three (H1, H2, H3) of the original scales. The second Mokken scale was composed of four (K1, K2, K3, K4) out of five items from the original Knowledge scale. The remaining three Mokken scales were composed of two items each, belonging to the original LGB Civil Rights, Religious Conflict, and Internal Affirmativeness, respectively. All but two of the Religious Conflict items were dropped. With the exception of the fifth Mokken scale, all scales had internal consistency measures >0.7. The final step in MSA with hierarchical scales indicates that two additional items (C1 and I4) can also be removed. A more conservative MSA with hierarchical assessment suggests 17 items.

Table 5. Results of Mokken scale analysis.

Internal consistency measures
ScaleItemScalability coefficientsHMonotonicity assessmentCronbach's alphaGuttman's L-2MS statist icHierarchy assessmentRankingMean score
1H10.540.59non sig0.870.880.90OK25.35
1H20.50non signon sig15.70
1H60.51non signon sig35.24
1C10.60non sigsigremoved4.87
1C20.60non sigsig44.65
1C50.61non signon sig54.25
1I30.68OKsig72.35
1I40.51OKsigremoved3.28
1I50.68OKnon sig62.50
2K10.620.58OK0.820.820.83non sig41.37
2K20.61OKnon sig11.67
2K30.56OKnon sig21.60
2K40.52OKnon sig31.46
3C30.620.62non sig0.740.740.76OK23.95
3C40.62OKOK14.44
4R30.590.59non sig0.730.730.74OK13.48
4R50.59non sigOK23.37
5I10.640.64OK0.690.690.74OK21.61
5I20.64OKOK12.36

21 Note. H = overall scalability coefficient; MS = Molenaar Sijtsma; OK = no deviation found; non sig = deviation found but nonsignificant; sig = significant deviation found; removed = item removed to obtain a hierarchical scale; Ranking = order of items in the hierarchy of endorsement to the latent trait; Mean item scores are shown for reference. Items in the original Hate (H) and Religious Conflict (R) scales were reverse-coded prior to the Mokken analysis. See Table 1 for item labels.

Item performance evaluation with parametric IRT methods

The EFA shown in Table 4 was the starting point used to assess dimensionality of the item set. The eight items with negative loadings on Factor 1 were reverse-coded prior to analysis, so all items in this factor were in the same direction (positive perceptions). Next, the two IRT models (PCM and GRM) were fitted individually to the items composing each of the three factors. Table 6 shows the results of the PCM analysis. High values of infit or outfit statistics with goodness-of-fit tests p < 0.001 indicated seven ill-fitting items. The result suggested dropping three items from the original Hate scale, three items from the original Religious Conflict scale, and one item from the original Internalized Affirmative scale. The results of the GRM approach were analogous to those of the PCM approach. The item information curves produced by the GRM analysis for the first factor (16 items) are shown in Figure 4. The items with the lowest contributing information were the same items with worst fit statistics in the PCM analysis. Similar results were obtained for the remaining two factors. Again, these findings further suggest the original hypothesized 5-dimension structure for the 28 items are not stable.

Table 6. Results of the partial credit model analysis.

FactorItemOutfitInfitChi-squarep value
1H1−0.22−3.43510.070.66
1H2−0.28−0.94485.370.89
1H3*2.67−0.58708.900.00*
1H4*3.100.051072.340.00*
1H5*5.821.72910.600.00*
1H60.15−1.29531.520.40
1C1−4.25−4.45311.211
1C2−4.29−5.33314.441
1C3−4.17−4.48331.251
1C40.87−1.54565.310.10
1C5−2.56−4.79417.141
1R4*6.754.61882.030.00*
1R7*2.823.47642.800.00*
1I3−1.60−0.74448.130.99
1I4*7.757.55943.640.00*
1I5−2.95−3.24401.481
2K1−3.99−3.45235.311
2K2−2.02−3.08358.991
2K3−2.64−1.88336.121
2K4−2.65−2.00304.401
2K5−0.52−0.15443.800.69
2I1−0.930.48384.731
2I20.640.80488.670.16
3R1−4.00−5.35381.001
3R20.85−1.30539.360.19
3R3−5.84−6.56327.541
3R5−3.28−3.55413.871
3R6*3.381.63640.70.00*

22 *Ill-fitting items; see Table 1 for item labels.

Graph: Figure 4. Item information curves for the first factor (16 items) generated by the graded response model. Items with higher peak contribute more to the overall aggregate information than items with lower peak. The highest peaked item was: "Hospitals should acknowledge same-sex partners equally to any other next of kin" (C2). Some of the lowest peaked items that did not contribute to the overall information are: "I have difficulty reconciling my religious views with my interest in being accepting of LGB people" (R4), "Hearing about a hate crime against a LGB person would not bother me" (H5), "I sometimes think about being violent toward LGB people" (H4), and "I have conflicting attitudes or beliefs about LGB people" (R7).

Discussion

The psychometric examination of the 28-item five-factor LGB-KASH instrument presented here resulted in a lack of support of the original instrument structure: the CFA did not result in at least fair fit; the EFA suggested three underlying dimensions instead of five; and the non-parametric and parametric ordinal IRT procedures suggested dropping at least seven items (25%). All these results suggest instability in the originally proposed structure by Worthington et al. ([38]). The selected item sets from the IRT procedures presented in this article could be used to revise the original instrument in subsequent studies to obtain a more psychometrically sound measure.

Comparing the parametric and non-parametric IRT approaches is not straightforward in this case since the parametric analysis began with an assessment of dimensionality based on an EFA, while the non-parametric approach began with a clustering of items. In terms of the original items retained by each procedure, the MSA approach retained 19 items, while the parametric procedures retained 21. The main result difference between the two approaches is that the MSA procedure removed all but two of the seven Religious Conflict items, while the parametric IRT models removed only three of these items. One approach in improving the scale could be to remove or modify items common in both the MSA and IRT approaches.

The data confirm the need for an updated and comprehensive measure of attitudes toward the LGBTQIA+. In addition to paying close attention to overall measure and structure reliability of the instrument, attention should be paid to item content as well. As mentioned above, the quality of the items in the LGB-KASH have been called into question in the scientific literature as well. In a rapidly changing social climate, it is important to ensure any instrument assessing attitudes is general enough to not be outdated or obsolete in a relatively short time period. Items also need to have some degree of generalizability to ensure that respondents from diverse backgrounds and beliefs glean the same understanding from what is being asked. Finally, ensuring that items are simplified, capturing targeted information, as opposed to "loaded" items that may assess attitudes, multiple constructs also affects instrument validity and subsequent utility.

The quality of an instrument is integral to the integrity of findings being reported. The need to assess attitudes as it relates to the LGBTQIA+ population is rapidly expanding. In addition to assessing individual attitudes and beliefs from the U.S. population at large, recent developments in mental health treatment and health care as a whole are in need of metrics for evaluating competency of practitioners. For those interested in studying behavior change with targeted interventions, or making policies, there also needs to be measures that are consistent and reliable across time points.

The present study is not without limitations. A limitation of this instrument is that it does not distinguish between attitudes toward gay men, bisexuals, and lesbians separately; in that sense, all these groups are lumped together. Yet it may be that some people hold differing opinions and attitude toward such individuals. Further, data for the present study were provided by a convenience sample belonging to a particular population (college students) in a particular geographic location (the Deep South), so their answers to the items may differ from those obtained from the original normative adult population. Furthermore, the alternative instrument structures resulting from the IRT analyses presented here are exploratory and in need of validation in separate datasets. That being said, the psychometric analysis framework provided in this study (see Figure 1) can be used as a psychometric analysis template in analyzing and improving the instrument.

Conclusion

This study adds to the literature by providing a comprehensive psychometric evaluation of the LGB-KASH, arguably the current "gold standard" instrument for measuring attitudes toward LGBTQIA+ individuals. Current societal and policy efforts endorse provision of person-centered and culturally competent health care, workplace, and educational environments. Attitudes toward LGBTQIA+ individuals, nonetheless, remain complex and multidimensional in the United States, as evidenced in the current political debate and ongoing reactions to the historic ruling of the Supreme Court of the United States legalizing same-sex marriage. In order to examine the degree of acceptance of diversity versus stigma and prejudice in health care, workplace, and educational environments, a generalized and psychometrically sound instrument is needed to measure attitudes toward LGBTQIA+ individuals.

The results of this article strongly suggest the original LGB-KASH instrument structure is unstable and unreliable; however, the selected item sets from the psychometric procedures presented here could be used to revise the original instrument in subsequent studies and thus obtain a more psychometrically sound measure. Moreover, mixed-method approaches including cognitive interviewing and focus groups might identify additional dimensions for the development of new items to improve assessment of LGBTQIA+ attitudes. Finally, this article provides a step-by-step approach in performing a comprehensive psychometric analysis that may be useful to other research groups in the development or revision of measures.

Disclosure statement

No potential conflict of interest was reported by the authors.

Footnotes 1 Color versions of one or more of the figures in the article can be found online at www.tandfonline.com/wjhm. References Acuna, E., & Rodriguez, C. (2004). The treatment of missing values and its effects in the classifier accuracy. In D. Banks, F. McMorris, P. Arabie, & W. Gaul (Eds.), Classification, clustering, and data mining applications (pp. 639 – 647). Berlin, Heidelberg : Springer-Verlag. 2 Agarwal, R., Imieliński, H., & Swami, A. (1993). Mining association rules between sets of items in large databases. ACM SIGMOD conference on management of data (pp. 207 – 216), Washington, D.C. 3 American Psychological Association. (2011). Practice guidelines for LGB clients: Guidelines for psychological practice with lesbian, gay, and bisexual clients. Retrieved from http://apa.org/pi/lgbt/resources/guidelines.aspx 4 Bond, T., & Fox, C. (2001). Applying the Rasch model: Fundamental measurement in the human sciences. New York, NY : Lawrence Erlbaum Associates. 5 Cialdini, R. B., Petty, R. E., & Cacioppo, J. T. (1981). Attitude and attitude change. Annual Review Of Psychology, 32, 357 – 404. doi: 10.1146/annurev.ps.32.020181.002041 6 Dentato, M. P. (2012, April). The minority stress perspective. American Psychological Association Psychology and AIDS Exchange Newsletter. Retrieved from http://www.apa.org/pi/aids/resources/exchange/2012/04/minority-stress.aspx 7 Doove, L. L., van Buuren, S., & Dusseldorp, E. (2014). Recursive partitioning for missing data imputation in the presence of interaction effects. Computational Statistics & Data Analysis, 72, 758 – 770. doi: 10.1016/j.csda.2013.10.025 8 Hahsler, M., Grun, B., & Hornik, K. (2005). arules - A computational environment for mining association rules and frequent item sets. Journal of Statistical Software, 14 (15). doi: 10.18637/jss.v014.i15 9 Herek, G. M. (1984). Attitudes toward lesbians and gay men: A factor analytic study. Journal of Homosexuality, 10 (1–2), 39 – 51. doi: 10.1300/J082v10n01_03 Kelly, P. A., Kallen, M. A., & Suarez-Almazor, M. E. (2007). A combined-method psychometric analysis recommended modification of the multidimensional health locus of control scales. Journal of Clinical Epidemiology, 60 (5), 440 – 447. doi: 10.1016/j.jclinepi.2006.08.005 Kite, M. E., & Deaux, K. (1986). Attitudes toward homosexuality: Assessment and behavioral consequences. Basic and Applied Social Psychology, 7, 137 – 162. doi: 10.1207/s15324834basp0702_4 Kyngdon, A. (2004). Comparing factor analysis and the Rasch model for ordered response categories: An investigation of the scale of gambling choices. Journal of Applied Measurement, 5 (4), 398 – 418. Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillside, NJ : Erlbaum. Mair, P., & Hatzinger, R. (2007). Extended Rasch modeling: The eRm package for the application of IRT models in R. Journal of Statistical Software, 20 (9). doi: 10.18637/jss.v020.i09 Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149 – 174. doi: 10.1007/BF02296272 Meyer, I. H. (1995). Minority stress and mental health in gay men. Journal of Health and Social Behavior, 36 (1), 38 – 56. Meyer, I. H. (2003). Prejudice, social stress, and mental health in lesbian, gay and bisexual populations: Conceptual issues and research evidence. Psychological Bulletin, 129, 674 – 697. doi: 10.1037/0033-2909.129.5.674 Meyer, I. H. (2013). Prejudice, social stress, and mental health in lesbian, gay, and bisexual populations: Conceptual issues and research evidence. Psychology Of Sexual Orientation And Gender Diversity, 1 (S), 3 – 26. doi: 10.1037/2329-0382.1.S.3 Mokken, R. J. (1971). A theory and procedure of scale analysis. Berlin, Germany : De Gruyter. Molenaar, I. W. (1991). A weighted Loevinger H-coefficient extending Mokken scaling to multicategory items. Kwantitatieve Methoden, 12 (37), 97 – 117. Molenaar, I. W., & Sijtsma, K. (1988). Mokken's approach to reliability estimation extended to multicategory items. Kwantitatieve Methoden, 9 (28), 115 – 126. Raju, D., Su, X., & Patrician, P. (2014). Using item response theory models to evaluate the practice environment scale. Journal of Nursing Measurement, 22 (2), 323 – 341. doi: 10.1891/1061-3749.22.2.323 Revelle, W. (2016) psych: Procedures for personality and psychological research. Evanston, Illinois, USA : Northwestern University. Version = 1.6.6. Retrieved from http://CRAN.R-project.org/package=psych Riddle, D. (1994). The Riddle scale. In D. Riddle (Ed.), Alone no more: Developing a school support system for gay, lesbian and bisexual youth. St Paul, MN : Minnesota State Department. Rizopoulos, D. (2006). ltm: An R package for latent variable modelling and item response theory. Journal of Statistical Software, 17 (5). doi: 10.18637/jss.v017.i05 Rosseel, Y. (2012). lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48 (2), 1 – 36. doi: 10.18637/jss.v048.i02 Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement, 34 (4, Pt. 2), 100. Shidlo, A. (1994). Internalized homophobia: Conceptual and empirical issues in Measurement. In B. Greene & G. M. Herek (Eds.), Lesbian and Gay psychology: Theory, research, and clinical applications (pp. 176 – 205). Thousand Oaks, CA : Sage Publications, Inc. Sijtsma, K., & Molenaar, I. W. (2002). Introduction to nonparametric item response theory. Thousand Oaks, CA : Sage. Stekhoven, D. J., & Buehlmann, P. (2012). MissForest - non-parametric missing value imputation for mixed-type data. Bioinformatics, 28 (1), 112 – 118. doi: 10.1093/bioinformatics/btr597 Swiger, P. A., Raju, D., Breckenridge-Sproat, S., & Patrician, P. A. (2017). Adaption of the practice environment scale for military nurses. Journal of Advanced Nursing. doi: 10.1111/jan.13276. [Epub ahead of print]. Tabachnick, B. G., & Fidell, L. S. (2013). Using multivariate statistics (6th ed.). Boston, MA : Pearson Education. Van Buuren, S. (2012). Flexible imputation of missing data. Boca Raton, FL : Chapman & Hall/CRC. Van Buuren, S., & Groothuis-Oudshoorn, K. (2011). mice: Multivariate imputation by chained equations in R. Journal of Statistical Software, 45 (3). doi: 10.18637/jss.v045.i03 van der Ark, L. A. (2007). Mokken scale analysis in R. Journal of Statistical Software, 20 (11), 1–19. van der Ark, L. A. (2012). New developments in Mokken scale analysis in R. Journal of Statistical Software, 48 (5), 1–67. Waugh, R. F., & Chapman, E. S. (2005). An analysis of dimensionality using factor analysis (true-score theory) and Rasch measurement: What is the difference? Which method is better? Journal of Applied Measurement, 6 (1), 80 – 90. doi: 10.1177/0255761417689925 Worthington, R. L., Dillon, F. R., & Becker-Schutte, A. M. (2005). Development, reliability, and validity of the Lesbian, Gay, and Bisexual knowledge and attitudes scale for heterosexuals (LGB-KASH). Journal of Counseling Psychology, 52 (1), 104 – 118. doi: 10.1037/0022-0167.52.1.104

By Dheeraj Raju; Lisa Beck; Andres Azuero; Casey Azuero; David Vance and Rebecca Allen

Reported by Author; Author; Author; Author; Author; Author

Titel:
A Comprehensive Psychometric Examination of the Lesbian, Gay, and Bisexual Knowledge and Attitudes Scale for Heterosexuals (LGB-KASH).
Autor/in / Beteiligte Person: Raju, D ; Beck, L ; Azuero, A ; Azuero, C ; Vance, D ; Allen, R
Link:
Zeitschrift: Journal of homosexuality, Jg. 66 (2019), Heft 8, S. 1104
Veröffentlichung: Philadelphia : Routledge ; <i>Original Publication</i>: New York, Haworth Press., 2019
Medientyp: academicJournal
ISSN: 1540-3602 (electronic)
DOI: 10.1080/00918369.2018.1491705
Schlagwort:
  • Bisexuality
  • Female
  • Homosexuality, Female
  • Homosexuality, Male
  • Humans
  • Male
  • Psychometrics
  • Students
  • Transgender Persons
  • Young Adult
  • Attitude
  • Heterosexuality psychology
  • Sexual and Gender Minorities psychology
Sonstiges:
  • Nachgewiesen in: MEDLINE
  • Sprachen: English
  • Publication Type: Journal Article
  • Language: English
  • [J Homosex] 2019; Vol. 66 (8), pp. 1104-1125. <i>Date of Electronic Publication: </i>2018 Aug 17.
  • MeSH Terms: Attitude* ; Heterosexuality / *psychology ; Sexual and Gender Minorities / *psychology ; Bisexuality ; Female ; Homosexuality, Female ; Homosexuality, Male ; Humans ; Male ; Psychometrics ; Students ; Transgender Persons ; Young Adult
  • Contributed Indexing: Keywords: LGBTQIA+; Mokken scale analysis; association rule analysis; item response theory (IRT); psychometric analysis
  • Entry Date(s): Date Created: 20180627 Date Completed: 20190619 Latest Revision: 20190619
  • Update Code: 20231215

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

oder
oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

oder
oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

xs 0 - 576
sm 576 - 768
md 768 - 992
lg 992 - 1200
xl 1200 - 1366
xxl 1366 -