Heterosexual attitudes toward individuals with other sexual orientations and identities, such as lesbian, gay, bisexual, transgender, intersex, asexual, etc. (LGBTQIA+), have been conceptualized as multidimensional and wide-ranging. With the objective of measuring these attitudes, Worthington and colleagues developed the Lesbian, Gay, and Bisexual Knowledge and Attitudes Scale for Heterosexuals (LGB-KASH). In this article, 540 college students were administered the LGB-KASH; these data were examined for their psychometric properties of the instrument using a sequential approach. The first step included descriptive analysis, including internal consistency measures and association rules, followed by the evaluation of dimension structure with factor analysis (confirmatory/exploratory), and, finally, item analysis using non-parametric (Mokken scale analysis) and parametric approaches (item response theory). Specific to the LGB-KASH factor structure, the current structure was not supported by these data; however, alternative subsets of items that could be used to revise the original instrument to obtain a more psychometrically sound measure are suggested.
Keywords: LGBTQIA+; psychometric analysis; association rule analysis; item response theory (IRT); Mokken scale analysis
In recent decades, numerous social, political, economic, and other gains have been achieved within and for the LGBTQIA+ community (lesbian, gay, bisexual, trans*, queer/questioning, and all other non-heteronormative and non-gender-normative identities). Increased awareness of and improved social attitudes toward these individuals have motivated these changes. Yet there is still plenty of work to be done to increase cultural competency toward LGBTQIA+ identified individuals.
It is useful to acknowledge the ever changing and evolving language of LGBTQIA+. LGBTQIA+ is a common acronym for the collective of individuals who identify as lesbian, gay, bisexual, trans*, queer/questioning, intersex, asexual, and others who do not claim normative gender and heterosexual identities. For the purpose of this study, the term queer will be used as the umbrella term to refer to individuals who may in any way identify as non-normative concerning their gender or sexuality. Even the LGBTQIA+ acronym may seemingly grant priority to certain identities within this broad population. It would be irresponsible to ignore that this term has long held a derogatory meaning and could potentially trigger unpleasant feelings; however, the term queer has been reclaimed as a term of empowerment in recent decades, especially since the development of academic "queer theory." This theory uses the term to challenge the categorization of gender and sexuality and expands the discussion beyond the commonly referenced "gay" and "lesbian" identities to be inclusive of all identities that deviate from the normative or dominant. Terms involving the "homo-" prefix (e.g., homosexual, homophobia, homonegativity) will be avoided unless it is used to mirror language used in previous research, as it is indicative of the restrictive notion that the individuals referenced have a romantic or sexual attraction toward or participate in sexual activity with someone of the same sex or gender. Also, this prefix maintains a negative connotation from previous decades in which homosexuality was still classified as a mental disorder by the American Psychiatric Association (APA).
Although queer is often used as a broad and encompassing term when discussing both sexual orientation and gender identity, the terms and language used within and about the LGBTQIA+ community is immense, and there are many other more specific terms that could be discussed. Due to the nature of this study attempting to create a measure that is more inclusive of non-normative gender identities, two terms that warrant further clarification are trans*/transgender and cisgender. Trans* or transgender is another umbrella term that refers to people whose self-identification, anatomy, appearance, manner, expression, behavior, and/or other perceptions differ from conventional or cultural expectations of congruent gender expression and designated birth sex. It should be noted that not everyone whose appearance or behavior is gender-atypical will self-identify as transgender. The asterisk (*) on trans* is alluding to the common technique used when searching through databases where an asterisk on the end of a prefix will return all the terms that include that prefix. For example, trans* could refer to transgender, transsexual, transman, or transwoman. but in general it is considered inclusive of all non-cisgender identities. Cisgender is a term that refers to someone who identifies either by nature or by choice with the sex and gender they were assigned at birth and who conforms to the mainstream gender-based societal expectations. This is also often referred to as "gender normative." This highlights the difficulty in trying to stay current in defining and measuring attitudes with this quickly evolving minority population.
A recent meta-analysis shows that lesbian, gay, and bisexual individuals experience a higher rate of mental disorders among other negative outcomes such as increased distress, higher rates of suicide attempts, and a greater likelihood of substance abuse (Meyer, [
Researchers have been measuring attitudes for decades and have consistently found a relationship between one's attitude and their behaviors (see Cialdini, Petty, & Cacioppo, [
To evaluate the competency of professionals, improve mental and physical health outcomes for LGBTQIA+ individuals, and assess individual and social attitudes, it is imperative to have a reliable, culturally sensitive, and relevant assessment tool for both public and research use. In a previous study, Beck and colleagues (2014) examined the potential utility of a campus Safe Zone program as an intervention to increase the cultural competency of training clinicians in providing mental health services to individuals from the queer community. These researchers noted difficulty formulating a pre-/post-test design due to limitations in measurement options. Specifically, they noted most of the current measures assess attitudes toward specific identities that may fall under the queer label such as the Riddle Scale (Riddle, [
The aim of the current study was to assess the psychometric properties and, if warranted, provide recommendations for improved constructs for the LGB-KASH (Worthington et al., [
Table 1. Descriptive statistics for items and scales and measures of internal consistency (N = 540).
Dimension (Cronbach's alpha) Item label Item Content Mean ( Cronbach's alpha if item is dropped Corrected Item-Dimension Correlation* Hate (0.74) 1.59(0.68) H1 It is important to me to avoid LGB individuals. 1.64 (1.00) 0.66 0.64 H2 LGB people deserve the hatred they receive. 1.30(0.82) 0.72 0.46 H3 I would be unsure what to do or say if I met someone who is openly LGB. 1.74(1.16) 0.69 0.55 H4 I sometimes think about being violent toward LGB people. 1.17(0.55) 0.74 0.37 H5 Hearing about a hate crime against a LGB person would not bother me. 1.93(1.30) 0.74 0.40 H6 I would feel self-conscious greeting a known LGB person in a public place. 1.76 (1.15) 0.68 0.56 Knowledge (0.81) 1.72(0.80) K1 I am knowledgeable about the history and mission of the PFLAG organization. 1.37(0.82) 0.76 0.66 K2 I am knowledgeable about the significance of the Stonewall Riot to the Gay Liberation Movement. 1.67(1.12) 0.74 0.68 K3 I am familiar with the work of the National Gay and Lesbian Task Force. 1.60(1.01) 0.76 0.63 K4 I could educate others about the history and symbolism behind the "pink triangle." 1.46(0.94) 0.77 0.60 K5 I feel qualified to educate others about how to be affirmative regarding LGB issues. 2.55(1.36) 0.82 0.49 LGB Rights (0.89) 4.43(1.38) C1 Health benefits should be available equally to same sex partners as to any other couple. 4.87(1.44) 0.87 0.75 C2 Hospitals should acknowledge same sex partners equally to any other next of kin. 4.65(1.58) 0.86 0.79 C3 I think marriage should be legal for same sex couples. 3.95(1.95) 0.87 0.74 C4 It is wrong for courts to make child custody decisions based on a parent's sexual orientation. 4.44(1.64) 0.87 0.71 C5 It is important to teach children positive attitudes toward LGB people. 4.25(1.63) 0.87 0.71 Rel. conflict (0.7) 2.87(0.95) R1 I conceal my negative views toward LGB people when I am with someone who doesn't share my views. 2.89(1.72) 0.64 0.53 R2 I keep my religious views to myself in order accept LGB people. 2.88(1.58) 0.70 0.31 R3 I try not to let my negative beliefs about LGB people harm my relationships with the LGB individuals I know. 3.52(1.84) 0.62 0.57 R4 I have difficulty reconciling my religious views with my interest in being accepting of LGB people. 2.23(1.37) 0.69 0.33 R5 I can accept LGB people even though I condemn their behavior. 3.62(1.66) 0.64 0.53 R6 I conceal my positive attitudes toward LGB people when I am with someone who is homophobic. 2.39(1.37) 0.71 0.22 R7 I have conflicting attitudes or beliefs about LGB people. 2.55(1.48) 0.68 0.39 Internalized Affirmative (0.8) 2.42(1.24) I1 I have had sexual fantasies about members of my same sex. 1.61(1.29) 0.78 0.49 I2 Feeling attracted to another person of the same sex would not make me uncomfortable. 2.36(1.72) 0.78 0.51 I3 I would display a symbol of gay pride (pink triangle, rainbow, etc.) to show support of the LGB community. 2.35(1.68) 0.72 0.68 I4 I have close friends who are LGB. 3.28 (1.93) 0.78 0.51 I5 I would attend a demonstration to promote LGB civil rights. 2.50(1.69) 0.71 0.73
- 12 Note. *Correlations between each item and the subscale scores computed without the item,
- 13 LGB = lesbian/gay/bisexual.
The instrument was developed in a series of four studies reported in the same article that consisted of: item generation, preliminary item screening, and exploratory examination of structure (study 1, n = 422); estimation of internal consistency and testing of measurement model structure (study 2, n = 574); estimation of test-retest reliability and convergent validity (study 3, n = 45); and examination of sensitivity to differences across sexual orientation identities (study 4, n = 190). It is noteworthy that study 2 was conducted using confirmatory factor analysis and tested three hypothetical structure models for the 28 items, but none of these models resulted in satisfactory fit to the data by a number of fit indices. Among these three models, the model that provided the best fit was a 5-factor model with intercorrelated factors. Worthington and colleagues ([
In order to meet the study aim (see Figure 1), this comprehensive psychometric evaluation included: (
Graph: Figure 1. Data analysis steps used in the study.
This psychometric study had institutional review board approval from a large university from the southeastern United States. Six hundred college students in an Introduction to Psychology course were enrolled and administered questionnaires. Items were in random order, and a quality control indicator was included to ascertain if participants were responding randomly. The students participated in research as part of the course curriculum. The study was posted on the course Web page. Overall, the sample was 19 years old on average (SD = 1.2 years) and predominately identified as White/Caucasian (79.5%; Black or African American: 13.6%, Hispanic or Latina/Latino: 1.2%, Mixed or Other: 4.9%). Approximately two thirds of the participants identified as female (66.8%), one third as male (32.5%), but just under 1% identified as trans* or other (0.7%). As expected, a large majority of the participants identified as heterosexual or straight (87.1%), while the remaining 13% endorsed asexual (8.2%), gay or lesbian (1.0%), bisexual (1.7%), or queer, other, or a combination of sexual orientation identities (1.7%). Although 600 students responded to the questionnaire, a final sample of 540 was obtained after excluding questionnaires with incorrect answers to the quality control item, excluding LGBTQ respondents, and those with seven (at least 75%) or more missing item responses.
A combination of classical psychometric methods, a data mining approach, and parametric and non-parametric IRT methods was used for testing, exploring the instrument structure, and examining the usefulness of each item. Combining two analytical paradigms in performing item analysis (parametric and non-parametric) offers advantages in quality of the resulting instrument because classical psychometric analysis focuses on estimating and testing the dimensional structure of the items, whereas the IRT methods focus on the performance of the items in classifying subjects along an underlying latent continuum (Kelly, Kallen, & Suarez-Almazor, [
Descriptive statistics such as mean, standard deviation, and median were computed for each item and scale. Cronbach's alpha (internal consistency measure), change in alpha when an item is dropped, and corrected item-dimension correlations (correlations between each item and the dimension/subscale scores computed without the item) were estimated. Association rule analysis was conducted on the data. This is an exploratory data mining technique for any type of categorical data that provides sets of rules for co-occurring situations. The a priori association rule algorithm searches for the most frequently occurring item set in the dataset (Agrawal, Imieliński, & Swami, [
A confirmatory factor analysis (CFA) was conducted to test the factor structure of the LGB-KASH using a structural equation model fitted with maximum likelihood, and the Satorra-Bentler scaled test statistic and robust standard errors (Rosseel, [
Non-parametric methods and parametric models were used to examine item performance.
This involved the employment of several sub-steps.
Mokken Scale Analysis (MSA). MSA (Mokken, [
Dimensionality assessment is akin to cluster analysis of the items, and it is conducted by a computer algorithm that searches for optimal groupings of items with a minimal level of scalability, as measured by Loevinger's item scalability coefficient (Molenaar, [
Monotonicity assessment examines whether higher overall trait levels (estimated by the total scale scores) correspond, probabilistically, to higher item scores (i.e., as one moves from the lower levels of the latent trait to the higher levels, the probability of obtaining higher item scores should not decrease) for each item. It can be conducted in two sub-steps. First, response function plots are examined for each item to visually assess whether increases in the item score correspond, on average, to increases in the total scale score (Figure 2a). If sections of the curves are not monotonically increasing (Figure 2b), then the second sub-step consists of testing statistically, using non-parametric methods, whether the local decreases are significantly different from a non-increasing trajectory. Statistically significant difference is indicative of a violation of the monotonicity assumption, but also can be affected by small sample size. The sample size of 540 was considered sufficiently powered to detect relevant deviations from the monotonicity assumption with significance tests.
Graph: Figure 2. Hypothetical item response functions for an ordinal item with responses ranging from 1 to 6 in a scale composed of four items (total scale score ranging from 4 to 24). Panel A: For this particular item, the monotonicity assumption holds. Higher overall trait levels (estimated by the total scale scores) correspond, probabilistically, to higher item scores, i.e., as one moves from the lower levels of the latent trait to the higher levels, the probability of giving higher item scores is monotonically increasing. Panel B: Violation of the monotonicity assumption. For this particular item, higher overall trait levels do not necessarily correspond, probabilistically, to higher item scores.
Reliability assessment was conducted by computing measures of internal consistency such as Cronbach's alpha, Guttman's lambda-2, or the Molenaar Sijtsma statistic (Molenaar & Sijtsma, [
Unidimensionality, monotonicity, and internal consistency allow ordering persons by their level of a trait, based on their total scale scores; however, they do not assess whether the items constitute a hierarchical scale. The assessment of hierarchical order of items was conducted in two sub-steps. First, for each scale, a plot with the items' summary response functions was examined to visually determine if the individual item functions intersect (they may overlap, but not cross), as in Figure 3a. For scales with curves that cross (Figure 3b), non-parametric methods were used to determine whether the crossing sections were significantly different from overlapping trajectories. Statistically significant differences indicate items might not form a hierarchical scale. For this study, the implementation of MSA was conducted using the mokken package in R (van der Ark, [
Graph: Figure 3. Hypothetical summary response functions for four ordinal items with individual item responses ranging from 1 to 6 (total scale score ranging from 4 to 24). Panel A: Hierarchical ordering of items with respect to endorsement of the latent trait. On average, item 1 provides the highest endorsement of the latent trait, followed by item 2, then item 3, and item 4. Panel B: The set of four items do not constitute a hierarchical scale.
These IRT methods model agreeability with or endorsement of a latent trait based on the responses to a set of items (Bond & Fox, [
A total of 540 questionnaires were included in the present analyses. Approximately 1.34% of data were missing, which were assumed to be missing at random. Low missing percent is considered manageable (Acuna & Rodriguez, [
Descriptive statistics for items and scales and measures of internal consistency are shown in Table 1. Cronbach's alpha was adequate for all dimensions/subscales (≥0.7). However, six items had corrected correlations ≤0.4, and removal of individual items on two dimensions (Hate and Religious Conflict) decreased internal consistency of the remaining items to levels below the commonly recommended 0.7 threshold. Table 2 shows the top association rules for the item pairs, ranked by lift, with support of at least 50%. These top rules were all found in the Knowledge scale. Exploring the first rule, 54% (support) of the total respondents indicated that they were not knowledgeable about the significance of the Stonewall riot and that they were not familiar with the work of the National Gay and Lesbian Task Force. Next, among respondents who indicated they were not knowledgeable about the significance of the Stonewall riot, 86% (confidence) indicated they were not familiar with the work of National Gay and Lesbian Task Force. Finally, respondents who indicated they were not knowledgeable about the significance of the Stonewall riot were 1.34 (lift) times more likely to indicate they were not familiar with the work of the National Gay and Lesbian Task Force, compared to those who indicated having knowledge of the Stonewall riot.
Table 2. Top five association rules by descending order of lift with support ≥.50 and lift ≥1.10. (N = 540).
Left-hand side (Antecedent) Right-hand side (Consequent) Support Confidence Lift Rules 1) I am = > I am 0.54 0.86 1.34 2) I am = > I am 0.61 0.80 1.25 3) I am = > I am 0.60 0.94 1.23 4) I am = > I am 0.57 0.90 1.21 5) I am = > I could 0.56 0.87 1.17
Results from the CFA are presented in Table 3. Although all hypothesized path coefficients were significantly different from 0 at the 0.01 (uncorrected) significance level, and only four items had standardized paths ≤0.4 in magnitude, the CFA model did not provide adequate fit to the item data, as indicated by the χ
Table 3. Confirmatory factor analysis of the original LGB-KASH structure (N = 540).
Factor Item Standardized Path Coefficient 1 H1 0.77 0.59 1 H2 0.53 0.28 1 H3 0.65 0.42 1 H4 0.40 0.16 1 H5 0.50 0.25 1 H6 0.67 0.45 2 K1 0.77 0.60 2 K2 0.78 0.61 2 K3 0.74 0.54 2 K4 0.66 0.43 2 K5 0.55 0.31 3 C1 0.79 0.63 3 C2 0.84 0.70 3 C3 0.81 0.65 3 C4 0.75 0.56 3 C5 0.79 0.62 4 R1 0.6 0.36 4 R2 0.32 0.10 4 R3 0.73 0.53 4 R4 0.4 0.16 4 R5 0.67 0.45 4 R6 0.23 0.05 4 R7 0.52 0.27 5 I1 0.44 0.19 5 I2 0.49 0.24 5 I3 0.85 0.73 5 I4 0.60 0.35 5 I5 0.90 0.82 Inter-factor correlations: F2 F3 F4 F5 F1 −0.075 −0.766 0.217 −0.554 F2 0.283 −0.302 0.527 F3 −0.335 0.741 F4 −0.543
- 14 Fit statistics:
- 15 χ
2 /df = 1117.5/340 = 3.29, p < .0001 - 16 CFI = 0.844
- 17 RMSEA (90%CI) = 0.065 (.061,.069); Ho: RMSEA < .05; p < .0001
- 18 SRMR = 0.087
- 19 Note. p < 0.01 for all path coefficients. χ
2 /df: Goodness of fit χ2 statistic divided by its degrees of freedom; CFI: Comparative Fit Index; RMSEA: Root Mean Square Error of Approximation; SRMR: Standardized Root Mean Square Residual; see Table 1 for item labels.
Table 4. Exploratory factor analysis of the 28 LGB-KASH items (N = 540).
Item Factor1 Factor2 Factor3 h2 H1 −0.72 0.49 H2 −0.53 0.26 H3 −0.58 0.33 H4 −0.45 0.22 H5 −0.49 0.25 H6 −0.6 0.37 K1 0.75 0.54 K2 0.79 0.58 K3 0.69 0.49 K4 0.69 0.44 K5 0.54 0.37 C1 0.75 0.58 C2 0.80 0.64 C3 0.68 0.65 C4 0.69 0.51 C5 0.73 0.62 R1 0.62 0.41 R2 0.49 0.24 R3 0.69 0.53 R4 −0.38 0.25 R5 0.59 0.40 R6 0.08 R7 −0.43 0.34 0.39 I1 0.31 0.19 I2 0.32 0.25 I3 0.49 0.39 0.59 I4 0.47 0.39 I5 0.52 0.39 0.68 Eigenvalues 7.45 2.73 1.57 Inter-Factor Correlations: Factor1 Factor2 Factor3 Factor1 1 0.2183 −0.1770 Factor2 0.2183 1 −0.2126 Factor3 −0.1770 −0.2126 1
20 Note. h
Prior to analysis, the items composing the original Hate and Religious Conflict scales were reverse-coded so that all 28 items were in the same direction (higher scores associated with positive perceptions). The results of the MSA procedure are shown in Table 5. All the items were used in the first step of MCA. The algorithm selected 19 out of the 28 items forming five scales. The results suggest these 19 items were scalable, and each scale, respectively, allowed reasonable ordering of persons by their level of the latent trait based on their total scale scores. The first Mokken scale (scale 1 in Table 5) was composed of nine items (H1, H2, H3, C1, C2, C5, I3, I4, I5) belonging to three (H1, H2, H3) of the original scales. The second Mokken scale was composed of four (K1, K2, K3, K4) out of five items from the original Knowledge scale. The remaining three Mokken scales were composed of two items each, belonging to the original LGB Civil Rights, Religious Conflict, and Internal Affirmativeness, respectively. All but two of the Religious Conflict items were dropped. With the exception of the fifth Mokken scale, all scales had internal consistency measures >0.7. The final step in MSA with hierarchical scales indicates that two additional items (C1 and I4) can also be removed. A more conservative MSA with hierarchical assessment suggests 17 items.
Table 5. Results of Mokken scale analysis.
Internal consistency measures Scale Item Scalability coefficients Monotonicity assessment Cronbach's alpha Guttman's L-2 MS statist ic Hierarchy assessment Ranking Mean score 1 H1 0.54 0.59 non sig 0.87 0.88 0.90 OK 2 5.35 1 H2 0.50 non sig non sig 1 5.70 1 H6 0.51 non sig non sig 3 5.24 1 C1 0.60 non sig sig removed 4.87 1 C2 0.60 non sig sig 4 4.65 1 C5 0.61 non sig non sig 5 4.25 1 I3 0.68 OK sig 7 2.35 1 I4 0.51 OK sig removed 3.28 1 I5 0.68 OK non sig 6 2.50 2 K1 0.62 0.58 OK 0.82 0.82 0.83 non sig 4 1.37 2 K2 0.61 OK non sig 1 1.67 2 K3 0.56 OK non sig 2 1.60 2 K4 0.52 OK non sig 3 1.46 3 C3 0.62 0.62 non sig 0.74 0.74 0.76 OK 2 3.95 3 C4 0.62 OK OK 1 4.44 4 R3 0.59 0.59 non sig 0.73 0.73 0.74 OK 1 3.48 4 R5 0.59 non sig OK 2 3.37 5 I1 0.64 0.64 OK 0.69 0.69 0.74 OK 2 1.61 5 I2 0.64 OK OK 1 2.36
21 Note. H = overall scalability coefficient; MS = Molenaar Sijtsma; OK = no deviation found; non sig = deviation found but nonsignificant; sig = significant deviation found; removed = item removed to obtain a hierarchical scale; Ranking = order of items in the hierarchy of endorsement to the latent trait; Mean item scores are shown for reference. Items in the original Hate (H) and Religious Conflict (R) scales were reverse-coded prior to the Mokken analysis. See Table 1 for item labels.
The EFA shown in Table 4 was the starting point used to assess dimensionality of the item set. The eight items with negative loadings on Factor 1 were reverse-coded prior to analysis, so all items in this factor were in the same direction (positive perceptions). Next, the two IRT models (PCM and GRM) were fitted individually to the items composing each of the three factors. Table 6 shows the results of the PCM analysis. High values of infit or outfit statistics with goodness-of-fit tests p < 0.001 indicated seven ill-fitting items. The result suggested dropping three items from the original Hate scale, three items from the original Religious Conflict scale, and one item from the original Internalized Affirmative scale. The results of the GRM approach were analogous to those of the PCM approach. The item information curves produced by the GRM analysis for the first factor (16 items) are shown in Figure 4. The items with the lowest contributing information were the same items with worst fit statistics in the PCM analysis. Similar results were obtained for the remaining two factors. Again, these findings further suggest the original hypothesized 5-dimension structure for the 28 items are not stable.
Table 6. Results of the partial credit model analysis.
Factor Item Outfit Infit Chi-square 1 H1 −0.22 −3.43 510.07 0.66 1 H2 −0.28 −0.94 485.37 0.89 1 H3* 2.67 −0.58 708.90 0.00* 1 H4* 3.10 0.05 1072.34 0.00* 1 H5* 5.82 1.72 910.60 0.00* 1 H6 0.15 −1.29 531.52 0.40 1 C1 −4.25 −4.45 311.21 1 1 C2 −4.29 −5.33 314.44 1 1 C3 −4.17 −4.48 331.25 1 1 C4 0.87 −1.54 565.31 0.10 1 C5 −2.56 −4.79 417.14 1 1 R4* 6.75 4.61 882.03 0.00* 1 R7* 2.82 3.47 642.80 0.00* 1 I3 −1.60 −0.74 448.13 0.99 1 I4* 7.75 7.55 943.64 0.00* 1 I5 −2.95 −3.24 401.48 1 2 K1 −3.99 −3.45 235.31 1 2 K2 −2.02 −3.08 358.99 1 2 K3 −2.64 −1.88 336.12 1 2 K4 −2.65 −2.00 304.40 1 2 K5 −0.52 −0.15 443.80 0.69 2 I1 −0.93 0.48 384.73 1 2 I2 0.64 0.80 488.67 0.16 3 R1 −4.00 −5.35 381.00 1 3 R2 0.85 −1.30 539.36 0.19 3 R3 −5.84 −6.56 327.54 1 3 R5 −3.28 −3.55 413.87 1 3 R6* 3.38 1.63 640.7 0.00*
22 *Ill-fitting items; see Table 1 for item labels.
Graph: Figure 4. Item information curves for the first factor (16 items) generated by the graded response model. Items with higher peak contribute more to the overall aggregate information than items with lower peak. The highest peaked item was: "Hospitals should acknowledge same-sex partners equally to any other next of kin" (C2). Some of the lowest peaked items that did not contribute to the overall information are: "I have difficulty reconciling my religious views with my interest in being accepting of LGB people" (R4), "Hearing about a hate crime against a LGB person would not bother me" (H5), "I sometimes think about being violent toward LGB people" (H4), and "I have conflicting attitudes or beliefs about LGB people" (R7).
The psychometric examination of the 28-item five-factor LGB-KASH instrument presented here resulted in a lack of support of the original instrument structure: the CFA did not result in at least fair fit; the EFA suggested three underlying dimensions instead of five; and the non-parametric and parametric ordinal IRT procedures suggested dropping at least seven items (25%). All these results suggest instability in the originally proposed structure by Worthington et al. ([
Comparing the parametric and non-parametric IRT approaches is not straightforward in this case since the parametric analysis began with an assessment of dimensionality based on an EFA, while the non-parametric approach began with a clustering of items. In terms of the original items retained by each procedure, the MSA approach retained 19 items, while the parametric procedures retained 21. The main result difference between the two approaches is that the MSA procedure removed all but two of the seven Religious Conflict items, while the parametric IRT models removed only three of these items. One approach in improving the scale could be to remove or modify items common in both the MSA and IRT approaches.
The data confirm the need for an updated and comprehensive measure of attitudes toward the LGBTQIA+. In addition to paying close attention to overall measure and structure reliability of the instrument, attention should be paid to item content as well. As mentioned above, the quality of the items in the LGB-KASH have been called into question in the scientific literature as well. In a rapidly changing social climate, it is important to ensure any instrument assessing attitudes is general enough to not be outdated or obsolete in a relatively short time period. Items also need to have some degree of generalizability to ensure that respondents from diverse backgrounds and beliefs glean the same understanding from what is being asked. Finally, ensuring that items are simplified, capturing targeted information, as opposed to "loaded" items that may assess attitudes, multiple constructs also affects instrument validity and subsequent utility.
The quality of an instrument is integral to the integrity of findings being reported. The need to assess attitudes as it relates to the LGBTQIA+ population is rapidly expanding. In addition to assessing individual attitudes and beliefs from the U.S. population at large, recent developments in mental health treatment and health care as a whole are in need of metrics for evaluating competency of practitioners. For those interested in studying behavior change with targeted interventions, or making policies, there also needs to be measures that are consistent and reliable across time points.
The present study is not without limitations. A limitation of this instrument is that it does not distinguish between attitudes toward gay men, bisexuals, and lesbians separately; in that sense, all these groups are lumped together. Yet it may be that some people hold differing opinions and attitude toward such individuals. Further, data for the present study were provided by a convenience sample belonging to a particular population (college students) in a particular geographic location (the Deep South), so their answers to the items may differ from those obtained from the original normative adult population. Furthermore, the alternative instrument structures resulting from the IRT analyses presented here are exploratory and in need of validation in separate datasets. That being said, the psychometric analysis framework provided in this study (see Figure 1) can be used as a psychometric analysis template in analyzing and improving the instrument.
This study adds to the literature by providing a comprehensive psychometric evaluation of the LGB-KASH, arguably the current "gold standard" instrument for measuring attitudes toward LGBTQIA+ individuals. Current societal and policy efforts endorse provision of person-centered and culturally competent health care, workplace, and educational environments. Attitudes toward LGBTQIA+ individuals, nonetheless, remain complex and multidimensional in the United States, as evidenced in the current political debate and ongoing reactions to the historic ruling of the Supreme Court of the United States legalizing same-sex marriage. In order to examine the degree of acceptance of diversity versus stigma and prejudice in health care, workplace, and educational environments, a generalized and psychometrically sound instrument is needed to measure attitudes toward LGBTQIA+ individuals.
The results of this article strongly suggest the original LGB-KASH instrument structure is unstable and unreliable; however, the selected item sets from the psychometric procedures presented here could be used to revise the original instrument in subsequent studies and thus obtain a more psychometrically sound measure. Moreover, mixed-method approaches including cognitive interviewing and focus groups might identify additional dimensions for the development of new items to improve assessment of LGBTQIA+ attitudes. Finally, this article provides a step-by-step approach in performing a comprehensive psychometric analysis that may be useful to other research groups in the development or revision of measures.
No potential conflict of interest was reported by the authors.
By Dheeraj Raju; Lisa Beck; Andres Azuero; Casey Azuero; David Vance and Rebecca Allen
Reported by Author; Author; Author; Author; Author; Author