Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Scale construction decisions  





2 Scale construction method  





3 Multi-Item and Single-Item Scales  





4 Data types  





5 Composite measures  





6 Comparative and non comparative scaling  





7 Comparative scaling techniques  





8 Non-comparative scaling techniques  





9 Scale evaluation  





10 See also  





11 References  





12 Further reading  





13 External links  














Scale (social sciences)






فارسی
Lietuvių
Português
Suomi
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


In the social sciences, scaling is the process of measuring or ordering entities with respect to quantitative attributes or traits. For example, a scaling technique might involve estimating individuals' levels of extraversion, or the perceived quality of products. Certain methods of scaling permit estimation of magnitudes on a continuum, while other methods provide only for relative ordering of the entities.

The level of measurement is the type of data that is measured.

The word scale, including in academic literature, is sometimes used to refer to another composite measure, that of an index. Those concepts are however different.[1]

Scale construction decisions[edit]

Scale construction method[edit]

Scales constructed should be representative of the construct that it intends to measure.[2] It is possible that something similar to the scale a person intends to create will already exist, so including those scale(s) and possible dependent variables in one's survey may increase validity of one's scale.

  1. Begin by generating at least ten items to represent each of the sub-scales. Administer the survey; the more representative and larger the sample, the more credibility one will have in the scales.
  2. Review the means and standard deviations for the items, dropping any items with skewed means or very low variance.
  3. Run an exploratory factor analysis with oblique rotation on items for the scales - it is important to differentiate them based on their loading on factors to create sub-scales that represents the construct. Request factors with eigenvalues (for calculating eigenvalue for each factor square the factor loading's and sum down the columns) greater than 1. It is easier to group the items by targeted scales. The more distinct the other items, the better the chances the items will load better in one's own scale.
  4. “Cleanly loaded items” are those items that load at least .40 on one factor and more than .10 greater on that factor than on any others. Identify those in the factor pattern.
  5. “Cross loaded items” are those that do not meet the above criterion. These are candidates to drop.
  6. Identify factors with only a few items that do not represent clear concepts, these are “uninterpretable scales.” Also identify any factors with only one item. These factors and their items are candidates to drop.
  7. Look at the candidates to drop and the factors to be dropped. Is there anything that needs to be retained because it is critical to one's construct. For example, if a conceptually important item only cross loads on a factor to be dropped, it is good to keep it for the next round.
  8. Drop the items, and run a confirmatory factor analysis asking the program to give only the number of factors after dropping the uninterpretable and single-item ones. Go through the process again starting at Step 3. Here various test reliability measures could also be taken.
  9. Keep running through the process until one get “clean factors” (until all factors have cleanly loaded items).
  10. Run the Alpha in the statistical program (asking for the Alpha's if each item is dropped). Any scales with insufficient Alphas should be dropped and the process repeated from Step 3. [Coefficient alpha=number of items2 x average correlation between different items/sum of all correlations in the correlation matrix (including the diagonal values)]
  11. Run correlational or regressional statistics to ensure the validity of the scale. For better practices, keep the final factors and all loadings of yours and similar scales selected in the Appendix of the created scale.

Multi-Item and Single-Item Scales[edit]

In most practical situations, multi-item scales are more effective in predicting outcomes compared to single items. The use of single-item measures in research is advised cautiously, their use should be limited to specific circumstances. [3][4]

Criterion Multi-item scale Single-item scale
Construct concreteness Abstract Concrete
Construct dimensionality/complexity Multidimensional, moderately complex Unidimensional or extremely complex
Semantic redundancy Low High
Primary role of construct Dependent or independent variable Moderator or control variable
Desired precision High Low
Monitoring changes Appropriate Problematic
Sampled population Homogenous Diverse
Sample size Large Limited

Table: Criteria for Assessing the Potential Use of Single-Item Measures[4]

Data types[edit]

The type of information collected can influence scale construction. Different types of information are measured in different ways.

  1. Some data are measured at the nominal level. That is, any numbers used are mere labels; they express no mathematical properties. Examples are SKU inventory codes and UPC bar codes.
  2. Some data are measured at the ordinal level. Numbers indicate the relative position of items, but not the magnitude of difference. An example is a preference ranking.
  3. Some data are measured at the interval level. Numbers indicate the magnitude of difference between items, but there is no absolute zero point. Examples are attitude scales and opinion scales.
  4. Some data are measured at the ratio level. Numbers indicate magnitude of difference and there is a fixed zero point. Ratios can be calculated. Examples include: age, income, price, costs, sales revenue, sales volume, and market share.

Composite measures[edit]

Composite measures of variables are created by combining two or more separate empirical indicators into a single measure. Composite measures measure complex concepts more adequately than single indicators, extend the range of scores available and are more efficient at handling multiple items.

In addition to scales, there are two other types of composite measures. Indexes are similar to scales except multiple indicators of a variable are combined into a single measure. The index of consumer confidence, for example, is a combination of several measures of consumer attitudes. A typology is similar to an index except the variable is measured at the nominal level.

Indexes are constructed by accumulating scores assigned to individual attributes, while scales are constructed through the assignment of scores to patterns of attributes.

While indexes and scales provide measures of a single dimension, typologies are often employed to examine the intersection of two or more dimensions. Typologies are very useful analytical tools and can be easily used as independent variables, although since they are not unidimensional it is difficult to use them as a dependent variable.

Comparative and non comparative scaling[edit]

With comparative scaling, the items are directly compared with each other (example: Does one prefer PepsiorCoke?). In noncomparative scaling each item is scaled independently of the others. (Example: How does one feel about Coke?)

Comparative scaling techniques[edit]

Non-comparative scaling techniques[edit]

Scale evaluation[edit]

Scales should be tested for reliability, generalizability, and validity. Generalizability is the ability to make inferences from a sample to the population, given the scale one have selected. Reliability is the extent to which a scale will produce consistent results. Test-retest reliability checks how similar the results are if the research is repeated under similar circumstances. Alternative forms reliability checks how similar the results are if the research is repeated using different forms of the scale. Internal consistency reliability checks how well the individual measures included in the scale are converted into a composite measure.

Scales and indexes have to be validated. Internal validation checks the relation between the individual measures included in the scale, and the composite scale itself. External validation checks the relation between the composite scale and other indicators of the variable, indicators not included in the scale. Content validation (also called face validity) checks how well the scale measures what is supposed to measured. Criterion validation checks how meaningful the scale criteria are relative to other possible criteria. Construct validation checks what underlying construct is being measured. There are three variants of construct validity. They are convergent validity, discriminant validity, and nomological validity (Campbell and Fiske, 1959; Krus and Ney, 1978). The coefficient of reproducibility indicates how well the data from the individual measures included in the scale can be reconstructed from the composite scale.

See also[edit]

References[edit]

  1. ^ Earl Babbie (1 January 2012). The Practice of Social Research. Cengage Learning. p. 162. ISBN 978-1-133-04979-1.
  • ^ McDonald, Roderick P. (2013-06-17). Test Theory: A Unified Treatment. Psychology Press. ISBN 978-1-135-67531-8.
  • ^ Diamantopoulos, Adamantio; Sarstedt, Marko; Fuchs, Christoph (2012). "Guidelines for choosing between multi-item and single-item scales for construct measurement: a predictive validity perspective". Journal of the Academy of Marketing Science. 40 (3): 434–449. doi:10.1007/s11747-011-0300-3. hdl:1959.13/1052296.
  • ^ a b Fuchs, Christoph; Diamantopoulos, Adamantios (2009). "Using single-item measures for construct measurement in management research: Conceptual issues and application guidelines" (PDF). Die Betriebswirtschaft. 69 (2).
  • ^ U.-D. Reips and F. Funke (2008) "Interval level measurement with visual analogue scales in Internet-based research: VAS Generator." doi:10.3758/BRM.40.3.699
  • Further reading[edit]

    External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Scale_(social_sciences)&oldid=1222842926"

    Categories: 
    Questionnaire construction
    Psychometrics
    Scales
    Index numbers
    Survey methodology
    Hidden categories: 
    Articles lacking in-text citations from December 2020
    All articles lacking in-text citations
    Articles needing additional references from December 2020
    All articles needing additional references
    Articles needing cleanup from April 2024
    All pages needing cleanup
    Articles containing how-to sections
     



    This page was last edited on 8 May 2024, at 07:08 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki