Jump to content

Main menu Navigation ●Main page ●Contents ●Current events ●Random article ●About Wikipedia ●Contact us ●Donate Contribute ●Help ●Learn to edit ●Community portal ●Recent changes ●Upload file

●Create account ●Log in ●Create account ● Log in Pages for logged out editors learn more ●Contributions ●Talk

(Top) 1 Versions and authors 2 Principle 3 Usage in lexicostatistics and glottochronology 4 Swadesh 100 original final list 5 Swadesh 207 list 6 Shorter lists 6.1 Sign languages 7 See also 8 Notes 9 References 10 External links

Swadesh list

●አማርኛ ●العربية ●অসমীয়া ●Asturianu ●Авар ●বাংলা ●閩南語 / Bân-lâm-gú ●Башҡортса ●Беларуская ●Беларуская (тарашкевіца) ●Brezhoneg ●Català ●Чӑвашла ●Deutsch ●Eesti ●Ελληνικά ●Español ●Esperanto ●Euskara ●فارسی ●Français ●Frysk ●Galego ●贛語 ●客家語/Hak-kâ-ngî ●한국어 ●Bahasa Indonesia ●Interlingue ●Italiano ●עברית ●ქართული ●Кыргызча ●Lombard ●Magyar ●Македонски ●Malagasy ●Bahasa Melayu ●閩東語 / Mìng-dĕ̤ng-ngṳ̄ ●日本語 ●Nordfriisk ●Олык марий ●Polski ●Português ●Ripoarisch ●Română ●Русиньскый ●Русский ●Slovenščina ●Svenska ●தமிழ் ●Thuɔŋjäŋ ●Türkçe ●Українська ●Vahcuengh ●Vepsän kel’ ●粵語 ●中文 Edit links ●Article ●Talk ●Read ●Edit ●View history Tools Actions ●Read ●Edit ●View history General ●What links here ●Related changes ●Upload file ●Special pages ●Permanent link ●Page information ●Cite this page ●Get shortened URL ●Download QR code ●Wikidata item Print/export ●Download as PDF ●Printable version In other projects ●Wiktionary Appearance From Wikipedia, the free encyclopedia

This article may be too technical for most readers to understand. Please help improve ittomake it understandable to non-experts, without removing the technical details. (November 2023) (Learn how and when to remove this message)

The Swadesh list (/ˈswɑːdɛʃ/) is a compilation of tentatively universal concepts for the purposes of lexicostatistics. Translations of the Swadesh list into a set of languages allow researchers to quantify the interrelatedness of those languages. The Swadesh list is named after linguist Morris Swadesh. It is used in lexicostatistics (the quantitative assessment of the genealogical relatedness of languages) and glottochronology (the dating of language divergence). Because there are several different lists, some authors also refer to "Swadesh lists".

Versions and authors[edit]

Morris Swadesh created several versions of his list. He started^[1] with a list of 215 meanings (falsely introduced as a list of 225 meanings in the paper due to a spelling error^[2]), which he reduced to 165 words for the Salish-Spokane-Kalispel language. In 1952, he published a list of 215 meanings,^[3] of which he suggested the removal of 16 for being unclear or not universal, with one added to arrive at 200 words. In 1955,^[4] he wrote, "The only solution appears to be a drastic weeding out of the list, in the realization that quality is at least as important as quantity. Even the new list has defects, but they are relatively mild and few in number." After minor corrections, the final 100-word list was published posthumously in 1971^[5] and 1972.

Other versions of lexicostatistical test lists were published e.g. by Robert Lees (1953), John A. Rea (1958:145f), Dell Hymes (1960:6), E. Cross (1964 with 241 concepts), W. J. Samarin (1967:220f), D. Wilson (1969 with 57 meanings), Lionel Bender (1969), R. L. Oswald (1971), Winfred P. Lehmann (1984:35f), D. Ringe (1992, passim, different versions), Sergei Starostin (1984, passim, different versions), William S-Y. Wang (1994), M. Lohr (2000, 128 meanings in 18 languages). B. Kessler (2002), and many others. The Concepticon,^[6] a project hosted at the Cross-Linguistic Linked Data (CLLD) project, collects various concept lists (including classical Swadesh lists) across different linguistic areas and times, currently listing 240 different concept lists.^[7]

Frequently used and widely available on the internet, is the version by Isidore Dyen (1992, 200 meanings of 95 language variants). Since 2010, a team around Michael Dunn has tried to update and enhance that list.^[8]

Principle[edit]

In origin, the words in the Swadesh lists were chosen for their universal, culturally independent availability in as many languages as possible, regardless of their "stability". Nevertheless, the stability of the resulting list of "universal" vocabulary under language change and the potential use of this fact for purposes of glottochronology have been analyzed by numerous authors, including Marisa Lohr 1999, 2000.^[9]

The Swadesh list was put together by Morris Swadesh on the basis of his intuition. Similar more recent lists, such as the Dolgopolsky list (1964) or the Leipzig–Jakarta list (2009), are based on systematic data from many different languages, but they are not yet as widely known nor as widely used as the Swadesh list.

Usage in lexicostatistics and glottochronology[edit]

Lexicostatistical test lists are used in lexicostatistics to define subgroupings of languages, and in glottochronology to "provide dates for branching points in the tree".^[10] The task of defining (and counting the number) of cognate words in the list is far from trivial, and often is subject to dispute, because cognates do not necessarily look similar, and recognition of cognates presupposes knowledge of the sound laws of the respective languages.

Swadesh 100 original final list[edit]

Swadesh's final list, published in 1971,^[5] contains 100 terms. Explanations of the terms can be found in Swadesh 1952^[3] or, where noted by a dagger (^†), in Swadesh 1955. Note that only this original sequence clarifies the correct meaning which is lost in an alphabetical order, e.g., in the case "27. bark" (originally without the specification here added).

I (first person singular pronoun)

you (second person singular pronoun; 1952 thou &ye)

we (1955: inclusive)

this

that

who? (“?” not 1971)

what? (“?” not 1971)

not

all (of a number)

many

one

two

big

long (not wide)

small

woman

man (adult male human)

person (individual human)

fish (noun)

bird

dog

louse

tree (not log)

seed (noun)

leaf (botanics)

root (botanics)

bark (of tree)

skin (1952: person’s)

flesh (1952 meat, flesh)

blood

bone

grease (1952: fat, organic substance)

egg

horn (of bull etc., not 1952)^†

tail

feather (large, not down)

hair (on head of humans)

head (anatomic)

ear

eye

nose

mouth

tooth (front, rather than molar)

tongue (anatomical)

claw (not in 1952)^†¹

foot (not leg)

knee (not 1952)^†

hand

belly (lower part of body, abdomen)

neck (not nape)

breasts (female; 1955 breast)^†

heart

liver

drink (verb)

eat (verb)

bite (verb)

see (verb)

hear (verb)

know (facts)

sleep (verb)

die (verb)

kill (verb)

swim (verb)

fly (verb)

walk (verb)

come (verb)

lie (on side, recline)

sit (verb)

stand (verb)

give (verb)

say (verb)^†

sun

moon (not 1952)^†

star

water (noun)

rain (noun, 1952 verb)

stone

sand

earth (soil)

cloud (not fog)

smoke (noun, of fire)

fire

ash(es)

burn (verb intransitive)

path (1952 road, trail; not street)

mountain (not hill)

red (color)

green (color)

yellow (color)

white (color)

black (color)

night

hot (adjective; 1952 warm, of weather)

cold (of weather)

full^†

new

good

round (not 1952)^†

dry (substance)

name

^ "Claw" was only added in 1955, but again replaced by many well-known specialists with (finger)nail, because expressions for "claw" are not available in many old, extinct, or lesser known languages.

The 110-item Global Lexicostatistical Database list uses the original 100-item Swadesh list, in addition to 10 other words from the Swadesh–Yakhontov list.^[11]

Swadesh 207 list[edit]

The most used list nowadays is the Swadesh 207-word list, adapted from Swadesh 1952.^[3]

In Wiktionary ("Swadesh lists by language"), Panlex^[12]^[13] and in Palisto's "Swadesh Word List of Indo-European languages",^[14] hundreds of Swadesh lists in this form can be found.

you (singular)

they (singular)

you (plural)

they (plural)

this

that

here

there

who

what

where

when

how

not

all

many

some

few

other

one

two

three

four

five

big

long

wide

thick

heavy

small

short

narrow

thin

woman

man (adult male)

man (human being)

child

wife

husband

mother

father

animal

fish

bird

dog

louse

snake

worm

tree

forest

stick

fruit

seed

leaf

root

bark (of a tree)

flower

grass

rope

skin

meat

blood

bone

fat (noun)

egg

horn

tail

feather

hair

head

ear

eye

nose

mouth

tooth

tongue (organ)

fingernail

foot

leg

knee

hand

wing

belly

guts

neck

back

breast

heart

liver

to drink

to eat

to bite

to suck

to spit

to vomit

to blow

to breathe

to laugh

to see

to hear

to know

to think

to smell

to fear

to sleep

to live

to die

to kill

to fight

to hunt

to hit

to cut

to split

to stab

to scratch

to dig

to swim

to fly

to walk

to come

to lie (as in a bed)

to sit

to stand

to turn (intransitive)

to fall

to give

to hold

to squeeze

to rub

to wash

to wipe

to pull

to push

to throw

to tie

to sew

to count

to say

to sing

to play

to float

to flow

to freeze

to swell

sun

moon

star

water

rain

river

lake

sea

salt

stone

sand

dust

earth

cloud

fog

sky

wind

snow

ice

smoke

fire

ash

to burn

road

mountain

red

green

yellow

white

black

night

day

year

warm

cold

full

new

old

good

bad

rotten

dirty

straight

round

sharp (as a knife)

dull (as a knife)

smooth

wet

dry

correct

near

far

right

left

with

and

because

name

Shorter lists[edit]

The Swadesh–Yakhontov list is a 35-word subset of the Swadesh list posited as especially stable by Russian linguist Sergei Yakhontov around the 1960s, although the list was only officially published in 1991.^[15] It has been used in lexicostatistics by linguists such as Sergei Starostin. With their Swadesh numbers, they are:^[16]

you (singular)

this

who

what

one

two

fish

dog

louse

blood

bone

egg

horn

tail

ear

eye

nose

tooth

tongue

hand

know

die

give

sun

moon

water

salt

stone

wind

fire

year

full

new

name

Holman et al. (2008) found that in identifying the relationships between Chinese dialects the Swadesh–Yakhontov list was less accurate than the original Swadesh-100 list. Further they found that a different (40-word) list (also known as the ASJP list) was just as accurate as the Swadesh-100 list. However, they calculated the relative stability of the words by comparing retentions between languages in established language families. They found no statistically significant difference in the correlations in the families of the Old versus the New World.

The ranked Swadesh-100 list, with Swadesh numbers and relative stability, is as follows (Holman et al., Appendix. Asterisked words appear on the 40-word list):

22 *louse (42.8)

12 *two (39.8)

75 *water (37.4)

39 *ear (37.2)

61 *die (36.3)

1 *I (35.9)

53 *liver (35.7)

40 *eye (35.4)

48 *hand (34.9)

58 *hear (33.8)

23 *tree (33.6)

19 *fish (33.4)

100 *name (32.4)

77 *stone (32.1)

43 *tooth (30.7)

51 *breasts (30.7)

2 *you (30.6)

85 *path (30.2)

31 *bone (30.1)

44 *tongue (30.1)

28 *skin (29.6)

92 *night (29.6)

25 *leaf (29.4)

76 rain (29.3)

62 kill (29.2)

30 *blood (29.0)

34 *horn (28.8)

18 *person (28.7)

47 *knee (28.0)

11 *one (27.4)

41 *nose (27.3)

95 *full (26.9)

66 *come (26.8)

74 *star (26.6)

86 *mountain (26.2)

82 *fire (25.7)

3 *we (25.4)

54 *drink (25.0)

57 *see (24.7)

27 bark (24.5)

96 *new (24.3)

21 *dog (24.2)

72 *sun (24.2)

64 fly (24.1)

32 grease (23.4)

73 moon (23.4)

70 give (23.3)

52 heart (23.2)

36 feather (23.1)

90 white (22.7)

89 yellow (22.5)

20 bird (21.8)

38 head (21.7)

79 earth (21.7)

46 foot (21.6)

91 black (21.6)

42 mouth (21.5)

88 green (21.1)

60 sleep (21.0)

7 what (20.7)

26 root (20.5)

45 claw (20.5)

56 bite (20.5)

83 ash (20.3)

87 red (20.2)

55 eat (20.0)

33 egg (19.8)

6 who (19.0)

99 dry (18.9)

37 hair (18.6)

81 smoke (18.5)

8 not (18.3)

4 this (18.2)

24 seed (18.2)

16 woman (17.9)

98 round (17.9)

14 long (17.4)

69 stand (17.1)

97 good (16.9)

17 man (16.7)

94 cold (16.6)

29 flesh (16.4)

50 neck (16.0)

71 say (16.0)

84 burn (15.5)

35 tail (14.9)

78 sand (14.9)

5 that (14.7)

65 walk (14.4)

68 sit (14.3)

10 many (14.2)

9 all (14.1)

59 know (14.1)

80 cloud (13.9)

63 swim (13.6)

49 belly (13.5)

13 big (13.4)

93 hot (11.6)

67 lie (11.2)

15 small (6.3)

Sign languages[edit]

In studying the sign languages of Vietnam and Thailand, linguist James Woodward noted that the traditional Swadesh list applied to spoken languages was unsuited for sign languages. The Swadesh list results in overestimation of the relationships between sign languages, due to indexical signs such as pronouns and parts of the body. The modified list is as follows, in largely alphabetical order:^[17]

animal

bad

because

bird

black

blood

child

count

day

die

dirty

dog

dry

dull

dust

earth

egg

grease

father

feather

fire

fish

flower

good

grass

green

heavy

how

hunt

husband

ice

kill

laugh

leaf

lie

live

long

louse

man

meat

mother

mountain

name

narrow

new

night

not

old

other

person

play

rain

red

correct

river

rope

salt

sea

sharp

short

sing

sit

smooth

snake

snow

stand

star

stone

sun

tail

thin

tree

vomit

warm

water

wet

what

when

where

white

who

wide

wife

wind

with

woman

wood

worm

year

yellow

full

moon

brother

cat

dance

pig

sister

work

Part of a serieson
Linguistics
Outline History Index
General linguistics Diachronic Lexicography Morphology Phonology Pragmatics Semantics Syntax Syntax–semantics interface Typology
Applied linguistics Acquisition Anthropological Applied Computational Conversation analysis Corpus linguistics Discourse analysis Distance Documentation Ethnography of communication Ethnomethodology Forensic History of linguistics Interlinguistics Neurolinguistics Philology Philosophy of language Phonetics Psycholinguistics Sociolinguistics Text Translating and interpreting Writing systems
Theoretical frameworks Formalist Constituency Dependency Distributionalism Generative Glossematics Functional Cognitive Construction grammar Functional discourse grammar Grammaticalization Interactional linguistics Prague circle Systemic functional Usage-based Structuralism
Topics Autonomy of syntax Compositionality Conservative and innovative language Descriptivism Etymology Iconicity Internet linguistics LGBT linguistics Origin of language Orthography Philosophy of linguistics Prescriptivism Second-language acquisition Theory of language
Portal
v t e

Notes[edit]

^ Swadesh 1950: 161

^ List, J.-M. (2018): Towards a history of concept list compilation in historical linguistics. History and Philosophy of the Language Sciences 5.10. URL

^ ^a ^b ^c Swadesh 1952: 456–7 PDF

^ Swadesh 1955: 125

^ ^a ^b Swadesh 1971: 283

^ Concepticon. doi:10.5281/zenodo.19782

^ List, J.-M., M. Cysouw, and R. Forkel (2016): Concepticon. A resource for the linking of concept lists. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation. 2393-2400. PDF

^ "IELex :: IELex". GitHub. March 2022.

^ Marisa Lohr (2000), "New Approaches to Lexicostatistics and Glottochronology" in C. Renfrew, A. McMahon and L. Trask, ed. Time Depth in Historical Linguistics, Vol. 1, pp. 209–223

^ Sheila Embleton (1992), in W. Bright, ed., International Encyclopaedia of Linguistics, Oxford University Press, p. 131

^ Starostin, George (ed.) 2011-2019. The Global Lexicostatistical Database. Moscow: Higher School of Economics, & Santa Fe: Santa Fe Institute. Accessed on 2020-12-26.

^ Jonathan Pool (2016), Panlex Swadesh Lists PDF

^ David Kamholz, Jonathan Pool, Susan Colowick (2014), PanLex: Building a Resource for Panlingual Lexical Translation PDF

^ Palisto (2013), Swadesh Word List of Indo-European languages .

^ Concept list Yakhontov 1991 100. Concepticon. Accessed 2020-12-30.

^ Starostin 1991

^ Karen Emmorey; Harlan L. Lane (2000). The Signs of Language Revisited: An Anthology to Honor Ursula Bellugi and Edward Klima. Psychology Press. pp. 20–21. ISBN 978-0-8058-3246-4. Retrieved 26 September 2011.

References[edit]

Campbell, Lyle. (1998). Historical Linguistics: An Introduction. Edinburgh: Edinburgh University Press. ISBN 0-262-53267-0.

Embleton, Sheila (1995). Review of An Indo-European Classification: A Lexicostatistical Experiment by Isidore Dyen, J.B. Kruskal and P.Black. TAPS Monograph 82–5, Philadelphia. in Diachronica Vol. 12, no. 2, 263–68.

Gudschinsky, Sarah. (1956). "The ABCs of Lexicostatistics (Glottochronology)." Word, Vol. 12, 175–210.

Hoijer, Harry. (1956). "Lexicostatistics: A Critique." Language, Vol. 32, 49–60.

Holm, Hans J. (2007). "The New Arboretum of Indo-European 'Trees': Can New Algorithms Reveal the Phylogeny and Even Prehistory of Indo-European?" Journal of Quantitative Linguistics, Vol. 14, 167–214.

Holman, Eric W., Søren Wichmann, Cecil H. Brown, Viveka Velupillai, André Müller, Dik Bakker (2008). "Explorations in Automated Language Classification". Folia Linguistica, Vol. 42, no. 2, 331–354

Sankoff, David (1970). "On the Rate of Replacement of Word-Meaning Relationships." Language, Vol. 46, 564–569.

Starostin, Sergei (1991). Altajskaja Problema i Proisxozhdenie Japonskogo Jazyka [The Altaic Problem and the Origin of the Japanese Language]. Moscow: Nauka

Swadesh, Morris. (1950). "Salish Internal Relationships." International Journal of American Linguistics, Vol. 16, 157–167.

Swadesh, Morris. (1952). "Lexicostatistic Dating of Prehistoric Ethnic Contacts." Proceedings of the American Philosophical Society, Vol. 96, 452–463.

Swadesh, Morris. (1955). "Towards Greater Accuracy in Lexicostatistic Dating." International Journal of American Linguistics, Vol. 21, 121–137.

Swadesh, Morris. (1971). The Origin and Diversification of Language. Ed. post mortem by Joel Sherzer. Chicago: Aldine. ISBN 0-202-01001-5. Contains final 100-word list on p. 283.

Swadesh, Morris, et al. (1972). "What is Glottochronology?" in Morris Swadesh and Joel Sherzer, ed., The Origin and Diversification of Language, pp. 271–284. London: Routledge & Kegan Paul. ISBN 0-202-30841-3.

Wittmann, Henri (1973). "The Lexicostatistical Classification of the French-Based Creole Languages." Lexicostatistics in Genetic Linguistics: Proceedings of the Yale Conference, April 3–4, 1971, dir. Isidore Dyen, 89–99. La Haye: Mouton.[1]

External links[edit]

Look up Appendix:Swadesh lists in Wiktionary, the free dictionary.

v t e Long-range comparative linguistics
Concepts	Comparative method Etymological dictionary Glottochronology Lexicostatistics Linguistic reconstruction Internal reconstruction Linguistic universal Macrofamily Mass comparison Origin of language Paleolinguistics Proto-language Swadesh list Dolgopolsky list Leipzig–Jakarta list
Language families	Proto-human Borean Amerind Nostratic Elamo-Dravidian Eurasiatic Altaic Ural-Altaic Indo-Uralic Sino-Uralic Dené–Caucasian North Caucasian Austric Indo-Pacific Ainu-Minoan
Linguists	John Bengtson Václav Blažek Allan R. Bomhard Svetlana Burlak Aharon Dolgopolsky Vladimir Dybo Harold C. Fleming Joseph Greenberg Eugene Helimski Murray Gell-Mann Vladislav Illich-Svitych Frederik Kortlandt Alexis Manaster Ramer Sergei Nikolaev Sorin Paliga Holger Pedersen Ilia Peiros Martine Robbeets Merritt Ruhlen Vitaly Shevoroshkin Georgiy Starostin Sergei Starostin Alfredo Trombetti
Journals	Journal of Language Relationship Mother Tongue
Books	Etymological Dictionary of the Altaic Languages The Languages of Africa
Institutions and schools	Evolution of Human Languages Institute of Linguistics of the Russian Academy of Sciences Moscow School of Comparative Linguistics Russian State University for the Humanities Santa Fe Institute
Linguistics portal Category

Retrieved from "https://en.wikipedia.org/w/index.php?title=Swadesh_list&oldid=1219925534" Categories: ●Historical linguistics ●Comparative linguistics ●Quantitative linguistics ●Word lists ●Linguistics lists ●1950 documents Hidden categories: ●Articles with short description ●Short description is different from Wikidata ●Wikipedia articles that are too technical from November 2023 ●All articles that are too technical ●This page was last edited on 20 April 2024, at 18:35 (UTC). ●Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. ●Privacy policy ●About Wikipedia ●Disclaimers ●Contact Wikipedia ●Code of Conduct ●Developers ●Statistics ●Cookie statement ●Mobile view