Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Expanded definition  





2 In dictionaries  





3 Statistically significant collocation  





4 See also  





5 References  





6 External links  














Collocation






العربية

 / Bân-lâm-gú
Català
Čeština
Deutsch
Español
Esperanto
Euskara
فارسی
Français
Gaeilge

Հայերեն
Bahasa Indonesia
Italiano
Lietuvių
Nederlands

Polski
Português
Русский
Slovenščina
Српски / srpski
Suomi
Svenska
Türkçe
Українська


 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Incorpus linguistics, a collocation is a series of words or terms that co-occur more often than would be expected by chance. In phraseology, a collocation is a type of compositional phraseme, meaning that it can be understood from the words that make it up. This contrasts with an idiom, where the meaning of the whole cannot be inferred from its parts, and may be completely unrelated.

There are about seven main types of collocations: adjective + noun, noun + noun (such as collective nouns), noun + verb, verb + noun, adverb + adjective, verbs + prepositional phrase (phrasal verbs), and verb + adverb.

Collocation extraction is a computational technique that finds collocations in a document or corpus, using various computational linguistics elements resembling data mining.

Expanded definition[edit]

Collocations are partly or fully fixed expressions that become established through repeated context-dependent use. Such terms as crystal clear, middle management, nuclear family, and cosmetic surgery are examples of collocated pairs of words.

Collocations can be in a syntactic relation (such as verb–object: make and decision), lexical relation (such as antonymy), or they can be in no linguistically defined relation. Knowledge of collocations is vital for the competent use of a language: a grammatically correct sentence will stand out as awkward if collocational preferences are violated. This makes collocation an interesting area for language teaching.

Corpus linguists specify a key word in context (KWIC) and identify the words immediately surrounding them. This gives an idea of the way words are used.

The processing of collocations involves a number of parameters, the most important of which is the measure of association, which evaluates whether the co-occurrence is purely by chance or statistically significant. Due to the non-random nature of language, most collocations are classed as significant, and the association scores are simply used to rank the results. Commonly used measures of association include mutual information, t scores, and log-likelihood.[1][2]

Rather than select a single definition, Gledhill[3] proposes that collocation involves at least three different perspectives: co-occurrence, a statistical view, which sees collocation as the recurrent appearance in a text of a node and its collocates;[4][5][6] construction, which sees collocation either as a correlation between a lexeme and a lexical-grammatical pattern,[7] or as a relation between a base and its collocative partners;[8] and expression, a pragmatic view of collocation as a conventional unit of expression, regardless of form.[9][10] These different perspectives contrast with the usual way of presenting collocation in phraseological studies. Traditionally speaking, collocation is explained in terms of all three perspectives at once, in a continuum:

Free combination ↔ bound collocation ↔ frozen idiom

In dictionaries[edit]

In 1933, Harold Palmer's Second Interim Report on English Collocations highlighted the importance of collocation as a key to producing natural-sounding language, for anyone learning a foreign language.[11] Thus from the 1940s onwards, information about recurrent word combinations became a standard feature of monolingual learner's dictionaries. As these dictionaries became "less word-centred and more phrase-centred",[12] more attention was paid to collocation. This trend was supported, from the beginning of the 21st century, by the availability of large text corpora and intelligent corpus-querying software, making it possible to provide a more systematic account of collocation in dictionaries. Using these tools, dictionaries such as the Macmillan English Dictionary and the Longman Dictionary of Contemporary English included boxes or panels with lists of frequent collocations.[13]

There are also a number of specialized dictionaries devoted to describing the frequent collocations in a language.[14] These include (for Spanish) Redes: Diccionario combinatorio del español contemporaneo (2004), (for French) Le Robert: Dictionnaire des combinaisons de mots (2007), and (for English) the LTP Dictionary of Selected Collocations (1997) and the Macmillan Collocations Dictionary (2010).[15]

Statistically significant collocation[edit]

Student's t-test can be used to determine whether the occurrence of a collocation in a corpus is statistically significant.[16] For a bigram , let be the unconditional probability of occurrence of in a corpus with size , and let be the unconditional probability of occurrence of in the corpus. The t-score for the bigram is calculated as:

where is the sample mean of the occurrence of , is the number of occurrences of , is the probability of under the null-hypothesis that and appear independently in the text, and is the sample variance. With a large , the t-test is equivalent to a Z-test.

See also[edit]

  • Agreement (linguistics)
  • Cliché
  • Collocational restriction
  • Collostructional analysis
  • Compound noun, adjective and verb
  • Government (linguistics)
  • Idiom (language structure)
  • Irreversible binomial
  • Isocolon
  • Lexical item
  • N-gram
  • Phrasal verb
  • Phraseology
  • Phraseme
  • Sketch Engine
  • Statistically improbable phrase
  • Word sketch
  • References[edit]

    1. ^ Dunning, Ted (1993): "Accurate methods for the statistics of surprise and coincidence Archived 2012-08-05 at the Wayback Machine". Computational Linguistics 19, 1 (Mar. 1993), 61–74.
  • ^ Dunning, Ted (2008-03-21). "Surprise and Coincidence". blogspot.com. Archived from the original on 2012-01-20. Retrieved 2012-04-09.
  • ^ Gledhill C. (2000): Collocations in Science Writing Archived 2023-06-29 at the Wayback Machine, Narr, Tübingen
  • ^ Firth J.R. (1957): Papers in Linguistics 1934–1951. Oxford: Oxford University Press.
  • ^ Sinclair J. (1996): "The Search for Units of Meaning", in Textus, IX, 75–106.
  • ^ Smadja F. A & McKeown, K. R. (1990): "Automatically extracting and representing collocations for language generation Archived 2015-09-06 at the Wayback Machine", Proceedings of ACL'90, 252–259, Pittsburgh, Pennsylvania.
  • ^ Hunston S. & Francis G. (2000): Pattern Grammar — A Corpus-Driven Approach to the Lexical Grammar of English Archived 2023-06-29 at the Wayback Machine, Amsterdam, John Benjamins
  • ^ Hausmann F. J. (1989): Le dictionnaire de collocations. In Hausmann F.J., Reichmann O., Wiegand H.E., Zgusta L.(eds), Wörterbücher : ein internationales Handbuch zur Lexikographie. Dictionaries. Dictionnaires. Berlin/New-York : De Gruyter. 1010–1019.
  • ^ Moon R. (1998): Fixed Expressions and Idioms, a Corpus-Based Approach. Oxford, Oxford University Press.
  • ^ Frath P. & Gledhill C. (2005): "Free-Range Clusters or Frozen Chunks? Reference as a Defining Criterion for Linguistic Units[dead link]", in Recherches anglaises et Nord-américaines, vol. 38 :25–43
  • ^ Cowie, A.P., English Dictionaries for Foreign Learners, Oxford University Press 1999:54–56
  • ^ Bejoint, H., The Lexicography of English, Oxford University Press 2010: 318
  • ^ "MED Second Edition – Key features – Macmillan". macmillandictionaries.com. Archived from the original on 2020-09-28. Retrieved 2011-08-24.
  • ^ Herbst, T. and Klotz, M. 'Syntagmatic and Phraseological Dictionaries' in Cowie, A.P. (Ed.) The Oxford History of English Lexicography, 2009: part 2, 234–243
  • ^ "Macmillan Collocation Dictionary – How it was written - Macmillan". macmillandictionaries.com. Archived from the original on 2018-12-21. Retrieved 2011-08-24.
  • ^ Manning, Chris; Schütze, Hinrich (1999). Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press. pp. 163–166. ISBN 0262133601.
  • External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Collocation&oldid=1222607671"

    Categories: 
    Lexical units
    Language education
    Corpus linguistics
    Semantic relations
    Hidden categories: 
    Webarchive template wayback links
    All articles with dead external links
    Articles with dead external links from July 2022
    Articles with short description
    Short description matches Wikidata
    Articles with BNF identifiers
    Articles with BNFdata identifiers
    Articles with GND identifiers
    Articles with J9U identifiers
    Articles with LCCN identifiers
    Articles with NKC identifiers
     



    This page was last edited on 6 May 2024, at 22:11 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki