Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 History and development  



1.1  Logos  







2 Multi-lingual  





3 Critical reception  





4 Wiktionary data in natural language processing  





5 See also  





6 Notes  





7 References  



7.1  Citations  





7.2  Sources  







8 External links  














Wiktionary






Afrikaans
Alemannisch
العربية
Aragonés
Armãneashti

Asturianu
Avañe'
Авар
Azərbaycanca
تۆرکجه
Basa Bali

Banjar
 / Bân-lâm-gú
Беларуская
Беларуская (тарашкевіца)
Bikol Central
Български
Boarisch
Brezhoneg
Català
Чӑвашла
Čeština
ChiShona
Cymraeg
Dansk
الدارجة
Deutsch
Eesti
Ελληνικά
Español
Esperanto
Euskara
فارسی
Français
Gaeilge
Gĩkũyũ

Gungbe
/Hak-kâ-ngî

Hausa
Հայերեն
ि
Hrvatski
Igbo
Bahasa Indonesia
Interlingua
Íslenska
Italiano
עברית
Jawa


 / کٲشُر
Kaszëbsczi
Kiswahili
Kreyòl ayisyen
Kurdî
Ladino

Latina
Latviešu
Lietuvių
Limburgs
Lombard
Magyar
Македонски
Malagasy

Malti

مصرى

Bahasa Melayu
 / Mìng-dĕ̤ng-nḡ

Nederlands
Nedersaksies


Norsk bokmål
Norsk nynorsk
Occitan
ି
Oʻzbekcha / ўзбекча

پنجابی


Tok Pisin
Polski
Português
Română
Русский
Sängö
Scots
Shqip
Sicilianu

Simple English
سنڌي
Slovenčina
Slovenščina
کوردی
Српски / srpski
Srpskohrvatski / српскохрватски
Sunda
Suomi
Svenska
Tagalog
ி
Татарча / tatarça
 


Тоҷикӣ
Türkçe
Tyap
Українська
اردو
Vahcuengh
Vèneto
Tiếng Vit
Walon

Winaray
Wolof

ייִדיש


Betawi
 

Edit links
 









Article
Talk
 

















Read
View source
View history
 








Tools
   


Actions  



Read
View source
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 




In other projects  



Wikimedia Commons
Meta-Wiki
Wikibooks
Wikiquote
Wikiversity
 
















Appearance
   

 





Page semi-protected

From Wikipedia, the free encyclopedia
 

(Redirected from Wikt)

Wiktionary
Logo of English Wiktionary
Screenshot
Main Page of the English Wiktionary on April 2, 2021.

Type of site

Online dictionary
Available inMultilingual (169 active)[1]
OwnerWikimedia Foundation
Created by
  • Wikimedia community
  • URLwiktionary.org
    CommercialNo
    RegistrationOptional
    LaunchedDecember 12, 2002; 21 years ago (2002-12-12)
    Current statusActive

    Wiktionary (UK: /ˈwɪkʃənəri/ , WIK-shə-nər-ee; US: /ˈwɪkʃənɛri/ , WIK-shə-nerr-ee; rhyming with "dictionary") is a multilingual, web-based project to create a free content dictionary of terms (including words, phrases, proverbs, linguistic reconstructions, etc.) in all natural languages and in a number of artificial languages. These entries may contain definitions, images for illustration, pronunciations, etymologies, inflections, usage examples, quotations, related terms, and translations of terms into other languages, among other features. It is collaboratively edited via a wiki. Its name is a portmanteau of the words wiki and dictionary. It is available in 193 languages and in Simple English. Like its sister project Wikipedia, Wiktionary is run by the Wikimedia Foundation, and is written collaboratively by volunteers, dubbed "Wiktionarians". Its wiki software, MediaWiki, allows almost anyone with access to the website to create and edit entries.

    Because Wiktionary is not limited by print space considerations, most of Wiktionary's language editions provide definitions and translations of terms from many languages, and some editions offer additional information typically found in thesauri.

    Wiktionary's data is frequently used in various natural language processing tasks.

    History and development

    Wiktionary was brought online on December 12, 2002,[2] following a proposal by Daniel Alston and an idea by Larry Sanger, co-founder of Wikipedia.[3] On March 28, 2004, the first non-English Wiktionaries were initiated in French and Polish. Wiktionaries in numerous other languages have since been started. Wiktionary was hosted on a temporary domain name (wiktionary.wikipedia.org) until May 1, 2004, when it switched to the current domain name.[a] As of July 2021, Wiktionary features over 30 million articles (and even more entries) across its editions.[4] The largest of the language editions is the English Wiktionary, with over 7.5 million entries, followed by the French Wiktionary with over 4.7 million and the Malagasy Wiktionary with over 3.5 million entries. Forty-three Wiktionary language editions contain over 100,000 entries each.[b]

    The use of bots to generate large numbers of articles is visible as "growth spurts" in this graph of article counts at the largest eight Wiktionary editions. (Data as of December 2009)

    Many of the definitions at the project's largest language editions were created by bots that found creative ways to generate entries or (rarely) automatically imported thousands of entries from previously published dictionaries. Seven of the 18 bots registered at the English Wiktionary in 2007[c] created 163,000 of the entries there.[5]

    Another of these bots, "ThirdPersBot", was responsible for the addition of a number of third-person conjugations that would not have received their own entries in standard dictionaries; for instance, it defined "smoulders" as the "third-person singular simple present form of smoulder." Of the 1,269,938 definitions the English Wiktionary provides for 996,450 English words, 478,068 are "form of" definitions of this kind.[6] This means that even without such entries, its coverage of English is significantly larger than that of major monolingual print dictionaries. Merriam-Webster's Third New International Dictionary of the English Language, Unabridged, for instance, has 475,000 entries (with many additional embedded headwords); the Oxford English Dictionary has 615,000 headwords, but includes Middle English as well, for which the English Wiktionary has an additional 34,234 gloss definitions. Detailed statistics exist to show how many entries of various kinds exist.

    The English Wiktionary does not rely on bots to the extent that some other editions do. The French and Vietnamese Wiktionaries, for example, imported large sections of the Free Vietnamese Dictionary Project (FVDP), which provides free content bilingual dictionaries to and from Vietnamese.[d] These imported entries make up virtually all of the Vietnamese edition's contents. Like the English edition, the French Wiktionary has imported approximately 20,000 entries from the Unihan database of Chinese, Japanese, Korean and Indian characters. The French Wiktionary grew rapidly in 2006 thanks in a large part to bots copying many entries from old, freely licensed dictionaries, such as the eighth edition of the Dictionnaire de l'Académie française (1935, around 35,000 words), and using bots to add words from other Wiktionary editions with French translations. The Russian edition grew by nearly 80,000 entries as "LXbot" added boilerplate entries (with headings, but without definitions) for words in English and German.[7]

    As of July 2021, the English Wiktionary has over 791,870 gloss definitions and over 1,269,938 total definitions (including different forms) for English entries alone, with a total of over 9,928,056 definitions across all languages.[8]

    Logos

    Wiktionary has historically lacked a uniform logo across its numerous language editions. Some editions use logos that depict a dictionary entry about the term "Wiktionary", based on the previous English Wiktionary logo, which was designed by Brooke Vibber, a MediaWiki developer.[9] Because a purely textual logo must vary considerably from language to language, a four-phase contest to adopt a uniform logo was held at the Wikimedia Meta-Wiki from September to October 2006.[e] Some communities adopted the winning entry by "Smurrayinchester", a 3×3 grid of wooden tiles, each bearing a character from a different writing system. However, the poll did not see as much participation from the Wiktionary community as some community members had hoped, and a number of the larger wikis ultimately kept their textual logos.[e]

    In April 2009, the issue was resurrected with a new contest. This time, a depiction by "AAEngelman" of an open hardbound dictionary won a head-to-head vote against the 2006 logo, but the process to refine and adopt the new logo then stalled.[10] In the following years, some wikis replaced their textual logos with one of the two newer logos. In 2012, 55 wikis that had been using the English Wiktionary logo received localized versions of the 2006 design by "Smurrayinchester".[f] In July 2016, the English Wiktionary adopted a variant of this logo.[11] As of 4 July 2016, 135 wikis, representing 61% of Wiktionary's entries, use a logo based on the 2006 design by "Smurrayinchester", 33 wikis (36%) use a textual logo, and three wikis (3%) use the 2009 design by "AAEngelman".[12]

    Multi-lingual

    As of July 2024, there are Wiktionary sites for 193 languages of which 169 are active and 24 are closed.[1] The active sites have 39,899,898 articles, and the closed sites have 339 articles.[13] There are 7,310,748 registered users of which 5,933 are recently active.[13]

    The top ten Wiktionary language projects by mainspace article count:[13]

    Language Wiki Good Total Edits Admins Users Active users Files
    1 English en 8,070,751 9,552,890 80,577,209 80 4,198,319 2,332 14
    2 French fr 5,805,127 6,423,269 35,201,727 32 373,171 524 6
    3 Malagasy mg 4,289,013 4,353,628 33,023,955 2 12,137 51 3
    4 Chinese zh 1,718,837 2,364,116 8,541,783 10 121,082 88 1
    5 Greek el 1,526,657 1,583,227 6,906,127 10 61,612 77 23
    6 Russian ru 1,362,768 2,877,151 13,432,529 15 317,503 245 184
    7 German de 1,132,492 1,314,670 10,083,992 13 239,436 188 107
    8 Kurdish ku 1,000,808 1,096,745 6,011,924 7 12,607 30 15
    9 Swedish sv 942,860 982,039 4,044,067 13 57,400 53 1
    10 Spanish es 928,433 985,985 5,490,718 8 168,687 113 14

    For a complete list with totals see Wikimedia Statistics: [14]

    Critical reception

    Critical reception of Wiktionary has been mixed. In 2006, Jill Lepore wrote in the article "Noah's Ark" for The New Yorker,[g]

    There's no show of hands at Wiktionary. There's not even an editorial staff. "Be your own lexicographer!", might be Wiktionary's motto. Who needs experts? Why pay good money for a dictionary written by lexicographers when we could cobble one together ourselves?

    Wiktionary isn't so much republicanordemocraticasMaoist. And it's only as good as the copyright-expired books from which it pilfers.

    Keir Graff's review for Booklist was less critical:

    Is there a place for Wiktionary? Undoubtedly. The industry and enthusiasm of its many creators are proof that there's a market. And it's wonderful to have another strong source to use when searching the odd terms that pop up in today's fast-changing world and the online environment. But as with so many Web sources (including this column), it's best used by sophisticated users in conjunction with more reputable sources.[citation needed]

    References in other publications are fleeting and part of larger discussions of Wikipedia, not progressing beyond a definition, although David Brooks in The Nashua Telegraph described it as "wild and woolly".[16] One of the impediments to independent coverage of Wiktionary is the continuing confusion that it is merely an extension of Wikipedia.[h]

    The measure of correctness of the inflections for a subset of the Polish words in the English Wiktionary showed that this grammatical data is very stable (a study showed that only 131 out of 4,748 Polish words have had their inflection data corrected).[17]

    As of 2016, Wiktionary has seen growing use in academia.[18]

    Wiktionary data in natural language processing

    Wiktionary has semi-structured data.[19] Wiktionary lexicographic data can be converted to machine-readable format in order to be used in natural language processing tasks.[20][21][22]

    Wiktionary's data mining is a complex task. There are the following difficulties:[23]

    There are several parsers for different Wiktionary language editions:[24]

    Examples of natural language processing tasks which have been solved with the help of Wiktionary data include:

    "Wikidata:Lexicographical data" was started in 2018 to provide structured data support to Wiktionaries. It stores word data of all languages in a machine readable data model, under a dedicated "Lexeme" namespace in Wikidata. As of October 2021, the project has amassed over 600,000 lexeme entries of various languages.[47]

    See also

    Notes

    1. ^ Wiktionary's current URL is www.wiktionary.org
  • ^ Wiktionary total article counts are here. Detailed statistics by word type are available here [1].
  • ^ The user list at the English Wiktionary identifies accounts that have been given "bot status".
  • ^ Hồ Ngọc Đức, Free Vietnamese Dictionary Project. Details at the Vietnamese Wiktionary.
  • ^ a b "Wiktionary/logo", Meta-Wiki, Wikimedia Foundation.
  • ^ [Translators-l] 56 Wiktionaries got a localised logo
  • ^ The full article is not available on-line.[15]
  • ^ In this citation, the author refers to Wiktionary as part of the Wikipedia site: Adapted from an article by Naomi DeTullio (2006). "Wikis for Librarians" (PDF). NETLS News #142. Northeast Texas Library System. p. 15. Archived from the original (PDF newsletter) on June 5, 2007. Retrieved April 21, 2007.
  • ^ E.g. compare the entry structure and formatting rules in English Wiktionary and Russian Wiktionary.
  • ^ Quotations are extracted only from Russian Wiktionary.[33]
  • ^ If there are several IPA notations on a Wiktionary page – either for different languages or for pronunciation variants, then the first pronunciation was extracted.[39]
  • ^ The source code and the results of POS-tagging are available at https://code.google.com/p/wikily-supervised-pos-tagger
  • References

    Citations

  • ^ "Wikipedia mailing list archive discussion announcing the opening of the Wiktionary project". December 12, 2002. Archived from the original on June 20, 2014. Retrieved May 3, 2011.
  • ^ Wikipedia mailing list archive discussion from Larry Sanger giving the idea on Wiktionary Archived June 20, 2014, at the Wayback Machine – Retrieved May 3, 2011
  • ^ "Wiktionary". www.wiktionary.org. Archived from the original on September 13, 2008. Retrieved October 28, 2021.
  • ^ TheDaveBot Archived October 11, 2007, at the Wayback Machine, TheCheatBot Archived October 11, 2007, at the Wayback Machine, Websterbot Archived October 11, 2007, at the Wayback Machine, PastBot Archived October 11, 2007, at the Wayback Machine, NanshuBot Archived October 11, 2007, at the Wayback Machine
  • ^ Detailed statistics Archived July 23, 2021, at the Wayback Machine as of July 21, 2021
  • ^ "LXbot". Archived from the original on May 24, 2008.
  • ^ "Wiktionary:Statistics". March 29, 2022. Archived from the original on March 6, 2023. Retrieved March 6, 2023 – via Wiktionary.
  • ^ "Wiktionary talk:Wiktionary Logo", English Wiktionary, Wikimedia Foundation.
  • ^ "Wiktionary/logo/refresh/voting", Meta-Wiki, Wikimedia Foundation.
  • ^ phab:T139255
  • ^ m:Wiktionary/logo#Logo use statistics.
  • ^ a b c Wikimedia's MediaWiki API:Siteinfo. Retrieved July 2024 from Data:Wikipedia statistics/data.tab
  • ^ "Wiktionary Statistics". Meta.Wikimedia.org. Archived from the original on September 2, 2020. Retrieved September 11, 2020.
  • ^ Lepore 2006.
  • ^ David Brooks, "Online, interactive encyclopedia not just for geeks anymore, because everyone seems to need it now, more than ever!" The Nashua Telegraph (August 4, 2004)
  • ^ Kurmas 2010.
  • ^ Sascha & Müller-Spitzer 2016, p. 348
  • ^ Meyer & Gurevych 2012, p. 140.
  • ^ Zesch, Müller & Gurevych 2008, p. 4, Figure 1.
  • ^ Meyer & Gurevych 2010, p. 40.
  • ^ Krizhanovsky, Transformation 2010, p. 1.
  • ^ Hellmann & Auer 2013, p. 302, p. 16 in PDF.
  • ^ Hellmann, Brekle & Auer 2012, p. 3, Table 1.
  • ^ "DBpedia Wiktionary". Archived from the original on May 4, 2013.
  • ^ Hellmann, Brekle & Auer 2012, pp. 8–9.
  • ^ Hellmann, Brekle & Auer 2012, p. 10.
  • ^ Hellmann, Brekle & Auer 2012, p. 11.
  • ^ "Welcome". DKPro JWKTL. Archived from the original on January 23, 2021. Retrieved June 23, 2019.
  • ^ Zesch, Müller & Gurevych 2008.
  • ^ "Wikokit - Machine-readable Wiktionary". December 19, 2022. Archived from the original on October 2, 2020. Retrieved November 7, 2015 – via GitHub.
  • ^ Krizhanovsky, Transformation 2010.
  • ^ a b Smirnov et al. 2012.
  • ^ Krizhanovsky, Comparison 2010.
  • ^ "Gerard de Melo's Research at ICSI, Berkeley". gerard.demelo.org. Archived from the original on March 27, 2023. Retrieved March 6, 2023.
  • ^ Otte & Tyers 2011.
  • ^ McFate & Forbus 2011.
  • ^ Schlippe, Ochs & Schultz 2012.
  • ^ Schlippe, Ochs & Schultz 2012, p. 4802.
  • ^ Schlippe, Ochs & Schultz 2012, p. 4804.
  • ^ Meyer & Gurevych 2012.
  • ^ "ConceptNet 5". conceptnet5.media.mit.edu. Archived from the original on October 19, 2011. Retrieved September 23, 2023.
  • ^ Lin & Krizhanovsky 2011.
  • ^ Medero & Ostendorf 2009.
  • ^ Li, Graça & Taskar 2012.
  • ^ Chesley et al. 2006.
  • ^ "Wikidata:Wiktionary". Archived from the original on January 3, 2023. Retrieved October 12, 2012.
  • Sources

    External links


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Wiktionary&oldid=1232398877"

    Categories: 
    Etymological dictionaries
    Internet properties established in 2002
    MediaWiki websites
    Multilingual websites
    Online dictionaries
    Wikimedia projects
    Jimmy Wales
    Larry Sanger
    Hidden categories: 
    Pages using the Phonos extension
    Webarchive template wayback links
    Wikipedia indefinitely semi-protected pages
    Articles with short description
    Short description is different from Wikidata
    Use mdy dates from November 2019
    Use American English from January 2019
    All Wikipedia articles written in American English
    Pages including recorded pronunciations
    Articles containing potentially dated statements from July 2021
    All articles containing potentially dated statements
    Articles containing potentially dated statements from December 2009
    Articles containing French-language text
    Articles lacking reliable references from February 2024
    Articles containing potentially dated statements from July 2016
    Articles with obsolete information from May 2013
    All Wikipedia articles in need of updating
    All articles with unsourced statements
    Articles with unsourced statements from November 2010
    Articles containing potentially dated statements from 2016
    CS1 maint: DOI inactive as of February 2024
    F-Droid template with ID not in Wikidata
    Google Play ID not in Wikidata
     



    This page was last edited on 3 July 2024, at 14:55 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki