Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Principles  



1.1  A sound archive with synchronized transcriptions  





1.2  A structured, open architecture  







2 History  





3 References  





4 External links  














Pangloss Collection






Français
Italiano
Picard

 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


The Pangloss Collection is a digital library whose objective is to store and facilitate access to audio recordingsinendangered languages of the world. Developed by the LACITO centre of CNRSinParis, the collection provides free online access to documents of connected, spontaneous speech, in otherwise little-documented languages of all continents.[1]

Principles[edit]

A sound archive with synchronized transcriptions[edit]

For the science of linguistics, language is first and foremost spoken language. The medium of spoken language is sound. The Pangloss Collection gives access to original recordings simultaneously with transcriptions and translations, as a resource for further research. After being recorded in its cultural context, texts have been transcribed in collaboration with native speakers.

A structured, open architecture[edit]

The archived data is structured in accordance with the latest data-processing standards, as open architecture, in an open format, and may be downloaded under a Creative Commons license. The software used to prepare and disseminate it is open-source. The Pangloss Collection is a member of the OLAC network of archival repositories and of the Digital Endangered Languages and Music Archive Network (DELAMAN).

History[edit]

The collection was initially called the LACITO Archive.[2][3] The project originated in 1996 from the collaboration of Boyd Michailovsky, linguist at LACITO, with John B. Lowe, engineer;[4]: 15  they were later joined by Michel Jacobson, engineer, who developed some tools for the project, and brought it online.[1]: 124  [4]

The purpose of the archive was “to conserve, and to make available for research, recorded and transcribed oral traditions and other linguistic materials in (mainly) unwritten languages, giving simultaneous access to sound recordings and text annotation.”[4] The earliest archived corpora in the collection were languages from Nepal, from New Caledonia, from eastern Africa and French Guiana.[5]

The archive has grown steadily since the early 2000s,[6] incorporating corpora from various linguists, whether members of LACITO or not. In 2009, the archive had 200 recordings in 45 languages.[7] In 2014, the (newly renamed) Pangloss Collection had 1,400 recordings in 70 languages.[1]: 121 

As of April 2021, the Pangloss archive contains 5,038 recordings[8] in 196 languages,[9] totalling 780 hours of audio and video recordings.[6]

Languages in the Pangloss Collection include Mwotlap (Austronesian; Vanuatu),[10] Japhug (Sino-Tibetan; Southwest China),[11] Ersu (Sino-Tibetan; Southwest China),[12] Naxi (orYongning Na: Sino-Tibetan; Southwest China),[13] and Cèmuhî (Austronesian; New Caledonia).[14]

References[edit]

  1. ^ a b c Michailovsky, Boyd, Martine Mazaudon, Alexis Michaud, Séverine Guillaume, Alexandre François & Evangelia Adamou. 2014. Documenting and researching endangered languages: the Pangloss Collection. Language Documentation & Conservation 8, pp. 119-135.
  • ^ Jacobson, Michel; Michailovsky, Boyd (2002). The LACITO Archive : its purpose and implementation. Int'l Workshop on Resources and Tools in Field Linguistics. Las Palmas, Canary Is., Spain.
  • ^ Screen capture of LACITO's archive homepage — 27 February 2001.
  • ^ a b c Jacobson, Michel; Michailovsky, Boyd; Lowe, John B. (2001). "Linguistic documents synchronizing sound and text". Speech Communication. Special issue: “Speech Annotation and Corpus Tools”. 33 (1–2): 79–96. CiteSeerX 10.1.1.467.490. doi:10.1016/S0167-6393(00)00070-4.
  • ^ Screen capture of LACITO's archive contents — 22 April 2002.
  • ^ a b “About us” section of the Pangloss Collection (retrieved 24 April 2021)
  • ^ Screen capture of LACITO's archive contents — 26 November 2009.
  • ^ Source: list of all Pangloss resources on the Cocoon homepage (retrieved 10 January 2022).
  • ^ Source: number of language entries in its list of corpora (retrieved 24 April 2021).
  • ^ Mwotlap corpus: 564 resources.
  • ^ Japhug corpus: 551 resources.
  • ^ Ersu corpus: 363 resources.
  • ^ Yongning Na corpus: 301 resources.
  • ^ Cèmuhî corpus: 230 resources.
  • External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Pangloss_Collection&oldid=1190849163"

    Categories: 
    Endangered languages projects
    Sound archives
    Creative Commons-licensed websites
    French National Centre for Scientific Research
    Hidden categories: 
    Articles with short description
    Short description matches Wikidata
    Webarchive template wayback links
     



    This page was last edited on 20 December 2023, at 05:49 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki