Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Features  





2 Technology  





3 References  





4 External links  














Datacommons.org







Add links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Datacommons.org is an open knowledge graph hosted by Google that provides a unified view across multiple public datasets, combining economic, scientific and other open datasets into an integrated data graph.[1] The Datacommons.org site was launched in May 2018 with an initial dataset consisting of fact-checking data published in Schema.org "ClaimReview" format by several fact checkers from the International Fact-Checking Network.[2][3] Google has worked with partners including the United States Census, the World Bank, and US Bureau of Labor Statistics to populate the repository,[4] which also hosts data from Wikipedia, the National Oceanic and Atmospheric Administration and the Federal Bureau of Investigation.[5] The service expanded during 2019 to include an RDF-style Knowledge Graph populated from a number of largely statistical open datasets. The service was announced to a wider audience in 2019.[6] In 2020 the service improved its coverage of non-US datasets, while also increasing its coverage of bioinformatics and coronavirus.[7]

Features[edit]

Datacommons.org places more emphasis on statistical data than is common for Linked Data and knowledge graph initiatives. It includes geographical, demographic, weather and real estate data alongside other categories,[1] describing states, Congressional districts, and cities in the United States as well as biological specimens, power plants, and elements of the human genome via the Encyclopedia of DNA Elements (ENCODE) project.[5] It represents data as semantic triples each of which can have its own provenance.[1] It centers on the entity-oriented integration of statistical observations from a variety of public datasets. Although it supports a subset of the W3C SPARQL query language,[8] its APIs[9] also include tools — such as a Pandas dataframe interface — oriented towards data science, statistics and data visualization.

Datacommons.org is integrative, meaning that, rather than providing a hosting platform for diverse datasets, it attempts to consolidate much of the information the datasets provide into a single data graph.

Technology[edit]

Datacommons.org is built on a graph data-model. The graph can be accessed through a browser interface and several APIs,[1][5] and is expanded through loading data (typically CSV and MCF-based templates).[10] The graph can be accessed by natural language queries in Google Search.[11] The data vocabulary used to define the datacommons.org graph is based upon Schema.org.[1] In particular the Schema.org terms StatisticalPopulation[12] and Observation[13] were proposed to Schema.org to support datacommons-like usecases.[14]

Software from the project is available on GitHub under Apache 2 license.[15]

References[edit]

  1. ^ a b c d e Fensel, Dieter; Şimşek, Umutcan; Angele, Kevin; Huaman, Elwin; Kärle, Elias; Panasiuk, Oleksandra; Toma, Ioan; Umbrich, Jürgen; Wahler, Alexander (2020), "Introduction: What Is a Knowledge Graph?", Knowledge Graphs, Cham: Springer International Publishing, pp. 1–10, doi:10.1007/978-3-030-37439-6_1, ISBN 978-3-030-37438-9, S2CID 213620389, retrieved 2020-10-16
  • ^ "Fact Checks". datacommons.org. 29 March 2019. Retrieved 14 October 2020.
  • ^ Jiang, Shan; Baumgartner, Simon; Ittycheriah, Abe; Yu, Cong (2020-04-20). "Factoring Fact-Checks: Structured Information Extraction from Fact-Checking Articles". Proceedings of the Web Conference 2020. WWW '20. Taipei Taiwan: ACM. pp. 1592–1603. doi:10.1145/3366423.3380231. ISBN 978-1-4503-7023-3. S2CID 215882520.
  • ^ Raghavan, Prabhakar (2020-10-15). "How AI is powering a more helpful Google". Google. Retrieved 2020-10-16.
  • ^ a b c Sheth, Amit; Padhee, Swati; Gyrard, Amelie; Sheth, Amit (2019-07-01). "Knowledge Graphs and Knowledge Networks: The Story in Brief". IEEE Internet Computing. 23 (4): 67–75. arXiv:2003.03623. doi:10.1109/MIC.2019.2928449. ISSN 1089-7801. S2CID 204820800.
  • ^ Luong, Daphne; Chou, Charina (5 March 2019). "Doing our part to share open data responsibly". The Keyword. Retrieved 14 October 2020.
  • ^ Ramasubramanian, Sowmya (21 September 2020). "Google's open source data to study impact of COVID-19". The Hindu. Retrieved 14 October 2020.
  • ^ "Query the Data Commons Knowledge Graph using SPARQL". datacommons.org. Retrieved 14 October 2020.
  • ^ "Overview". datacommons.org. Retrieved 14 October 2020.
  • ^ "Contributing to Data Commons - Adding datasets". datacommons.org. Data Commons.
  • ^ Guha, Ramanathan V. (15 October 2020). "Data Commons, now accessible on Google Search". docs.datacommons.org. Retrieved 2020-10-16.
  • ^ "StatisticalPopulation type at Schema.org". schema.org. Retrieved 14 October 2020.
  • ^ "Observation type at Schema.org". schema.org. Retrieved 14 October 2020.
  • ^ "Proposal for representing Aggregate Statistical Data". GitHub - Schema.org repository. 25 June 2019. Retrieved 14 October 2020.
  • ^ "datacommons.org GitHub". GitHub.
  • External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Datacommons.org&oldid=1169022878"

    Categories: 
    Google
    Open data
    Knowledge graphs
    Hidden categories: 
    Articles with short description
    Short description is different from Wikidata
     



    This page was last edited on 6 August 2023, at 15:27 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki