81 captures
01 Sep 2018 - 03 Jul 2024
Mar APR May
13
2020 2021 2022
success
fail

About this capture

COLLECTED BY

Collection: Open Syllabus

The Open Syllabus collection contains WARC files from a mid-2021 crawl of about 50 million unique seed URLs extracted from the Open Syllabus version 2.6 dataset and their page requisites. The bulk of the seed URLs are from ".com", ".org", ".edu", and ".uk" TLDs.


Crawl Summary


Seed Summary

* NOTE: More than 13% URLs in the dataset point to Wayback Machine!


TIMESTAMPS

The Wayback Machine - http://web.archive.org/web/20210413134603/https://lab.github.com/githubtraining