| Mar | APR | May |
| 13 | ||
| 2020 | 2021 | 2022 |
COLLECTED BY
Collection: Open Syllabus
The Open Syllabus collection contains WARC files from a mid-2021 crawl of about 50 million unique seed URLs extracted from the Open Syllabus version 2.6 dataset and their page requisites. The bulk of the seed URLs are from ".com", ".org", ".edu", and ".uk" TLDs.
Crawl Summary
Seed Summary
* NOTE: More than 13% URLs in the dataset point to Wayback Machine!