1,634 captures
07 Feb 2014 - 25 Mar 2026
Jul AUG Sep
12
2021 2022 2023
success
fail

About this capture

COLLECTED BY

Organization: Archive Team

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.

History is littered with hundreds of conflicts over the future of a community, group, location or business that were "resolved" when one of the parties stepped ahead and destroyed what was there. With the original point of contention destroyed, the debates would fall to the wayside. Archive Team believes that by duplicated condemned data, the conversation and debate can continue, as well as the richness and insight gained by keeping the materials. Our projects have ranged in size from a single volunteer downloading the data to a small-but-critical site, to over 100 volunteers stepping forward to acquire terabytes of user-created data to save for future generations.

The main site for Archive Team is at archiveteam.org and contains up to the date information on various projects, manifestos, plans and walkthroughs.

This collection contains the output of many Archive Team projects, both ongoing and completed. Thanks to the generous providing of disk space by the Internet Archive, multi-terabyte datasets can be made available, as well as in use by the Wayback Machine, providing a path back to lost websites and work.

Our collection has grown to the point of having sub-collections for the type of data we acquire. If you are seeking to browse the contents of these collections, the Wayback Machine is the best first stop. Otherwise, you are free to dig into the stacks to see what you may find.

The Archive Team Panic Downloads are full pulldowns of currently extant websites, meant to serve as emergency backups for needed sites that are in danger of closing, or which will be missed dearly if suddenly lost due to hard drive crashes or server failures.

Collection: ArchiveBot: The Archive Team Crowdsourced Crawler

ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).

To use ArchiveBot, drop by #archivebot on EFNet. To interact with ArchiveBot, you issue commands by typing it into the channel. Note you will need channel operator permissions in order to issue archiving jobs. The dashboard shows the sites being downloaded currently.

There is a dashboard running for the archivebot process at http://www.archivebot.com.

ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot.

TIMESTAMPS
The Wayback Machine - http://web.archive.org/web/20220812002740/https://octoverse.github.com/
  • Writing code faster
  • Creating documentation
  • Sustainable communities
  • Download PDF
  • Download PDF

    The 2021 State of the
    Octoverse

    Last year, our approach to remote work reflected a lack of familiarity. We were juggling competing needs in our personal lives and at work while trying to maintain the same levels of productivity before the pandemic.

    During 2021, we’ve begun to evolve from merely compensating while hoping for a return to the “old normal” to truly metamorphosing our processes with the awareness of remote work needs.

    In this year’s Octoverse Report, our research tells you how to improve your performance and well-being by developing code, creating documentation, and supporting communities in smarter, more sustainable ways.

    For the first time, this research combines telemetry from 4M+ repositories and surveys from more than 12,000 developers.

    This approach reveals current trends and also gives us predictive results--we can now see with more precision how to achieve successful outcomes for developers, teams, organizations, and communities.

    Explore data

    Writing and shipping code faster

    Learn more

    Creating documentation to support developers

    Learn more

    Supporting sustainable communities

    Learn more

    What makes developers and teams perform better, be more productive, and have a great developer experience?

    We created predictive models based on this year’s survey data to help us understand the impacts of different practices teams use in software development and delivery.

    Each section begins by presenting a predictive model supported by survey data, then follows with developer patterns we observe across GitHub’s repositories.

    Explore the data
    Relationships chart explanation

    Let’s look back at the code and communities built on GitHub this year...

    73M+Total developers
    on GitHub
    16M+New users
    in 2021
    84%of Fortune 100 companies
    use GitHub Enterprise
    61M+New repositories
    created in the last year
    170MPull requests
    merged

    Geographical distribution of active users

    North America
    Decrease 2.3% from last year
    31.5%
    Asia
    Increase 0.3% from last year
    31.2%
    Europe
    Increase 0.7% from last year
    27.3%
    South America
    Increase of 1.0% from last year
    5.9%
    Africa
    Increase of 0.3% from last year
    2.3%
    Oceania
    Decrease 0.1% from last year
    1.7%

    Top languages over the years

    Improving how we work

    In 2021, productivity began returning to pre-pandemic levels while solidifying the paradigm shift of remote and hybrid work.

    The workplace is shifting: Survey respondents were asked where they worked before the pandemic and where they expect to work with others after the pandemic. Only about 11% of respondents expect to go back to working collocated, a 30% drop from 41% working in an office before.

    Work before and after the pandemic

    We present one decimal for simplicity; there may be a rounding difference of 1%.

    Where respondents worked before the pandemicWhere respondents expect to work after the pandemic
    41%
    C
    10.7%
    28.1%
    H
    47.6%
    26.5%
    F
    38.8%
    4.4%
    N
    2.9%
    C

    Collocated
    In an office all the time or part-time

    H

    Hybrid
    Some team members in an office and others remote

    F

    Fully remote
    All team members working remotely

    N

    Not applicable

    Automation can enhance sustainability
    By removing friction and repetitive tasks through automation, teams perform 27% better in open source and 43% better at work, and developers report higher fulfillment.

    Code needs documentation to become a project

    Number of repositories with and without README, by repository type

    No README

    With README

    Open Source
    14.1%
    (33,544)
    85.9%
    (204,373)
    Open source at work
    12.26%
    (6,000)
    87.74%
    (42,954)
    Work
    84.33%
    (796,235)
    15.67%
    (147,975)

    A twist on improving productivity: documentation increases confidence in a project and invites collaboration

    Sharing information through READMEs, contribution guidelines, and issues are open source projects’ secret sauce: they invite new contributors and make developers 55% more productive. Enterprises can adopt these best practices to support their teams’ work and jump start inner source initiatives.

    Try a new practice:
    pull request wrangling

    The Kubernetes Docs Special Interest Group (SIG) has great documentation on their contribution process. Given the large developer community they support globally, this isn’t surprising.

    Within their well-defined [docs] roles and responsibilities, contributors who have reached “approver” status can volunteer for a week-long pull request (PR) wrangler shift.
    A PR wrangler triages and tags incoming GitHub Issues, reviews open pull requests for quality and adherence to guidelines, offers feedback, and approves merge-ready pull requests.
    This helps the Kubernetes community maintain accuracy in its current documentation, make sure that new documentation gets approved in sync with new release code, and enables localization of the docs by the community.
    Kubernetes’ PR wranglers docs cover everything from responsibilities to helpful tips, including GitHub queries for pull request labels.

    Sustainable communities

    Developing is about community. Codes of conduct, contribution guidelines, Good First Issues, and respectful language in Discussions signal a community is safe, welcoming, and trusted. Communities with these signals attract more contributors, while also creating a stronger sense of belonging and fulfillment.

    Teams with high trust are more likely to have a healthy collaborative culture:

    2XMore likely in companies3XMore likely in open source

    Where respondents contribute code

    Future of developer communities

    Geographic distribution users by region or country

    Click on the menu below to switch between total users of 2020 and 2021.

    Total users 2020
    Total users 2021

    Developer feedback helps steer GitHub Public Policy commitments

    We asked developers which issues matter to them and their communities, to inform the work of our Policy team.
    Here is what they said:

    Explore more data