77 captures
05 Jun 2015 - 02 Feb 2026
Aug SEP Oct
04
2019 2020 2021
success
fail

About this capture

COLLECTED BY

Organization: Archive Team

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.

History is littered with hundreds of conflicts over the future of a community, group, location or business that were "resolved" when one of the parties stepped ahead and destroyed what was there. With the original point of contention destroyed, the debates would fall to the wayside. Archive Team believes that by duplicated condemned data, the conversation and debate can continue, as well as the richness and insight gained by keeping the materials. Our projects have ranged in size from a single volunteer downloading the data to a small-but-critical site, to over 100 volunteers stepping forward to acquire terabytes of user-created data to save for future generations.

The main site for Archive Team is at archiveteam.org and contains up to the date information on various projects, manifestos, plans and walkthroughs.

This collection contains the output of many Archive Team projects, both ongoing and completed. Thanks to the generous providing of disk space by the Internet Archive, multi-terabyte datasets can be made available, as well as in use by the Wayback Machine, providing a path back to lost websites and work.

Our collection has grown to the point of having sub-collections for the type of data we acquire. If you are seeking to browse the contents of these collections, the Wayback Machine is the best first stop. Otherwise, you are free to dig into the stacks to see what you may find.

The Archive Team Panic Downloads are full pulldowns of currently extant websites, meant to serve as emergency backups for needed sites that are in danger of closing, or which will be missed dearly if suddenly lost due to hard drive crashes or server failures.

Collection: ArchiveBot: The Archive Team Crowdsourced Crawler

ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).

To use ArchiveBot, drop by #archivebot on EFNet. To interact with ArchiveBot, you issue commands by typing it into the channel. Note you will need channel operator permissions in order to issue archiving jobs. The dashboard shows the sites being downloaded currently.

There is a dashboard running for the archivebot process at http://www.archivebot.com.

ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot.

TIMESTAMPS
The Wayback Machine - http://web.archive.org/web/20200904200027/https://github.com/apache/storm
Skip to content
Sign in Sign up
  • Star
  • Fork 4.1k
  • Mirror of Apache Storm

    View license
    6.1k stars 4.1k forks
    Star
    Watch
    master
    40 branches 41 tags
    Go to file
    Code

    If nothing happens, download GitHub Desktop and try again.

    If nothing happens, download GitHub Desktop and try again.

    If nothing happens, download Xcode and try again.

    If nothing happens, download the GitHub extension for Visual Studio and try again.

    Latest commit

    bipinprasad [STORM-3685] Detect and prevent cycles when Topology is submitted. (#…
    8399edc Sep 4, 2020
    [STORM-3685] Detect and prevent cycles when Topology is submitted. (#…
    …3322)
    8399edc

    Git stats

    Files

    Permalink
    Failed to load latest commit information.
    Type
    Name
    Latest commit message
    Commit time
    .github
     
     
    bin
     
     
    conf
     
     
    dev-tools
     
     
    docs
     
     
    examples
     
     
    external
     
     
    flux
     
     
    integration-test
     
     
    licenses
     
     
    log4j2
     
     
    sql
     
     
    storm-buildtools
     
     
    storm-checkstyle
     
     
    storm-client
     
     
    storm-clojure-test
     
     
    storm-clojure
     
     
    storm-core
     
     
    storm-dist
     
     
    storm-multilang
     
     
    storm-server
     
     
    storm-shaded-deps
     
     
    storm-submit-tools
     
     
    storm-webapp
     
     
    .gitattributes
     
     
    .gitignore
     
     
    .travis.yml
     
     
    DEPENDENCY-LICENSES
     
     
    DEVELOPER.md
     
     
    KEYS
     
     
    LICENSE
     
     
    LICENSE-binary
     
     
    NOTICE
     
     
    NOTICE-binary
     
     
    README.markdown
     
     
    RELEASING.md
     
     
    SECURITY.md
     
     
    THIRD-PARTY.properties
     
     
    VERSION
     
     
    doap_Storm.rdf
     
     
    pom.xml
     
     

    README.markdown

    Master Branch:
    Travis CI Maven Version

    Storm is a distributed realtime computation system. Similar to how Hadoop provides a set of general primitives for doing batch processing, Storm provides a set of general primitives for doing realtime computation. Storm is simple, can be used with any programming language, is used by many companies, and is a lot of fun to use!

    The Rationale page explains what Storm is and why it was built. This presentation is also a good introduction to the project.

    Storm has a website at storm.apache.org. Follow @stormprocessor on Twitter for updates on the project.

    Documentation

    Documentation and tutorials can be found on the Storm website.

    Developers and contributors should also take a look at our Developer documentation.

    Getting help

    NOTE: The google groups account storm-user@googlegroups.com is now officially deprecated in favor of the Apache-hosted user/dev mailing lists.

    Storm Users

    Storm users should send messages and subscribe to user@storm.apache.org.

    You can subscribe to this list by sending an email to user-subscribe@storm.apache.org. Likewise, you can cancel a subscription by sending an email to user-unsubscribe@storm.apache.org.

    You can also browse the archives of the storm-user mailing list.

    Storm Developers

    Storm developers should send messages and subscribe to dev@storm.apache.org.

    You can subscribe to this list by sending an email to dev-subscribe@storm.apache.org. Likewise, you can cancel a subscription by sending an email to dev-unsubscribe@storm.apache.org.

    You can also browse the archives of the storm-dev mailing list.

    Storm developers who would want to track the JIRA issues should subscribe to issues@storm.apache.org.

    You can subscribe to this list by sending an email to issues-subscribe@storm.apache.org. Likewise, you can cancel a subscription by sending an email to issues-unsubscribe@storm.apache.org.

    You can view the archives of the mailing list here.

    Issue tracker

    In case you want to raise a bug/feature or propose an idea, please use Apache Jira

    Which list should I send/subscribe to?

    If you are using a pre-built binary distribution of Storm, then chances are you should send questions, comments, storm-related announcements, etc. to user@storm.apache.org.

    If you are building storm from source, developing new features, or otherwise hacking storm source code, then dev@storm.apache.org is more appropriate.

    If you are committers and/or PMCs, or contributors looking for following up and participating development of Storm, then you would want to also subscribe issues@storm.apache.org in addition to dev@storm.apache.org.

    What will happen with storm-user@googlegroups.com?

    All existing messages will remain archived there, and can be accessed/searched here.

    New messages sent to storm-user@googlegroups.com will either be rejected/bounced or replied to with a message to direct the email to the appropriate Apache-hosted group.

    IRC

    You can also come to the #storm-user room on freenode. You can usually find a Storm developer there to help you out.

    License

    Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

    Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

    The LICENSE and NOTICE files cover the source distributions. The LICENSE-binary and NOTICE-binary files cover the binary distributions. The DEPENDENCY-LICENSES file lists the licenses of all dependencies of Storm, including those not packaged in the source or binary distributions, such as dependencies of optional connector modules.

    Project lead

    Committers

    Acknowledgements

    YourKit is kindly supporting open source projects with its full-featured Java Profiler. YourKit, LLC is the creator of innovative and intelligent tools for profiling Java and .NET applications. Take a look at YourKit's leading software products: YourKit Java Profiler and YourKit .NET Profiler.

    About

    Mirror of Apache Storm

    Topics

    Resources

    Readme

    License

    View license

    Releases

    41 tags

    Packages

    No packages published

    Used by 3

  • @mayurkale22 @mayurkale22 / MayShashProject
  • @mjdivan @mjdivan / cincamimisConversor
  • Contributors 339

  • @HeartSaVioR
  • @ptgoetz
  • @srdo
  • @knusbaum
  • @kishorvpatil
  • @Parth-Brahmbhatt
  • @Ethanlm
  • @revans2
  • @arunmahadevan
  • @vesense
  • + 328 contributors

    Languages

  • Privacy
  • Security
  • Status
  • Help
  • You can’t perform that action at this time.