A collection of awesome web crawler,spider in different languages
-
Updated
May 29, 2021
{{ message }}
A collection of awesome web crawler,spider in different languages
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Web Scraper in Go, similar to BeautifulSoup
A list of practical knowledge-building projects.
Faster requests on Python 3
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Bulk download your favourite anime episodes from your favourite anime websites
A framework for creating semi-automatic web content extractors
Download and generate e-books from online sources.
A list of scrapers from around the web.
NBA Stats API via Basketball Reference
A simple browser/client-side web scraper.
A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)
The image URL should be returned, if this the only part of the anchor texts.
A collection of awesome web scaper, crawler.
MetaData html scraper and parser for Node.js (supports Promises and callback style)
Instagram Bot which when given a post url will spam mentions to increase the chances of winning. Win Instagram Giveaways!
Fetch user's data across social media
PHP Library for detecting CMS
Go cascadia package command line CSS selector
Powerful web scraping framework for Crystal
Adult XXX Addons (18+) for the Kodi Media Center - Kodi is a registered trademark of the XBMC Foundation. We are not connected to or in any other way affiliated with Kodi - DMCA: legal@tvaddons.co
A command line interface for downloading Bollywood and punjabi songs
A simple python library that allows for easy access of the SEC website so that someone can parse filings, collect data, and query documents.
Web Scraping Craigslist's Engineering Jobs in NY with Scrapy
Add a description, image, and links to the web-scraper topic page so that developers can more easily learn about it.
To associate your repository with the web-scraper topic, visit your repo's landing page and select "manage topics."
Hello,
Thanks for new update in personal_info section,
I found out that the attribute 'certifications' return empty list []
Test url:
https://www.linkedin.com/in/an-nguyen-9b3248122/Results:
`{'personal_info': {'name': 'An Nguyen',
'headline': 'Data Scientist/Machine Learning Engineer',
'company': 'PERSOL PROCESS & TECHNOLOGY CO., LTD.',
'school': 'National Chiao Tung University',