-
Updated
Sep 1, 2020 - Python
#
scrapers
Here are 77 public repositories matching this topic...
source for Open States scrapers
A framework for creating semi-automatic web content extractors
python
crawler
tutorial
extractor
scraping
web-scraper
selector
css-selector
web-scraping
scrapy
scrapers
beautifulsoup
xpath-expression
lxml
selector-expression
-
Updated
Oct 12, 2019 - Python
Python scripts for building 'Short Jokes' dataset, featured on Kaggle
-
Updated
Oct 24, 2019 - Python
nodejs
json
crawler
scraper
spider
linkedin
scraping
crawling
expressjs
linkedin-profile
scrapers
scraping-websites
linkedin-bot
website-scraper
profile-data
linkedin-scraper
linkedin-crawler
puppeteer
linkedin-scraping
linkedin-profile-scraper
-
Updated
Sep 2, 2020 - TypeScript
Simple library which helps you to retrieve the source of various video streaming sites.
-
Updated
Aug 28, 2019 - TypeScript
An Unofficial Pitchfork Music API client for Node.js
-
Updated
Feb 18, 2018 - JavaScript
Using Apache Airflow to schedule web scrapers
-
Updated
Oct 3, 2018 - Python
philshem
commented
Mar 30, 2020
currently the code requires a path to a CSV file, with the first column containing a gmaps URL
Would be cool to smarten it up a bit, and let the user point to EITHER a CSV path, OR a single gmaps URL. This way you could loop over variables in a batch script, which could be fed dynamically.
Library for scraping websites or apis at any scale
-
Updated
Apr 25, 2020 - Python
ProxyCrawl Node library for scraping and crawling
-
Updated
Mar 23, 2020 - JavaScript
nike monitor supports all region, and will have stock level
-
Updated
Jul 18, 2020
ProxyCrawl PHP library for scraping and crawling websites
crawler
scraper
scraping
crawling
scrapers
scraping-websites
scraping-framework
proxycrawl
proxycrawl-api
-
Updated
Apr 28, 2020 - PHP
Board game data scraper
board-game
scraper
spider
dataset
boardgame
scrapy
scrapers
scraped-data
datasets
boardgamegeek
tabletop-games
boardgames
bgg
data-set
board-games
boardgamegeek-dataset
bgg-rating
data-sets
-
Updated
Aug 24, 2020 - Python
A basic python 3 based web scraper for extracting reviews from Amazon. Built using Selectorlib and requests
-
Updated
Jun 21, 2020 - Python
This Python code scrapes Google search results then applies sentiment analysis, generates text summaries, and ranks keywords.
python
search-engine
scraper
google
sentiment-analysis
scraping
sentiment
python3
keywords
scrape
scrapers
python-3
lsa
keyword-extraction
scraping-websites
luhn
sentiment-classification
lexrank
keyword-extractor
keywords-extraction
-
Updated
Aug 31, 2019 - Python
Web scraper/crawler of Airbnb Brazilians page
nodejs
scraper
node
yarn
cheerio
scraping
airbnb
scrapers
node-js
scraping-websites
scrapy-crawler
cheerio-js
puppeteer
cheerio-node
scraperjs
-
Updated
Sep 3, 2020 - JavaScript
Practice scraping techniques with Guzzle framework and DomCrawler
-
Updated
Sep 14, 2017 - PHP
Add-on para AniTube, famoso site que ofereceu uma gigantesca biblioteca de animes para assistir online e de graça.
python
scraper
streaming
anime
kodi
scrapers
kodi-plugin
xbmc
scraper-engine
portugues
portuguese
kodi-addon
streaming-video
kodi-addons
portuguese-language
xbmc-video-plugin
anime-scraper
xbmc-plugin
xbmc-addon
anitube
-
Updated
Aug 14, 2019 - Python
Example of web scraper on craigslist.org using Puppeteer and Node.js
nodejs
scraper
node
cheerio
scraping
craigslist
scrapers
node-js
scraping-websites
scrapy-crawler
chromium-browser
cheeriojs
cheerio-js
puppeteer
scraperjs
-
Updated
Sep 3, 2020 - JavaScript
IMDB webscraper with Request-Promise, Cheerio and Nightmare.js
nodejs
scraper
node
cheerio
request
scrapers
node-js
scraping-websites
scrapy-crawler
nightmarejs
cheeriojs
cheerio-js
request-promise
-
Updated
Sep 4, 2020 - JavaScript
Web scraper/crawler of Reddit page
scraper
reddit
mongodb
mongoose
cheerio
reddit-bot
request
scrapers
scraping-websites
scrapy-crawler
cheeriojs
cheerio-js
mongodb-atlas
request-promise
scraperjs
-
Updated
Sep 3, 2020 - JavaScript
Improve this page
Add a description, image, and links to the scrapers topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the scrapers topic, visit your repo's landing page and select "manage topics."
The idea is to have an option like 3 (Do a Google search, save the Urls found and search the emails), but search a list of phrases.
This list can be in a .txt
The option can ask for number of search results in Google