GitHub - scrapy/scrapy: Scrapy, a fast high-level web crawling & scraping fr... - JOYK Joy of Geek, Geek News, Link all geek

Scrapy

Overview

Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Scrapy is maintained by Zyte (formerly Scrapinghub) and many other contributors.

Check the Scrapy homepage at https://scrapy.org for more information, including a list of features.

Requirements

Python 3.6+
Works on Linux, Windows, macOS, BSD

Install

The quick way:

pip install scrapy

See the install section in the documentation at https://docs.scrapy.org/en/latest/intro/install.html for more details.

Documentation

Documentation is available online at https://docs.scrapy.org/ and in the docs directory.

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct (see https://github.com/scrapy/scrapy/blob/master/CODE_OF_CONDUCT.md).

By participating in this project you agree to abide by its terms. Please report unacceptable behavior to [email protected].

Companies using Scrapy

See https://scrapy.org/companies/ for a list.

Commercial Support

See https://scrapy.org/support/ for details.

GitHub - scrapy/scrapy: Scrapy, a fast high-level web crawling & scraping fr...

Scrapy

Overview

Requirements

Install

Documentation

Releases

Community (blog, twitter, mail list, IRC)

Contributing

Code of Conduct

Companies using Scrapy

Commercial Support

Recommend

“百名红通人员”郭欣回国投案

鹿晗关晓彤公开恋情，是如何把微博服务器搞炸的？

我们的大脑能被科技公司劫持

英国计划扩大反恐法

Chronicles of Pair Programming 2

早知道｜下周重磅日程：中国外储、金融、贸易数据密集来袭美联储纪要公布缩表讨论细...

GitHub - morozov/diff-sniffer-pre-commit: Git pre-commit hook for Diff Sniffer

一周回复精选（17.10.08）

The Cost of Kotlin Language Features - Preliminary Results Part 2 - Strings &...

Creating a simple chat-app with WebSockets - Javalin: Simple REST APIs for Java...

About Joyk