49

GitHub - wyan001/xcrawler: 快速、简洁且强大的PHP爬虫框架

 6 years ago
source link: https://github.com/wyan001/xcrawler
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md

XCrawler - 轻量级、易维护的PHP爬虫框架

特性:

  • 极易上手,具备完善的文档和示例
  • 支持多并发爬取
  • 支持基于xpath、css选择器解析内容
  • 模拟浏览器行为,如user-agent、cookie、提交表单
  • 支持断点续爬、失败重试、代理、并有完善的爬取日志
  • 健壮的底层库,HTTP客户端基于guzzle,dom解析基于symfony/dom-crawler库

查看文档: https://xcrawler.yanshuju.com/docs/


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK