26

#开源推荐# 微软开源的浏览器自动化工具-Playwright 微软开源的浏览器自动化工具-Play...

 1 year ago
source link: https://blog.csdn.net/somenzz/article/details/109396506
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

软开源了一个 Python 项目:Playwright,从此又多了一个浏览器自动化工具。之前一直用 selenium 或 splinter。

Playwright 可通过单个 API 自动执行 Chromium,Firefox 和 WebKit浏览器,支持无头浏览器(headless),Linux、macOS、Windows 下均可以使用,Playwright提供的自动化技术是绿色的,功能强大,稳定且速度快。

raUf6fn.gif!mobile

Playwright 最吸引我的地方在于它可以自己记录你对浏览器的操作,并将这些操作生成可以执行的代码,这简直就是神器,大大提升了浏览器自动化的效率。生成代码只需要执行

python -m playwright codegen

此外,它不像 selenium 需要再单独安装浏览器驱动,它在 pip install 时就会安装浏览器的驱动文件。

pip install playwrightpython -m playwright install

这将会安装 Playwright 和 Chromium,Firefox 和 WebKit 浏览器的二进制文件,非常方便,需要 Python 3.7 及以上版本。

还有几个亮眼的功能:

1、Playwright同时提供同步(阻止)API和异步API。

它们在功能方面是相同的,只是在使用API的方式上有所不同。

同步:

from playwright import sync_playwright
with sync_playwright() as p:    for browser_type in [p.chromium, p.firefox, p.webkit]:        browser = browser_type.launch()        page = browser.newPage()        page.goto('http://whatsmyuseragent.org/')        page.screenshot(path=f'example-{browser_type.name}.png')        browser.close()

异步:

import asynciofrom playwright import async_playwright
async def main():    async with async_playwright() as p:        for browser_type in [p.chromium, p.firefox, p.webkit]:            browser = await browser_type.launch()            page = await browser.newPage()            await page.goto('http://whatsmyuseragent.org/')            await page.screenshot(path=f'example-{browser_type.name}.png')            await browser.close()
asyncio.get_event_loop().run_until_complete(main())

2、集成 pytest 测试:

def test_playwright_is_visible_on_google(page):    page.goto("https://www.google.com")    page.type("input[name=q]", "Playwright GitHub")    page.click("input[type=submit]")    page.waitForSelector("text=microsoft/Playwright")

3、交互模式运行:

>>> from playwright import sync_playwright>>> playwright = sync_playwright().start()
# Use playwright.chromium, playwright.firefox or playwright.webkit# Pass headless=False to see the browser UI>>> browser = playwright.chromium.launch()>>> page = browser.newPage()>>> page.goto("http://whatsmyuseragent.org/")>>> page.screenshot(path="example.png")>>> browser.close()>>> playwright.stop(

4、执行 JS 代码:

from playwright import sync_playwright
with sync_playwright() as p:    browser = p.firefox.launch()    page = browser.newPage()    page.goto('https://www.example.com/')    dimensions = page.evaluate('''() => {      return {        width: document.documentElement.clientWidth,        height: document.documentElement.clientHeight,        deviceScaleFactor: window.devicePixelRatio      }    }''')    print(dimensions)    browser.close()

5、中断网络请求:

from playwright import sync_playwright


with sync_playwright() as p:
    browser = p.chromium.launch()
    page = browser.newPage()


    def log_and_continue_request(route, request):
      print(request.url)
      route.continue_()


    # Log and continue all network requests
    page.route('**', lambda route, request: log_and_continue_request(route, request))


    page.goto('http://todomvc.com')
    browser.close()

官方文档暂时还是 Node.js 版本,不过正在转换成 Python 版本,API的调用方式相当一致,现在看 Node.js 版本的文档来编码也是无障碍的。

官方文档:https://playwright.dev/

GitHub 仓库:https://github.com/microsoft/playwright-python

综上,感觉比 selenium 更好用啦,准备入坑。如果觉得文章对你有用,欢迎点赞、转发、关注,谢谢支持。


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK