Zhagana

停更了2年，终于又有空写个小文章，关于 Playwright ，一个 Microsoft 开源的跨浏览器自动化工具，类似 Puppeteer 的 Chromium + Firefox + WebKit 版。

Step by step guide to use Playwright .

Project Setup

Create a as simple as we can project manually:

mkdir zhagana
cd zhagana
npm init

Install Playwright and TypeScript via npm, which may take some minutes to download browser binaries:

npm i playwright
npm i -D typescript

TypeScript Configuration

Playwright for JavaScript and TypeScript is generally available. But we still need some configuration for TypeScript. Create a tsconfig.json file with the following content:

{
  "compilerOptions": {
    "target": "es5",
    "module": "commonjs",
    "outDir": "build",
    "sourceMap": true
  }
}

VS Code Launcher and Debuger

Click the RUN button on the left menu then create a launch.json . Select Node.js from the drop down if you have other debugger extensions.

QZ7bAzR.png!mobile

Make sure you have "preLaunchTask": "tsc: build - tsconfig.json" and "outFiles": ["${workspaceFolder}/build/**/*.js"] in launch.json .

{
  "version": "0.2.0",
  "configurations": [
    {
      "type": "node",
      "request": "launch",
      "name": "Launch Program",
      "skipFiles": [
        "<node_internals>/**"
      ],
      "program": "${workspaceFolder}/index.ts",
      "preLaunchTask": "tsc: build - tsconfig.json",
      "outFiles": ["${workspaceFolder}/build/**/*.js"]
    }
  ]
}

Coding

Screenshot

We will start by taking a screenshot of the page. This is code from their documentation , but transfer into TypeScript

import { webkit } from 'playwright'

(async () => {
  const browser = await webkit.launch();
  const page = await browser.newPage();
  await page.goto('http://whatsmyuseragent.org/');
  await page.screenshot({ path: `out/whatsmyuseragent.png` });
  await browser.close();
})();

Press F5 to run our project, and we will get the out/whatsmyuseragent.png file like this

6bUvQne.jpg!mobile

Now, let’s make it happen in 3 browsers:

import { Browser, BrowserType, chromium, firefox, webkit } from 'playwright'

async function screenshot(browserType: BrowserType<Browser>) {
  // use `browserType` from arguments instead of hardcode
  const browser = await browserType.launch();
  const page = await browser.newPage();
  await page.goto('http://whatsmyuseragent.org/');
  await page.screenshot({ path: `out/ua-${browserType.name()}.png` });
  await browser.close();
}

(async () => {
  // 3 different kind of browsers
  const BROWSER_TYPES = [
    chromium,
    firefox,
    webkit
  ]
  // make screenshot all together
  await Promise.all(BROWSER_TYPES.map((browserType) => {
    return screenshot(browserType);
  }));
})();

Here we use the screenshot function to take the place of main function and use Promise.all to handle 3 browsers in parallel. After a few seconds, we will get 3 screenshots:

out/ua-chromium.png with HeadlessChrome
out/ua-firefox.png with Firefox
out/ua-webkit.png with AppleWebKit ... Safari

Emulation - Mobile Device

Next step, we will simulate browser behavior on a mobile device and navigate to Google Maps .

import { Browser, BrowserType, devices, chromium, firefox, webkit } from 'playwright'

async function screenshot(browserType: BrowserType<Browser>) {
  // use `browserType` from arguments instead of hardcode
  const browser = await browserType.launch();
  // simulate browser behavior on a mobile device
  const iphone = devices['iPhone X'];
  const context = await browser.newContext({ ...iphone });
  // open web page
  const page = await context.newPage();
  await page.goto('https://www.google.com/maps');  
  // take screenshot
  await page.screenshot({ path: `out/map-${browserType.name()}.png` });
  await browser.close();
}

Since firefox does not support mobile, we reduce our browsers to chromium and webkit only:

(async () => {
  // firefox does not support mobile
  const BROWSER_TYPES = [ chromium, webkit ]
  // make screenshot all together
  await Promise.all(BROWSER_TYPES.map((browserType) => {

    return screenshot(browserType);
  }));
})();

F5 again we will get 2 png file in out directory:

chromium webkit rENbIzY.png!mobile

Maps came out, but seems not complete loaded. So we need .waitForNavigation() after page.goto() :

await page.goto('https://www.google.com/maps');
await page.waitForNavigation();
await page.screenshot({ path: `out/map-${browserType.name()}.png` });

But, wait… there is a blocker comes up: Google Maps want us to download App but we just want to STAY ON WEB .

2a26beY.png!mobile

Input - Mouse Click

From devtools we can get the selector of this promo: .ml-promotion-nonlu-blocking-promo , use page.waitForSelector() instead of page.waitForNavigation() to catch the promotion:

UJRv6b3.png!mobile

await page.goto('https://www.google.com/maps');
await page.waitForSelector('.ml-promotion-nonlu-blocking-promo');

So let’s click the STAY ON WEB button on the page! From devtools we can also get the selector of this button: button.ml-promotion-action-button.ml-promotion-no-button , use page.click() to trigger the click event:

UVjAFvZ.png!mobile

// click STAY ON WEB
await page.click('button.ml-promotion-action-button.ml-promotion-no-button');

As the invisible animation last for 0.3s, we need to wait for more than 300ms after button clicked, before we capture the screenshot.

2aU3eeV.png!mobile

// wait for more than 300 millisecond for browser to response with the events
await page.waitForTimeout(400);
await page.screenshot({ path: `out/map-${browserType.name

Emulation - Geolocation

Now we have the map in our current location (may be base on IP address) but we also have the ability to simulate to a different place. We can “fly” to town Tewo by

reating a context with “geolocation” permissions granted:

const context = await browser.newContext({
  ...iphone,
  geolocation: {
      longitude: 103.2199128,
      latitude: 34.0556586,
  },
  permissions: ['geolocation'],
});

If you don’t konw the longitude and latitude of your “perfect place”, just search it in Google Maps then you can get it from the browser URL.

a6fEvqN.png!mobile

Click the Your Location button to navigate to our emulated geolocation.

zIbQBfq.png!mobile

// click `your location` to navi to current location
await page.click('button.ml-button-my-location-fab');
// As I can not find any event which means relocat finished,
// so we need to wait for some seconds for Google Maps to load resources
await page.waitForTimeout(500);

Re-run our project we will find us located in Tewo Post Bureau .

yqUJJrz.png!mobile

Input - Text Input

After these simulations, we can start to control the page with more playwright APIs, just like what we click the page just now.

First, fill in the search bar with our target place, like Zhagana .

UVjMRbF.png!mobile

IVv2ieq.png!mobile

await page.click('div.ml-searchbox-button-textarea');
await page.waitForSelector('#ml-searchboxinput');
// fill in content
await page.fill('#ml-searchboxinput', 'Zhagana');

Second, press Enter to search.

// press Enter to start searching
await page.press('#ml-searchboxinput', 'Enter');

After that, we will get the target place with a red point, and there should be a Directions button at the bottom of the page.

BZjQfi3.png!mobile

Third, click Directions and google will provide us the navigation route.

// click Directions
const directionsSelector = 'button[jsaction="pane.placeActions.directions"]'
await page.waitForSelector(directionsSelector);
await page.click(directionsSelector)

Put them all together, with output path string as a result.

async function screenshot(browserType: BrowserType<Browser>): Promise<string> {
  // use `browserType` from arguments instead of hardcode
  const browser = await browserType.launch();
  // simulate browser behavior on a mobile device
  const iphone = devices['iPhone X']
  const context = await browser.newContext({
    ...iphone,
    geolocation: {
      longitude: 103.2199128,
      latitude: 34.0556586,
    },
    permissions: ['geolocation'],
  });
  // open web page
  const page = await context.newPage();
  await page.goto('https://www.google.com/maps');
  // await page.waitForNavigation();

  await page.waitForSelector('.ml-promotion-on-screen');
  // click STAY ON WEB
  await page.click('button.ml-promotion-action-button.ml-promotion-no-button');

  // click `your location` to navi to current location
  await page.click('button.ml-button-my-location-fab');
  
  // click to trigger input field
  await page.click('div.ml-searchbox-button-textarea');
  await page.waitForSelector('#ml-searchboxinput');
  // fill in content
  await page.fill('#ml-searchboxinput', 'Zhagana');
  // press Enter to start searching
  await page.press('#ml-searchboxinput', 'Enter');

  // click Directions
  const directionsSelector = 'button[jsaction="pane.placeActions.directions"]'
  await page.waitForSelector(directionsSelector);
  await page.click(directionsSelector);
  // wait for result
  // As I can not find any event which means direction finished,
  // so we need to wait for some seconds for Google Maps to load resources
  await page.waitForTimeout(2000);

  // take screenshot, output path string as a result.
  const outputPath = `out/map-${browserType.name()}.png`;
  await page.screenshot({ path: outputPath });
  await browser.close();
  return outputPath;
}

Okay! Here comes out the two maps screenshots:

chromium webkit r6v6Frm.png!mobile

Image Diff

The 2 screenshots look exactly the same, but we still want to use some tools to check. Pixelmatch is a simple and fast JavaScript pixel-level image comparison library. Create a function to compare two file A and B.

async function diff(fileA: string, fileB: string) {
  // read the 2 different PNG file
  const mapChromium = PNG.sync.read(fs.readFileSync(fileA));
  const mapWebkit = PNG.sync.read(fs.readFileSync(fileB));
  // init the diff image buffer
  const { width, height } = mapChromium;
  const diffImg = new PNG({ width, height });
  // pixel diff
  pixelmatch(
    mapChromium.data,
    mapWebkit.data,
    diffImg.data,
    width,
    height,
    { threshold: 0.1 }
  );
  // print out the diff image
  fs.writeFileSync('out/map-diff.png', PNG.sync.write(diffImg));
}

And call this function after we generated the two screenshots:

(async () => {
  const BROWSER_TYPES = [ chromium, webkit ];
  // make screenshot all together
  const maps = await Promise.all(BROWSER_TYPES.map((browserType) => {
    return screenshot(browserType);
  }));

  await diff(maps[0], maps[1]);
})();

Bingo! Google Maps did a great job in the two different browser with almost the same behavior. The only different are font weight and also the navigate route weight.

YR7bieZ.png!mobile

Postscript

Zhagana is a wonderful place in Tiewu County, Gannan (Tibetan Autonomous Prefecture), Gansu province, China. Zhagana means “Rock Box” in Tibetan language, which is fitting as it is surrounded by large rocky spires on all sides.

e2YZNjn.jpg!mobile

Project Setup

TypeScript Configuration

VS Code Launcher and Debuger

Coding

Screenshot

Emulation - Mobile Device

Input - Mouse Click

Emulation - Geolocation

Input - Text Input

Image Diff

Postscript

Recommend

因果推断书单-4本中文科普书 8本英文书

边缘设备、系统及计算杂谈(2)——go语言

CPU 虚拟化系列文章 1——x86 架构 CPU 虚拟化

李彦宏在百家号cue四阿哥，他在说啥？

46岁的他意外去世，留给世界三双鞋

人生的意义是什么，早晚，都要离开

最近买了重疾险，先是自己买，后来一家人都买了，难道被销售洗脑了？

阿里巴巴张勇：将积极学习和响应国家的政策和法规

码了2000多行代码就是为了讲清楚TLS握手流程

Redis，就是这么朴实无华

About Joyk