基于 Channel 实现的多路复用 RPC 组件

基础组件设计

前言

首先，我们先介绍一下 Swoole\Coroutine\Client 的一个限制，那便是同一个连接，不允许同时被两个协程绑定，我们可以进行以下测试。

<?php
run(function () {
    $client = new \Swoole\Coroutine\Client(SWOOLE_SOCK_TCP);
    $client->set([
        'open_length_check' => true,
        'package_length_type' => 'N',
        'package_length_offset' => 0,
        'package_body_offset' => 4,
        'package_max_length' => 1024 * 1024 * 2,
    ]);
    $client->connect('127.0.0.1', 9601, 0.5);
    go(function () use ($client) {
        $ret = $client->send(str_repeat('xxx', 1000));
        $client->recv();
    });
    go(function () use ($client) {
        $ret = $client->send('xxx');
        $client->recv();
    });
});

当我们执行以上代码，就会抛出以下错误

PHP Fatal error:  Uncaught Swoole\Error: Socket#9 has already been bound to another coroutine#2, reading of the same socket in coroutine#3 at the same time is not allowed in /Users/limingxin/Applications/GitHub/hyperf/repos/multiplex-socket/tests/swoole_client.php:32
Stack trace:
#0 /Users/limingxin/Applications/GitHub/hyperf/repos/multiplex-socket/tests/swoole_client.php(32): Swoole\Coroutine\Client->recv()
#1 /Users/limingxin/Applications/GitHub/hyperf/repos/multiplex-socket/vendor/hyperf/utils/src/Functions.php(271): {closure}()
#2 /Users/limingxin/Applications/GitHub/hyperf/repos/multiplex-socket/vendor/hyperf/utils/src/Coroutine.php(62): call(Object(Closure))
#3 {main}
  thrown in /Users/limingxin/Applications/GitHub/hyperf/repos/multiplex-socket/tests/swoole_client.php on line 32

但我们稍微改动一下代码，就不会再次报错，代码如下

<?php
run(function () {
    $client = new \Swoole\Coroutine\Client(SWOOLE_SOCK_TCP);
    $client->set([
        'open_length_check' => true,
        'package_length_type' => 'N',
        'package_length_offset' => 0,
        'package_body_offset' => 4,
        'package_max_length' => 1024 * 1024 * 2,
    ]);
    $client->connect('127.0.0.1', 9601, 0.5);
    $chan = new \Swoole\Coroutine\Channel(1);
    go(function () use ($client, $chan) {
        $ret = $client->send(str_repeat('xxx', 1000));
        $chan->push(true);
        $client->recv();
        $chan->pop();
    });
    go(function () use ($client, $chan) {
        $ret = $client->send('xxx');
        $chan->push(true);
        $client->recv();
        $chan->pop();
    });
});

可见，我们只需要让 recv 在一个协程里循环调用，然后再根据收包发到不同的 Channel 当中，这样我们就可以多个协程复用同一个连接。

包体设计

接下来的事情就很简单了，我们设计一个十分简单的包结构。包头为使用 pack N 打包的包体长度，包体为 pack N 打包的 Channel ID 和数据体。

因为 Swoole 中分包规则已经实现，所以我们可以简单的配置一下实现上述效果

'open_length_check' => true,
'package_length_type' => 'N',
'package_length_offset' => 0,
'package_body_offset' => 4,
'package_max_length' => 1024 * 1024 * 2,

接下来我们只需要实现包体的打包和解包功能即可，我们可以实现一个十分简单的打包器。

<?php

declare(strict_types=1);

namespace Multiplex;

use Multiplex\Constract\PackerInterface;

class Packer implements PackerInterface
{
    public function pack(Packet $packet): string
    {
        return sprintf(
            '%s%s%s',
            pack('N', strlen($packet->getBody()) + 4),
            pack('N', $packet->getId()),
            $packet->getBody()
        );
    }

    public function unpack(string $data): Packet
    {
        $unpacked = unpack('Nid', substr($data, 4, 4));
        $body = substr($data, 8);
        return new Packet((int) $unpacked['id'], $body);
    }
}

服务端

服务端的设计就尤为简单了，因为 Channel 机制主要是给客户端使用，所以服务端解包之后，原封不动的将 ChannelID 和数据返回即可。

$server->handle(function (Connection $conn) {
    while (true) {
        $ret = $conn->recv();
        if (empty($ret)) {
            break;
        }

        Coroutine::create(function () use ($ret, $conn) {
            $packet = $this->packer->unpack($ret);
            $id = $packet->getId();
            try {
                $result = $this->handler->__invoke($packet, $this->getSerializer());
            } catch (\Throwable $exception) {
                $result = $exception;
            } finally {
                $conn->send($this->packer->pack(new Packet($id, $this->getSerializer()->serialize($result))));
            }
        });
    }
});

客户端

客户端相比而言，就要麻烦一些。我们需要创建一个 Channel 存储需要发送的数据，还需要设计一个 Channel Map 存储各个 ID 返回的数据，这样方便 recv 时，直接使用 Channel::pop() 获得数据，这样一来就可以很方便的将业务客户端与实际客户端进行解耦。

下述代码中，我们创建了两个协程，循环调用 Client::send 和 Client::recv 方法。

protected function loop(): void
{
    if ($this->chan !== null && ! $this->chan->isClosing()) {
        return;
    }
    $this->chan = $this->getChannelManager()->make(65535);
    $this->client = $this->makeClient();
    Coroutine::create(function () {
        try {
            $chan = $this->chan;
            $client = $this->client;
            while (true) {
                $data = $client->recv(-1);
                if (! $client->isConnected()) {
                    break;
                }
                if ($chan->isClosing()) {
                    break;
                }

                $packet = $this->packer->unpack($data);
                if ($channel = $this->getChannelManager()->get($packet->getId())) {
                    $channel->push(
                        $this->serializer->unserialize($packet->getBody())
                    );
                }
            }
        } finally {
            $chan->close();
            $client->close();
        }
    });

    Coroutine::create(function () {
        try {
            $chan = $this->chan;
            $client = $this->client;
            while (true) {
                $data = $chan->pop();
                if ($chan->isClosing()) {
                    break;
                }
                if (! $client->isConnected()) {
                    break;
                }

                if (empty($data)) {
                    continue;
                }

                $client->send($data);
            }
        } finally {
            $chan->close();
            $client->close();
        }
    });
}

实现组件

最后，根据上述的想法，我们实现了以下两个组件

multiplex multiplex-socket

随手写了两段代码，对多路复用和连接池进行测试，我们创建 10000 个协程，同时调用服务端，当服务端接收到数据，立马返回的情况下

二者差距不大，完全结束都在 0.3-0.5 秒之间。

但当我们在返回数据前，睡眠 10 毫秒的情况下，多路复用所用的时间要低于连接池的十分之一。

不仅速度更快，多路复用的连接，从始至终只用到 1 个，但连接池却起了 100 个连接，综合来说，多路复用要比使用连接池表现的更加优秀。

示例

客户端

<?php
declare(strict_types=1);

require_once __DIR__ . '/../vendor/autoload.php';

run(function () {
    $client = new \Multiplex\Socket\Client('127.0.0.1', 9601);
    $client->request('World.');
});

服务端

<?php

declare(strict_types=1);

use Multiplex\Packet;
use Multiplex\Socket\Server;

require_once __DIR__ . '/../vendor/autoload.php';

run(function () {
    $server = new Server();
    $config = collect([]);
    $server->bind('0.0.0.0', 9601, $config)->handle(static function (Packet $packet) {
        return 'Hello ' . $packet->getBody();
    })->start();
});

RPC 组件设计

在实现 RPC 组件时，也发现了一些问题

在 rpc-client 组件中，设计了一套动态生成 客户端代理 的监听器 AddConsumerDefinitionListener 。这原本是很好的一种实现。

但我发现，其实现的客户端包含以下代码（已删除多余代码）

<?php

declare(strict_types=1);

namespace Hyperf\RpcClient;

use Hyperf\Contract\IdGeneratorInterface;
use Hyperf\Contract\NormalizerInterface;
use Hyperf\Di\MethodDefinitionCollectorInterface;
use Hyperf\RpcClient\Exception\RequestException;
use Hyperf\Utils\Arr;
use Psr\Container\ContainerInterface;

class ServiceClient extends AbstractServiceClient
{
    protected function __request(string $method, array $params, ?string $id = null)
    {
        if ($this->idGenerator instanceof IdGeneratorInterface && ! $id) {
            $id = $this->idGenerator->generate();
        }
        $response = $this->client->send($this->__generateData($method, $params, $id));
        if (! is_array($response)) {
            throw new RequestException('Invalid response.');
        }

        $response = $this->checkRequestIdAndTryAgain($response, $id);
        if (array_key_exists('result', $response)) {
            $type = $this->methodDefinitionCollector->getReturnType($this->serviceInterface, $method);
            return $this->normalizer->denormalize($response['result'], $type->getName());
        }

        if ($code = $response['error']['code'] ?? null) {
            $error = $response['error'];
            // Denormalize exception.
            $class = Arr::get($error, 'data.class');
            $attributes = Arr::get($error, 'data.attributes', []);
            if (isset($class) && class_exists($class) && $e = $this->normalizer->denormalize($attributes, $class)) {
                if ($e instanceof \Throwable) {
                    throw $e;
                }
            }

            // Throw RequestException when denormalize exception failed.
            throw new RequestException($error['message'] ?? '', $code, $error['data'] ?? []);
        }

        throw new RequestException('Invalid response.');
    }
}

我们可以返回值这里，直接写死了 result error 等参数，这就导致返回值的数据结构，必须要按照这个结构来返回。

但实际上，我们应该使用 dataFormatter 来处理这件事。于是设计了以下接口：

后续会修改这里，暂时只是设计出来，并没有使用

<?php

declare(strict_types=1);

namespace Hyperf\RpcMultiplex\Contract;

use Hyperf\RpcClient\Exception\RequestException;

interface DataFetcherInterface
{
    /**
     * @throws RequestException
     * @return mixed
     */
    public function fetch(array $data);
}

这样我们便可以方便的进行解耦

<?php

declare(strict_types=1);

namespace Hyperf\RpcMultiplex;

use Hyperf\Rpc\Context;
use Hyperf\Rpc\Contract\DataFormatterInterface;
use Hyperf\RpcClient\Exception\RequestException;
use Hyperf\RpcMultiplex\Contract\DataFetcherInterface;
use Hyperf\Utils\Codec\Json;

class DataFormatter implements DataFormatterInterface, DataFetcherInterface
{
    /**
     * @var Context
     */
    protected $context;

    public function __construct(Context $context)
    {
        $this->context = $context;
    }

    public function formatRequest($data)
    {
        [$path, $params, $id] = $data;
        return [
            Constant::ID => $id,
            Constant::PATH => $path,
            Constant::DATA => $params,
            Constant::CONTEXT => $this->context->getData(),
        ];
    }

    public function formatResponse($data)
    {
        [$id, $result] = $data;
        return [
            Constant::ID => $id,
            Constant::RESULT => $result,
            Constant::CONTEXT => $this->context->getData(),
        ];
    }

    public function formatErrorResponse($data)
    {
        [$id, $code, $message, $data] = $data;

        if (isset($data) && $data instanceof \Throwable) {
            $data = [
                'class' => get_class($data),
                'code' => $data->getCode(),
                'message' => $data->getMessage(),
            ];
        }
        return [
            Constant::ID => $id ?? null,
            Constant::ERROR => [
                Constant::CODE => $code,
                Constant::MESSAGE => $message,
                Constant::DATA => $data,
            ],
            Constant::CONTEXT => $this->context->getData(),
        ];
    }

    public function fetch(array $data)
    {
        if (array_key_exists(Constant::DATA, $data)) {
            $this->context->setData($data[Constant::CONTEXT] ?? []);

            return $data[Constant::DATA];
        }

        if (array_key_exists(Constant::ERROR, $data)) {
            throw new RequestException(
                $data[Constant::ERROR][Constant::MESSAGE] ?? 'Invalid error message',
                $data[Constant::ERROR][Constant::CODE] ?? 0,
                $data[Constant::ERROR][Constant::DATA],
            );
        }

        throw new RequestException('Unknown data ' . Json::encode($data), 0);
    }
}

不过因为 rpc-client 中已经写死了结构，所以这次并没有使用到上述代码，而是将返回值修改为满足 rpc-client 的验证规则。

HttpMessage 设计

在 RPC 设计上，为了可以统计设计中间件，又需要兼容 HTTP 和 TCP，所以我们会将数据转化为 Request 返回值转化为 Response，所以新设计了接口文件 HttpMessageBuilderInterface 。

<?php

declare(strict_types=1);

namespace Hyperf\RpcMultiplex;

use Hyperf\Contract\PackerInterface;
use Hyperf\HttpMessage\Server\Request;
use Hyperf\HttpMessage\Stream\SwooleStream;
use Hyperf\HttpMessage\Uri\Uri;
use Hyperf\RpcMultiplex\Contract\HttpMessageBuilderInterface;
use Hyperf\Utils\Codec\Json;
use Hyperf\Utils\Context;
use Psr\Http\Message\ResponseInterface;
use Psr\Http\Message\ServerRequestInterface;
use Psr\Http\Message\UriInterface;

class HttpMessageBuilder implements HttpMessageBuilderInterface
{
    /**
     * @var PackerInterface
     */
    protected $packer;

    public function __construct(PackerInterface $packer)
    {
        $this->packer = $packer;
    }

    public function buildRequest(array $data): ServerRequestInterface
    {
        $uri = $this->buildUri(
            $data[Constant::PATH] ?? '/',
            $data[Constant::HOST] ?? 'unknown',
            $data[Constant::PORT] ?? 80
        );

        $parsedData = $data[Constant::DATA] ?? [];

        $request = new Request('POST', $uri, ['Content-Type' => 'application/json'], new SwooleStream(Json::encode($parsedData)));

        return $request->withParsedBody($parsedData);
    }

    public function buildResponse(ServerRequestInterface $request, array $data): ResponseInterface
    {
        $packed = $this->packer->pack($data);

        return $this->response()->withBody(new SwooleStream($packed));
    }

    public function persistToContext(ResponseInterface $response): ResponseInterface
    {
        return Context::set(ResponseInterface::class, $response);
    }

    protected function buildUri($path, $host, $port, $scheme = 'http'): UriInterface
    {
        $uri = "{$scheme}://{$host}:{$port}/" . ltrim($path, '/');

        return new Uri($uri);
    }

    /**
     * Get response instance from context.
     */
    protected function response(): ResponseInterface
    {
        return Context::get(ResponseInterface::class);
    }
}

重写 ON_RECEIVE

在基础组件设计时，我们已经清楚的看到，如果想要让服务端吞吐量最大化，必须要实现以下逻辑

$server->handle(function (Connection $conn) {
    while (true) {
        $ret = $conn->recv();
        if (empty($ret)) {
            break;
        }

        Coroutine::create(function () use ($ret, $conn) {
            // Do something ...
        });
    }
});

而我们默认的 TcpServer 是以下逻辑

$this->server->handle(function (Coroutine\Server\Connection $connection) use ($connectHandler, $connectMethod, $receiveHandler, $receiveMethod, $closeHandler, $closeMethod) {
    if ($connectHandler && $connectMethod) {
        parallel([static function () use ($connectHandler, $connectMethod, $connection) {
            $connectHandler->{$connectMethod}($connection, $connection->exportSocket()->fd);
        }]);
    }
    while (true) {
        $data = $connection->recv();
        if (empty($data)) {
            if ($closeHandler && $closeMethod) {
                parallel([static function () use ($closeHandler, $closeMethod, $connection) {
                    $closeHandler->{$closeMethod}($connection, $connection->exportSocket()->fd);
                }]);
            }
            $connection->close();
            break;
        }
        // One coroutine at a time, consistent with other servers
        parallel([static function () use ($receiveHandler, $receiveMethod, $connection, $data) {
            $receiveHandler->{$receiveMethod}($connection, $connection->exportSocket()->fd, 0, $data);
        }]);
    }
});

所以我们继承后，仍需要重写对应代码

public function onReceive($server, int $fd, int $fromId, string $data): void
{
    Coroutine::create(function () use ($server, $fd, $fromId, $data) {
        $packet = $this->packetPacker->unpack($data);

        Context::set(Constant::CHANNEL_ID, $packet->getId());

        parent::onReceive($server, $fd, $fromId, $packet->getBody());
    });
}

组件实现

接下来我们只需要再实现对应的 packer transporter 和 path-generator ，就大功告成了。

rpc-multiplex-incubator DEMO

性能表现

测试代码只有一行 Redis::incr

32连接池

$ ab -c 32 -n 10000 -k http://127.0.0.1:9501/
This is ApacheBench, Version 2.3 <$Revision: 1879490 $>
Copyright 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/
Licensed to The Apache Software Foundation, http://www.apache.org/

Benchmarking 127.0.0.1 (be patient)
Completed 1000 requests
Completed 2000 requests
Completed 3000 requests
Completed 4000 requests
Completed 5000 requests
Completed 6000 requests
Completed 7000 requests
Completed 8000 requests
Completed 9000 requests
Completed 10000 requests
Finished 10000 requests


Server Software:        Hyperf
Server Hostname:        127.0.0.1
Server Port:            9501

Document Path:          /
Document Length:        5 bytes

Concurrency Level:      32
Time taken for tests:   3.177 seconds
Complete requests:      10000
Failed requests:        0
Keep-Alive requests:    10000
Total transferred:      1460000 bytes
HTML transferred:       50000 bytes
Requests per second:    3147.21 [#/sec] (mean)
Time per request:       10.168 [ms] (mean)
Time per request:       0.318 [ms] (mean, across all concurrent requests)
Transfer rate:          448.72 [Kbytes/sec] received

Connection Times (ms)
              min  mean[+/-sd] median   max
Connect:        0    0   0.0      0       1
Processing:     5   10   4.8      9      68
Waiting:        5   10   4.8      9      68
Total:          5   10   4.8      9      68

Percentage of the requests served within a certain time (ms)
  50%      9
  66%     10
  75%     10
  80%     12
  90%     15
  95%     17
  98%     19
  99%     23
 100%     68 (longest request)

多路复用 - 单连接

$ ab -c 32 -n 10000 -k http://127.0.0.1:9501/
This is ApacheBench, Version 2.3 <$Revision: 1879490 $>
Copyright 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/
Licensed to The Apache Software Foundation, http://www.apache.org/

Benchmarking 127.0.0.1 (be patient)
Completed 1000 requests
Completed 2000 requests
Completed 3000 requests
Completed 4000 requests
Completed 5000 requests
Completed 6000 requests
Completed 7000 requests
Completed 8000 requests
Completed 9000 requests
Completed 10000 requests
Finished 10000 requests


Server Software:        Hyperf
Server Hostname:        127.0.0.1
Server Port:            9501

Document Path:          /
Document Length:        5 bytes

Concurrency Level:      32
Time taken for tests:   2.509 seconds
Complete requests:      10000
Failed requests:        0
Keep-Alive requests:    10000
Total transferred:      1460000 bytes
HTML transferred:       50000 bytes
Requests per second:    3984.94 [#/sec] (mean)
Time per request:       8.030 [ms] (mean)
Time per request:       0.251 [ms] (mean, across all concurrent requests)
Transfer rate:          568.17 [Kbytes/sec] received

Connection Times (ms)
              min  mean[+/-sd] median   max
Connect:        0    0   0.0      0       1
Processing:     4    8   1.4      8      18
Waiting:        4    8   1.4      8      17
Total:          4    8   1.4      8      18

Percentage of the requests served within a certain time (ms)
  50%      8
  66%      8
  75%      9
  80%      9
  90%     10
  95%     10
  98%     12
  99%     12
 100%     18 (longest request)

可见，就性价比而言，多路复用比连接池要优秀的多。

写在最后

Hyperf 是基于 Swoole 4.5+ 实现的高性能、高灵活性的 PHP 协程框架，内置协程服务器及大量常用的组件，性能较传统基于 PHP-FPM 的框架有质的提升，提供超高性能的同时，也保持着极其灵活的可扩展性，标准组件均基于 PSR 标准实现，基于强大的依赖注入设计，保证了绝大部分组件或类都是可替换与可复用的。

框架组件库除了常见的协程版的 MySQL 客户端、Redis 客户端，还为您准备了协程版的 Eloquent ORM、WebSocket 服务端及客户端、JSON RPC 服务端及客户端、GRPC 服务端及客户端、Zipkin/Jaeger (OpenTracing) 客户端、Guzzle HTTP 客户端、Elasticsearch 客户端、Consul 客户端、ETCD 客户端、AMQP 组件、Apollo 配置中心、阿里云 ACM 应用配置管理、ETCD 配置中心、基于令牌桶算法的限流器、通用连接池、熔断器、Swagger 文档生成、Swoole Tracker、Blade 和 Smarty 视图引擎、Snowflake 全局ID生成器等组件，省去了自己实现对应协程版本的麻烦。

Hyperf 还提供了基于 PSR-11 的依赖注入容器、注解、AOP 面向切面编程、基于 PSR-15 的中间件、自定义进程、基于 PSR-14 的事件管理器、Redis/RabbitMQ 消息队列、自动模型缓存、基于 PSR-16 的缓存、Crontab 秒级定时任务、Translation 国际化、Validation 验证器等非常便捷的功能，满足丰富的技术场景和业务场景，开箱即用。

基础组件设计

前言

包体设计

服务端

客户端

实现组件

示例

RPC 组件设计

HttpMessage 设计

重写 ON_RECEIVE

组件实现

性能表现

写在最后

Recommend

抖音搜索月活用户突破5.5亿，张楠称将大力投入视频搜索

明年，我要用 AI 给全村写对联

如何在 Linux 中创建和管理归档文件 | Linux 中国

不用卷积，也能生成清晰图像，华人博士生首次尝试用两个Transformer构建一个GAN

焦点分析｜京东物流争气了：为京东挣钱，不靠京东养家

资产管理中的研究挑战

2021年，手机行业有哪些值得期待的「新尝试」

喝着枸杞水，大白差点把MySQL磁盘干爆

O2O的中场战事：Yelp衰落，美团突围

SpaceX计划融资8.5亿美元估值升至740亿

About Joyk