Self-hosted ML deployment platform

fYZnEnz.png!web

Get started: Install • Tutorial • Docs • Examples

Learn more: Website • Blog • Subscribe • Contact

Cortex is a machine learning model deployment platform that runs in your AWS account. You define deployments with simple declarative configuration and Cortex deploys your models as JSON APIs. It also handles autoscaling, rolling updates, log streaming, inference on CPUs or GPUs, and more.

Cortex is actively maintained by a venture-backed team of infrastructure engineers and we're hiring .

How it works

Defineyour deployment using declarative configuration:

# cortex.yaml

- kind: api
  name: my-api
  model: s3://my-bucket/my-model.zip
  request_handler: handler.py
  compute:
    min_replicas: 5
    max_replicas: 20

Customizerequest handling (optional):

# handler.py

def pre_inference(sample, metadata):
  # Python code


def post_inference(prediction, metadata):
  # Python code

Deployto your cloud infrastructure:

$ cortex deploy

Deploying ...
https://amazonaws.com/my-api  # Your API is ready!

Servereal time predictions via scalable JSON APIs:

$ curl -d '{"a": 1, "b": 2, "c": 3}' https://amazonaws.com/my-api

{ prediction: "def" }

Spinning up a Cortex cluster on your AWS account

# Download the install script
$ curl -O https://raw.githubusercontent.com/cortexlabs/cortex/master/cortex.sh && chmod +x cortex.sh

# Set your AWS credentials
$ export AWS_ACCESS_KEY_ID=***
$ export AWS_SECRET_ACCESS_KEY=***

# Provision infrastructure on AWS and install Cortex
$ ./cortex.sh install

# Install the Cortex CLI on your machine
$ ./cortex.sh install cli

Key features

Minimal declarative configuration:Deployments can be defined in a single cortex.yaml file.
Autoscaling:Cortex can automatically scale APIs to handle production workloads.
Multi framework:Cortex supports TensorFlow, Keras, PyTorch, Scikit-learn, XGBoost, and more.
Rolling updates:Cortex updates deployed APIs without any downtime.
Log streaming:Cortex streams logs from your deployed models to your CLI.
CPU / GPU support:Cortex can run inference on CPU or GPU infrastructure.

How it works

Spinning up a Cortex cluster on your AWS account

Key features

Recommend

《Android 美颜类相机开发汇总》第三章 Android OpenGLES 给相机添加滤镜 - 简书

Why Generics? - The Go Blog

美国大爷发明无链条自行车！垂直踩踏即可驱动，永远不会掉链子！

React Hooks 深入系列 —— 设计模式

刚刚，ACL2019最佳论文奖出炉，刘群团队获最佳长论文奖

记录一次Spring Boot假死诊断 | 大名Dean鼎

UAVStack之文件数据归集 - 宜信技术学院

Cendertron，安全爬虫的分布式与稳定性优化之路 - 知乎

GitHub - onevcat/FengNiao: A command line tool for cleaning unused resources in...

文化和旅游部：自 2019 年 8 月 1 日起暂停 47 个城市大陆居民赴台个人游试点

About Joyk