GitHub - go-skynet/LocalAI: Self-hosted, community-driven, local OpenAI-compatib... - JOYK Joy of Geek, Geek News, Link all geek

LocalAI

LocalAI is a drop-in replacement REST API that’s compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families that are compatible with the ggml format. Does not require GPU.

For a list of the supported model families, please see the model compatibility table.

In a nutshell:

Local, OpenAI drop-in alternative REST API. You own your data.
NO GPU required. NO Internet access is required either. Optional, GPU Acceleration is available in llama.cpp-compatible LLMs. See building instructions.
Supports multiple models, Audio transcription, Text generation with GPTs, Image generation with stable diffusion (experimental)
Once loaded the first time, it keep models loaded in memory for faster inference
Doesn't shell-out, but uses C++ bindings for a faster inference and better performance.

LocalAI was created by Ettore Di Giacinto and is a community-driven project, focused on making the AI accessible to anyone. Any contribution, feedback and PR is welcome!

ChatGPT OSS alternative	Image generation

See the Getting started and examples sections to learn how to use LocalAI. For a list of curated models check out the model gallery.

06-06-2023: v1.18.0: Many updates, new features, and much more , check out the Changelog!
29-05-2023: LocalAI now has a website, https://localai.io! check the news in the dedicated section!

For latest news, follow also on Twitter @LocalAI_API and @mudler_it

Contribute and help

To help the project you can:

Upvote the Reddit post about LocalAI.
Hacker news post - help us out by voting if you like this project.
If you have technological skills and want to contribute to development, have a look at the open issues. If you are new you can have a look at the good-first-issue and help-wanted labels.
If you don't have technological skills you can still help improving documentation or add examples or share your user-stories with our community, any help and contribution is welcome!

Usage

Check out the Getting started section. Here below you will find generic, quick instructions to get ready and use LocalAI.

The easiest way to run LocalAI is by using docker-compose (to build locally, see building LocalAI):

git clone https://github.com/go-skynet/LocalAI

cd LocalAI

# (optional) Checkout a specific LocalAI tag
# git checkout -b build <TAG>

# copy your models to models/
cp your-model.bin models/

# (optional) Edit the .env file to set things like context size and threads
# vim .env

# start with docker-compose
docker-compose up -d --pull always
# or you can build the images with:
# docker-compose up -d --build

# Now API is accessible at localhost:8080
curl http://localhost:8080/v1/models
# {"object":"list","data":[{"id":"your-model.bin","object":"model"}]}

curl http://localhost:8080/v1/completions -H "Content-Type: application/json" -d '{
     "model": "your-model.bin",            
     "prompt": "A long time ago in a galaxy far, far away",
     "temperature": 0.7
   }'

Example: Use GPT4ALL-J model


git clone https://github.com/go-skynet/LocalAI

 LocalAI





wget https://gpt4all.io/models/ggml-gpt4all-j.bin -O models/ggml-gpt4all-j


cp -rf prompt-templates/ggml-gpt4all-j.tmpl models/





docker-compose up -d --pull always



curl http://localhost:8080/v1/models


curl http://localhost:8080/v1/chat/completions -H  -d

Build locally

See the build section in our documentation for detailed instructions.

Run LocalAI in Kubernetes

LocalAI can be installed inside Kubernetes with helm. See installation instructions.

Supported API endpoints

See the list of the supported API endpoints and how to configure image generation and audio transcription.

Frequently asked questions

See the FAQ section for a list of common questions.

Projects already using LocalAI to run local models

Feel free to open up a PR to get your project listed!

Short-term roadmap

Mimic OpenAI API (#10)
Binary releases (#6)
Upstream our golang bindings to llama.cpp (ggerganov/llama.cpp#351) and gpt4all
Multi-model support
Have a webUI!
Allow configuration of defaults for models.
Support for embeddings
Support for audio transcription with https://github.com/ggerganov/whisper.cpp
GPU/CUDA support ( #69 )
Enable automatic downloading of models from a curated gallery, with only free-licensed models, directly from the webui.

Star history

License

LocalAI is a community-driven project created by Ettore Di Giacinto.

Author

Ettore Di Giacinto and others

Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

GitHub - go-skynet/LocalAI: Self-hosted, community-driven, local OpenAI-compatib...

Contribute and help

Usage

Example: Use GPT4ALL-J model

Build locally

Run LocalAI in Kubernetes

Supported API endpoints

Frequently asked questions

Projects already using LocalAI to run local models

Short-term roadmap

Star history

License

Author

Acknowledgements

Contributors

Recommend

Reddit insists on being “fairly paid” amid API price protest plans, layoffs

史上最大屏的 MacBook Air，被最强悍芯片 M2 Ultra 武装的 Mac Studio 和 Mac Pro |...

10 security tool categories needed to shore up software supply chain security

On AWS Shutting Down Open Source Documentation

Autonomous Waymo car runs over dog in San Francisco

Traditional malware increasingly takes advantage of ChatGPT for attacks

Vodafone says Australians are paying too much for their mobile bills

有单朋友圈 | 有氧YOYA：用创意服务生意

iMacros Automation Scripting with Test Studio

kali 使用John破解zip压缩包的密码 - 无主题博客

About Joyk