Stability AI announces new open-source large language model

2 years ago

source link: https://www.theverge.com/2023/4/19/23689883/stability-ai-open-source-large-language-model-stablelm
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Stability AI announces new open-source large language model

Stability AI, the same company behind the AI image generator Stable Diffusion, is now open-sourcing its language model, StableLM.

By Emma Roth

Apr 19, 2023, 8:21 PM UTC|

Share this story

An image showing a robot performing various tasks

Illustration by Alex Castro / The Verge

Stability AI, the company behind the AI-powered Stable Diffusion image generator, has released a suite of open-source large language models (LLMs) collectively called StableLM. In a post shared on Wednesday, the company announced that its models are now available for developers to use and adapt on GitHub.

Like its ChatGPT rival, StableLM is designed to efficiently generate text and code. It’s trained on a larger version of the open-source dataset known as the Pile, which encompasses information from a range of sources, including Wikipedia, Stack Exchange, and PubMed. Stability AI says StableLM models are currently available between 3 billion and 7 billion parameters, with 15 to 65 billion parameter models arriving later.

Here’s an example of what StableLM can do.Image: Stability AI

While StableLM expands on the open-source language models that Stability AI has already worked on in collaboration with the nonprofit EleutherAI, it also builds on its mission to make AI tools more accessible, as it has done with Stable Diffusion. The company made its text-to-image AI available in several ways, including a public demo, a software beta, and a full download of the model, allowing developers to toy with the tool and come up with various integrations.

We might even see the same happen with StableLM, along with Meta’s open-source LLaMa language model that leaked online last month. As pointed out by my colleague James Vincent, the release of Stable Diffusion has led “to both more good stuff and more bad stuff happening,” and “we’ll likely see a similar dynamic play out once more with AI text generation: more stuff, more of the time.”

Stability AI’s “fine-tuned” chat model is a work in progress.Screenshot: Emma Roth / The Verge

You can try out a demo of StableLM’s fine-tuned chat model hosted on Hugging Face, which gave me a very complex and somewhat nonsensical recipe when I tried asking it how to make a peanut butter sandwich. It also suggested that I add a “funny drawing” to a sympathy card. Stability AI warns that while the datasets it uses should help “steer the base language models into ‘safer’ distributions of text, not all biases and toxicity can be mitigated through fine-tuning.”

Recommend

venturebeat.com 2 years ago
Cache

Why Meta’s large language model does not work for researchers

medium.com 2 years ago
Cache

The biggest bottleneck for large language model startups is UX

venturebeat.com 2 years ago
Cache

What happens to a large language model (LLM) after it's trained | VentureBeat

Guest What happens to a large language model (LLM) after it’s trained

www.slashgear.com 2 years ago
Cache

Mark Zuckerberg Announces Meta's New Large-Language Model AI 'LLaMA'

Mark Zuckerberg Announces Meta's New Large-Language Model AI 'LLaMA' ...

www.cnbc.com 2 years ago
Cache

Mark Zuckerberg announces Meta LLaMA large language model

Mark Zuckerberg announces Meta LLaMA large language modelKey PointsIn this article

gizmodo.com 2 years ago
Cache

Stable Diffusion Now Has Its Own Open Source AI Language Model

venturebeat.com 2 years ago
Cache

Stability AI unveils its first LLM, as open-source AI race continues

arstechnica.com 2 years ago
Cache

Stability AI launches StableLM, an open source ChatGPT alternative

open source parrots — Stability AI launches StableLM, an open source ChatGPT alternative StableLM's 3B and 7B models are available now on GitHub under CC 4.0 license....

www.analyticsvidhya.com 1 year ago
Cache

Falcon AI: The New Open Source Large Language Model

Introduction Ever since the launch of GPT (Generative Pre Trained) by Open AI, the world has been taken by storm by Generative AI. From that period on, many Generative Models have come into the picture. With each relea...

devm.io 1 year ago
Cache

Llama 2: New Open Source Language Model From Meta Released

Llama 2 is the second version of the open source language model from Meta. It is based on a transformer architecture and has now also been released for commercial use. This article will discuss info about its benchmarks, parameters, and trai...

Stability AI announces new open-source large language model

Stability AI announces new open-source large language model

Stability AI, the same company behind the AI image generator Stable Diffusion, is now open-sourcing its language model, StableLM.

Share this story

Recommend

About Joyk