10

OpenAI opens GPT-3.5 Turbo up for custom tuning

 1 year ago
source link: https://www.theverge.com/2023/8/22/23842042/openai-gpt-3-5-turbo-fine-tuning-enterprise-business-custom-chatbot-ai-artificial-intelligence
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

OpenAI opens GPT-3.5 Turbo up for custom tuning

/

OpenAI says that by fine-tuning its chatbot to focus on specific tasks — like code completion or maintaining a consistent tone — businesses can make ChatGPT a more efficient tool.

By Wes Davis, a weekend editor who covers the latest in tech and entertainment. He has written news, reviews, and more as a tech journalist since 2020.

Aug 22, 2023, 10:25 PM UTC|

Share this story

ChatGPT logo in minty green and black colors.
Illustration: The Verge

OpenAI has announced that businesses can now fine-tune GPT-3.5 Turbo using their own data — OpenAI claims the resulting custom model can match or exceed the abilities of GPT-4 for certain tasks. Later this fall, the company says it will open up the arguably more advanced GPT-4 for the same purpose.

Fine-tuning lets businesses essentially hone ChatGPT to a more focused model that’s especially efficient for certain tasks. The supervised training would make a bot that’s unique to the client company so that it offers, say, reliable responses in a specific language or with more concise wording. Until now, business customers were limited to GPT-3 variants for this, like davinci-002 or babbage-002.

The model would come pre-trained, like GPT-4, up to September 2021 before being fed company data. OpenAI says that none of that data, nor any input or output, will be used to train models outside of the client company.

Other uses include ensuring the bot is trained to mimic brand voices so they’re more consistent — think ad copy or internal communications at least partially written by AI (not that we don’t already see plenty). Software companies could use it for routine code like API calls or to dependably format and complete snippets of code.

GPT-3.5 Turbo is a model family the company debuted earlier this year that it said was ideal for use cases that aren’t chat-specific. It can handle 4,000 tokens at a time, which OpenAI says is double what previously offered models could interpret. The company added that early testers have been able to make 90 percent shorter prompts after priming GPT-3.5 with fine-tuned instructions.

Pricing for GPT-3.5 is $0.0080 per 1,000 tokens for training, $0.0120 per 1,000 tokens for input usage, and $0.0120 per 1,000 tokens of the chatbot’s output.

Microsoft also offers refinable GPT-based models as part of its AI Builder and Power Virtual Agents services, intended to be connected to a company’s internal data to craft responses. Microsoft pitches them as a way to summarize information or generate content for email campaigns. Like OpenAI’s fine-tuning bots, Microsoft’s customizable AI bots can connect to company data to generate responses from a business’ knowledge base.

Featured Videos From The Verge

Taylor Swift vs. Ronald Reagan: The Ticketmaster story

Ticketmaster botched the sale of Taylor Swift’s The Eras Tour and so many others. It’s gotten so bad - and has angered so many Taylor Swift fans - that in 2023 Congress held a hearing on antitrust law. Since the 1980s a series of policy changes have helped the firm grow to dominate every single aspect of the live events business. And Ronald Reagan is to blame.


Recommend

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK