5

Interview with OpenAI's Greg Brockman: GPT-4 isn't perfect, but neither are you

 1 year ago
source link: https://finance.yahoo.com/news/interview-openais-greg-brockman-gpt-123004823.html
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Interview with OpenAI's Greg Brockman: GPT-4 isn't perfect, but neither are you

Kyle Wiggers
Wed, March 15, 2023, 9:30 PM GMT+9·7 min read
c9f2b5772f1440d1a584a297159b2350

OpenAI shipped GPT-4 yesterday, the much-anticipated text-generating AI model, and it's a curious piece of work.

GPT-4 improves upon its predecessor, GPT-3, in key ways, for example giving more factually true statements and allowing developers to prescribe its style and behavior more easily. It's also multimodal in the sense that it can understand images, allowing it to caption and even explain in detail the contents of a photo.

But GPT-4 has serious shortcomings. Like GPT-3, the model "hallucinates" facts and makes basic reasoning errors. In one example on OpenAI's own blog, GPT-4 describes Elvis Presley as the "son of an actor." (Neither of his parents were actors.)

To get a better handle on GPT-4's development cycle and its capabilities, as well as its limitations, TechCrunch spoke with Greg Brockman, one of the co-founders of OpenAI and its president, via a video call on Tuesday.

Asked to compare GPT-4 to GPT-3, Brockman had one word: Different.

"It's just different," he told TechCrunch. "There's still a lot of problems and mistakes that [the model] makes ... but you can really see the jump in skill in things like calculus or law, where it went from being really bad at certain domains to actually quite good relative to humans."

Test results support his case. On the AP Calculus BC exam, GPT-4 scores a 4 out of 5 while GPT-3 scores a 1. (GPT-3.5, the intermediate model between GPT-3 and GPT-4, also scores a 4.) And in a simulated bar exam, GPT-4 passes with a score around the top 10% of test takers; GPT-3.5’s score hovered around the bottom 10%.

Shifting gears, one of GPT-4's more intriguing aspects is the above-mentioned multimodality. Unlike GPT-3 and GPT-3.5, which could only accept text prompts (e.g. "Write an essay about giraffes"), GPT-4 can take a prompt of both images and text to perform some action (e.g. an image of giraffes in the Serengeti with the prompt "How many giraffes are shown here?").

Recommended Stories
  • 11fc83a93e178fabb2e7e4c22b2a181a.cf.webp
    TechCrunch

    OpenAI is testing a version of GPT-4 that can 'remember' long conversations

    OpenAI has built a version of GPT-4, its latest text-generating model, that can "remember" roughly 50 pages of content thanks to a greatly expanded context window. "The model is able to flexibly use long documents," Greg Brockman, OpenAI co-founder and president, said during a live demo this afternoon. Where it concerns text-generating AI, the context window refers to the text the model considers before generating additional text.

    1d ago
  • 292123247a4043c7c6bc6ef969ba9fc3.cf.webp
    Bloomberg

    Crypto Firms Move Cash to Asset Managers Including Fidelity as Banking Crisis Deepens

    (Bloomberg) -- A rising number of companies in the digital-asset sector are reaching out to asset managers such as Fidelity Investments to invest their cash in products like Treasuries in the aftermath of the recent collapse of several crypto friendly US banks. Most Read from Bloomberg‘Old-School’ Signature Bank Collapsed After Its Big Crypto LeapRussian Fighter Jet Collides With US Drone Over Black SeaUS Core CPI Tops Estimates, Pressuring Fed as It Weighs HikeCredit Suisse Finds ‘Material’ Con

    1d ago
  • 7b8bd23f33fedbeb819e626376d5f310.cf.webp
    TechCrunch

    Stripe now valued at $50B following $6.5B raise

    Digital payments company Stripe announced Wednesday that it raised over $6.5 billion in Series I funding to value the company at $50 billion. More recently, Stripe was publicly valued at $95 billion. New investors in the round include GIC, Goldman Sachs Asset and Wealth Management and Temasek.

    7h ago
  • b966190dcecd5c32e74e8b43242d8e39.cf.webp
    TechCrunch

    5 ways GPT-4 outsmarts ChatGPT

    OpenAI's new GPT-4 AI model has made its big debut and is already powering everything from a virtual volunteer for the visually impaired to an improved language learning bot in Duolingo. Although ChatGPT was originally described as being GPT-3.5 (and therefore a few iterations beyond GPT-3), it is not itself a version of OpenAI's large language model, but rather a chat-based interface for whatever model powers it.

    1d ago
  • a281a0a7baf7f0393630d2207bc7409c.cf.webp
    Fortune

    OpenAI releases a ‘still limited’ GPT-4

    This latest update still "hallucinates" and makes up facts.

    1d ago
  • b7cb555b988c80c0fe0d1a96be810dc1.cf.webp
    Fortune

    With GPT-4, OpenAI’s chief scientist says the company has ‘a recipe for producing magic’

    There's a lot the company isn't saying about the powerful new language A.I. model it just released

    9h ago
  • 7390dbfd7c5243a7b827d5e42e37a0fc.cf.webp
    Bloomberg

    BTS Mogul Hybe Eyes US Targets After Failed Takeover Bid for SM

    (Bloomberg) -- Hybe Co., the label behind global sensation BTS, is hunting new targets in the US after dropping a bid to acquire K-pop rival SM Entertainment Co. Most Read from BloombergCredit Suisse Reels After Top Shareholder Rules Out Raising StakeRyan Reynolds-Backed Mint Is Bought by T-Mobile for $1.35 BillionFirst Republic Bank Is Said to Weigh Options Including a SaleIn New York City, a $100,000 Salary Feels Like $36,000Traders Dash for Cover as Bank Drama Rattles Globe: Markets WrapHybe’

    20h ago
  • 898e4f7bb38bb238a0640ff6cbf2531a.cf.webp
    TechCrunch

    Google Cloud gives developers access to its foundation models

    Google Cloud today announced a slew of new AI-powered features for its productivity tools, but the company also today launched a set of new APIs and tools for developers that are just as interesting -- if not more so. In addition to making its large language models available to developers through an API, Google also today launched MakerSuite, a new browser-based tool that will make it easier for developers to build AI-powered applications on top of Google's foundation models. Google is also bringing support for generative AI to Vertex AI, its platform for building and deploying ML models, and launching its Generative AI App Builder, a new service that will help developers ship bots, chat interfaces, digital assistants and custom search engines.

    2d ago
  • cc49d0af6a26c35895a39569ddd1d6e0.cf.webp
    Fortune

    How CEOs are using ChatGPT across their businesses—and what they warn are the biggest risks so far

    The possibility that a chatbot could throw out an inaccurate, or worse, racist response isn’t worth the risk for some.

    18h ago
  • 40de6a10-c363-11ed-b5ef-918dbf3dd770.cf.webp
    Yahoo Finance

    US Copyright Office opens door to protecting AI-assisted works

    In a notice scheduled to publish in the Federal Register on Thursday, the department clarified its willingness to consider copyright protection for works containing AI-generated material.

    9h ago
  • f591e720acbe1a49e3dd00b764780ddd.cf.webp
    Investor's Business Daily

    Microsoft Improves Position In Artificial Intelligence With GPT-4

    The release of OpenAI's latest artificial intelligence software will bolster Microsoft's position as a leader in the AI market, a Wall Street analyst says.

    8h ago
  • 89e679c39886203e1dd26dc6c80d440f.cf.webp
    TheStreet.com

    Microsoft's Latest Change Seems Like a Bad Idea

    When the story of how artificial intelligence took over and conquered the human race is written (possibly by ChatGPT chatbots) this will be the chapter no one believes really happened. Just weeks before announcing that it will fuse the revolutionary AI tech from OpenAI with its search engine, Microsoft reportedly made deep cuts to the team focused on the ethics portion of the digital revolution. Among the 10,000 Microsoft employees that were laid off in January, the entire ethics and society team was also let go, according to a report from Platformer.

    15h ago
  • d07dc9b5ff1be131a105cfa85a4bcf40.cf.webp
    Benzinga

    Dish Liable For $469M Penalty For Infringing Patents, US Federal Court Jury Rules

    DISH Network Corp (NASDAQ: DISH) must pay $469 million for infringing two parental-control technology maker ClearPlay Inc patents related to filtering material from streaming video, under a U.S. federal court jury ruling last Friday. The jury found that ClearPlay's patents covered Dish's AutoHop feature for skipping commercials on its Hopper set-top boxes, Reuters reports citing court documents. Though jurors found that Dish's technology infringed ClearPlay's patent rights, they refused ClearPla

    2d ago
  • 6664a4edcf2bf3b7c92df909a3b82116.cf.webp
    Barrons.com

    It’s Microsoft vs. Google as the AI Battle Shifts to Your PC Desktop

    Google this week unveiled plans to add AI capability to Google Docs and Gmail. Microsoft is likely to do the same for Office.

    13h ago
  • fe38bf20b6859f0651fe2166233caa12.cf.webp
    South China Morning Post

    US-sanctioned Huawei denies breakthrough in chip packaging tech as speculation mounts on firm's efforts to overcome trade restrictions

    Chinese telecommunications giant Huawei Technologies Co has dismissed speculation about its development of an innovative semiconductor packaging technology, which would enable the US-sanctioned company to produce advanced chips for its smartphones and other devices, despite strict restrictions imposed by Washington. Shenzhen-based Huawei on Tuesday denied the rumours, which claimed that the new packaging tech was able to achieve 7-nanometre performance for chips, according to a report on Chinese

    19h ago
  • 4139f8761e1138ab5e594e85bdc93ceb.cf.webp
    Fortune

    GPT-4 debuts and Google beats Microsoft in race to add generative A.I. to consumer office tools

    Google is also giving business customers access to its most powerful language models

    1d ago
  • 04c9439f56cb11fb1e6d5b036b63afc9.cf.webp

    Inside the Lucrative–and Secretive–Business of iPhone Trade-Ins

    So you just traded in your old iPhone to get a deal on a new one. Where does that old phone go? Who makes money on it? WSJ’s Joanna Stern follows an iPhone through the refurbishment process to explain why the second-hand phone market is booming. Photo illustration: Kenny Wassus

    19h ago
  • 335ca7ecb1feca664aa22ecf8eaa68fd.cf.webp
    South China Morning Post

    Samsung to invest US$230 billion to build world's largest semiconductor manufacturing base in South Korea

    Samsung Electronics said on Wednesday it expects to invest 300 trillion won (US$230 billion) over the next 20 years as part of an ambitious South Korean national project to build the world's largest semiconductor manufacturing base near the capital, Seoul. The chip-making "mega cluster", which will be established in Gyeonggi province by 2042, will be anchored by five new semiconductor plants built by Samsung. It will aim to attract 150 other companies producing materials and components or design

    19h ago
  • cfc254dce3815a5dce3762ea1fd5d069.cf.webp
    Reuters

    Xiaomi's slow shift in India to premium smartphones helps Samsung steal its crown

    Xiaomi Corp is overhauling its India strategy after misjudging consumer tastes in mobile phones, a costly lapse that has allowed Samsung Electronics to pip the Chinese company to the top spot in the world's second biggest market for the devices. While Xiaomi remained focused on selling mobile phones under 10,000 rupees ($120), Indian consumers were willing to pay up for better looking models with richer features. Those moves have helped Samsung wrest leadership of India's competitive mobile phones market from Xiaomi, with data from Hong Kong-based Counterpoint Research showing it had a 20% market share for the last quarter of 2022 compared to the Chinese company's 18%.

    3h ago
  • bf781b87007a713019134a7814557345.cf.webp
    Barrons.com

    T-Mobile to Buy Ryan Reynolds-Backed Mint Mobile in $1.35 Billion Deal

    T-Mobile  said Wednesday it has agreed to acquire Mint Mobile—a prepaid wireless brand part owned by actor Ryan Reynolds—and other brands in a deal valued up to $1.35 billion. T-Mobile (ticker:TMUS) will pay up to that amount in a combination of 39% cash and 61% stock to purchase Ka’ena Corporation, the parent company of Mint, Ultra Mobile, and the wholesaler Plum. “The actual price to be paid by T-Mobile will be based upon Ka’ena’s performance during certain periods before and after the closing,” according to a news release.

    10h ago

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK