Which AI Writes the Best Code or Generates the Most Realistic Image?

1 month ago

source link: https://www.nytimes.com/2024/04/15/technology/ai-models-measurement.html
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

00roose-aimeasure-articleLarge.jpg?quality=75&auto=webp&disable=upscale

Credit...Davide Comai

The SHifT

A.I. Has a Measurement Problem

Which A.I. system writes the best computer code or generates the most realistic image? Right now, there’s no easy way to answer those questions.

By Kevin Roose

Reporting from San Francisco

There’s a problem with leading artificial intelligence tools like ChatGPT, Gemini and Claude: We don’t really know how smart they are.

That’s because, unlike companies that make cars or drugs or baby formula, A.I. companies aren’t required to submit their products for testing before releasing them to the public. There’s no Good Housekeeping seal for A.I. chatbots, and few independent groups are putting these tools through their paces in a rigorous way.

Instead, we’re left to rely on the claims of A.I. companies, which often use vague, fuzzy phrases like “improved capabilities” to describe how their models differ from one version to the next. And while there are some standard tests given to A.I. models to assess how good they are at, say, math or logical reasoning, many experts have doubts about how reliable those tests really are.

Subscribe to The Times to read as many articles as you like.

Recommend

hackernoon.com 3 years ago
Cache

Vishal Chovatiya Writes Code When It Is Helpful To Others in The Future

Vishal Chovatiya Writes Code When It Is Helpful To Others in The FutureOctober 12th 2020 4

www.youtube.com 3 years ago
Cache

C# Source Generators - Write Code that Writes Code - YouTube

Write Code that Writes CodeC# Source Generators - Write Code that Writes Code - YouTube

itnext.io 3 years ago
Cache

Direct I/O writes: the best way to improve your credit score.

Direct I/O writes: the best way to improve your credit score.I have recently written about how major changes in storage...

dotnettips.wordpress.com 2 years ago
Cache

dotNetDave Says… No One Writes Perfect Code! – dotNetTips.com

dotNetDave Says… No One Writes Perfect Code! Lets face it, none of us write perfect code all the time! Especially since there are many ways to write the code to achieve its purpose. The best way to make sure the code...

www.theverge.com 2 years ago
Cache

Microsoft Edge now automatically generates image labels for screen readers

Microsoft Edge now automatically generates image labels for screen readers Ideal for blind or low vision web users By...

www.gizchina.com 1 year ago
Cache

GitHub Copilot AI Tool Writes 40% Of Code Instead Of You

Microsoft GitHub Copilot AI Tool Writes 40% Of Code Instead Of You...

www.vice.com 1 year ago
Cache

This Furry Porn AI Generates a Sexual ‘Hindquarters’ Image Every 40 Seconds

Why Does This Horrifying Woman Keep Appearing in AI-Generated Images?“Loab” is creepypasta for the AI generation, and nobody understands where she came from.by

arstechnica.com 1 year ago
Cache

Meta announces Make-A-Video, which generates video from text

Seeing video is not believing — Meta announces Make-A-Video, which generates video from text Using a text description or an existing image, Make-A-Video can render video on...

www.vox.com 1 year ago
Cache

The AI-generated image of the Pope shows how realistic fake images could take ov...

The AI-generated image of the Pope shows how realistic fake images could take over the internet

statmodeling.stat.columbia.edu 1 year ago
Cache

ChatGPT4 writes Stan code so I don’t have to.

ChatGPT4 writes Stan code so I don’t have to. Several months ago I (Phil Price) wrote a Stan model to do some time series forecasting. It took me almost a full day to get it running and debugged. Today...

Which AI Writes the Best Code or Generates the Most Realistic Image?

A.I. Has a Measurement Problem

Recommend

About Joyk