10

Researchers detail blind spots of large language models

 2 years ago
source link: https://venturebeat.com/2021/07/08/researchers-detail-blind-spots-of-large-language-models/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Researchers detail blind spots of large language models

Image Credit: raindrop74 / Shutterstock
ADVERTISEMENT

Transform 2021

Live now: AI/ML Automation Technology Summit

July 12-16

Watch Now

Join AI & data leaders at Transform 2021 for the AI/ML Automation Technology Summit. Watch now!


Modern AI-powered language systems like OpenAI’s GPT-3 can generate impressively fluent and grammatical text. But they aren’t perfect. While these systems rarely make syntactic errors, they’re prone to breaking semantic and narrative rules or struggling with repetition. For example, they might change the subject of a conversation without a segue or answer a question with an illogical statement.

To measure the extent to which systems suffer from these shortcomings, researchers at the Allen Institute for AI developed Scarecrow, a framework that provides a way for developers to mark problems in AI-generated text. In an analysis spanning 13,000 annotations of 1,300 paragraphs from both AI systems and humans, they found that scaling up the size of models powering the systems helps mitigate some issues but others might require more involved fixes.

Video Player is loading.
Current Time 0:00
1586154785616_478x270_thumbnail.jpg
Transform 2021 - VentureBeat

Categorizing model errors

The researchers applied their framework to OpenAI’s GPT-2 and GPT-3, as well as Grover, a fake news generator and detector from the University of Washington. As the team explains in a paper, Scarecrow divides errors into 10 categories identified by combining expert analysis with crowdsourced annotation:

  • Grammar and usage: Missing words, extra words, and incorrect or out-of-order words.
  • Redundant: Repeated words or phrases, or ideas repeated using different words.
  • Off-prompt: A phrase or sentence unrelated — or contradictory — to a prompt given to a language generation system.
  • Self-contradiction: Text that contradicts another piece of text the system had previously written.
  • Incoherent: Text that doesn’t fit into the above categories but still doesn’t make sense.
  • Technical jargon: Jargon or specific words from an esoteric field.
  • Needs Google: A fact or figure that appears to be true but requires a Google search to confirm.
  • Bad math: Problems with basic math and converting fixed units and currencies.
  • Commonsense: Text that violates our basic understanding of the world.
  • Encyclopedic: Factually wrong text disproven by textbooks, Wikipedia entries, or encyclopedias.

According to the researchers, certain errors, like Encyclopedic, Commonsense, and Incoherent errors, decrease with models trained on data from particular domains, like news, as well as models containing higher numbers of parameters. (In machine learning, parameters are the parts of models learned from historical training data, and they generally correlate with linguistic sophistication.) But the researchers say parameter scaling benefits seemingly plateau for Off-Prompt, Bad Math, and Grammar and Usage errors.

“These three error categories see a model plateau in error reduction when scaling to GPT-3. Of these error types, humans still commit fewer Off-Prompt and Grammar and Usage errors, but Bad Math appears saturated for our [study],” the researchers wrote.

Self-Contradiction and Redundant errors exhibit more complex scaling behavior, increasing for medium- and large-scale models, depending on interactions with other error types and how the errors are counted. Sampling from a larger set of words makes the models more prone to changing topics but less likely to repeat themselves, and vice versa.

“We posit the reason is that GPT-2 generations [in particular] are so incoherent and off-prompt that there is little opportunity for relevant, comprehensible points to be made and then reversed,” the researchers noted in the paper. “We [also] observe GPT-3 will seem stuck on a particular topic, elaborating on and rephrasing similar ideas more times than a human writer would.”

The researchers aim to spur explorations of natural language generations at scale, in particular ways errors in language models might be automatically fixed. “This paper focuses on open-ended generation, but a natural extension of this method would be to [assess] constrained generation tasks, such as machine translation,” they wrote. “Especially if considering a novel task setting, new error types may [also] prove useful.”

VentureBeat

VentureBeat's mission is to be a digital town square for technical decision-makers to gain knowledge about transformative technology and transact.

Our site delivers essential information on data technologies and strategies to guide you as you lead your organizations. We invite you to become a member of our community, to access:

  • up-to-date information on the subjects of interest to you
  • our newsletters
  • gated thought-leader content and discounted access to our prized events, such as Transform 2021: Learn More
  • networking features, and more
Become a member
Sponsored

How gold-backed cryptocurrency is changing the modern investment landscape

VB StaffJuly 12, 2021 05:04 AM
jingming-pan-iYsrkq5qq0Q-unsplash.jpg?fit=930%2C454&strip=all
Image Credit: Jingming Pan, Unsplash

Transform 2021

Live now: AI/ML Automation Technology Summit

July 12-16

Watch Now

Presented by Asia Broadband


The proliferation of online markets and the introduction of new cryptocurrencies has launched a brand-new type of token. Companies are now digitizing the value proposition of gold, one of the oldest, most stable currencies in the world. The result is a liquid crypto asset backed by the benefits of physical gold. Blockchain technology enables financial leverage, liquidity, and monetized physical gold holdings.

In 2021, Bitcoin and Ethereum have been the market leaders, posting record highs for investors. But then came the crash in the wake of Elon Musk’s offhanded tweets, plus China promising increased regulation. This kind of volatility attracts recreational investors who see big risks as the natural corollary of even bigger gains. But more cautious investors have stayed on the sidelines, unwilling to take those chances.

Historically, gold has been a protective hedge against inflation. Experts like Timothy Ord, president and editor of The Ord Oracle predict that gold stocks could see 10X gains in the next 3 years. In addition, Wall Street gold estimates are bullish for 2021. Gold is expected to hit $2,300-$2,400 an ounce, Goldman and Citibank say, and the metal looks poised to continue that bullish trend, which started back in September 2018.

This innovation comes at a time when investors have been searching for safe havens. They’re facing massive amounts of liquidity in the economy, created by government stimuluses that were designed to shield the economy from the impact of the pandemic.

Tether offers a more stable crypto investment, protected from the bigger crypto market swings, but even fiat currency backed cryptos are vulnerable to potential inflation, and investors are wary how that can impact the currency’s buying power over time.

To hedge against these inflation fears, investors have turned to commodity-backed cryptocurrency projects, such as those leveraging safe-haven commodities like precious metals as their store of value. For cautious investors, gold-backed tokens combine the structure of gold investments with the scarcity, flexibility, and upside potential of the crypto world. And they’re set to bridge the gaps for investors who are uncomfortable launching into the world of cryptocurrency investing, those who dislike fiat currency, and those who are looking for more secure investments.

Asia Broadband, Inc. (OTC: AABB) is entering the fray with its AABB Gold token (AABBG). It’s a gold-backed currency, but the company is going a step further with its unique proposition: the gold they’re backing their token with is directly produced by their gold mines.

“Backing currency with gold offers investors a major competitive advantage over other junior mining companies,” says Chris Torres, CEO of Asia Broadband, Inc. (OTC: AABB). “The unique mine-to-token product can become a worldwide standard of exchange, secured and trusted with gold backing.”

The company is acquiring highly prospective gold projects in Latin America, and distributing production through an extensive global sales network. In January, Asia Broadband sold its main mining operations in the Guerrero gold belt in Mexico for $82 million. The sale produced $30 million dollars in gold bullion, which provides the backing for the company’s tokens. They’ll continue to strategically target gold mineral properties to back continuing token sales, Torres says – but if token demand exceeds the company’s supply of gold, bullion is available from third-party sources, and can be purchased using the proceeds from token sales.

The minimum price of the AABBG token is maintained at the current spot price of gold, which reduces purchaser’s investment risk, Torres adds. On the flip side, the token price will rise with the price of gold. Since the pandemic, the value of gold has continued to rise, Torres says.

“We believe the price of gold will increase over $2,000 to $3,000 an ounce over the next 24 months,” he explains. “And the rising price of gold is just the minimum company-supported price of our AABBG token, which adds security to investors.

Most importantly, the token price will continue to appreciate based on its market demand and the limited supply of tokens available for purchase and exchange. The company has released only 5.4 million tokens to this point, or the equivalent value of the gold bullion in the company’s treasury.

“In 2020, AABB saw $16.8 million gross profits, and as gold mining operations move forward, we expect revenues and gross profits to be strong in 2021 — surpassing our previous year’s achievement,” he says.

The company’s primary goal is to make its token a worldwide standard of exchange, Torres says. It’s working now to aggressively expand token circulation to the primary sales markets in both North America and Europe, and also globally.

The company has been developing their own propriety exchange, which will allow AABB Wallet users to complete quick two-way exchanges of their AABB Gold tokens for major cryptocurrencies such as Bitcoin, Ethereum, and Litecoin. The proprietary exchange will also add to transaction fee revenues and allow for the price appreciation of AABBG beyond the price of gold, influenced by market demand and the limited supply of tokens released into circulation.

The exchange completion timeline is on path for testing in mid-August with the live exchange launch expected in early September.

For more information on the AABBG token, visit www.aabbgoldtoken.com


Sponsored articles are content produced by a company that is either paying for the post or has a business relationship with VentureBeat, and they’re always clearly marked. Content produced by our editorial team is never influenced by advertisers or sponsors in any way. For more information, contact [email protected].


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK