[2311.16989] ChatGPT's One-year Anniversary: Are Open-Source Large Language Mode...

1 year ago

source link: https://arxiv.org/abs/2311.16989
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Computer Science > Computation and Language

[Submitted on 28 Nov 2023 (v1), last revised 5 Dec 2023 (this version, v3)]

ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?

Download PDF

Upon its release in late 2022, ChatGPT has brought a seismic shift in the entire landscape of AI, both in research and commerce. Through instruction-tuning a large language model (LLM) with supervised fine-tuning and reinforcement learning from human feedback, it showed that a model could answer human questions and follow instructions on a broad panel of tasks. Following this success, interests in LLMs have intensified, with new LLMs flourishing at frequent interval across academia and industry, including many start-ups focused on LLMs. While closed-source LLMs (e.g., OpenAI's GPT, Anthropic's Claude) generally outperform their open-source counterparts, the progress on the latter has been rapid with claims of achieving parity or even better on certain tasks. This has crucial implications not only on research but also on business. In this work, on the first anniversary of ChatGPT, we provide an exhaustive overview of this success, surveying all tasks where an open-source LLM has claimed to be on par or better than ChatGPT.

Comments:	version v3
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2311.16989 [cs.CL]
	(or arXiv:2311.16989v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.16989

Submission history

From: Hailin Chen [view email]
[v1] Tue, 28 Nov 2023 17:44:51 UTC (660 KB)
[v2] Wed, 29 Nov 2023 16:00:05 UTC (733 KB)
[v3] Tue, 5 Dec 2023 16:58:46 UTC (736 KB)

Recommend

www.theverge.com 2 years ago
Cache

Stability AI announces new open-source large language model

Stability AI announces new open-source large language model / Stability AI, the same company behind the AI image generator Stable Diffusion, is now open-sourcing its language model, StableLM.By...

www.analyticsvidhya.com 1 year ago
Cache

Falcon AI: The New Open Source Large Language Model

Introduction Ever since the launch of GPT (Generative Pre Trained) by Open AI, the world has been taken by storm by Generative AI. From that period on, many Generative Models have come into the picture. With each relea...

blogs.sap.com 1 year ago
Cache

SAP Business Network for Supply Chain 2311 Release – My Top Two Features

Damian Edelberg December 12, 2023 1 minute read...

arxiv.org 1 year ago
Cache

[2311.14452] Refinement Proofs in Rust Using Ghost Locks

Computer Science > Logic in Computer Science [Submitted on 24 Nov 2023] Refinement Proofs in Rust Using Ghost Locks...

blog.xiaket.org 1 year ago
Cache

Pensieve: 2311

Pensieve: 2311 2023-11-26 13:51 这个月有点懒, 不太想读书. 读完了两本, 一本是记录老北京历史的府门儿·宅门儿, 本来期望能读到更多普通老百...

community.sap.com 1 year ago
Cache

Sap B1 Patch 2311 sluggish and slowing down system

Sap B1 Patch 2311 sluggish and slowing down system ...

community.sap.com 1 year ago
Cache

SAP Business one 10 FP 2311 bypassing authenticati... - SAP Community

SAP Business one 10 FP 2311 bypassing authentication while accessing DT...

arxiv.org 1 year ago
Cache

[2311.14648] Calibrated Language Models Must Hallucinate

Computer Science > Computation and Language [Submitted on 24 Nov 2023 (v1), last revised...

arxiv.org 1 year ago
Cache

[2311.18145] Sparsifying generalized linear models

Computer Science > Data Structures and Algorithms [Submitted on 29 Nov 2023] Sparsifying generalized linear models...

arxiv.org 1 year ago
Cache

[2311.09631] On the Pauli Spectrum of QAC0

[Submitted on 16 Nov 2023 (v1), last revised 3 Feb 2024 (this version, v3)] On the Pauli Spectrum of QAC0

[2311.16989] ChatGPT's One-year Anniversary: Are Open-Source Large Language Mode...

Computer Science > Computation and Language

ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?

Submission history

Recommend

About Joyk