

[2311.16989] ChatGPT's One-year Anniversary: Are Open-Source Large Language Mode...
source link: https://arxiv.org/abs/2311.16989
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Computer Science > Computation and Language
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?
Upon its release in late 2022, ChatGPT has brought a seismic shift in the entire landscape of AI, both in research and commerce. Through instruction-tuning a large language model (LLM) with supervised fine-tuning and reinforcement learning from human feedback, it showed that a model could answer human questions and follow instructions on a broad panel of tasks. Following this success, interests in LLMs have intensified, with new LLMs flourishing at frequent interval across academia and industry, including many start-ups focused on LLMs. While closed-source LLMs (e.g., OpenAI's GPT, Anthropic's Claude) generally outperform their open-source counterparts, the progress on the latter has been rapid with claims of achieving parity or even better on certain tasks. This has crucial implications not only on research but also on business. In this work, on the first anniversary of ChatGPT, we provide an exhaustive overview of this success, surveying all tasks where an open-source LLM has claimed to be on par or better than ChatGPT.
Comments: | version v3 |
Subjects: | Computation and Language (cs.CL) |
Cite as: | arXiv:2311.16989 [cs.CL] |
(or arXiv:2311.16989v3 [cs.CL] for this version) | |
https://doi.org/10.48550/arXiv.2311.16989 |
Submission history
From: Hailin Chen [view email][v1] Tue, 28 Nov 2023 17:44:51 UTC (660 KB)
[v2] Wed, 29 Nov 2023 16:00:05 UTC (733 KB)
[v3] Tue, 5 Dec 2023 16:58:46 UTC (736 KB)
Recommend
-
7
Stability AI announces new open-source large language model / Stability AI, the same company behind the AI image generator Stable Diffusion, is now open-sourcing its language model, StableLM.By...
-
3
Introduction Ever since the launch of GPT (Generative Pre Trained) by Open AI, the world has been taken by storm by Generative AI. From that period on, many Generative Models have come into the picture. With each relea...
-
7
Damian Edelberg December 12, 2023 1 minute read...
-
9
Computer Science > Logic in Computer Science [Submitted on 24 Nov 2023] Refinement Proofs in Rust Using Ghost Locks...
-
7
Pensieve: 2311 2023-11-26 13:51 这个月有点懒, 不太想读书. 读完了两本, 一本是记录老北京历史的府门儿·宅门儿, 本来期望能读到更多普通老百...
-
25
Sap B1 Patch 2311 sluggish and slowing down system ...
-
11
SAP Business one 10 FP 2311 bypassing authentication while accessing DT...
-
7
Computer Science > Computation and Language [Submitted on 24 Nov 2023 (v1), last revised...
-
6
Computer Science > Data Structures and Algorithms [Submitted on 29 Nov 2023] Sparsifying generalized linear models...
-
6
[Submitted on 16 Nov 2023 (v1), last revised 3 Feb 2024 (this version, v3)] On the Pauli Spectrum of QAC0
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK