GPT-3
2020 text-generating language model / From Wikipedia, the free encyclopedia
Dear Wikiwand AI, let's keep it short by simply answering these key questions:
Can you list the top facts and stats about GPT-3.5?
Summarize this article for a 10 year old
Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020.
Original author(s) | OpenAI[1] |
---|---|
Initial release | May 28, 2020 (publication); June 11, 2020 (OA API beta) |
Repository | |
Predecessor | GPT-2 |
Successor | GPT-3.5 GPT-4 |
Type | |
Website | openai |
Like its predecessor, GPT-2, it is a decoder-only[2] transformer model of deep neural network, which supersedes recurrence and convolution-based architectures with a technique known as "attention".[3] This attention mechanism allows the model to selectively focus on segments of input text it predicts to be most relevant.[4] GPT-3 has 175 billion parameters, each with a 16-bit precision, thus requiring 350GB of storage space as each parameter takes 2 bytes of space. It has a context window size of 2048 tokens, and has demonstrated strong "zero-shot" and "few-shot" learning abilities on many tasks.[2]
On September 22, 2020, Microsoft announced that it had licensed GPT-3 exclusively. Others can still receive output from its public API, but only Microsoft has access to the underlying model.[5]