What is GPT-3.5 and Why it Enabled ChatGPT?
Will 2023 be the year of Conversational A.I?
Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application.
text-davinci-003 was trained on a more recent dataset, containing data up to June 2021. This is what we normally refer to as GPT-3.5 and what the viral ChatGPT demo embodied for the public.
Open Source PaLM Architecture with RLHF
More recently in late December, 2022, it appears that the first open-source equivalent of ChatGPT has arrived:
It’s an implementation of RLHF (Reinforcement Learning with Human Feedback) on top of Google’s 540 billion parameter PaLM architecture. Check out the LinkedIn comments on this post.
Just weeks after the demo of ChatGPT launched there are many live examples of Chatbots that are similar.
There is also much healthy speculation on how GPT-4 may be like (Twitter thread), and how it may produce emergent A.I. and more emergent behaviors along the spectrum of for instance chain-of-thought and multi-model tasks.
On November 28th, OpenAI released a new addition to the GPT-3 model family:
davinci-003. This latest model builds on InstructGPT, using reinforcement learning with human feedback to better align language models with human instructions.
Due to the larger LLM of GPT-4, extended training period (GPT-3 was released in June, 2022 - going on 29 months) and with improved methods of RLHF, ChatGPT as a real product will produce some interesting competition for Google’s LaMDA even potentially impacting their future of dominating Search advertising and consumer search in general.
Keep reading with a 7-day free trial
Subscribe to AI Supremacy to keep reading this post and get 7 days of free access to the full post archives.