The building blocks behind ChatGPT

Episode 2, Mar 07, 2023, 06:23 AM

Join Rafael and Marcia as they welcome Matt Kidd, Senior Data Scientist (NLP) from Deeper Insights, for a discussion on InstructGPT, the predecessor of ChatGPT. The discussion centres on the paper "Training language models to follow instructions with human feedback" (2022) authored by the OpenAI team.

Join Rafael and Marcia as they welcome Matt Kidd, Senior Data Scientist (NLP) from Deeper Insights, for a discussion on InstructGPT, the predecessor of ChatGPT. The main discussion revolves around the impact of using alignment techniques, namely Reinforcement Learning from Human Feedback (RLHF), on the usefulness and widespread use of Large Language Models (LLMs). Centres around the paper "Training language models to follow instructions with human feedback" (2022) authored by the OpenAI team. They cover topics like alignment with human intentions, RLHF and the finer areas of what makes this paper a seminal paper for the generative AI communities. If you are interested in reading the paper and following along please click this link: https://arxiv.org/pdf/2203.02155.pdf

For more information on all things artificial intelligence, generative AI, machine learning, and engineering for your business please visit www.deeperinsights.com or reach out to us at thepaperclub@deeperinsights.com.

The building blocks behind ChatGPT

Subscribe

Next

The Foundations of Stable Diffusion

Top episodes

Understanding Deep Learning with Simon Prince

Exploring LoRA: Fine-Tuning Large Language Models

Machine Learning Operations (MLOps)

Sorry, your browser isn't supported by Audioboom.

Page load failed