Are long context windows the end of RAG?

The Stack Overflow Podcast

The home team is joined by Michael Foree, Stack Overflow’s director of data science and data platform, and occasional cohost Cassidy Williams, CTO at Contenda, for a conversation about long context windows, retrieval-augmented generation, and how Databricks’ new open LLM could change the game for developers. Plus: How will FTX co-founder Sam Bankman-Fried’s sentence of 25 years in prison reverberate in the blockchain and crypto spaces?

DBRX, an open, general-purpose LLM created by Databricks, reportedly outperforms GPT-3.5 and is competitive with Gemini 1.0 Pro.

Recent research found that large, complex LLMs use a simple mechanism to retrieve stored knowledge in response to a user prompt. These mechanisms can help researchers reveal what the model knows and potentially even correct false information it has stored.

FTX co-founder Sam Bankman-Fried, whose downfall began in late 2022, was sentenced last week to 25 years in prison for conspiracy and fraud.

Find Michael on LinkedIn.

Find Cassidy on her website.

Stack Overflow user Bucket received a well-deserved Lifeboat badge for rescuing How to calculate decimal(x, y) max value in SQL Server from an ocean of ignorance.

Chapters (please note that these timestamps may not be exact):

00:00 Introduction and White Paper Discussion

02:01 Long Context Windows and Retrieval Augmented Generation

05:56 Models' Ability to Recall Relevant Information

07:18 Models' Creativity and Thinking Outside the Box

09:41 Advantages and Limitations of Models' Knowledge

15:09 Databricks' Open Language Model

22:25 Sam Bankman-Fried’s Sentence and the Effects on Crypto/Blockchain

31:28 Closing Remarks and Lifeboat Badge

Audio Player