<?xml version="1.0"?>
<?xml-stylesheet type="text/css" href="https://script.spoken-tutorial.org/skins/common/feed.css?303"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>https://script.spoken-tutorial.org/index.php?action=history&amp;feed=atom&amp;title=Being-Creative-with-AI%2FC4%2FUnderstanding-the-RAG-pipeline%2FEnglish</id>
		<title>Being-Creative-with-AI/C4/Understanding-the-RAG-pipeline/English - Revision history</title>
		<link rel="self" type="application/atom+xml" href="https://script.spoken-tutorial.org/index.php?action=history&amp;feed=atom&amp;title=Being-Creative-with-AI%2FC4%2FUnderstanding-the-RAG-pipeline%2FEnglish"/>
		<link rel="alternate" type="text/html" href="https://script.spoken-tutorial.org/index.php?title=Being-Creative-with-AI/C4/Understanding-the-RAG-pipeline/English&amp;action=history"/>
		<updated>2026-05-21T21:10:03Z</updated>
		<subtitle>Revision history for this page on the wiki</subtitle>
		<generator>MediaWiki 1.23.17</generator>

	<entry>
		<id>https://script.spoken-tutorial.org/index.php?title=Being-Creative-with-AI/C4/Understanding-the-RAG-pipeline/English&amp;diff=57793&amp;oldid=prev</id>
		<title>Ketkinaina: Created page with &quot;'''Title of the Script: Understanding the RAG Pipeline.'''  '''Author: EdyPyramids Team.'''  '''Keywords: RAG, Retrieval Augmented Generation, Artificial Intelligence, Embeddi...&quot;</title>
		<link rel="alternate" type="text/html" href="https://script.spoken-tutorial.org/index.php?title=Being-Creative-with-AI/C4/Understanding-the-RAG-pipeline/English&amp;diff=57793&amp;oldid=prev"/>
				<updated>2026-05-19T20:58:31Z</updated>
		
		<summary type="html">&lt;p&gt;Created page with &amp;quot;&amp;#039;&amp;#039;&amp;#039;Title of the Script: Understanding the RAG Pipeline.&amp;#039;&amp;#039;&amp;#039;  &amp;#039;&amp;#039;&amp;#039;Author: EdyPyramids Team.&amp;#039;&amp;#039;&amp;#039;  &amp;#039;&amp;#039;&amp;#039;Keywords: RAG, Retrieval Augmented Generation, Artificial Intelligence, Embeddi...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;'''Title of the Script: Understanding the RAG Pipeline.'''&lt;br /&gt;
&lt;br /&gt;
'''Author: EdyPyramids Team.'''&lt;br /&gt;
&lt;br /&gt;
'''Keywords: RAG, Retrieval Augmented Generation, Artificial Intelligence, Embeddings, Vector Database, Vector Search, AI Chatbot, Document Chunking, AI Pipeline, Query Processing, Information Retrieval, Generative AI, EduPyramids, video tutorial.'''&lt;br /&gt;
&lt;br /&gt;
{|border=1&lt;br /&gt;
|- &lt;br /&gt;
|| '''Visual Cue'''&lt;br /&gt;
|| '''Narration'''&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 1'''&lt;br /&gt;
&lt;br /&gt;
'''Title Slide'''&lt;br /&gt;
|| Welcome to this '''Spoken''' '''Tutorial''' on '''Understanding the RAG Pipeline.'''&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 2'''&lt;br /&gt;
&lt;br /&gt;
'''Learning Objectives'''&lt;br /&gt;
|| In this tutorial, we will learn:&lt;br /&gt;
* The stages of a '''RAG pipeline.'''&lt;br /&gt;
* How a user query is processed.&lt;br /&gt;
* How retrieval improves the generated response.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 3'''&lt;br /&gt;
&lt;br /&gt;
'''Disclaimer Slide'''&lt;br /&gt;
&lt;br /&gt;
As '''AI''' tools constantly evolve, if you are unable to locate any icon or encounter difficulty at any step, you may use any conversational '''AI''' '''Chatbot''' for guidance.&lt;br /&gt;
|| As '''AI''' tools constantly evolve, if you are unable to locate any icon or encounter difficulty at any step, you may use any conversational '''AI''' '''Chatbot''' for guidance.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 4'''&lt;br /&gt;
&lt;br /&gt;
'''System Requirements'''&lt;br /&gt;
|| To record this tutorial, I am using: &lt;br /&gt;
* '''Ubuntu 24.04 LTS, '''&lt;br /&gt;
* '''Firefox '''version''' 148.0.2'''&lt;br /&gt;
&lt;br /&gt;
Learners will also need a working internet connection&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 5'''&lt;br /&gt;
&lt;br /&gt;
'''Prerequisites'''&lt;br /&gt;
&lt;br /&gt;
[https://edupyramids.org/ https://EduPyramids.org]&lt;br /&gt;
|| To follow this tutorial, &lt;br /&gt;
* Learners should know basic computer and internet usage. &lt;br /&gt;
* Basic '''AI''' knowledge is required.&lt;br /&gt;
&lt;br /&gt;
For the Prerequisite of this tutorial, visit the website shown on your screen&lt;br /&gt;
|- &lt;br /&gt;
|| &lt;br /&gt;
|| Let us get started.&lt;br /&gt;
|- &lt;br /&gt;
|| '''RAG pipeline diagram with Retrieval + Generation blocks'''&lt;br /&gt;
|| '''RAG''' is known as '''Retrieval Augmented Generation.'''&lt;br /&gt;
|- &lt;br /&gt;
|| '''Flowchart showing sequence of stages'''&lt;br /&gt;
|| It is a sequence of steps used to answer a question.&lt;br /&gt;
|- &lt;br /&gt;
|| '''AI system connected to external documents/database'''&lt;br /&gt;
|| It uses external data to generate accurate answers.&lt;br /&gt;
|- &lt;br /&gt;
|| '''AI brain with “Stored Knowledge” and “Retrieved Knowledge” labels'''&lt;br /&gt;
|| The system does not rely only on stored knowledge.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Search and retrieval animation from documents'''&lt;br /&gt;
|| It retrieves relevant information to generate answers.&lt;br /&gt;
|- &lt;br /&gt;
|| '''“Step-by-step process” illustration'''&lt;br /&gt;
|| Let us understand this step by step.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Grocery shopping chatbot interface'''&lt;br /&gt;
|| Consider a grocery app '''chatbot'''. &lt;br /&gt;
|- &lt;br /&gt;
|| '''User typing a question into chatbot'''&lt;br /&gt;
|| Ask this question: “Can I return vegetables after 2 days?”&lt;br /&gt;
|- &lt;br /&gt;
|| '''Human brain vs computer comparison graphic'''&lt;br /&gt;
|| Now the system cannot understand words directly like humans&lt;br /&gt;
|- &lt;br /&gt;
|| '''Flowchart showing processing stages'''&lt;br /&gt;
|| So it follows a series of steps to find the answer.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Text converting into vectors/numbers animation'''&lt;br /&gt;
|| First, the system converts the question into a machine-understandable form. &lt;br /&gt;
|- &lt;br /&gt;
|| '''Highlight the term “Embedding”'''&lt;br /&gt;
|| This process is called '''embedding'''.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Binary numbers or vectors beside words'''&lt;br /&gt;
|| Computers work with numbers, not words.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Words converting into numeric arrays'''&lt;br /&gt;
|| So every word is represented as a list of numbers.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Example vector representation of “vegetables”'''&lt;br /&gt;
|| For example,&lt;br /&gt;
&lt;br /&gt;
The word ‘vegetables’ may be converted into a list of numbers like:&lt;br /&gt;
&lt;br /&gt;
[0.21, 0.45, 0.78, …]&lt;br /&gt;
|- &lt;br /&gt;
|| '''Side-by-side vectors for vegetables and fruits'''&lt;br /&gt;
|| Similarly,&lt;br /&gt;
&lt;br /&gt;
The word ‘fruits’ will have a different set of numbers.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Highlight closeness between vectors'''&lt;br /&gt;
|| But the values will be close to ‘vegetables’ because both are related.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Vector comparison illustration'''&lt;br /&gt;
|| Now, instead of comparing words,&lt;br /&gt;
&lt;br /&gt;
The system compares these numbers.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Similar vectors connected visually'''&lt;br /&gt;
|| If two sentences have similar '''vectors''', their meanings are similar.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Question matched with stored documents'''&lt;br /&gt;
|| This lets the system compare the question with stored documents&lt;br /&gt;
|- &lt;br /&gt;
|| '''Similarity measurement graphic'''&lt;br /&gt;
|| It helps measure sentence similarity.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Company documents or policy PDF shown'''&lt;br /&gt;
|| Next, the system searches company documents such as the return policy. &lt;br /&gt;
|- &lt;br /&gt;
|| '''Large document splitting into blocks'''&lt;br /&gt;
|| These documents are often divided into smaller parts called '''chunks'''.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Three text boxes labeled Chunk 1, Chunk 2, Chunk 3'''&lt;br /&gt;
|| A return policy document may be split into '''chunks''' like this &lt;br /&gt;
|- &lt;br /&gt;
|| '''Three text boxes labeled Chunk 1, Chunk 2, Chunk 3'''&lt;br /&gt;
|| '''Chunk 1: '''Return rules for fruits and vegetables&lt;br /&gt;
&lt;br /&gt;
'''Chunk 2:''' Return rules for packaged items&lt;br /&gt;
&lt;br /&gt;
'''Chunk 3:''' Refund process and timelines&lt;br /&gt;
|- &lt;br /&gt;
|| '''Chunks converting into embeddings'''&lt;br /&gt;
|| Now, each '''chunk''' is converted into numbers using the same '''embedding''' process.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Vector database illustration with stored vectors'''&lt;br /&gt;
|| These number representations are stored in a '''database''' called a '''vector database'''.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Search icon over vector database'''&lt;br /&gt;
|| A '''vector database''' stores data in the form of numbers.&lt;br /&gt;
&lt;br /&gt;
It can quickly find similar meanings.&lt;br /&gt;
|- &lt;br /&gt;
|| '''User query converting into vectors'''&lt;br /&gt;
|| When you ask:&lt;br /&gt;
&lt;br /&gt;
‘Can I return vegetables after 2 days?’&lt;br /&gt;
&lt;br /&gt;
Your question is also converted into numbers.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Query vector compared with stored chunk vectors'''&lt;br /&gt;
|| Then, the system compares your question with '''chunks''' in the''' vector database'''.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Best matching chunk highlighted'''&lt;br /&gt;
|| It finds the most similar '''chunk'''.&lt;br /&gt;
&lt;br /&gt;
For example:&lt;br /&gt;
&lt;br /&gt;
‘Fresh vegetables can be returned within 24 hours only.’&lt;br /&gt;
|- &lt;br /&gt;
|| '''Retrieved chunk sent to AI model'''&lt;br /&gt;
|| This relevant '''chunk''' is then used to generate the final answer.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Multiple retrieved policy lines displayed'''&lt;br /&gt;
|| The system compares the question with '''chunks''' and retrieves relevant '''chunks'''.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Highlight retrieved policy statements'''&lt;br /&gt;
|| The system may retrieve statements such as:&lt;br /&gt;
* “Perishable items cannot be returned.” &lt;br /&gt;
* “Items can be returned within 24 hours.”&lt;br /&gt;
|- &lt;br /&gt;
|| '''Good retrieval vs bad retrieval comparison'''&lt;br /&gt;
|| Note that the quality of the final answer depends on the retrieved information.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Relevant chunk highlighted among many results'''&lt;br /&gt;
|| The system selects the most relevant information from the retrieved results.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Question and retrieved chunk combined visually'''&lt;br /&gt;
|| This information is combined with the original question.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Two blocks labeled Question and Policy Line'''&lt;br /&gt;
|| The system now has two things:&lt;br /&gt;
Your question and the relevant policy line.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Context box formed from question + retrieved text'''&lt;br /&gt;
|| Together, these form the '''context'''&lt;br /&gt;
&lt;br /&gt;
The '''AI''' uses the background information to generate an answer.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Policy statement highlighted'''&lt;br /&gt;
|| For example, from the return policy, it may find this line:&lt;br /&gt;
&lt;br /&gt;
‘Fresh vegetables can be returned within 24 hours only.’&lt;br /&gt;
|- &lt;br /&gt;
|| '''Original question shown beside retrieved statement'''&lt;br /&gt;
|| Now, this information is combined with your original question:&lt;br /&gt;
|- &lt;br /&gt;
|| '''Question displayed again prominently'''&lt;br /&gt;
|| ‘Can I return vegetables after 2 days?’&lt;br /&gt;
|- &lt;br /&gt;
|| '''Combined input entering AI model'''&lt;br /&gt;
|| So the system now has both, your question and the relevant policy information&lt;br /&gt;
|- &lt;br /&gt;
|| '''Label “Context” shown clearly'''&lt;br /&gt;
|| This combined input is called '''context'''.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Definition text animation'''&lt;br /&gt;
'''Context → AI → Response flowchart'''&lt;br /&gt;
|| In simple terms, '''context''' is the background information given to the system.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Final chatbot answer displayed on screen'''&lt;br /&gt;
|| Using this '''context''', the system can now generate a correct response, like this:&lt;br /&gt;
‘No, vegetables cannot be returned after 2 days.&lt;br /&gt;
&lt;br /&gt;
As the policy allows returns only within 24 hours.&lt;br /&gt;
|- &lt;br /&gt;
|| '''AI model reading retrieved chunk and question'''&lt;br /&gt;
|| The '''AI model''' reads the question and the retrieved information. &lt;br /&gt;
|- &lt;br /&gt;
|| '''Generated answer appearing'''&lt;br /&gt;
|| It then generates an answer based on this information.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Alternative generated answer example'''&lt;br /&gt;
|| For example: &lt;br /&gt;
&lt;br /&gt;
“Vegetables are perishable items and cannot be returned after delivery.”&lt;br /&gt;
|- &lt;br /&gt;
|| '''Incorrect guessing crossed out, retrieved answer ticked'''&lt;br /&gt;
|| The system reduces guess work and bases its answer on retrieved information.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Relevant answer highlighted'''&lt;br /&gt;
|| This leads to a more relevant response.&lt;br /&gt;
&lt;br /&gt;
Without retrieval, the answer may be generic or incorrect.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Three-step pipeline animation: Question → Retrieval → Answer'''&lt;br /&gt;
|| The process is as follows: &lt;br /&gt;
* A question is asked. &lt;br /&gt;
* Relevant information is retrieved. &lt;br /&gt;
* An answer is generated.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Wrong retrieval leading to incorrect answer illustration'''&lt;br /&gt;
|| Wrong or unrelated retrieval may lead to an incorrect answer.&lt;br /&gt;
|- &lt;br /&gt;
|| &lt;br /&gt;
|| With this we come to the end of this tutorial.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 6'''&lt;br /&gt;
&lt;br /&gt;
'''Summary'''&lt;br /&gt;
&lt;br /&gt;
'''In this tutorial, we learnt:'''&lt;br /&gt;
* '''The stages of a RAG pipeline.'''&lt;br /&gt;
* '''How a user query is processed.'''&lt;br /&gt;
* '''How retrieval improves the generated response.'''&lt;br /&gt;
|| In this tutorial, we learnt:&lt;br /&gt;
* The stages of a RAG pipeline.&lt;br /&gt;
* How a user query is processed.&lt;br /&gt;
* How retrieval improves the generated response.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 7'''&lt;br /&gt;
&lt;br /&gt;
'''Assignment '''&lt;br /&gt;
&lt;br /&gt;
'''Create a simple grocery return policy with at least three rules.'''&lt;br /&gt;
&lt;br /&gt;
'''Ask the question: “Can I return vegetables after 2 days?”'''&lt;br /&gt;
&lt;br /&gt;
'''Then ask the same question again using your policy as context.'''&lt;br /&gt;
&lt;br /&gt;
'''Compare both responses and observe how retrieval improves the final answer.'''&lt;br /&gt;
&lt;br /&gt;
'''Also identify:'''&lt;br /&gt;
* '''the user question,'''&lt;br /&gt;
* '''the retrieved information, and'''&lt;br /&gt;
* '''the final answer.'''&lt;br /&gt;
|| We encourage you to do this assignment.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 8'''&lt;br /&gt;
&lt;br /&gt;
'''Acknowledgement'''&lt;br /&gt;
&lt;br /&gt;
'''Domain Inputs: Bhavani Shankar R and Saisudha Sugavanam '''&lt;br /&gt;
&lt;br /&gt;
'''Script Writer: Ketki Naina'''&lt;br /&gt;
&lt;br /&gt;
'''Admin Reviewer: Arthi Varadarajan'''&lt;br /&gt;
&lt;br /&gt;
'''Quality Reviewer: Sakina Sidhwa'''&lt;br /&gt;
&lt;br /&gt;
'''Novice Reviewer: Misbah Samir'''&lt;br /&gt;
&lt;br /&gt;
'''AI Narration: Debosmita Mukherjee'''&lt;br /&gt;
&lt;br /&gt;
'''AI Graphics: Arvind Pillai'''&lt;br /&gt;
&lt;br /&gt;
'''Video Editor: Arvind Pillai'''&lt;br /&gt;
&lt;br /&gt;
'''Web Developer: Ankita Singhal'''&lt;br /&gt;
|| Thank you for joining.&lt;br /&gt;
|- &lt;br /&gt;
|| '''Slide 9'''&lt;br /&gt;
&lt;br /&gt;
'''Acknowledgement'''&lt;br /&gt;
&lt;br /&gt;
'''This Spoken Tutorial is brought to you by EduPyramids Educational Services Private Limited at SINE, IIT Bombay. '''&lt;br /&gt;
|| &lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Ketkinaina</name></author>	</entry>

	</feed>