Build a LangChain or LlamaIndex RAG Pipeline
Build a proof-of-concept Retrieval-Augmented Generation pipeline over a document of your choice: ingest and chunk the document, embed it into a vector store, wire up a retriever, and have an LLM answer questions grounded in the retrieved context. Submit your script or notebook and we grade it against a rubric.
4 hrs
Est. time
5
Outcomes
6
Rubric criteria
65%
Pass score
What you'll learn
Skills you'll have real reps in after shipping this.
See how it works
The two-phase RAG pipeline
RAG is two pipelines that meet at the vector store. Toggle between index time (offline, once per document) and query time (online, every question), and click any stage to see its job.
Why chunking matters
How you split the document decides what the retriever can find: chunks too big blur topics together, too small and they lose the context an answer needs.
The scenario
Your company, InnovateTech Solutions, provides technical support and product information. The team currently answers customer questions by manually searching a growing library of internal documentation, and that lookup is getting slower as the library grows. Leadership wants to use Generative AI to automate answering questions over a fixed set of documents.
You have been asked to build a proof-of-concept RAG pipeline. It should let a support agent ask a natural-language question about the contents of one internal document (a product manual, an FAQ list, or a technical spec sheet) and get back an answer that is generated by an LLM and grounded in that document.
Your role
You are an AI Engineer responsible for the foundational components of this document Q&A system. Your goal is a functional RAG pipeline built with either LangChain or LlamaIndex, integrating an LLM and a retriever over a document you provide.
Start the task to unlock the full brief
You'll get the step-by-step requirements, setup commands, the 6-criterion grading rubric, tips, and the ability to submit your solution for instant AI grading.
Free to start · submit when you're ready
Learning resources
What you'll build in this RAG pipeline task
This is a build-and-submit task, not a guided lab. You build a working Retrieval-Augmented Generation (RAG) pipeline on your own machine or notebook, then upload it to be graded against a transparent rubric. RAG is the pattern behind almost every production document Q&A system: instead of hoping a model remembers your internal docs, you retrieve the relevant passages at query time and feed them to the model as context, so the answer is grounded in your source material rather than the model's training data.
You choose the document and the framework. Load a product manual, FAQ, or spec sheet, split it into chunks with a sensible strategy, embed those chunks into an in-memory vector store (FAISS, Chroma, or your framework's built-in store), wire up a retriever, and integrate an LLM that answers questions using the retrieved context. You then demonstrate the pipeline by answering at least three questions. The deliverable is a single Python script or Jupyter notebook, the same artifact you would commit to a real proof-of-concept repository.
Grading is rubric-based and explainable. Each criterion (framework setup, ingestion and chunking, vector store and embeddings, retriever, LLM integration, and the demonstration) carries a weight, and you get a per-criterion score with evidence pulled from your own submission plus specific feedback on what to improve. The pass threshold is 65 percent. You can resubmit. The point is not the number: it is the feedback and the portfolio-ready artifact you walk away with.