RAG 组件 #

本文档中引用的文件

目录 #

简介
核心接口设计
RAGPipeline 架构
内置组件详解
三种 RAG 模式
实例分析
最佳实践
总结

简介 #

prebuilt 包提供了完整的 RAG（检索增强生成）解决方案，基于清晰的接口设计理念，支持从基础到高级的各种 RAG 应用场景。该系统采用模块化架构，允许开发者灵活组合不同的组件来构建适合自己需求的 RAG 系统。

核心接口设计 #

RAG 系统的核心由七个主要接口组成，每个接口都承担特定的功能职责：

classDiagram
class Document {
+string PageContent
+map[string]interface Metadata
}
class DocumentLoader {
<<interface>>
+Load(ctx Context) []Document
}
class TextSplitter {
<<interface>>
+SplitDocuments([]Document) []Document
}
class Embedder {
<<interface>>
+EmbedDocuments(ctx Context, []string) [][]float64
+EmbedQuery(ctx Context, string) []float64
}
class VectorStore {
<<interface>>
+AddDocuments(ctx Context, []Document, [][]float64) error
+SimilaritySearch(ctx Context, string, int) []Document
+SimilaritySearchWithScore(ctx Context, string, int) []DocumentWithScore
}
class Retriever {
<<interface>>
+GetRelevantDocuments(ctx Context, string) []Document
}
class Reranker {
<<interface>>
+Rerank(ctx Context, string, []Document) []DocumentWithScore
}
DocumentLoader --> Document : "loads"
TextSplitter --> Document : "splits"
Embedder --> VectorStore : "provides embeddings"
VectorStore --> Retriever : "stores documents"
Retriever --> Reranker : "retrieves documents"

图表来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L12-L55)

接口功能说明 #

接口	职责	主要方法
`Document`	文档表示	内容和元数据存储
`DocumentLoader`	文档加载	从各种源加载文档
`TextSplitter`	文本分割	将长文档分割为较小块
`Embedder`	嵌入生成	生成文本的向量表示
`VectorStore`	向量存储	存储和检索嵌入向量
`Retriever`	文档检索	根据查询检索相关文档
`Reranker`	重排序	对检索结果进行重新排序

节来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L12-L55)

RAGPipeline 架构 #

RAGPipeline 是整个 RAG 系统的核心控制器，它基于消息图（MessageGraph）构建，支持三种不同的执行模式：

graph TB
subgraph "RAGPipeline 架构"
Config[RAGConfig 配置]
Graph[MessageGraph 图]
State[RAGState 状态]
Config --> Graph
Graph --> State
subgraph "节点类型"
Retrieve[检索节点]
Rerank[重排序节点]
Generate[生成节点]
Fallback[回退搜索节点]
Citations[引用格式化节点]
end
Graph --> Retrieve
Graph --> Rerank
Graph --> Generate
Graph --> Fallback
Graph --> Citations
end

图表来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L108-L123)

状态管理 #

RAG 系统维护一个统一的状态结构，包含查询、文档、上下文和答案等信息：

classDiagram
class RAGState {
+string Query
+[]Document Documents
+[]Document RetrievedDocuments
+[]DocumentWithScore RankedDocuments
+string Context
+string Answer
+[]string Citations
+map[string]interface Metadata
}
class DocumentWithScore {
+Document Document
+float64 Score
}
RAGState --> DocumentWithScore : "contains"
RAGState --> Document : "manages"

图表来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L57-L67)

节来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L57-L67)

内置组件详解 #

SimpleTextSplitter #

简单文本分割器提供基本的文档分割功能，支持自定义块大小和重叠：

flowchart TD
Input[输入文档] --> CheckSize{文档长度 ≤ 块大小?}
CheckSize --> |是| Return[直接返回]
CheckSize --> |否| Split[按分隔符分割]
Split --> Overlap[处理重叠部分]
Overlap --> Output[输出分割块]
style Input fill:#e1f5fe
style Output fill:#e8f5e8

图表来源

[rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L54-L92)

InMemoryVectorStore #

内存向量存储提供高效的相似性搜索功能：

sequenceDiagram
participant Client as 客户端
participant Store as InMemoryVectorStore
participant Embedder as 嵌入器
Client->>Store : AddDocuments(documents, embeddings)
Store->>Store : 存储文档和嵌入
Client->>Store : SimilaritySearch(query, k)
Store->>Embedder : EmbedQuery(query)
Embedder-->>Store : query_embedding
Store->>Store : 计算余弦相似度
Store->>Store : 排序并返回前k个
Store-->>Client : 返回结果

图表来源

[rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L94-L204)

SimpleReranker #

简单重排序器基于关键词匹配进行文档评分：

flowchart TD
Query[查询] --> Tokenize[分词处理]
Docs[文档列表] --> Tokenize
Tokenize --> Count[统计关键词出现次数]
Count --> Normalize[归一化处理]
Normalize --> Score[计算得分]
Score --> Sort[排序]
Sort --> Results[返回结果]
style Query fill:#fff3e0
style Results fill:#e8f5e8

图表来源

[rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L206-L261)

VectorStoreRetriever #

向量存储检索器结合向量存储和检索功能：

classDiagram
class VectorStoreRetriever {
+VectorStore VectorStore
+int TopK
+GetRelevantDocuments(ctx Context, string) []Document
}
class VectorStore {
<<interface>>
+SimilaritySearch(ctx Context, string, int) []Document
}
VectorStoreRetriever --> VectorStore : "uses"

图表来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L374-L391)

节来源

[rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L10-L333)

三种 RAG 模式 #

基础 RAG 模式 #

基础 RAG 模式是最简单的实现，包含检索和生成两个阶段：

sequenceDiagram
participant User as 用户
participant Pipeline as RAGPipeline
participant Retriever as 检索器
participant LLM as 大语言模型
User->>Pipeline : 提交查询
Pipeline->>Retriever : 获取相关文档
Retriever-->>Pipeline : 返回文档列表
Pipeline->>LLM : 生成回答
LLM-->>Pipeline : 返回答案
Pipeline-->>User : 返回最终结果

图表来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L125-L146)

高级 RAG 模式 #

高级 RAG 模式增加了重排序和引用生成功能：

flowchart TD
Query[查询] --> Retrieve[文档检索]
Retrieve --> Rerank{是否启用重排序?}
Rerank --> |是| RerankProcess[重排序处理]
Rerank --> |否| Generate[直接生成]
RerankProcess --> Generate
Generate --> Citations{是否包含引用?}
Citations --> |是| Format[格式化引用]
Citations --> |否| Final[最终结果]
Format --> Final

图表来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L148-L191)

条件 RAG 模式 #

条件 RAG 模式根据相关性分数进行智能路由：

flowchart TD
Query[查询] --> Retrieve[文档检索]
Retrieve --> Rerank[重排序]
Rerank --> Check{相关性分数 ≥ 阈值?}
Check --> |是| Generate[生成回答]
Check --> |否| Fallback{是否启用回退?}
Fallback --> |是| WebSearch[网络搜索]
Fallback --> |否| Generate
WebSearch --> Generate
Generate --> Citations[格式化引用]
Citations --> Final[最终结果]

图表来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L193-L249)

节来源

[rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L125-L249)

实例分析 #

基础 RAG 示例 #

基础 RAG 示例展示了最简单的 RAG 实现流程：

sequenceDiagram
participant Main as 主程序
participant Embedder as 嵌入器
participant VectorStore as 向量存储
participant Retriever as 检索器
participant Pipeline as RAG管道
participant LLM as 大语言模型
Main->>Embedder : 创建嵌入器
Main->>VectorStore : 创建向量存储
Main->>Embedder : 生成文档嵌入
Embedder-->>Main : 返回嵌入向量
Main->>VectorStore : 添加文档
Main->>Retriever : 创建检索器
Main->>Pipeline : 配置并构建管道
Pipeline->>LLM : 设置大语言模型
Main->>Pipeline : 编译管道
Main->>Pipeline : 执行查询
Pipeline->>Retriever : 检索相关文档
Retriever-->>Pipeline : 返回文档
Pipeline->>LLM : 生成回答
LLM-->>Pipeline : 返回答案
Pipeline-->>Main : 返回最终结果

图表来源

[rag_basic/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_basic/main.go#L14-L155)

高级 RAG 示例 #

高级 RAG 示例包含了更复杂的文档处理和重排序功能：

flowchart TD
Docs[原始文档] --> Splitter[文档分割器]
Splitter --> Chunks[文档块]
Chunks --> Embedder[嵌入器]
Embedder --> Embeddings[嵌入向量]
Embeddings --> VectorStore[向量存储]
VectorStore --> Retriever[检索器]
Retriever --> Reranker[重排序器]
Reranker --> Pipeline[高级RAG管道]
Pipeline --> LLM[大语言模型]
LLM --> Answer[最终答案]
style Docs fill:#e3f2fd
style Answer fill:#e8f5e8

图表来源

[rag_advanced/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_advanced/main.go#L14-L223)

条件 RAG 示例 #

条件 RAG 示例演示了基于相关性阈值的智能路由：

flowchart TD
Query[用户查询] --> Classifier[查询分类器]
Classifier --> Retriever[文档检索器]
Retriever --> Reranker[重排序器]
Reranker --> Threshold{相关性阈值检查}
Threshold --> |高相关性| Generator[答案生成器]
Threshold --> |低相关性| Fallback[回退搜索]
Fallback --> Generator
Generator --> Formatter[格式化器]
Formatter --> Response[响应]
style Threshold fill:#fff3e0
style Response fill:#e8f5e8

图表来源

[rag_conditional/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_conditional/main.go#L14-L212)

节来源

[rag_basic/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_basic/main.go#L14-L155)
[rag_advanced/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_advanced/main.go#L14-L223)
[rag_conditional/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_conditional/main.go#L14-L212)

最佳实践 #

组件选择指南 #

场景	推荐组件	说明
快速原型	MockEmbedder + InMemoryVectorStore	开发测试友好
生产环境	LangChainEmbedder + ChromaVectorStore	性能和稳定性
多模态	VisionEmbedder + MultiModalVectorStore	支持图像和文本
海量数据	分布式向量存储 + 并行处理	可扩展性

性能优化建议 #

嵌入维度权衡：平衡质量和性能
缓存策略：缓存常用查询的嵌入
批处理：批量处理多个文档
索引优化：使用合适的相似性度量

错误处理模式 #

flowchart TD
Request[请求] --> Validate[验证输入]
Validate --> Success{验证成功?}
Success --> |否| Error[错误处理]
Success --> |是| Process[处理流程]
Process --> Retry{需要重试?}
Retry --> |是| Delay[延迟重试]
Retry --> |否| Success2[成功返回]
Delay --> Process
Error --> Log[记录日志]
Log --> Fail[失败响应]

总结 #

prebuilt 包的 RAG 组件提供了一个完整、灵活且可扩展的解决方案。通过清晰的接口设计和模块化架构，开发者可以轻松构建从基础到高级的各种 RAG 应用。系统支持多种执行模式，能够满足不同复杂度的需求，同时提供了丰富的内置组件和适配器，便于与现有系统集成。

关键优势：

模块化设计：清晰的接口分离，易于扩展和替换
多种模式：支持基础、高级和条件三种执行模式
丰富组件：提供多种内置实现和适配器
测试友好：完整的单元测试和集成测试覆盖

通过合理选择和配置这些组件，开发者可以快速构建高质量的 RAG 系统，为用户提供准确、相关性强的问答服务。