GraphRAG & Knowledge Graph - Nâng cấp hệ thống AI với đồ thị tri thức

Posted on: 4/13/2026 8:52:17 PM

Table of contents

1. Tại sao RAG truyền thống chưa đủ?
1. GraphRAG là gì?
2. Knowledge Graph - Nền tảng của GraphRAG
1. 2.1 Cấu trúc cơ bản
2. 2.2 Các loại Knowledge Graph
3. Kiến trúc GraphRAG: Từ lý thuyết đến thực tế
4. So sánh chi tiết: RAG truyền thống vs GraphRAG
1. Lưu ý quan trọng
5. Microsoft GraphRAG & LazyGraphRAG
1. 5.1 Microsoft GraphRAG (Open-source)
2. 5.2 LazyGraphRAG - Bước đột phá về chi phí
6. Triển khai GraphRAG với Neo4j + LLM
7. Production Patterns & Best Practices
8. Use Cases thực tế
9. Lộ trình phát triển GraphRAG
10. Bắt đầu với GraphRAG - Hướng dẫn thực chiến
11. Kết luận
1. Next steps cho builder

GraphRAG Knowledge Graph AI Architecture Neo4j LLM

1. Tại sao RAG truyền thống chưa đủ?

Nếu bạn đã đọc bài viết trước của mình về RAG (Retrieval-Augmented Generation), bạn biết rằng RAG giải quyết bài toán "hallucination" bằng cách bổ sung tri thức bên ngoài cho LLM. Tuy nhiên, RAG truyền thống (Naive RAG) có một điểm yếu chí mạng: nó xử lý mỗi chunk văn bản như một đơn vị độc lập, hoàn toàn mù mờ về mối quan hệ giữa các thực thể.

Hãy tưởng tượng bạn hỏi: "Nhân viên nào trong phòng R&D đã từng làm việc với khách hàng VinGroup và có kinh nghiệm về AI?" - RAG truyền thống sẽ tìm các chunk chứa từ khóa liên quan, nhưng không thể "nối các dấu chấm" (connect the dots) giữa nhân viên, phòng ban, dự án, và khách hàng. Đây chính là lúc GraphRAG tỏa sáng.

72-83%Độ bao phủ (comprehensiveness) so với RAG truyền thống

3.4xCải thiện độ chính xác trong enterprise

4.5%Tăng reasoning depth cho multi-hop questions

99%Giảm chi phí indexing với LazyGraphRAG

GraphRAG là gì?

GraphRAG là phương pháp kết hợp Knowledge Graph (đồ thị tri thức) với LLM, cho phép AI không chỉ truy xuất thông tin mà còn suy luận qua các mối quan hệ giữa entities. Thay vì tìm kiếm theo vector similarity, GraphRAG duyệt qua đồ thị để trả lời các câu hỏi phức tạp đòi hỏi multi-hop reasoning.

2. Knowledge Graph - Nền tảng của GraphRAG

2.1 Cấu trúc cơ bản

Knowledge Graph (KG) lưu trữ thông tin dưới dạng bộ ba (triple): (Subject, Predicate, Object) hay (Entity, Relationship, Entity). Ví dụ:

(Nguyễn Văn A, WORKS_AT, Phòng R&D)
(Phòng R&D, BELONGS_TO, Công ty XYZ)
(Dự án Alpha, USES_TECH, Neo4j)
(Nguyễn Văn A, LEADS, Dự án Alpha)

graph LR
    A["Nguyễn Văn A"] -->|WORKS_AT| B["Phòng R&D"]
    A -->|LEADS| C["Dự án Alpha"]
    A -->|HAS_SKILL| D["AI/ML"]
    B -->|BELONGS_TO| E["Công ty XYZ"]
    C -->|USES_TECH| F["Neo4j"]
    C -->|FOR_CLIENT| G["VinGroup"]
    C -->|USES_TECH| D
    H["Trần Thị B"] -->|WORKS_AT| B
    H -->|CONTRIBUTES_TO| C
    H -->|HAS_SKILL| I["NLP"]

Hình 1: Ví dụ Knowledge Graph đơn giản trong doanh nghiệp

Với đồ thị này, câu hỏi "Ai trong phòng R&D làm việc với VinGroup và có kinh nghiệm AI?" được trả lời bằng cách duyệt: Phòng R&D ← WORKS_AT ← Nguyễn Văn A → LEADS → Dự án Alpha → FOR_CLIENT → VinGroup, đồng thời kiểm tra Nguyễn Văn A → HAS_SKILL → AI/ML.

2.2 Các loại Knowledge Graph

Loại	Mô tả	Ví dụ	Use case
Domain KG	Đồ thị chuyên biệt cho một lĩnh vực	Medical KG, Legal KG	Healthcare AI, Legal Research
Enterprise KG	Đồ thị nội bộ doanh nghiệp	Employee-Project-Client graph	Internal search, decision support
Open KG	Đồ thị tri thức mở, công cộng	Wikidata, DBpedia, YAGO	General-purpose QA, enrichment
Temporal KG	Đồ thị có yếu tố thời gian	Event timeline graph	Trend analysis, forecasting

3. Kiến trúc GraphRAG: Từ lý thuyết đến thực tế

3.1 Pipeline tổng quan

GraphRAG pipeline gồm hai pha chính: Indexing (xây dựng đồ thị từ dữ liệu thô) và Querying (truy vấn đồ thị + LLM để trả lời).

flowchart TB
    subgraph Indexing["INDEXING PIPELINE"]
        A["Tài liệu thô"] --> B["Chunking"]
        B --> C["Entity Extraction (LLM)"]
        C --> D["Relation Extraction (LLM)"]
        D --> E["Entity Resolution"]
        E --> F["Community Detection"]
        F --> G["Community Summarization"]
        G --> H["Knowledge Graph + Summaries"]
    end

    subgraph Querying["QUERY PIPELINE"]
        I["User Query"] --> J{"Query Router"}
        J -->|"Local Search"| K["Subgraph Retrieval"]
        J -->|"Global Search"| L["Community Summaries"]
        K --> M["Context Assembly"]
        L --> M
        M --> N["LLM Generation"]
        N --> O["Answer + Sources"]
    end

    H --> K
    H --> L

Hình 2: GraphRAG Pipeline - Indexing và Querying

3.2 Indexing Pipeline chi tiết

Bước 1: Entity & Relation Extraction

Đây là bước quan trọng nhất - sử dụng LLM để trích xuất entities và relationships từ văn bản. Microsoft GraphRAG sử dụng prompt engineering tinh vi để LLM đóng vai "knowledge extractor".

// Ví dụ prompt cho Entity Extraction
const extractionPrompt = `
Bạn là chuyên gia trích xuất tri thức. Phân tích đoạn văn bản sau
và trích xuất TẤT CẢ entities và relationships.

Entities: Tên người, tổ chức, địa điểm, sản phẩm, công nghệ,
         sự kiện, khái niệm.

Relationships: Mối quan hệ giữa các entities.
              Format: (entity1, RELATIONSHIP_TYPE, entity2, description)

Văn bản: {text_chunk}

Trả về JSON:
{
  "entities": [
    {"name": "...", "type": "PERSON|ORG|TECH|...", "description": "..."}
  ],
  "relationships": [
    {"source": "...", "target": "...", "type": "...", "description": "..."}
  ]
}`;

Bước 2: Entity Resolution

Cùng một entity có thể xuất hiện dưới nhiều tên khác nhau: "Nguyễn Văn A", "anh A", "NVA", "Mr. Nguyen". Entity Resolution gộp chúng thành một node duy nhất, thường sử dụng combination of:

String similarity (Levenshtein, Jaro-Winkler)
Embedding similarity (cosine similarity giữa entity descriptions)
LLM-based resolution (hỏi LLM hai entity có phải một không)

Bước 3: Community Detection

Đây là điểm khác biệt cốt lõi của Microsoft GraphRAG so với các approach khác. Sau khi xây dựng đồ thị, thuật toán Leiden được sử dụng để phát hiện "cộng đồng" (community) - các nhóm node có liên kết chặt chẽ với nhau.

graph TB
    subgraph C1["Community 1: AI Team"]
        A1["Data Scientist A"]
        A2["ML Engineer B"]
        A3["Dự án LLM"]
        A1 --- A2
        A2 --- A3
        A1 --- A3
    end

    subgraph C2["Community 2: Infra Team"]
        B1["DevOps C"]
        B2["SRE D"]
        B3["K8s Cluster"]
        B1 --- B2
        B2 --- B3
        B1 --- B3
    end

    subgraph C3["Community 3: Client Projects"]
        D1["VinGroup"]
        D2["FPT"]
        D3["Dự án Enterprise"]
        D1 --- D3
        D2 --- D3
    end

    A3 -.->|"deploys_on"| B3
    A3 -.->|"serves"| D3

Hình 3: Community Detection - Phân nhóm entities có liên kết chặt

Mỗi community sau đó được LLM tóm tắt thành một community summary - bản mô tả ngắn gọn về nhóm đó. Đây là "chìa khóa" cho Global Search: thay vì duyệt toàn bộ đồ thị, query engine chỉ cần đọc các summaries.

3.3 Query Engine: Local vs Global Search

Tiêu chí	Local Search	Global Search
Câu hỏi phù hợp	Cụ thể, về entity xác định	Tổng quát, cần nhìn toàn cảnh
Ví dụ	"Dự án Alpha dùng công nghệ gì?"	"Xu hướng công nghệ chính của công ty là gì?"
Cơ chế	Tìm entity → duyệt subgraph xung quanh	Map-reduce trên community summaries
Độ trễ	Thấp (ms)	Cao hơn (seconds, do nhiều LLM calls)
Chi phí LLM	Thấp	Cao (map-reduce qua nhiều summaries)
Độ bao phủ	Hẹp, chính xác	Rộng, toàn diện

4. So sánh chi tiết: RAG truyền thống vs GraphRAG

flowchart LR
    subgraph Traditional["RAG Truyền thống"]
        Q1["Query"] --> E1["Embedding"]
        E1 --> V1["Vector Search"]
        V1 --> C1["Top-K Chunks"]
        C1 --> L1["LLM"]
        L1 --> A1["Answer"]
    end

    subgraph Graph["GraphRAG"]
        Q2["Query"] --> R2["Query Router"]
        R2 --> E2["Entity Recognition"]
        E2 --> G2["Graph Traversal"]
        G2 --> CS2["Community Summaries"]
        G2 --> SC2["Subgraph Context"]
        CS2 --> L2["LLM"]
        SC2 --> L2
        L2 --> A2["Answer + Reasoning Path"]
    end

Hình 4: RAG truyền thống vs GraphRAG - Luồng xử lý

Tiêu chí	RAG Truyền thống	GraphRAG
Đơn vị lưu trữ	Text chunks (vectors)	Entities + Relationships + Communities
Cơ chế tìm kiếm	Vector similarity (cosine, dot product)	Graph traversal + Community map-reduce
Multi-hop reasoning	Yếu - mỗi chunk độc lập	Mạnh - duyệt qua nhiều hop trên đồ thị
Global questions	Rất yếu - chỉ thấy top-K chunks	Mạnh - community summaries cho bird-eye view
Explainability	Trả về source chunks	Trả về reasoning path trên đồ thị
Chi phí indexing	Thấp (chỉ embedding)	Cao (nhiều LLM calls cho extraction)
Chi phí query	Thấp	Trung bình - cao (tùy local/global)
Cập nhật dữ liệu	Dễ - thêm/xóa chunks	Phức tạp - cần re-extract entities

Lưu ý quan trọng

GraphRAG không thay thế RAG truyền thống - nó bổ sung. Nhiều hệ thống production sử dụng Hybrid approach: Vector search cho câu hỏi đơn giản (fast path), Graph search cho câu hỏi phức tạp đòi hỏi reasoning (slow path). Query Router quyết định đường nào dựa trên phân tích câu hỏi.

5. Microsoft GraphRAG & LazyGraphRAG

5.1 Microsoft GraphRAG (Open-source)

Microsoft phát hành GraphRAG dưới dạng open-source vào giữa 2024, nhanh chóng trở thành chuẩn de facto cho graph-based RAG. Pipeline của họ:

Text → Chunks: Chia tài liệu thành chunks ~600 tokens
Chunks → Entities/Relations: LLM extract với multiple rounds (gleaning)
Graph Construction: Xây dựng đồ thị từ extracted data
Community Detection: Thuật toán Leiden phân nhóm
Community Summarization: LLM tóm tắt mỗi community
Query → Answer: Local/Global search pipeline

5.2 LazyGraphRAG - Bước đột phá về chi phí

Điểm yếu lớn nhất của GraphRAG gốc là chi phí indexing cực cao: mỗi chunk cần nhiều LLM calls để extract entities, detect communities, và tạo summaries. Với dataset lớn, chi phí có thể lên đến hàng nghìn USD.

LazyGraphRAG (Microsoft, 06/2025) giải quyết bằng cách trì hoãn community summarization đến query time:

Index Time (Rẻ)

Chỉ extract entities & relations, xây đồ thị, detect communities. KHÔNG tạo summaries. Giảm 99% chi phí indexing.

Query Time (On-demand)

Khi có query, chỉ summarize các communities liên quan đến query. Tăng nhẹ latency nhưng tiết kiệm rất nhiều chi phí.

Caching

Summaries được cache. Queries tương tự sau đó sẽ nhanh hơn. Phù hợp với pattern "few topics, many queries".

Khi nào dùng LazyGraphRAG?

LazyGraphRAG lý tưởng cho: (1) Dataset lớn với budget indexing hạn chế, (2) Dữ liệu thay đổi thường xuyên (re-index rẻ), (3) Query patterns tập trung vào một số chủ đề. Nếu bạn có dataset nhỏ hoặc cần latency thấp cho mọi query, GraphRAG gốc vẫn phù hợp hơn.

6. Triển khai GraphRAG với Neo4j + LLM

6.1 Tại sao Neo4j?

Neo4j là graph database phổ biến nhất thế giới, với hệ sinh thái phong phú cho AI/ML. Lý do chọn Neo4j cho GraphRAG:

Cypher query language: Ngôn ngữ truy vấn đồ thị mạnh mẽ, dễ đọc
APOC library: 400+ procedures cho data processing
GDS (Graph Data Science): Thuật toán community detection, centrality, similarity built-in
Vector Index: Native vector search từ Neo4j 5.x - kết hợp graph + vector trong một query
LangChain/LlamaIndex integration: Các framework AI phổ biến đều hỗ trợ Neo4j

6.2 Xây dựng Knowledge Graph

// Bước 1: Tạo constraints và indexes
CREATE CONSTRAINT entity_name IF NOT EXISTS
  FOR (e:Entity) REQUIRE e.name IS UNIQUE;

CREATE VECTOR INDEX entity_embedding IF NOT EXISTS
  FOR (e:Entity) ON e.embedding
  OPTIONS {indexConfig: {
    `vector.dimensions`: 1536,
    `vector.similarity_function`: 'cosine'
  }};

// Bước 2: Import entities từ LLM extraction
UNWIND $entities AS entity
MERGE (e:Entity {name: entity.name})
SET e.type = entity.type,
    e.description = entity.description,
    e.embedding = entity.embedding;

// Bước 3: Tạo relationships
UNWIND $relationships AS rel
MATCH (s:Entity {name: rel.source})
MATCH (t:Entity {name: rel.target})
CALL apoc.create.relationship(s, rel.type, {
  description: rel.description,
  weight: rel.weight
}, t) YIELD rel AS r
RETURN count(r);

6.3 Hybrid Query: Graph + Vector

Sức mạnh thực sự của Neo4j cho GraphRAG nằm ở khả năng kết hợp graph traversal và vector search trong cùng một query:

// Hybrid search: Vector similarity + Graph context
// Tìm entities liên quan → mở rộng context qua đồ thị

// Bước 1: Vector search tìm entities gần nhất
CALL db.index.vector.queryNodes(
  'entity_embedding', 5, $query_embedding
) YIELD node AS entity, score

// Bước 2: Mở rộng qua đồ thị (2-hop)
MATCH path = (entity)-[*1..2]-(related)
WHERE score > 0.7

// Bước 3: Thu thập context
WITH entity, related, score,
     relationships(path) AS rels
RETURN entity.name AS source,
       entity.description AS source_desc,
       [r IN rels | type(r)] AS rel_types,
       related.name AS target,
       related.description AS target_desc,
       score
ORDER BY score DESC
LIMIT 20;

6.4 Python Implementation

from neo4j import GraphDatabase
from openai import OpenAI
import json

class GraphRAGEngine:
    def __init__(self, neo4j_uri, neo4j_auth, llm_client):
        self.driver = GraphDatabase.driver(neo4j_uri, auth=neo4j_auth)
        self.llm = llm_client

    def extract_entities(self, text_chunk: str) -> dict:
        """Dùng LLM extract entities và relationships từ text"""
        response = self.llm.chat.completions.create(
            model="claude-sonnet-4-6",
            messages=[{
                "role": "user",
                "content": f"""Extract entities and relationships:
                Text: {text_chunk}
                Return JSON with 'entities' and 'relationships' arrays."""
            }]
        )
        return json.loads(response.choices[0].message.content)

    def local_search(self, query: str, top_k: int = 5) -> str:
        """Local search: vector → graph expansion → LLM"""
        query_embedding = self._get_embedding(query)

        with self.driver.session() as session:
            result = session.run("""
                CALL db.index.vector.queryNodes(
                    'entity_embedding', $k, $embedding
                ) YIELD node, score
                MATCH path = (node)-[*1..2]-(related)
                WHERE score > 0.7
                RETURN node.name AS entity,
                       node.description AS desc,
                       collect(DISTINCT related.name) AS related_entities,
                       score
                ORDER BY score DESC
            """, k=top_k, embedding=query_embedding)

            context = self._format_context(result.data())

        # Gửi context + query cho LLM
        answer = self.llm.chat.completions.create(
            model="claude-sonnet-4-6",
            messages=[
                {"role": "system", "content": f"Context:\n{context}"},
                {"role": "user", "content": query}
            ]
        )
        return answer.choices[0].message.content

7. Production Patterns & Best Practices

7.1 Kiến trúc Production

flowchart TB
    subgraph Ingestion["Data Ingestion Layer"]
        D1["Documents"] --> P1["Chunking Service"]
        D2["APIs"] --> P1
        D3["Databases"] --> P1
        P1 --> Q["Message Queue (Kafka)"]
    end

    subgraph Processing["Processing Layer"]
        Q --> E["Entity Extractor (LLM Workers)"]
        E --> R["Entity Resolver"]
        R --> G["Graph Builder"]
        G --> N["Neo4j Cluster"]
        E --> V["Embedding Service"]
        V --> N
    end

    subgraph Serving["Query Serving Layer"]
        U["User Query"] --> QR["Query Router"]
        QR --> LS["Local Search"]
        QR --> GS["Global Search"]
        QR --> HS["Hybrid Search"]
        LS --> N
        GS --> N
        HS --> N
        LS --> LLM["LLM API"]
        GS --> LLM
        HS --> LLM
        LLM --> U
    end

Hình 5: Kiến trúc GraphRAG Production-Ready

7.2 Các sai lầm phổ biến và cách tránh

Sai lầm #1: Extract quá nhiều entities

LLM có xu hướng extract mọi danh từ thành entity. Kết quả: đồ thị quá lớn, noisy, query chậm. Giải pháp: Định nghĩa rõ entity types cần extract, dùng few-shot examples trong prompt, filter entities có confidence thấp.

Sai lầm #2: Bỏ qua Entity Resolution

"OpenAI", "Open AI", "openai.com" → 3 nodes khác nhau cho cùng 1 entity. Đồ thị bị phân mảnh, multi-hop queries fail. Giải pháp: Luôn có bước Entity Resolution, dùng combination of string matching + embedding similarity + LLM verification.

Sai lầm #3: Không có Query Router

Dùng Global Search cho mọi query → tốn chi phí LLM, chậm. Dùng Local Search cho global questions → câu trả lời không đầy đủ. Giải pháp: Xây Query Router (có thể dùng LLM nhỏ hoặc classifier) để chọn search strategy phù hợp.

Best Practice: Incremental Updates

Không cần rebuild toàn bộ đồ thị khi có dữ liệu mới. Extract entities từ documents mới → merge vào đồ thị hiện có → chỉ re-detect communities cho affected subgraphs. Neo4j MERGE đảm bảo idempotency.

7.3 Monitoring & Observability

Những metrics quan trọng cần theo dõi trong production:

Metric	Mục tiêu	Cách đo
Graph coverage	> 85% documents có entities trong graph	docs_with_entities / total_docs
Entity density	5-15 entities per chunk (avg)	total_entities / total_chunks
Query latency (local)	< 2s P95	Tracing từ query đến response
Query latency (global)	< 10s P95	Tracing từ query đến response
Answer faithfulness	> 90%	LLM-as-judge hoặc human eval
Graph freshness	< 1h lag	newest_doc_date - newest_entity_date

8. Use Cases thực tế

8.1 Enterprise Knowledge Management

Một doanh nghiệp lớn với hàng nghìn tài liệu nội bộ (policies, SOPs, meeting notes, project docs). GraphRAG giúp:

Cross-document reasoning: "Chính sách nào ảnh hưởng đến dự án X?" - duyệt từ project → policies qua relationships
Expert finding: "Ai hiểu rõ nhất về module authentication?" - duyệt từ module → contributors → expertise
Impact analysis: "Nếu thay đổi API v2, ảnh hưởng đến những team nào?" - graph traversal từ API → dependent services → owner teams

8.2 Healthcare - Clinical Decision Support

Knowledge Graph y khoa kết nối: bệnh nhân → triệu chứng → chẩn đoán → thuốc → tương tác thuốc → nghiên cứu lâm sàng. GraphRAG cho phép bác sĩ hỏi: "Bệnh nhân dùng thuốc A và B, có triệu chứng X, những chẩn đoán nào cần xem xét và có tương tác thuốc nào cần lưu ý?"

8.3 Legal Research

Đồ thị pháp luật kết nối: luật → điều khoản → án lệ → phán quyết → bình luận chuyên gia. Luật sư có thể hỏi: "Những án lệ nào liên quan đến điều 15 Luật Doanh nghiệp trong bối cảnh M&A, và phán quyết gần nhất của Tòa án Nhân dân Tối cao?"

8.4 Source Code Intelligence

Đồ thị code: functions → classes → modules → dependencies → commits → authors. Developers hỏi: "Function X được gọi từ đâu, ai là người thay đổi gần nhất, và có test coverage không?" - GraphRAG trả lời bằng cách duyệt call graph + git history.

9. Lộ trình phát triển GraphRAG

2020 - 2023

Tiền GraphRAG: Knowledge Graphs được sử dụng chủ yếu cho structured QA (KGQA). Google Knowledge Graph, Wikidata là các KG lớn nhất. RAG truyền thống ra đời và phát triển mạnh.

Giữa 2024

Microsoft GraphRAG Open-source: Microsoft phát hành GraphRAG, kết hợp KG + LLM + Community Detection. Nhanh chóng trở thành chuẩn mực cho graph-based RAG.

Q4 2024 - Q1 2025

Adoption rộng rãi: LangChain, LlamaIndex tích hợp GraphRAG. Neo4j ra mắt native vector index. Nhiều startup xây dựng trên GraphRAG.

06/2025

LazyGraphRAG: Microsoft công bố LazyGraphRAG, giảm 99% chi phí indexing. LinearRAG được accepted tại ICLR'26. GraphRAG Benchmark ra mắt.

2026

Production-ready: GraphRAG trở thành standard cho enterprise AI. Hybrid RAG (Vector + Graph) là kiến trúc phổ biến. Agentic AI systems sử dụng GraphRAG làm memory layer.

2027+ (Dự đoán)

Autonomous KG: AI tự động xây dựng, cập nhật, và tối ưu Knowledge Graph. Healthcare và Legal bắt buộc dùng GraphRAG cho decision support. Real-time GraphRAG với streaming data.

10. Bắt đầu với GraphRAG - Hướng dẫn thực chiến

Bước 1: Đánh giá nhu cầu

Trả lời các câu hỏi sau trước khi quyết định dùng GraphRAG:

Dữ liệu của bạn có nhiều entities có mối quan hệ với nhau không? (Nếu không, RAG truyền thống đủ)
Users có hỏi câu hỏi cần multi-hop reasoning không? (VD: "A liên quan đến B như thế nào thông qua C?")
Bạn cần explainability - giải thích reasoning path? (Quan trọng trong healthcare, legal, finance)
Budget cho LLM calls có đủ cho indexing pipeline? (GraphRAG tốn token hơn RAG truyền thống đáng kể)

Bước 2: Chọn stack công nghệ

Đơn giản nhất

Microsoft GraphRAG (Python package) + Neo4j Community. Dùng cho PoC và dataset nhỏ-vừa.

Production-ready

LangChain/LlamaIndex + Neo4j Enterprise + Custom extraction pipeline. Cho hệ thống cần scale.

Managed

Neo4j AuraDB + Cloud LLM (Claude/GPT). Không cần quản lý infrastructure, nhanh nhất để go-live.

Bước 3: Iteration loop

Start small: 50-100 documents, extract entities, build graph, test queries
Evaluate: So sánh answer quality giữa RAG truyền thống vs GraphRAG trên test set
Tune extraction: Cải thiện prompts, thêm entity types, refine entity resolution
Scale: Tăng dần dataset, optimize Cypher queries, add caching
Hybrid: Kết hợp vector search + graph search cho best of both worlds

11. Kết luận

GraphRAG đại diện cho bước tiến lớn trong cách AI xử lý và suy luận trên tri thức. Bằng cách kết hợp sức mạnh của Knowledge Graph (lưu trữ mối quan hệ) với LLM (hiểu ngôn ngữ tự nhiên), GraphRAG mở ra khả năng trả lời những câu hỏi phức tạp mà RAG truyền thống không thể.

Tuy nhiên, GraphRAG không phải "silver bullet". Chi phí indexing cao, complexity vận hành lớn hơn, và không phải mọi use case đều cần graph-based reasoning. Nguyên tắc vàng: bắt đầu với RAG truyền thống, đánh giá gaps, và chỉ thêm GraphRAG khi thực sự cần multi-hop reasoning hoặc global summarization.

Với sự phát triển của LazyGraphRAG và các frameworks ngày càng mature, rào cản gia nhập đang giảm nhanh. 2026 là thời điểm tuyệt vời để bắt đầu thử nghiệm GraphRAG trong dự án của bạn.

Next steps cho builder

Cài đặt graphrag package: pip install graphrag
Setup Neo4j Community: docker run -p 7474:7474 -p 7687:7687 neo4j:5
Thử với 50 documents nội bộ của team bạn
So sánh kết quả với RAG pipeline hiện có
Đọc thêm bài viết về RAG và Agentic AI Architecture trên blog này

# GraphRAG & Knowledge Graph - Nâng cấp hệ thống AI với đồ thị tri thức

GraphRAG Knowledge Graph AI Architecture Neo4j LLM

## 1. Tại sao RAG truyền thống chưa đủ?

Nếu bạn đã đọc bài viết trước của mình về **RAG (Retrieval-Augmented Generation)**, bạn biết rằng RAG giải quyết bài toán "hallucination" bằng cách bổ sung tri thức bên ngoài cho LLM. Tuy nhiên, RAG truyền thống (Naive RAG) có một điểm yếu chí mạng: nó xử lý mỗi chunk văn bản như một đơn vị độc lập, **hoàn toàn mù mờ về mối quan hệ giữa các thực thể**.

Hãy tưởng tượng bạn hỏi: *"Nhân viên nào trong phòng R&D đã từng làm việc với khách hàng VinGroup và có kinh nghiệm về AI?"* - RAG truyền thống sẽ tìm các chunk chứa từ khóa liên quan, nhưng không thể "nối các dấu chấm" (connect the dots) giữa nhân viên, phòng ban, dự án, và khách hàng. Đây chính là lúc **GraphRAG** tỏa sáng.

72-83%Độ bao phủ (comprehensiveness) so với RAG truyền thống

3.4xCải thiện độ chính xác trong enterprise

4.5%Tăng reasoning depth cho multi-hop questions

99%Giảm chi phí indexing với LazyGraphRAG

#### GraphRAG là gì?

**GraphRAG** là phương pháp kết hợp **Knowledge Graph** (đồ thị tri thức) với LLM, cho phép AI không chỉ truy xuất thông tin mà còn **suy luận qua các mối quan hệ** giữa entities. Thay vì tìm kiếm theo vector similarity, GraphRAG duyệt qua đồ thị để trả lời các câu hỏi phức tạp đòi hỏi multi-hop reasoning.

## 2. Knowledge Graph - Nền tảng của GraphRAG

### 2.1 Cấu trúc cơ bản

Knowledge Graph (KG) lưu trữ thông tin dưới dạng **bộ ba (triple)**: `(Subject, Predicate, Object)` hay `(Entity, Relationship, Entity)`. Ví dụ:

- `(Nguyễn Văn A, WORKS_AT, Phòng R&D)`
- `(Phòng R&D, BELONGS_TO, Công ty XYZ)`
- `(Dự án Alpha, USES_TECH, Neo4j)`
- `(Nguyễn Văn A, LEADS, Dự án Alpha)`

Hình 1: Ví dụ Knowledge Graph đơn giản trong doanh nghiệp

Với đồ thị này, câu hỏi *"Ai trong phòng R&D làm việc với VinGroup và có kinh nghiệm AI?"* được trả lời bằng cách duyệt: `Phòng R&D ← WORKS_AT ← Nguyễn Văn A → LEADS → Dự án Alpha → FOR_CLIENT → VinGroup`, đồng thời kiểm tra `Nguyễn Văn A → HAS_SKILL → AI/ML`.

### 2.2 Các loại Knowledge Graph

| Loại | Mô tả | Ví dụ | Use case |
| --- | --- | --- | --- |
| **Domain KG** | Đồ thị chuyên biệt cho một lĩnh vực | Medical KG, Legal KG | Healthcare AI, Legal Research |
| **Enterprise KG** | Đồ thị nội bộ doanh nghiệp | Employee-Project-Client graph | Internal search, decision support |
| **Open KG** | Đồ thị tri thức mở, công cộng | Wikidata, DBpedia, YAGO | General-purpose QA, enrichment |
| **Temporal KG** | Đồ thị có yếu tố thời gian | Event timeline graph | Trend analysis, forecasting |

## 3. Kiến trúc GraphRAG: Từ lý thuyết đến thực tế

### 3.1 Pipeline tổng quan

GraphRAG pipeline gồm hai pha chính: **Indexing** (xây dựng đồ thị từ dữ liệu thô) và **Querying** (truy vấn đồ thị + LLM để trả lời).

```
flowchart TB
    subgraph Indexing["INDEXING PIPELINE"]
        A["Tài liệu thô"] --> B["Chunking"]
        B --> C["Entity Extraction (LLM)"]
        C --> D["Relation Extraction (LLM)"]
        D --> E["Entity Resolution"]
        E --> F["Community Detection"]
        F --> G["Community Summarization"]
        G --> H["Knowledge Graph + Summaries"]
    end

subgraph Querying["QUERY PIPELINE"]
        I["User Query"] --> J{"Query Router"}
        J -->|"Local Search"| K["Subgraph Retrieval"]
        J -->|"Global Search"| L["Community Summaries"]
        K --> M["Context Assembly"]
        L --> M
        M --> N["LLM Generation"]
        N --> O["Answer + Sources"]
    end

H --> K
    H --> L
  
```

Hình 2: GraphRAG Pipeline - Indexing và Querying

### 3.2 Indexing Pipeline chi tiết

#### Bước 1: Entity & Relation Extraction

// Ví dụ prompt cho Entity Extraction const extractionPrompt = `
Bạn là chuyên gia trích xuất tri thức. Phân tích đoạn văn bản sau
và trích xuất TẤT CẢ entities và relationships.

Entities: Tên người, tổ chức, địa điểm, sản phẩm, công nghệ,
 sự kiện, khái niệm.

Relationships: Mối quan hệ giữa các entities.
 Format: (entity1, RELATIONSHIP_TYPE, entity2, description)

Văn bản: {text_chunk}

Trả về JSON:
{ "entities": [
 {"name": "...", "type": "PERSON|ORG|TECH|...", "description": "..."}
 ], "relationships": [
 {"source": "...", "target": "...", "type": "...", "description": "..."}
 ]
}`;

#### Bước 2: Entity Resolution

- **String similarity** (Levenshtein, Jaro-Winkler)
- **Embedding similarity** (cosine similarity giữa entity descriptions)
- **LLM-based resolution** (hỏi LLM hai entity có phải một không)

#### Bước 3: Community Detection

Đây là điểm khác biệt cốt lõi của Microsoft GraphRAG so với các approach khác. Sau khi xây dựng đồ thị, thuật toán **Leiden** được sử dụng để phát hiện "cộng đồng" (community) - các nhóm node có liên kết chặt chẽ với nhau.

```
graph TB
    subgraph C1["Community 1: AI Team"]
        A1["Data Scientist A"]
        A2["ML Engineer B"]
        A3["Dự án LLM"]
        A1 --- A2
        A2 --- A3
        A1 --- A3
    end

subgraph C2["Community 2: Infra Team"]
        B1["DevOps C"]
        B2["SRE D"]
        B3["K8s Cluster"]
        B1 --- B2
        B2 --- B3
        B1 --- B3
    end

subgraph C3["Community 3: Client Projects"]
        D1["VinGroup"]
        D2["FPT"]
        D3["Dự án Enterprise"]
        D1 --- D3
        D2 --- D3
    end

A3 -.->|"deploys_on"| B3
    A3 -.->|"serves"| D3
  
```

Hình 3: Community Detection - Phân nhóm entities có liên kết chặt

Mỗi community sau đó được LLM tóm tắt thành một **community summary** - bản mô tả ngắn gọn về nhóm đó. Đây là "chìa khóa" cho Global Search: thay vì duyệt toàn bộ đồ thị, query engine chỉ cần đọc các summaries.

### 3.3 Query Engine: Local vs Global Search

| Tiêu chí | Local Search | Global Search |
| --- | --- | --- |
| **Câu hỏi phù hợp** | Cụ thể, về entity xác định | Tổng quát, cần nhìn toàn cảnh |
| **Ví dụ** | "Dự án Alpha dùng công nghệ gì?" | "Xu hướng công nghệ chính của công ty là gì?" |
| **Cơ chế** | Tìm entity → duyệt subgraph xung quanh | Map-reduce trên community summaries |
| **Độ trễ** | Thấp (ms) | Cao hơn (seconds, do nhiều LLM calls) |
| **Chi phí LLM** | Thấp | Cao (map-reduce qua nhiều summaries) |
| **Độ bao phủ** | Hẹp, chính xác | Rộng, toàn diện |

## 4. So sánh chi tiết: RAG truyền thống vs GraphRAG

```
flowchart LR
    subgraph Traditional["RAG Truyền thống"]
        Q1["Query"] --> E1["Embedding"]
        E1 --> V1["Vector Search"]
        V1 --> C1["Top-K Chunks"]
        C1 --> L1["LLM"]
        L1 --> A1["Answer"]
    end

subgraph Graph["GraphRAG"]
        Q2["Query"] --> R2["Query Router"]
        R2 --> E2["Entity Recognition"]
        E2 --> G2["Graph Traversal"]
        G2 --> CS2["Community Summaries"]
        G2 --> SC2["Subgraph Context"]
        CS2 --> L2["LLM"]
        SC2 --> L2
        L2 --> A2["Answer + Reasoning Path"]
    end
  
```

Hình 4: RAG truyền thống vs GraphRAG - Luồng xử lý

| Tiêu chí | RAG Truyền thống | GraphRAG |
| --- | --- | --- |
| **Đơn vị lưu trữ** | Text chunks (vectors) | Entities + Relationships + Communities |
| **Cơ chế tìm kiếm** | Vector similarity (cosine, dot product) | Graph traversal + Community map-reduce |
| **Multi-hop reasoning** | Yếu - mỗi chunk độc lập | Mạnh - duyệt qua nhiều hop trên đồ thị |
| **Global questions** | Rất yếu - chỉ thấy top-K chunks | Mạnh - community summaries cho bird-eye view |
| **Explainability** | Trả về source chunks | Trả về reasoning path trên đồ thị |
| **Chi phí indexing** | Thấp (chỉ embedding) | Cao (nhiều LLM calls cho extraction) |
| **Chi phí query** | Thấp | Trung bình - cao (tùy local/global) |
| **Cập nhật dữ liệu** | Dễ - thêm/xóa chunks | Phức tạp - cần re-extract entities |

#### Lưu ý quan trọng

GraphRAG **không thay thế** RAG truyền thống - nó bổ sung. Nhiều hệ thống production sử dụng **Hybrid approach**: Vector search cho câu hỏi đơn giản (fast path), Graph search cho câu hỏi phức tạp đòi hỏi reasoning (slow path). Query Router quyết định đường nào dựa trên phân tích câu hỏi.

## 5. Microsoft GraphRAG & LazyGraphRAG

### 5.1 Microsoft GraphRAG (Open-source)

Microsoft phát hành **GraphRAG** dưới dạng open-source vào giữa 2024, nhanh chóng trở thành chuẩn de facto cho graph-based RAG. Pipeline của họ:

1. **Text → Chunks**: Chia tài liệu thành chunks ~600 tokens
2. **Chunks → Entities/Relations**: LLM extract với multiple rounds (gleaning)
3. **Graph Construction**: Xây dựng đồ thị từ extracted data
4. **Community Detection**: Thuật toán Leiden phân nhóm
5. **Community Summarization**: LLM tóm tắt mỗi community
6. **Query → Answer**: Local/Global search pipeline

### 5.2 LazyGraphRAG - Bước đột phá về chi phí

Điểm yếu lớn nhất của GraphRAG gốc là **chi phí indexing cực cao**: mỗi chunk cần nhiều LLM calls để extract entities, detect communities, và tạo summaries. Với dataset lớn, chi phí có thể lên đến hàng nghìn USD.

**LazyGraphRAG** (Microsoft, 06/2025) giải quyết bằng cách **trì hoãn community summarization đến query time**:

#### Index Time (Rẻ)

Chỉ extract entities & relations, xây đồ thị, detect communities. **KHÔNG** tạo summaries. Giảm 99% chi phí indexing.

#### Query Time (On-demand)

Khi có query, chỉ summarize các communities **liên quan** đến query. Tăng nhẹ latency nhưng tiết kiệm rất nhiều chi phí.

#### Caching

Summaries được cache. Queries tương tự sau đó sẽ nhanh hơn. Phù hợp với pattern "few topics, many queries".

#### Khi nào dùng LazyGraphRAG?

## 6. Triển khai GraphRAG với Neo4j + LLM

### 6.1 Tại sao Neo4j?

**Neo4j** là graph database phổ biến nhất thế giới, với hệ sinh thái phong phú cho AI/ML. Lý do chọn Neo4j cho GraphRAG:

- **Cypher query language**: Ngôn ngữ truy vấn đồ thị mạnh mẽ, dễ đọc
- **APOC library**: 400+ procedures cho data processing
- **GDS (Graph Data Science)**: Thuật toán community detection, centrality, similarity built-in
- **Vector Index**: Native vector search từ Neo4j 5.x - kết hợp graph + vector trong một query
- **LangChain/LlamaIndex integration**: Các framework AI phổ biến đều hỗ trợ Neo4j

### 6.2 Xây dựng Knowledge Graph

// Bước 1: Tạo constraints và indexes CREATE CONSTRAINT entity_name IF NOT EXISTS FOR (e:Entity) REQUIRE e.name IS UNIQUE;

CREATE VECTOR INDEX entity_embedding IF NOT EXISTS FOR (e:Entity) ON e.embedding OPTIONS {indexConfig: {
 `vector.dimensions`: 1536,
 `vector.similarity_function`: 'cosine' }};

// Bước 2: Import entities từ LLM extraction UNWIND $entities AS entityMERGE (e:Entity {name: entity.name})SET e.type = entity.type,
 e.description = entity.description,
 e.embedding = entity.embedding;

// Bước 3: Tạo relationships UNWIND $relationships AS relMATCH (s:Entity {name: rel.source})MATCH (t:Entity {name: rel.target})CALL apoc.create.relationship(s, rel.type, {
 description: rel.description,
 weight: rel.weight
}, t) YIELD rel AS rRETURN count(r);

### 6.3 Hybrid Query: Graph + Vector

Sức mạnh thực sự của Neo4j cho GraphRAG nằm ở khả năng kết hợp **graph traversal** và **vector search** trong cùng một query:

// Hybrid search: Vector similarity + Graph context // Tìm entities liên quan → mở rộng context qua đồ thị // Bước 1: Vector search tìm entities gần nhất CALL db.index.vector.queryNodes( 'entity_embedding', 5, $query_embedding
) YIELD node AS entity, score

// Bước 2: Mở rộng qua đồ thị (2-hop) MATCH path = (entity)-[*1..2]-(related)WHERE score > 0.7

// Bước 3: Thu thập context WITH entity, related, score,
 relationships(path) AS relsRETURN entity.name AS source,
 entity.description AS source_desc,
 [r IN rels | type(r)] AS rel_types,
 related.name AS target,
 related.description AS target_desc,
 scoreORDER BY score DESC LIMIT 20;

### 6.4 Python Implementation

from neo4j import GraphDatabasefrom openai import OpenAIimport json

class GraphRAGEngine: def __init__(self, neo4j_uri, neo4j_auth, llm_client):
 self.driver = GraphDatabase.driver(neo4j_uri, auth=neo4j_auth)
 self.llm = llm_client def extract_entities(self, text_chunk: str) -> dict: """Dùng LLM extract entities và relationships từ text""" response = self.llm.chat.completions.create(
 model="claude-sonnet-4-6",
 messages=[{ "role": "user", "content": f"""Extract entities and relationships:
 Text: {text_chunk}
 Return JSON with 'entities' and 'relationships' arrays.""" }]
 ) return json.loads(response.choices[0].message.content) def local_search(self, query: str, top_k: int = 5) -> str: """Local search: vector → graph expansion → LLM""" query_embedding = self._get_embedding(query) with self.driver.session() as session:
 result = session.run("""
 CALL db.index.vector.queryNodes(
 'entity_embedding', $k, $embedding
 ) YIELD node, score
 MATCH path = (node)-[*1..2]-(related)
 WHERE score > 0.7
 RETURN node.name AS entity,
 node.description AS desc,
 collect(DISTINCT related.name) AS related_entities,
 score
 ORDER BY score DESC
 """, k=top_k, embedding=query_embedding)

context = self._format_context(result.data()) # Gửi context + query cho LLM answer = self.llm.chat.completions.create(
 model="claude-sonnet-4-6",
 messages=[
 {"role": "system", "content": f"Context:\n{context}"},
 {"role": "user", "content": query}
 ]
 ) return answer.choices[0].message.content

## 7. Production Patterns & Best Practices

### 7.1 Kiến trúc Production

```
flowchart TB
    subgraph Ingestion["Data Ingestion Layer"]
        D1["Documents"] --> P1["Chunking Service"]
        D2["APIs"] --> P1
        D3["Databases"] --> P1
        P1 --> Q["Message Queue (Kafka)"]
    end

subgraph Processing["Processing Layer"]
        Q --> E["Entity Extractor (LLM Workers)"]
        E --> R["Entity Resolver"]
        R --> G["Graph Builder"]
        G --> N["Neo4j Cluster"]
        E --> V["Embedding Service"]
        V --> N
    end

subgraph Serving["Query Serving Layer"]
        U["User Query"] --> QR["Query Router"]
        QR --> LS["Local Search"]
        QR --> GS["Global Search"]
        QR --> HS["Hybrid Search"]
        LS --> N
        GS --> N
        HS --> N
        LS --> LLM["LLM API"]
        GS --> LLM
        HS --> LLM
        LLM --> U
    end
  
```

Hình 5: Kiến trúc GraphRAG Production-Ready

### 7.2 Các sai lầm phổ biến và cách tránh

#### Sai lầm #1: Extract quá nhiều entities

LLM có xu hướng extract mọi danh từ thành entity. Kết quả: đồ thị quá lớn, noisy, query chậm. **Giải pháp**: Định nghĩa rõ entity types cần extract, dùng few-shot examples trong prompt, filter entities có confidence thấp.

#### Sai lầm #2: Bỏ qua Entity Resolution

"OpenAI", "Open AI", "openai.com" → 3 nodes khác nhau cho cùng 1 entity. Đồ thị bị phân mảnh, multi-hop queries fail. **Giải pháp**: Luôn có bước Entity Resolution, dùng combination of string matching + embedding similarity + LLM verification.

#### Sai lầm #3: Không có Query Router

Dùng Global Search cho mọi query → tốn chi phí LLM, chậm. Dùng Local Search cho global questions → câu trả lời không đầy đủ. **Giải pháp**: Xây Query Router (có thể dùng LLM nhỏ hoặc classifier) để chọn search strategy phù hợp.

#### Best Practice: Incremental Updates

### 7.3 Monitoring & Observability

Những metrics quan trọng cần theo dõi trong production:

| Metric | Mục tiêu | Cách đo |
| --- | --- | --- |
| **Graph coverage** | > 85% documents có entities trong graph | docs_with_entities / total_docs |
| **Entity density** | 5-15 entities per chunk (avg) | total_entities / total_chunks |
| **Query latency (local)** | < 2s P95 | Tracing từ query đến response |
| **Query latency (global)** | < 10s P95 | Tracing từ query đến response |
| **Answer faithfulness** | > 90% | LLM-as-judge hoặc human eval |
| **Graph freshness** | < 1h lag | newest_doc_date - newest_entity_date |

## 8. Use Cases thực tế

### 8.1 Enterprise Knowledge Management

Một doanh nghiệp lớn với hàng nghìn tài liệu nội bộ (policies, SOPs, meeting notes, project docs). GraphRAG giúp:

- **Cross-document reasoning**: "Chính sách nào ảnh hưởng đến dự án X?" - duyệt từ project → policies qua relationships
- **Expert finding**: "Ai hiểu rõ nhất về module authentication?" - duyệt từ module → contributors → expertise
- **Impact analysis**: "Nếu thay đổi API v2, ảnh hưởng đến những team nào?" - graph traversal từ API → dependent services → owner teams

### 8.2 Healthcare - Clinical Decision Support

Knowledge Graph y khoa kết nối: bệnh nhân → triệu chứng → chẩn đoán → thuốc → tương tác thuốc → nghiên cứu lâm sàng. GraphRAG cho phép bác sĩ hỏi: *"Bệnh nhân dùng thuốc A và B, có triệu chứng X, những chẩn đoán nào cần xem xét và có tương tác thuốc nào cần lưu ý?"*

### 8.3 Legal Research

Đồ thị pháp luật kết nối: luật → điều khoản → án lệ → phán quyết → bình luận chuyên gia. Luật sư có thể hỏi: *"Những án lệ nào liên quan đến điều 15 Luật Doanh nghiệp trong bối cảnh M&A, và phán quyết gần nhất của Tòa án Nhân dân Tối cao?"*

### 8.4 Source Code Intelligence

Đồ thị code: functions → classes → modules → dependencies → commits → authors. Developers hỏi: *"Function X được gọi từ đâu, ai là người thay đổi gần nhất, và có test coverage không?"* - GraphRAG trả lời bằng cách duyệt call graph + git history.

## 9. Lộ trình phát triển GraphRAG

2020 - 2023

**Tiền GraphRAG:** Knowledge Graphs được sử dụng chủ yếu cho structured QA (KGQA). Google Knowledge Graph, Wikidata là các KG lớn nhất. RAG truyền thống ra đời và phát triển mạnh.

Giữa 2024

**Microsoft GraphRAG Open-source:** Microsoft phát hành GraphRAG, kết hợp KG + LLM + Community Detection. Nhanh chóng trở thành chuẩn mực cho graph-based RAG.

Q4 2024 - Q1 2025

**Adoption rộng rãi:** LangChain, LlamaIndex tích hợp GraphRAG. Neo4j ra mắt native vector index. Nhiều startup xây dựng trên GraphRAG.

06/2025

**LazyGraphRAG:** Microsoft công bố LazyGraphRAG, giảm 99% chi phí indexing. LinearRAG được accepted tại ICLR'26. GraphRAG Benchmark ra mắt.

2026

**Production-ready:** GraphRAG trở thành standard cho enterprise AI. Hybrid RAG (Vector + Graph) là kiến trúc phổ biến. Agentic AI systems sử dụng GraphRAG làm memory layer.

2027+ (Dự đoán)

**Autonomous KG:** AI tự động xây dựng, cập nhật, và tối ưu Knowledge Graph. Healthcare và Legal bắt buộc dùng GraphRAG cho decision support. Real-time GraphRAG với streaming data.

## 10. Bắt đầu với GraphRAG - Hướng dẫn thực chiến

### Bước 1: Đánh giá nhu cầu

Trả lời các câu hỏi sau trước khi quyết định dùng GraphRAG:

- Dữ liệu của bạn có nhiều **entities có mối quan hệ với nhau** không? (Nếu không, RAG truyền thống đủ)
- Users có hỏi câu hỏi cần **multi-hop reasoning** không? (VD: "A liên quan đến B như thế nào thông qua C?")
- Bạn cần **explainability** - giải thích reasoning path? (Quan trọng trong healthcare, legal, finance)
- Budget cho LLM calls có đủ cho indexing pipeline? (GraphRAG tốn token hơn RAG truyền thống đáng kể)

### Bước 2: Chọn stack công nghệ

#### Đơn giản nhất

**Microsoft GraphRAG** (Python package) + Neo4j Community. Dùng cho PoC và dataset nhỏ-vừa.

#### Production-ready

**LangChain/LlamaIndex** + Neo4j Enterprise + Custom extraction pipeline. Cho hệ thống cần scale.

#### Managed

**Neo4j AuraDB** + Cloud LLM (Claude/GPT). Không cần quản lý infrastructure, nhanh nhất để go-live.

### Bước 3: Iteration loop

1. **Start small:** 50-100 documents, extract entities, build graph, test queries
2. **Evaluate:** So sánh answer quality giữa RAG truyền thống vs GraphRAG trên test set
3. **Tune extraction:** Cải thiện prompts, thêm entity types, refine entity resolution
4. **Scale:** Tăng dần dataset, optimize Cypher queries, add caching
5. **Hybrid:** Kết hợp vector search + graph search cho best of both worlds

## 11. Kết luận

Tuy nhiên, GraphRAG không phải "silver bullet". Chi phí indexing cao, complexity vận hành lớn hơn, và không phải mọi use case đều cần graph-based reasoning. **Nguyên tắc vàng:** bắt đầu với RAG truyền thống, đánh giá gaps, và chỉ thêm GraphRAG khi thực sự cần multi-hop reasoning hoặc global summarization.

#### Next steps cho builder

- Cài đặt `graphrag` package: `pip install graphrag`
- Setup Neo4j Community: `docker run -p 7474:7474 -p 7687:7687 neo4j:5`
- Thử với 50 documents nội bộ của team bạn
- So sánh kết quả với RAG pipeline hiện có
- Đọc thêm bài viết về [RAG](/rag-retrieval-augmented-generation-kien-truc-truy-xuat-tri-thuc-cho-ung-dung-ai-14) và [Agentic AI Architecture](/agentic-ai-architecture-thiet-ke-kien-truc-he-thong-ai-da-tac-nhan-production-2026-13) trên blog này

RAG - Retrieval-Augmented Generation: Kiến trúc truy xuất tri thức cho ứng dụng AI

A2A Protocol - Giao thức Agent-to-Agent cho Hệ thống Multi-Agent AI

Disclaimer: The opinions expressed in this blog are solely my own and do not reflect the views or opinions of my employer or any affiliated organizations. The content provided is for informational and educational purposes only and should not be taken as professional advice. While I strive to provide accurate and up-to-date information, I make no warranties or guarantees about the completeness, reliability, or accuracy of the content. Readers are encouraged to verify the information and seek independent advice as needed. I disclaim any liability for decisions or actions taken based on the content of this blog.