Redis 8 và Caching Patterns 2026 - I/O Threading, Vector Set và Chiến lược Cache hiệu năng cao

Posted on: 4/17/2026 9:14:01 AM

Table of contents

1. Redis 8 — Bản cập nhật lớn nhất trong lịch sử
2. Redis 8.6 (Tháng 3/2026) — Tối ưu sâu hơn
1. 2.1. LRM Eviction — Chính sách loại bỏ mới cho AI Caching
  1. Tại sao LRM quan trọng cho AI/Semantic Caching?
2. 2.2. Benchmark Redis 8.6
3. Caching Patterns — Chọn đúng chiến lược cho từng bài toán
4. So sánh các Caching Patterns
5. Redis vs Valkey 2026 — Bối cảnh License Fork
1. Lưu ý về License
6. Thiết kế Cache Layer cho hệ thống Production
7. Cache Anti-Patterns cần tránh
8. Kết luận
1. Nguồn tham khảo

Trong thế giới backend, Redis không còn chỉ là một key-value store đơn giản nữa. Với phiên bản Redis 8 (GA 2025, liên tục cập nhật đến 8.6 tháng 3/2026), Redis đã trở thành một nền tảng dữ liệu đa năng — tích hợp JSON, Search, TimeSeries, Vector Set và Bloom Filter trực tiếp vào core, cùng kiến trúc I/O Threading mới cho throughput tăng gấp 5 lần so với Redis 7.2. Bài viết này sẽ đi sâu vào kiến trúc Redis 8, các caching patterns hiện đại, và cách chọn chiến lược cache phù hợp cho hệ thống production.

1. Redis 8 — Bản cập nhật lớn nhất trong lịch sử

Redis 8 không phải một bản nâng cấp thông thường — đây là sự hợp nhất toàn bộ hệ sinh thái. Trước đây, các module như RediSearch, RedisJSON, RedisTimeSeries, RedisBloom phải cài đặt riêng biệt qua MODULE LOAD. Từ Redis 8, tất cả được tích hợp vào một binary duy nhất gọi là Redis Open Source.

5x Throughput cao hơn so với Redis 7.2

3.5M ops/sec với pipelining

87% Lệnh được tối ưu tốc độ

8 Kiểu dữ liệu mới trong core

1.1. Tám kiểu dữ liệu mới tích hợp sẵn

Redis 8 bổ sung 8 kiểu dữ liệu trực tiếp vào core, không cần cài module riêng:

Kiểu dữ liệu	Tiền thân (Module)	Ứng dụng chính
JSON	RedisJSON	Lưu trữ, truy vấn document JSON với JSONPath
Time Series	RedisTimeSeries	Metrics, IoT sensor data, monitoring
Vector Set (preview)	Mới hoàn toàn	Vector similarity search cho AI/ML
Bloom Filter	RedisBloom	Kiểm tra phần tử tồn tại (probabilistic)
Cuckoo Filter	RedisBloom	Tương tự Bloom nhưng hỗ trợ delete
Top-K	RedisBloom	Tracking top K phần tử phổ biến nhất
Count-Min Sketch	RedisBloom	Ước lượng tần suất xuất hiện
T-Digest	RedisBloom	Ước lượng percentile (p99, p95)

# Redis 8 - JSON là kiểu dữ liệu first-class
127.0.0.1:6379> JSON.SET user:1001 $ '{"name":"Anh Tu","role":"engineer","skills":["redis","dotnet"]}'
OK

# Truy vấn JSONPath trực tiếp
127.0.0.1:6379> JSON.GET user:1001 $.skills[0]
"[\"redis\"]"

# Vector Set cho AI/Semantic Search (preview trong 8.x)
127.0.0.1:6379> VADD products REDUCE 2 VALUES 3 0.12 0.87 0.34 ELE "laptop-gaming"
(integer) 1

# Bloom Filter kiểm tra email đã tồn tại
127.0.0.1:6379> BF.ADD emails:registered "user@example.com"
(integer) 1
127.0.0.1:6379> BF.EXISTS emails:registered "user@example.com"
(integer) 1

1.2. I/O Threading — Bước nhảy vọt về hiệu năng

Một trong những thay đổi kiến trúc quan trọng nhất của Redis 8 là I/O Threading mới. Redis vốn nổi tiếng với mô hình single-threaded — tất cả xử lý trên một core CPU duy nhất. Từ Redis 8, network I/O được phân tán ra nhiều core trong khi data manipulation vẫn giữ single-threaded để đảm bảo tính nhất quán.

graph LR
    C1["Client 1"] --> IO1["I/O Thread 1"]
    C2["Client 2"] --> IO2["I/O Thread 2"]
    C3["Client 3"] --> IO3["I/O Thread 3"]
    C4["Client N"] --> IO4["I/O Thread N"]
    IO1 --> MAIN["Main Thread
(Data Processing)"]
    IO2 --> MAIN
    IO3 --> MAIN
    IO4 --> MAIN
    MAIN --> IO1
    MAIN --> IO2
    MAIN --> IO3
    MAIN --> IO4
    style MAIN fill:#e94560,stroke:#fff,color:#fff
    style IO1 fill:#16213e,stroke:#e94560,color:#fff
    style IO2 fill:#16213e,stroke:#e94560,color:#fff
    style IO3 fill:#16213e,stroke:#e94560,color:#fff
    style IO4 fill:#16213e,stroke:#e94560,color:#fff
    style C1 fill:#0f3460,stroke:#e94560,color:#fff
    style C2 fill:#0f3460,stroke:#e94560,color:#fff
    style C3 fill:#0f3460,stroke:#e94560,color:#fff
    style C4 fill:#0f3460,stroke:#e94560,color:#fff

Hình 1: Kiến trúc I/O Threading của Redis 8 — network I/O đa luồng, data processing đơn luồng

# redis.conf — bật I/O Threading
io-threads 8          # Số I/O threads (khuyến nghị = số CPU cores)
io-threads-do-reads yes  # Cho phép I/O threads xử lý cả read

# Kết quả benchmark trên 8-core CPU:
# Redis 7.2:  ~650K ops/sec
# Redis 8.0:  ~1.38M ops/sec (+112%)
# Redis 8.6:  ~3.5M ops/sec (với pipelining)

Khi nào nên bật I/O Threading?

I/O Threading mang lại lợi ích rõ rệt khi hệ thống có nhiều kết nối đồng thời (hàng nghìn clients) và workload chủ yếu là các lệnh đơn giản (GET/SET). Với workload phức tạp (Lua script dài, SORT trên dataset lớn), bottleneck nằm ở main thread nên I/O threading ít tác dụng hơn.

1.3. Dual-Stream Replication

Redis 8 cải tiến cơ chế replication với dual-stream — hai luồng replication chạy đồng thời: một luồng cho snapshot và một luồng cho các thay đổi trong quá trình transfer. Kết quả: thời gian replication giảm 18%, peak buffer size giảm 35%.

sequenceDiagram
    participant P as Primary
    participant R as Replica
    P->>R: Stream 1: RDB Snapshot
    P->>R: Stream 2: Write Buffer (song song)
    Note over P,R: Hai luồng chạy đồng thời,
không chờ snapshot xong mới gửi buffer
    R->>R: Apply snapshot
    R->>R: Apply buffered writes
    R-->>P: Replication complete

Hình 2: Dual-Stream Replication — song song hoá quá trình đồng bộ primary-replica

2. Redis 8.6 (Tháng 3/2026) — Tối ưu sâu hơn

Bản cập nhật mới nhất Redis 8.6 tập trung vào hiệu năng thuần túy và giới thiệu eviction policy mới dành cho semantic caching:

2.1. LRM Eviction — Chính sách loại bỏ mới cho AI Caching

Redis 8.6 giới thiệu hai eviction policy mới: volatile-lrm và allkeys-lrm (Least Recently Modified). Khác với LRU (Least Recently Used) nơi mỗi lần đọc đều refresh timestamp, LRM chỉ tính thời điểm ghi cuối cùng.

Tại sao LRM quan trọng cho AI/Semantic Caching?

Trong hệ thống AI, các embedding vectors hoặc cached inference results thường được đọc rất thường xuyên nhưng hiếm khi cập nhật. Với LRU, những key này sẽ không bao giờ bị evict (vì liên tục được đọc). LRM giải quyết vấn đề này bằng cách evict dựa trên thời điểm write — key nào lâu không được cập nhật sẽ bị loại trước, bất kể tần suất đọc.

# redis.conf — sử dụng LRM cho mixed workload
maxmemory 8gb
maxmemory-policy allkeys-lrm

# Phù hợp cho scenario:
# - Short-lived cache keys (session, API response)
#   → thường xuyên write → LRM giữ lại
# - Long-lived semantic cache (embeddings, ML model output)
#   → ít write, nhiều read → LRM sẽ evict khi cần

2.2. Benchmark Redis 8.6

Metric	Redis 8.4	Redis 8.6	Cải thiện
Vector Set insertion	baseline	+43%	Vectorized quantized distance
Vector Set querying	baseline	+58%	Binary quantization (Intel/AMD AVX)
Sorted Set latency	baseline	-35%	Prefetching optimization
GET (short strings)	baseline	-15% latency	ACL verification optimization
Hash memory usage	baseline	-17%	Encoding optimization
Sorted Set memory	baseline	-31%	Compact representation

3. Caching Patterns — Chọn đúng chiến lược cho từng bài toán

Việc sử dụng Redis hiệu quả không chỉ nằm ở cấu hình server, mà còn ở việc chọn đúng caching pattern. Mỗi pattern có ưu nhược điểm riêng và phù hợp với từng dạng workload khác nhau.

3.1. Cache-Aside (Lazy Loading)

Đây là pattern phổ biến nhất. Application chủ động kiểm tra cache trước, nếu miss thì đọc từ database và populate cache.

graph LR
    APP["Application"] -->|"1. GET key"| REDIS["Redis Cache"]
    REDIS -->|"2a. Cache HIT"| APP
    REDIS -->|"2b. Cache MISS"| APP
    APP -->|"3. Query DB"| DB["Database"]
    DB -->|"4. Return data"| APP
    APP -->|"5. SET key (TTL)"| REDIS
    style REDIS fill:#e94560,stroke:#fff,color:#fff
    style DB fill:#0f3460,stroke:#e94560,color:#fff
    style APP fill:#16213e,stroke:#e94560,color:#fff

Hình 3: Cache-Aside pattern — application quản lý cả cache và database

// C# (.NET) — Cache-Aside Pattern với StackExchange.Redis
public async Task<Product?> GetProductAsync(int productId)
{
    var cacheKey = $"product:{productId}";
    var db = _redis.GetDatabase();

    // 1. Kiểm tra cache
    var cached = await db.StringGetAsync(cacheKey);
    if (cached.HasValue)
        return JsonSerializer.Deserialize<Product>(cached!);

    // 2. Cache miss → đọc database
    var product = await _dbContext.Products.FindAsync(productId);
    if (product is null) return null;

    // 3. Populate cache với TTL
    await db.StringSetAsync(
        cacheKey,
        JsonSerializer.Serialize(product),
        expiry: TimeSpan.FromMinutes(15)
    );

    return product;
}

Ưu điểm Cache-Aside

Chỉ cache dữ liệu thực sự được truy cập (demand-driven). Nếu Redis down, application vẫn hoạt động bình thường bằng cách đọc trực tiếp từ database. Pattern này phù hợp nhất cho read-heavy workload với tỷ lệ read/write > 10:1.

3.2. Write-Through

Mỗi lần ghi database, đồng thời ghi cache. Đảm bảo cache luôn đồng bộ với database nhưng đánh đổi bằng write latency cao hơn.

// Write-Through Pattern
public async Task UpdateProductAsync(Product product)
{
    // Ghi database VÀ cache đồng thời
    await _dbContext.Products.UpdateAsync(product);
    await _dbContext.SaveChangesAsync();

    // Cập nhật cache ngay lập tức
    var cacheKey = $"product:{product.Id}";
    var db = _redis.GetDatabase();
    await db.StringSetAsync(
        cacheKey,
        JsonSerializer.Serialize(product),
        expiry: TimeSpan.FromHours(1)
    );
}

3.3. Write-Behind (Write-Back)

Application chỉ ghi vào Redis, Redis bất đồng bộ flush xuống database. Throughput ghi cực cao nhưng có risk mất dữ liệu nếu Redis crash trước khi flush.

graph LR
    APP["Application"] -->|"1. Write"| REDIS["Redis Cache"]
    REDIS -->|"2. ACK ngay"| APP
    REDIS -->|"3. Async flush"| DB["Database"]
    WORKER["Background Worker"] -->|"4. Batch write"| DB
    REDIS -->|"Queue"| WORKER
    style REDIS fill:#e94560,stroke:#fff,color:#fff
    style DB fill:#0f3460,stroke:#e94560,color:#fff
    style APP fill:#16213e,stroke:#e94560,color:#fff
    style WORKER fill:#533483,stroke:#e94560,color:#fff

Hình 4: Write-Behind pattern — ghi nhanh vào cache, async xuống database

// Write-Behind Pattern sử dụng Redis Stream làm buffer
public async Task RecordPageViewAsync(string pageId, string userId)
{
    var db = _redis.GetDatabase();

    // Ghi vào Redis Stream (rất nhanh, ~0.1ms)
    await db.StreamAddAsync(
        "pageviews:buffer",
        new NameValueEntry[]
        {
            new("pageId", pageId),
            new("userId", userId),
            new("timestamp", DateTimeOffset.UtcNow.ToUnixTimeMilliseconds().ToString())
        },
        maxLength: 100_000  // Giới hạn buffer size
    );
}

// Background worker flush xuống database theo batch
public async Task FlushPageViewsAsync()
{
    var db = _redis.GetDatabase();
    var entries = await db.StreamReadAsync("pageviews:buffer", "0-0", count: 500);

    if (entries.Length == 0) return;

    // Batch insert vào database
    var records = entries.Select(e => new PageView
    {
        PageId = e["pageId"],
        UserId = e["userId"],
        Timestamp = DateTimeOffset.FromUnixTimeMilliseconds(long.Parse(e["timestamp"]))
    });

    await _dbContext.PageViews.AddRangeAsync(records);
    await _dbContext.SaveChangesAsync();

    // Xóa entries đã xử lý
    foreach (var entry in entries)
        await db.StreamDeleteAsync("pageviews:buffer", new[] { entry.Id });
}

3.4. Read-Through

Application chỉ tương tác với cache. Cache tự động đọc database khi miss. Đơn giản hoá code ở tầng application nhưng yêu cầu cache layer phải "thông minh" hơn.

3.5. Proactive Cache Refresh — Xu hướng 2026

Thay vì chờ user request để populate cache (reactive), hệ thống chủ động cập nhật cache trước khi dữ liệu sắp hết hạn. User không bao giờ thấy cache miss hoặc stale data.

// Proactive Refresh với background timer
public class CacheRefreshService : BackgroundService
{
    protected override async Task ExecuteAsync(CancellationToken ct)
    {
        while (!ct.IsCancellationRequested)
        {
            var db = _redis.GetDatabase();

            // Scan các key sắp hết hạn (TTL < 2 phút)
            var server = _redis.GetServer(_redis.GetEndPoints()[0]);
            foreach (var key in server.Keys(pattern: "product:*"))
            {
                var ttl = await db.KeyTimeToLiveAsync(key);
                if (ttl.HasValue && ttl.Value < TimeSpan.FromMinutes(2))
                {
                    // Proactive refresh trước khi expire
                    var productId = int.Parse(key.ToString().Split(':')[1]);
                    var product = await _dbContext.Products.FindAsync(productId);
                    if (product != null)
                    {
                        await db.StringSetAsync(
                            key,
                            JsonSerializer.Serialize(product),
                            expiry: TimeSpan.FromMinutes(15)
                        );
                    }
                }
            }

            await Task.Delay(TimeSpan.FromSeconds(30), ct);
        }
    }
}

4. So sánh các Caching Patterns

Pattern	Consistency	Read Latency	Write Latency	Best For
Cache-Aside	Eventual	Cache hit: rất thấp Cache miss: cao	Bình thường	Read-heavy, có thể chịu stale data
Write-Through	Strong	Luôn thấp	Cao (ghi cả DB + cache)	Cần consistency cao
Write-Behind	Eventual	Luôn thấp	Rất thấp	Write-heavy (analytics, logging)
Read-Through	Eventual	Tương tự Cache-Aside	Bình thường	Đơn giản hoá application code
Proactive Refresh	Near real-time	Luôn thấp (no miss)	Bình thường + background	Hot data, UX-critical paths

5. Redis vs Valkey 2026 — Bối cảnh License Fork

Tháng 3/2024, Redis Ltd. thay đổi license từ BSD-3-Clause sang SSPL/RSALv2. Các cloud provider lớn ngay lập tức fork Redis 7.2 thành Valkey dưới Linux Foundation, giữ license BSD-3-Clause. Đến 2026, hai dự án đã phân tách đáng kể:

Tiêu chí	Redis 8.x	Valkey 8.x
License	SSPL / RSALv2 (source-available)	BSD-3-Clause (fully open)
Tích hợp modules	JSON, Search, TimeSeries, Bloom, Vector Set trong core	Không có — focus vào core data types
I/O Threading	Có (io-threads config)	Có (cải tiến bởi AWS engineers)
Performance (SET)	~729K RPS (8.0)	~1M RPS (8.1)
Hướng phát triển	All-in-one data platform (AI, search, analytics)	Lean core, clustering, Rust integration
Cloud support	AWS ElastiCache, GCP Memorystore, Azure Cache	AWS MemoryDB, GCP Memorystore
Chọn khi	Cần JSON/Search/TimeSeries/Vector tích hợp	Cần BSD license (SaaS, managed service)

Lưu ý về License

Nếu bạn đang xây dựng SaaS hoặc managed service cung cấp Redis-as-a-service, license SSPL/RSALv2 của Redis 8 có thể gây vấn đề pháp lý. Trong trường hợp này, Valkey (BSD-3-Clause) là lựa chọn an toàn hơn. Tuy nhiên, nếu chỉ sử dụng Redis như một thành phần nội bộ trong hệ thống, license này không ảnh hưởng.

6. Thiết kế Cache Layer cho hệ thống Production

Dưới đây là kiến trúc cache đa tầng kết hợp nhiều pattern, phù hợp cho hệ thống production xử lý hàng triệu request:

graph TB
    CLIENT["Client Request"] --> LB["Load Balancer"]
    LB --> APP["Application Server"]
    APP --> L1["L1: In-Memory Cache
(IMemoryCache, 30s TTL)"]
    L1 -->|"Miss"| L2["L2: Redis Cache
(Distributed, 15min TTL)"]
    L2 -->|"Miss"| DB["Database
(SQL Server / PostgreSQL)"]
    DB -->|"Populate"| L2
    L2 -->|"Populate"| L1
    WORKER["Background Worker"] -->|"Proactive Refresh"| L2
    WORKER -->|"Invalidate"| L1
    CDC["CDC / Event Bus"] -->|"Data Changed"| WORKER
    DB -->|"Change Event"| CDC
    style L1 fill:#533483,stroke:#e94560,color:#fff
    style L2 fill:#e94560,stroke:#fff,color:#fff
    style DB fill:#0f3460,stroke:#e94560,color:#fff
    style WORKER fill:#16213e,stroke:#e94560,color:#fff
    style CDC fill:#16213e,stroke:#e94560,color:#fff
    style APP fill:#16213e,stroke:#e94560,color:#fff

Hình 5: Kiến trúc cache đa tầng với L1 in-memory, L2 Redis, proactive refresh qua CDC

// Multilevel Cache Service trong .NET
public class MultiLevelCacheService
{
    private readonly IMemoryCache _l1;
    private readonly IDatabase _l2;  // Redis

    public async Task<T?> GetAsync<T>(string key, Func<Task<T?>> factory)
    {
        // L1: In-Memory (ultra-fast, ~0.001ms)
        if (_l1.TryGetValue(key, out T? l1Value))
            return l1Value;

        // L2: Redis (fast, ~0.5ms)
        var l2Value = await _l2.StringGetAsync(key);
        if (l2Value.HasValue)
        {
            var result = JsonSerializer.Deserialize<T>(l2Value!);
            _l1.Set(key, result, TimeSpan.FromSeconds(30));
            return result;
        }

        // L3: Database (slow, ~5-50ms)
        var dbValue = await factory();
        if (dbValue is not null)
        {
            var json = JsonSerializer.Serialize(dbValue);
            await _l2.StringSetAsync(key, json, TimeSpan.FromMinutes(15));
            _l1.Set(key, dbValue, TimeSpan.FromSeconds(30));
        }

        return dbValue;
    }
}

7. Cache Anti-Patterns cần tránh

7.1. Thundering Herd (Cache Stampede)

Khi một cache key phổ biến expire, hàng nghìn request đồng thời đều miss cache và đổ dồn vào database. Giải pháp: sử dụng distributed lock hoặc probabilistic early expiration.

// Distributed Lock để chống Thundering Herd
public async Task<T?> GetWithLockAsync<T>(string key, Func<Task<T?>> factory)
{
    var db = _redis.GetDatabase();
    var cached = await db.StringGetAsync(key);
    if (cached.HasValue)
        return JsonSerializer.Deserialize<T>(cached!);

    var lockKey = $"lock:{key}";
    var lockAcquired = await db.StringSetAsync(
        lockKey, "1", TimeSpan.FromSeconds(10), When.NotExists);

    if (lockAcquired)
    {
        try
        {
            var value = await factory();
            if (value is not null)
            {
                await db.StringSetAsync(key,
                    JsonSerializer.Serialize(value),
                    TimeSpan.FromMinutes(15));
            }
            return value;
        }
        finally
        {
            await db.KeyDeleteAsync(lockKey);
        }
    }

    // Các request khác chờ và retry
    await Task.Delay(100);
    return await GetWithLockAsync<T>(key, factory);
}

7.2. Cache Penetration

Request liên tục query key không tồn tại trong cả cache và database (thường do attacker). Giải pháp: cache giá trị null hoặc sử dụng Bloom Filter (giờ đã tích hợp sẵn trong Redis 8).

// Dùng Bloom Filter (Redis 8 native) chống Cache Penetration
public async Task<Product?> GetProductSafeAsync(int productId)
{
    var db = _redis.GetDatabase();

    // Kiểm tra Bloom Filter trước — O(1), cực nhanh
    var exists = (bool)await db.ExecuteAsync("BF.EXISTS", "products:bf", productId.ToString());
    if (!exists)
        return null;  // Chắc chắn không tồn tại, skip DB query

    // Bloom Filter nói "có thể tồn tại" → kiểm tra cache + DB
    return await GetProductAsync(productId);
}

7.3. Hot Key Problem

Một key được truy cập quá thường xuyên, tạo bottleneck trên một Redis node. Giải pháp: replicate hot key sang nhiều nodes hoặc sử dụng L1 in-memory cache phía application.

8. Kết luận

Redis 8 đánh dấu bước chuyển mình từ một key-value store thành một nền tảng dữ liệu thống nhất với JSON, Search, TimeSeries, Vector Set và Bloom Filter tích hợp sẵn. Kiến trúc I/O Threading mới cho throughput tăng vượt trội, trong khi dual-stream replication và LRM eviction policy mở ra khả năng mới cho AI/semantic caching.

Về caching patterns, không có "silver bullet" — mỗi pattern phù hợp với một dạng workload khác nhau. Cache-Aside vẫn là lựa chọn mặc định an toàn cho read-heavy systems, Write-Behind cho write-heavy analytics, và Proactive Refresh cho các hot path cần UX mượt mà. Kết hợp cache đa tầng (L1 in-memory + L2 Redis) với các kỹ thuật chống stampede là công thức đã được chứng minh cho hệ thống triệu request.

Cuối cùng, sự phân tách Redis/Valkey buộc các team phải đưa ra quyết định licensing rõ ràng. Hãy đánh giá nhu cầu tích hợp module (JSON, Search, Vector) và yêu cầu license trước khi chọn hướng đi.

Nguồn tham khảo

#redis #Caching #system design #Performance #Database

# Redis 8 và Caching Patterns 2026 - I/O Threading, Vector Set và Chiến lược Cache hiệu năng cao

Trong thế giới backend, **Redis** không còn chỉ là một key-value store đơn giản nữa. Với phiên bản **Redis 8** (GA 2025, liên tục cập nhật đến 8.6 tháng 3/2026), Redis đã trở thành một **nền tảng dữ liệu đa năng** — tích hợp JSON, Search, TimeSeries, Vector Set và Bloom Filter trực tiếp vào core, cùng kiến trúc **I/O Threading** mới cho throughput tăng gấp 5 lần so với Redis 7.2. Bài viết này sẽ đi sâu vào kiến trúc Redis 8, các caching patterns hiện đại, và cách chọn chiến lược cache phù hợp cho hệ thống production.

## 1. Redis 8 — Bản cập nhật lớn nhất trong lịch sử

Redis 8 không phải một bản nâng cấp thông thường — đây là sự **hợp nhất toàn bộ hệ sinh thái**. Trước đây, các module như RediSearch, RedisJSON, RedisTimeSeries, RedisBloom phải cài đặt riêng biệt qua `MODULE LOAD`. Từ Redis 8, tất cả được tích hợp vào một binary duy nhất gọi là **Redis Open Source**.

5x Throughput cao hơn so với Redis 7.2

3.5M ops/sec với pipelining

87% Lệnh được tối ưu tốc độ

8 Kiểu dữ liệu mới trong core

### 1.1. Tám kiểu dữ liệu mới tích hợp sẵn

Redis 8 bổ sung 8 kiểu dữ liệu trực tiếp vào core, không cần cài module riêng:

| Kiểu dữ liệu | Tiền thân (Module) | Ứng dụng chính |
| --- | --- | --- |
| **JSON** | RedisJSON | Lưu trữ, truy vấn document JSON với JSONPath |
| **Time Series** | RedisTimeSeries | Metrics, IoT sensor data, monitoring |
| **Vector Set** (preview) | Mới hoàn toàn | Vector similarity search cho AI/ML |
| **Bloom Filter** | RedisBloom | Kiểm tra phần tử tồn tại (probabilistic) |
| **Cuckoo Filter** | RedisBloom | Tương tự Bloom nhưng hỗ trợ delete |
| **Top-K** | RedisBloom | Tracking top K phần tử phổ biến nhất |
| **Count-Min Sketch** | RedisBloom | Ước lượng tần suất xuất hiện |
| **T-Digest** | RedisBloom | Ước lượng percentile (p99, p95) |

```bash
# Redis 8 - JSON là kiểu dữ liệu first-class
127.0.0.1:6379> JSON.SET user:1001 $ '{"name":"Anh Tu","role":"engineer","skills":["redis","dotnet"]}'
OK

# Truy vấn JSONPath trực tiếp
127.0.0.1:6379> JSON.GET user:1001 $.skills[0]
"[\"redis\"]"

# Vector Set cho AI/Semantic Search (preview trong 8.x)
127.0.0.1:6379> VADD products REDUCE 2 VALUES 3 0.12 0.87 0.34 ELE "laptop-gaming"
(integer) 1

# Bloom Filter kiểm tra email đã tồn tại
127.0.0.1:6379> BF.ADD emails:registered "user@example.com"
(integer) 1
127.0.0.1:6379> BF.EXISTS emails:registered "user@example.com"
(integer) 1

```

### 1.2. I/O Threading — Bước nhảy vọt về hiệu năng

Một trong những thay đổi kiến trúc quan trọng nhất của Redis 8 là **I/O Threading mới**. Redis vốn nổi tiếng với mô hình single-threaded — tất cả xử lý trên một core CPU duy nhất. Từ Redis 8, network I/O được phân tán ra nhiều core trong khi **data manipulation vẫn giữ single-threaded** để đảm bảo tính nhất quán.

```
graph LR
    C1["Client 1"] --> IO1["I/O Thread 1"]
    C2["Client 2"] --> IO2["I/O Thread 2"]
    C3["Client 3"] --> IO3["I/O Thread 3"]
    C4["Client N"] --> IO4["I/O Thread N"]
    IO1 --> MAIN["Main Thread  
(Data Processing)"]
    IO2 --> MAIN
    IO3 --> MAIN
    IO4 --> MAIN
    MAIN --> IO1
    MAIN --> IO2
    MAIN --> IO3
    MAIN --> IO4
    style MAIN fill:#e94560,stroke:#fff,color:#fff
    style IO1 fill:#16213e,stroke:#e94560,color:#fff
    style IO2 fill:#16213e,stroke:#e94560,color:#fff
    style IO3 fill:#16213e,stroke:#e94560,color:#fff
    style IO4 fill:#16213e,stroke:#e94560,color:#fff
    style C1 fill:#0f3460,stroke:#e94560,color:#fff
    style C2 fill:#0f3460,stroke:#e94560,color:#fff
    style C3 fill:#0f3460,stroke:#e94560,color:#fff
    style C4 fill:#0f3460,stroke:#e94560,color:#fff

```
Hình 1: Kiến trúc I/O Threading của Redis 8 — network I/O đa luồng, data processing đơn luồng

```bash
# redis.conf — bật I/O Threading
io-threads 8          # Số I/O threads (khuyến nghị = số CPU cores)
io-threads-do-reads yes  # Cho phép I/O threads xử lý cả read

# Kết quả benchmark trên 8-core CPU:
# Redis 7.2:  ~650K ops/sec
# Redis 8.0:  ~1.38M ops/sec (+112%)
# Redis 8.6:  ~3.5M ops/sec (với pipelining)

```

#### Khi nào nên bật I/O Threading?

I/O Threading mang lại lợi ích rõ rệt khi hệ thống có **nhiều kết nối đồng thời** (hàng nghìn clients) và workload chủ yếu là các lệnh đơn giản (GET/SET). Với workload phức tạp (Lua script dài, SORT trên dataset lớn), bottleneck nằm ở main thread nên I/O threading ít tác dụng hơn.

### 1.3. Dual-Stream Replication

Redis 8 cải tiến cơ chế replication với **dual-stream** — hai luồng replication chạy đồng thời: một luồng cho snapshot và một luồng cho các thay đổi trong quá trình transfer. Kết quả: thời gian replication giảm 18%, peak buffer size giảm 35%.

```
sequenceDiagram
    participant P as Primary
    participant R as Replica
    P->>R: Stream 1: RDB Snapshot
    P->>R: Stream 2: Write Buffer (song song)
    Note over P,R: Hai luồng chạy đồng thời,  
không chờ snapshot xong mới gửi buffer
    R->>R: Apply snapshot
    R->>R: Apply buffered writes
    R-->>P: Replication complete

```
Hình 2: Dual-Stream Replication — song song hoá quá trình đồng bộ primary-replica

## 2. Redis 8.6 (Tháng 3/2026) — Tối ưu sâu hơn

Bản cập nhật mới nhất Redis 8.6 tập trung vào **hiệu năng thuần túy** và giới thiệu eviction policy mới dành cho semantic caching:

### 2.1. LRM Eviction — Chính sách loại bỏ mới cho AI Caching

Redis 8.6 giới thiệu hai eviction policy mới: `volatile-lrm` và `allkeys-lrm` (Least Recently **Modified**). Khác với LRU (Least Recently Used) nơi mỗi lần đọc đều refresh timestamp, LRM chỉ tính thời điểm **ghi cuối cùng**.

#### Tại sao LRM quan trọng cho AI/Semantic Caching?

Trong hệ thống AI, các **embedding vectors** hoặc **cached inference results** thường được đọc rất thường xuyên nhưng hiếm khi cập nhật. Với LRU, những key này sẽ không bao giờ bị evict (vì liên tục được đọc). LRM giải quyết vấn đề này bằng cách evict dựa trên thời điểm write — key nào lâu không được cập nhật sẽ bị loại trước, bất kể tần suất đọc.

```bash
# redis.conf — sử dụng LRM cho mixed workload
maxmemory 8gb
maxmemory-policy allkeys-lrm

# Phù hợp cho scenario:
# - Short-lived cache keys (session, API response)
#   → thường xuyên write → LRM giữ lại
# - Long-lived semantic cache (embeddings, ML model output)
#   → ít write, nhiều read → LRM sẽ evict khi cần

```

### 2.2. Benchmark Redis 8.6

| Metric | Redis 8.4 | Redis 8.6 | Cải thiện |
| --- | --- | --- | --- |
| Vector Set insertion | baseline | +43% | Vectorized quantized distance |
| Vector Set querying | baseline | +58% | Binary quantization (Intel/AMD AVX) |
| Sorted Set latency | baseline | -35% | Prefetching optimization |
| GET (short strings) | baseline | -15% latency | ACL verification optimization |
| Hash memory usage | baseline | -17% | Encoding optimization |
| Sorted Set memory | baseline | -31% | Compact representation |

## 3. Caching Patterns — Chọn đúng chiến lược cho từng bài toán

Việc sử dụng Redis hiệu quả không chỉ nằm ở cấu hình server, mà còn ở việc chọn đúng **caching pattern**. Mỗi pattern có ưu nhược điểm riêng và phù hợp với từng dạng workload khác nhau.

### 3.1. Cache-Aside (Lazy Loading)

Đây là pattern phổ biến nhất. Application chủ động kiểm tra cache trước, nếu miss thì đọc từ database và populate cache.

```
Hình 3: Cache-Aside pattern — application quản lý cả cache và database

```csharp
// C# (.NET) — Cache-Aside Pattern với StackExchange.Redis
public async Task<Product?> GetProductAsync(int productId)
{
    var cacheKey = $"product:{productId}";
    var db = _redis.GetDatabase();

// 1. Kiểm tra cache
    var cached = await db.StringGetAsync(cacheKey);
    if (cached.HasValue)
        return JsonSerializer.Deserialize<Product>(cached!);

// 2. Cache miss → đọc database
    var product = await _dbContext.Products.FindAsync(productId);
    if (product is null) return null;

// 3. Populate cache với TTL
    await db.StringSetAsync(
        cacheKey,
        JsonSerializer.Serialize(product),
        expiry: TimeSpan.FromMinutes(15)
    );

return product;
}

```

#### Ưu điểm Cache-Aside

Chỉ cache dữ liệu thực sự được truy cập (demand-driven). Nếu Redis down, application vẫn hoạt động bình thường bằng cách đọc trực tiếp từ database. Pattern này phù hợp nhất cho **read-heavy workload** với tỷ lệ read/write > 10:1.

### 3.2. Write-Through

Mỗi lần ghi database, đồng thời ghi cache. Đảm bảo cache luôn đồng bộ với database nhưng đánh đổi bằng write latency cao hơn.

```csharp
// Write-Through Pattern
public async Task UpdateProductAsync(Product product)
{
    // Ghi database VÀ cache đồng thời
    await _dbContext.Products.UpdateAsync(product);
    await _dbContext.SaveChangesAsync();

// Cập nhật cache ngay lập tức
    var cacheKey = $"product:{product.Id}";
    var db = _redis.GetDatabase();
    await db.StringSetAsync(
        cacheKey,
        JsonSerializer.Serialize(product),
        expiry: TimeSpan.FromHours(1)
    );
}

```

### 3.3. Write-Behind (Write-Back)

Application chỉ ghi vào Redis, Redis **bất đồng bộ** flush xuống database. Throughput ghi cực cao nhưng có risk mất dữ liệu nếu Redis crash trước khi flush.

```
Hình 4: Write-Behind pattern — ghi nhanh vào cache, async xuống database

```csharp
// Write-Behind Pattern sử dụng Redis Stream làm buffer
public async Task RecordPageViewAsync(string pageId, string userId)
{
    var db = _redis.GetDatabase();

// Ghi vào Redis Stream (rất nhanh, ~0.1ms)
    await db.StreamAddAsync(
        "pageviews:buffer",
        new NameValueEntry[]
        {
            new("pageId", pageId),
            new("userId", userId),
            new("timestamp", DateTimeOffset.UtcNow.ToUnixTimeMilliseconds().ToString())
        },
        maxLength: 100_000  // Giới hạn buffer size
    );
}

// Background worker flush xuống database theo batch
public async Task FlushPageViewsAsync()
{
    var db = _redis.GetDatabase();
    var entries = await db.StreamReadAsync("pageviews:buffer", "0-0", count: 500);

if (entries.Length == 0) return;

// Batch insert vào database
    var records = entries.Select(e => new PageView
    {
        PageId = e["pageId"],
        UserId = e["userId"],
        Timestamp = DateTimeOffset.FromUnixTimeMilliseconds(long.Parse(e["timestamp"]))
    });

await _dbContext.PageViews.AddRangeAsync(records);
    await _dbContext.SaveChangesAsync();

// Xóa entries đã xử lý
    foreach (var entry in entries)
        await db.StreamDeleteAsync("pageviews:buffer", new[] { entry.Id });
}

```

### 3.4. Read-Through

Application chỉ tương tác với cache. Cache **tự động** đọc database khi miss. Đơn giản hoá code ở tầng application nhưng yêu cầu cache layer phải "thông minh" hơn.

### 3.5. Proactive Cache Refresh — Xu hướng 2026

Thay vì chờ user request để populate cache (reactive), hệ thống **chủ động cập nhật cache trước** khi dữ liệu sắp hết hạn. User không bao giờ thấy cache miss hoặc stale data.

```csharp
// Proactive Refresh với background timer
public class CacheRefreshService : BackgroundService
{
    protected override async Task ExecuteAsync(CancellationToken ct)
    {
        while (!ct.IsCancellationRequested)
        {
            var db = _redis.GetDatabase();

// Scan các key sắp hết hạn (TTL < 2 phút)
            var server = _redis.GetServer(_redis.GetEndPoints()[0]);
            foreach (var key in server.Keys(pattern: "product:*"))
            {
                var ttl = await db.KeyTimeToLiveAsync(key);
                if (ttl.HasValue && ttl.Value &lt; TimeSpan.FromMinutes(2))
                {
                    // Proactive refresh trước khi expire
                    var productId = int.Parse(key.ToString().Split(':')[1]);
                    var product = await _dbContext.Products.FindAsync(productId);
                    if (product != null)
                    {
                        await db.StringSetAsync(
                            key,
                            JsonSerializer.Serialize(product),
                            expiry: TimeSpan.FromMinutes(15)
                        );
                    }
                }
            }

await Task.Delay(TimeSpan.FromSeconds(30), ct);
        }
    }
}

```

## 4. So sánh các Caching Patterns

| Pattern | Consistency | Read Latency | Write Latency | Best For |
| --- | --- | --- | --- | --- |
| **Cache-Aside** | Eventual | Cache hit: rất thấp   Cache miss: cao | Bình thường | Read-heavy, có thể chịu stale data |
| **Write-Through** | Strong | Luôn thấp | Cao (ghi cả DB + cache) | Cần consistency cao |
| **Write-Behind** | Eventual | Luôn thấp | Rất thấp | Write-heavy (analytics, logging) |
| **Read-Through** | Eventual | Tương tự Cache-Aside | Bình thường | Đơn giản hoá application code |
| **Proactive Refresh** | Near real-time | Luôn thấp (no miss) | Bình thường + background | Hot data, UX-critical paths |

## 5. Redis vs Valkey 2026 — Bối cảnh License Fork

Tháng 3/2024, Redis Ltd. thay đổi license từ BSD-3-Clause sang SSPL/RSALv2. Các cloud provider lớn ngay lập tức fork Redis 7.2 thành **Valkey** dưới Linux Foundation, giữ license BSD-3-Clause. Đến 2026, hai dự án đã phân tách đáng kể:

| Tiêu chí | Redis 8.x | Valkey 8.x |
| --- | --- | --- |
| **License** | SSPL / RSALv2 (source-available) | BSD-3-Clause (fully open) |
| **Tích hợp modules** | JSON, Search, TimeSeries, Bloom, Vector Set trong core | Không có — focus vào core data types |
| **I/O Threading** | Có (io-threads config) | Có (cải tiến bởi AWS engineers) |
| **Performance (SET)** | ~729K RPS (8.0) | ~1M RPS (8.1) |
| **Hướng phát triển** | All-in-one data platform (AI, search, analytics) | Lean core, clustering, Rust integration |
| **Cloud support** | AWS ElastiCache, GCP Memorystore, Azure Cache | AWS MemoryDB, GCP Memorystore |
| **Chọn khi** | Cần JSON/Search/TimeSeries/Vector tích hợp | Cần BSD license (SaaS, managed service) |

#### Lưu ý về License

## 6. Thiết kế Cache Layer cho hệ thống Production

Dưới đây là kiến trúc cache đa tầng kết hợp nhiều pattern, phù hợp cho hệ thống production xử lý hàng triệu request:

```
graph TB
    CLIENT["Client Request"] --> LB["Load Balancer"]
    LB --> APP["Application Server"]
    APP --> L1["L1: In-Memory Cache  
(IMemoryCache, 30s TTL)"]
    L1 -->|"Miss"| L2["L2: Redis Cache  
(Distributed, 15min TTL)"]
    L2 -->|"Miss"| DB["Database  
(SQL Server / PostgreSQL)"]
    DB -->|"Populate"| L2
    L2 -->|"Populate"| L1
    WORKER["Background Worker"] -->|"Proactive Refresh"| L2
    WORKER -->|"Invalidate"| L1
    CDC["CDC / Event Bus"] -->|"Data Changed"| WORKER
    DB -->|"Change Event"| CDC
    style L1 fill:#533483,stroke:#e94560,color:#fff
    style L2 fill:#e94560,stroke:#fff,color:#fff
    style DB fill:#0f3460,stroke:#e94560,color:#fff
    style WORKER fill:#16213e,stroke:#e94560,color:#fff
    style CDC fill:#16213e,stroke:#e94560,color:#fff
    style APP fill:#16213e,stroke:#e94560,color:#fff

```
Hình 5: Kiến trúc cache đa tầng với L1 in-memory, L2 Redis, proactive refresh qua CDC

```csharp
// Multilevel Cache Service trong .NET
public class MultiLevelCacheService
{
    private readonly IMemoryCache _l1;
    private readonly IDatabase _l2;  // Redis

public async Task<T?> GetAsync<T>(string key, Func<Task<T?>> factory)
    {
        // L1: In-Memory (ultra-fast, ~0.001ms)
        if (_l1.TryGetValue(key, out T? l1Value))
            return l1Value;

// L2: Redis (fast, ~0.5ms)
        var l2Value = await _l2.StringGetAsync(key);
        if (l2Value.HasValue)
        {
            var result = JsonSerializer.Deserialize<T>(l2Value!);
            _l1.Set(key, result, TimeSpan.FromSeconds(30));
            return result;
        }

// L3: Database (slow, ~5-50ms)
        var dbValue = await factory();
        if (dbValue is not null)
        {
            var json = JsonSerializer.Serialize(dbValue);
            await _l2.StringSetAsync(key, json, TimeSpan.FromMinutes(15));
            _l1.Set(key, dbValue, TimeSpan.FromSeconds(30));
        }

return dbValue;
    }
}

```

## 7. Cache Anti-Patterns cần tránh

### 7.1. Thundering Herd (Cache Stampede)

Khi một cache key phổ biến expire, hàng nghìn request đồng thời đều miss cache và đổ dồn vào database. Giải pháp: sử dụng **distributed lock** hoặc **probabilistic early expiration**.

```csharp
// Distributed Lock để chống Thundering Herd
public async Task<T?> GetWithLockAsync<T>(string key, Func<Task<T?>> factory)
{
    var db = _redis.GetDatabase();
    var cached = await db.StringGetAsync(key);
    if (cached.HasValue)
        return JsonSerializer.Deserialize<T>(cached!);

var lockKey = $"lock:{key}";
    var lockAcquired = await db.StringSetAsync(
        lockKey, "1", TimeSpan.FromSeconds(10), When.NotExists);

if (lockAcquired)
    {
        try
        {
            var value = await factory();
            if (value is not null)
            {
                await db.StringSetAsync(key,
                    JsonSerializer.Serialize(value),
                    TimeSpan.FromMinutes(15));
            }
            return value;
        }
        finally
        {
            await db.KeyDeleteAsync(lockKey);
        }
    }

// Các request khác chờ và retry
    await Task.Delay(100);
    return await GetWithLockAsync<T>(key, factory);
}

```

### 7.2. Cache Penetration

Request liên tục query key không tồn tại trong cả cache và database (thường do attacker). Giải pháp: cache giá trị null hoặc sử dụng **Bloom Filter** (giờ đã tích hợp sẵn trong Redis 8).

```csharp
// Dùng Bloom Filter (Redis 8 native) chống Cache Penetration
public async Task<Product?> GetProductSafeAsync(int productId)
{
    var db = _redis.GetDatabase();

// Kiểm tra Bloom Filter trước — O(1), cực nhanh
    var exists = (bool)await db.ExecuteAsync("BF.EXISTS", "products:bf", productId.ToString());
    if (!exists)
        return null;  // Chắc chắn không tồn tại, skip DB query

// Bloom Filter nói "có thể tồn tại" → kiểm tra cache + DB
    return await GetProductAsync(productId);
}

```

### 7.3. Hot Key Problem

## 8. Kết luận

Redis 8 đánh dấu bước chuyển mình từ một key-value store thành một **nền tảng dữ liệu thống nhất** với JSON, Search, TimeSeries, Vector Set và Bloom Filter tích hợp sẵn. Kiến trúc I/O Threading mới cho throughput tăng vượt trội, trong khi dual-stream replication và LRM eviction policy mở ra khả năng mới cho AI/semantic caching.

Về caching patterns, không có "silver bullet" — mỗi pattern phù hợp với một dạng workload khác nhau. **Cache-Aside** vẫn là lựa chọn mặc định an toàn cho read-heavy systems, **Write-Behind** cho write-heavy analytics, và **Proactive Refresh** cho các hot path cần UX mượt mà. Kết hợp cache đa tầng (L1 in-memory + L2 Redis) với các kỹ thuật chống stampede là công thức đã được chứng minh cho hệ thống triệu request.

### Nguồn tham khảo

- [Redis 8 GA Announcement — redis.io](https://redis.io/blog/redis-8-ga/)
- [Redis 8.6 Performance Improvements — redis.io](https://redis.io/blog/announcing-redis-86-performance-improvements-streams/)
- [What's New in Redis 8.0 — redis.io](https://redis.io/docs/latest/develop/whats-new/8-0/)
- [Database Caching Strategies Using Redis — AWS](https://docs.aws.amazon.com/whitepapers/latest/database-caching-strategies-using-redis/caching-patterns.html)
- [Redis Caching Solutions — redis.io](https://redis.io/solutions/caching/)
- [Redis vs Valkey in 2026 — DEV Community](https://dev.to/synsun/redis-vs-valkey-in-2026-what-the-license-fork-actually-changed-1kni)
- [Choosing the Right Key-Value Store: Redis vs Valkey — Percona](https://www.percona.com/blog/choosing-the-right-key-value-store-redis-vs-valkey/)
- [Redis 8.6 Throughput Improvements — Linuxiac](https://linuxiac.com/redis-8-6-improves-throughput-by-more-than-five-times-over-redis-7-2/)

Passkeys & WebAuthn 2026 - Thay thế Password với FIDO2, Platform Authenticator và Phishing-resistant Auth trên .NET 10 và Vue

AWS Lambda Serverless 2026: Kiến trúc, SnapStart, Event-Driven Patterns và Free Tier cho Production

Disclaimer: The opinions expressed in this blog are solely my own and do not reflect the views or opinions of my employer or any affiliated organizations. The content provided is for informational and educational purposes only and should not be taken as professional advice. While I strive to provide accurate and up-to-date information, I make no warranties or guarantees about the completeness, reliability, or accuracy of the content. Readers are encouraged to verify the information and seek independent advice as needed. I disclaim any liability for decisions or actions taken based on the content of this blog.