What percentage of the AI-200 exam covers Develop AI Solutions by Using Azure Data Management Services?

Domain 2 (Develop AI Solutions by Using Azure Data Management Services) accounts for 25–30% of the AI-200 exam. Enhance AI Solutions with Azure Managed Redis topics like Azure Managed Redis and RediSearch are actively tested.

Is Azure Managed Redis on the AI-200 exam?

Yes. Enhance AI Solutions with Azure Managed Redis is part of Domain 2 in the official AI-200 skill outline, weighted at 25–30%. The key services tested are Azure Managed Redis, RediSearch, Vector Index.

How do I practice Azure Managed Redis hands-on?

Create a free Azure account and follow the code examples in this module step-by-step. The official Microsoft Learn sandbox for Course AI-200T00-A also provides free lab environments for Azure Managed Redis and related services.

Module 6: Azure Managed Redis — AI-200 Study Notes

Module

Cache Content with Azure Managed Redis

units

🎬 Unit 1

Introduction

3 min

Azure Managed Redis is a fully managed Redis service. Redis is an in-memory key-value store that runs at microsecond latency — 100-1,000x faster than database queries. AI applications use it to cache model inference results, prompt/response pairs, embedding vectors, rate-limit counters, and session state — all to reduce cost and latency.

💡 Exam Tip

Exam pillars: 1) Tiers + features 2) Cache-aside pattern implementation 3) Data types and commands 4) TTL strategy 5) Cache invalidation patterns 6) Redis port 10000 with TLS (not 6379).

📘 Unit 2

Explore Azure Managed Redis

7 min

Cache-Aside Pattern: App checks Redis → HIT returns instantly → MISS fetches DB then caches result

1. Azure Managed Redis Tiers

#	Tier	Memory	Persistence	Cluster	Use For
1	Memory Optimized	Up to 1.5 TB	AOF	✅	Large AI caches, embedding stores
2	Balanced	Up to 120 GB	AOF	✅	General production AI workloads
3	Compute Optimized	Up to 120 GB	AOF	✅	High throughput, CPU-intensive operations
4	Flash Optimized	Up to 13 TB	AOF	✅	Very large datasets, warm data on flash

Also: Azure Cache for Redis with Basic/Standard/Premium tiers — Basic is single-node dev/test; Standard adds replication; Premium adds geo-replication and VNet.

2. Connection: Port 10000 (Not 6379)

Azure Managed Redis uses port 10000 with mandatory TLS — not the default Redis port 6379. This is a common exam trap.

redis://myinstance.redis.azure.com:10000 (TLS required)

⚠️ Common Gotcha

Port 6379 = open-source Redis default. Azure Managed Redis = port 10000 with TLS. Any question about Azure Redis connection → port 10000.

3. Caching Strategies — Which to Use When

Cache-aside (Lazy loading) — app checks cache first, on MISS fetches from DB and stores in cache. Most common for AI inference results. Requires cache invalidation on source data change.
Write-through — writes go to cache AND DB simultaneously. Cache always consistent. Higher write latency. Good for user profiles that must always be current.
Write-behind (Write-back) — writes go to cache, DB updated asynchronously. Low write latency. Risk of data loss on crash. Advanced scenario.
Read-through — cache fetches from DB on MISS automatically. Cache acts as transparent proxy. Simplifies application code.

💡 Exam Tip

Cache-aside = application manages cache. Cache hit: return from cache. Cache miss: fetch DB, populate cache, set TTL, return. The exam always has a "implement caching" scenario where cache-aside is the right answer for AI inference results.

📘 Unit 3

Client Libraries and Configuration

5 min

1. Python: redis-py with Managed Identity

import redis

# Access key authentication
r = redis.Redis(
    host='your-instance.redis.azure.com',
    port=10000,
    ssl=True,
    decode_responses=True,
    password='your-access-key'
)

# Entra ID (Managed Identity) authentication
from redis_entraid.cred_provider import create_from_default_azure_credential
credential_provider = create_from_default_azure_credential(
    ("https://redis.azure.com/.default",)
)
r = redis.Redis(host='...', port=10000, ssl=True,
    decode_responses=True, credential_provider=credential_provider)

💡 Exam Tip

Production: use create_from_default_azure_credential from redis-entraid package. Token refreshes automatically. Access key = shared long-lived secret — avoid in production.

2. decode_responses=True

Set this in ALL clients. Without it, Redis returns bytes objects. With it, returns Python str. Nearly always what you want.

📘 Unit 4

Implement Redis Data Operations

12 min

1. Strings — Most Common Type (Key→Value)

r.set("inference:result:user-123", '{"answer": "42", "confidence": 0.98}', ex=300)
result = r.get("inference:result:user-123")  # None if expired

r.setex("rate:user-123", 60, 1)  # Set with TTL of 60 seconds
r.incr("rate:user-123")           # Atomic increment — thread-safe counter
remaining = r.ttl("rate:user-123")  # Time to live in seconds

2. Hashes — Object with Multiple Fields

Use for session data, user profiles — avoids serializing/deserializing a full JSON blob when you only need one field.

r.hset("user:1001", mapping={"name": "Alice", "tier": "premium", "credits": "500"})
name = r.hget("user:1001", "name")        # Get one field
profile = r.hgetall("user:1001")          # Get all fields as dict
r.hincrby("user:1001", "credits", -10)   # Atomically decrement credits

3. Lists — Ordered Queue / Recent History

r.lpush("chat:session-abc", "Hello")   # Push to left
r.rpush("chat:session-abc", "World")   # Push to right
history = r.lrange("chat:session-abc", 0, -1)  # All items
r.ltrim("chat:session-abc", 0, 9)      # Keep only last 10 messages

4. Sets — Unique Members

r.sadd("active-sessions", "sess-001", "sess-002")
r.sismember("active-sessions", "sess-001")  # True/False
r.smembers("active-sessions")               # All members

5. Sorted Sets — Leaderboards / Priority Queues

r.zadd("model-latency", {"gpt-4o": 250.5, "gpt-4o-mini": 85.2})
fastest = r.zrange("model-latency", 0, -1, withscores=True)  # Ascending (fastest first)

6. TTL Strategy by Data Type

#	Data	TTL	Reason
1	Rate limit counters	60–300 seconds	Reset per time window
2	Inference result cache	5–60 minutes	Reasonably fresh, invalidate on model update
3	User session data	15–60 minutes	Expire inactive sessions
4	Product catalog	1–24 hours	Changes infrequently
5	Static config	24+ hours	Very stable data

⚠️ Common Gotcha

Never store sensitive data (API keys, PII) in Redis without encryption — Redis stores data in memory and may persist it to disk. Expire sensitive data with short TTLs or use Key Vault instead.

⚡ Redis Master Cheatsheet

Azure Redis port10000 (TLS, NOT 6379)

Cache-aside key commandsGET → miss → DB → SET with TTL

Set with TTL (seconds)r.set(key, val, ex=300)

Atomic counterr.incr(key) — thread-safe

Object storageHash: r.hset(key, mapping=dict)

Ordered queueList: lpush/rpush + lrange

Unique membersSet: r.sadd

Ranked dataSorted Set: r.zadd

Decode bytes to strdecode_responses=True

Prod authredis-entraid + DefaultAzureCredential

🧪 Unit 5

Exercise — Cache AI Inference Results

30 min

Create Azure Managed Redis instance and connect on port 10000 with TLS
Implement cache-aside for model inference: GET → MISS → model call → SET with 5-min TTL
Implement rate limiting: INCR per user per minute, block at limit
Store session history in a Redis List, trim to last 10 messages
Measure latency difference: Redis HIT vs database query

✅ Unit 6

Knowledge Check

5 min

Q: Azure Managed Redis connection port? A: 10000 with TLS (not 6379)
Q: Cache frequently-read inference results, update on source change. Which pattern? A: Cache-aside (lazy loading)
Q: Track API calls per user per minute atomically. Which command? A: INCR with SETEX for the window TTL
Q: Store user profile fields individually without full JSON serialization. Which data type? A: Redis Hash
Q: Production authentication for Redis without stored keys? A: Entra ID with redis-entraid + DefaultAzureCredential

🏁 Unit 7

Summary

2 min

Azure Managed Redis = in-memory cache at microsecond latency. Connect on port 10000 with TLS. Implement cache-aside for AI inference results. Choose data types by pattern: String (simple cache), Hash (objects), List (queues/history), Set (unique members), Sorted Set (leaderboards). Set TTLs based on data volatility. Use Entra ID for production auth.

🧠 Memory Tricks

Cache-aside flow: "Get → Miss → DB → Set" — always in that order

Data type mnemonic: "SHLSS" — String (cache), Hash (objects), List (queue), Set (unique), Sorted Set (ranked)

Azure Redis port: 10000 = "ten thousand ms slower than RAM... but still fast" (just remember: 10000, not 6379)

⚡

Module Cheatsheet

Azure Managed Redis

20–25% PDF

🔑 Key Facts

Azure Redis port — 10000 with TLS — NOT the open-source default 6379
Cache-aside flow — GET → MISS → DB → SET with TTL → return
Atomic counter — r.incr(key) — thread-safe, use for rate limiting
Hash (object) — r.hset(key, mapping=dict) — avoids full JSON serialize/deserialize
List (queue) — lpush/rpush + lrange + ltrim — chat history, task queues
Set (unique) — r.sadd/sismember — active sessions, deduplicated sets
Sorted Set (ranked) — r.zadd — leaderboards, priority queues, TTL ordering
Prod auth — redis-entraid + DefaultAzureCredential (no stored key)

💻 Commands & Patterns

import redis
r = redis.Redis(host="myinst.redis.azure.com",
  port=10000, ssl=True, decode_responses=True,
  password="your-access-key")
# Cache-aside
def get_result(key):
  hit = r.get(key)
  if hit: return hit
  val = db_query()
  r.set(key, val, ex=300)  # 5-min TTL
  return val
# Atomic rate limit
key = f"rate:&#123;user_id&#125;:&#123;int(time.time())//60&#125;"
count = r.incr(key); r.expire(key, 60)
if count > 10: raise RateLimitError()
# Hash for user profile
r.hset("user:1001", mapping=&#123;"name":"Alice","tier":"premium"&#125;)
r.hincrby("user:1001", "credits", -10)
# List: keep last 10 messages
r.lpush("chat:abc", "Hello"); r.ltrim("chat:abc", 0, 9)

Module

Implement Vector Indexing and Semantic Caching with Azure Managed Redis

units

Vector Indexing and Semantic Caching in Redis — Microsoft Learn

🎬 Unit 1

Introduction to Redis Vector Search

3 min

Azure Managed Redis (via the RediSearch module) supports vector similarity search — store embeddings in Redis hashes and query with FT.SEARCH KNN. The primary AI use case is semantic caching: return cached LLM responses for semantically similar prompts instead of calling the API every time.

💡 Exam Tip

Redis vector exam pillars: 1) FT.CREATE with VECTOR HNSW field 2) FLOAT32 / DIM / DISTANCE_METRIC COSINE 3) FT.SEARCH KNN syntax with DIALECT 2 4) Cosine distance 0=identical — similarity = 1 - distance 5) Cache hit threshold ~0.95.

📘 Unit 2

Create a Vector Index with FT.CREATE

8 min

Create RediSearch Vector Index

import redis, struct

r = redis.Redis(
    host="myredis.redis.cache.windows.net",
    port=10000, password="key", ssl=True,
    decode_responses=False  # MUST be False for binary vectors
)

r.execute_command(
    "FT.CREATE", "idx:docs", "ON", "HASH",
    "PREFIX", "1", "doc:",
    "SCHEMA",
    "title", "TEXT",
    "embedding", "VECTOR", "HNSW", "6",
        "TYPE", "FLOAT32",
        "DIM", "1536",
        "DISTANCE_METRIC", "COSINE"
)

# Store document with embedding
def embed(text):
    vec = oai.embeddings.create(input=text,
        model="text-embedding-3-small").data[0].embedding
    return struct.pack(f"{len(vec)}f", *vec)

r.hset("doc:1", mapping={
    "title": "Azure Key Vault",
    "embedding": embed("Azure Key Vault secures secrets")
})

💡 Exam Tip

decode_responses=False required for binary vector data. HNSW = approximate index type. DIM 1536 must match your embedding model dimensions.

📘 Unit 3

KNN Vector Search

7 min

FT.SEARCH with KNN

def vector_search(query_text, top_k=5):
    q_emb = embed(query_text)
    results = r.execute_command(
        "FT.SEARCH", "idx:docs",
        f"*=>[KNN {top_k} @embedding $blob AS score]",
        "PARAMS", "2", "blob", q_emb,
        "RETURN", "3", "title", "score",
        "SORTBY", "score",
        "DIALECT", "2"
    )
    return results

💡 Exam Tip

$blob passes binary embedding. AS score names the distance field. SORTBY score = nearest first. DIALECT 2 required for vector syntax.

📘 Unit 4

Semantic Caching Pattern

8 min

Cache-Aside with Similarity Threshold

import hashlib

THRESHOLD = 0.95  # cosine similarity for cache hit

def semantic_cache_get(prompt):
    q_emb = embed(prompt)
    res = r.execute_command(
        "FT.SEARCH", "idx:cache",
        "*=>[KNN 1 @embedding $blob AS score]",
        "PARAMS", "2", "blob", q_emb,
        "RETURN", "2", "response", "score",
        "DIALECT", "2"
    )
    if res[0] > 0:
        distance = float(dict(zip(res[2][::2],
            res[2][1::2]))[b"score"])
        if (1 - distance) >= THRESHOLD:
            return res[2][res[2].index(b"response")+1]
    return None

def semantic_cache_set(prompt, response, ttl=3600):
    key = f"cache:{hashlib.md5(prompt.encode()).hexdigest()}"
    r.hset(key, mapping={
        "prompt": prompt,
        "response": response,
        "embedding": embed(prompt)
    })
    r.expire(key, ttl)

⚠️ Common Gotcha

Redis returns cosine distance (0=identical). Similarity = 1 - distance. Always set TTL on cached responses — LLM answers become stale. Never cache user-specific or sensitive data.

🧪 Unit 5

Exercise

20 min

Create an Azure Managed Redis (Enterprise tier with RediSearch)
Create an FT.CREATE index with HNSW FLOAT32 DIM=1536 COSINE
Store 10 Q&A pairs with embeddings using r.hset()
Implement semantic_cache_get() with 0.95 similarity threshold
Test with rephrased questions — verify cache hits vs misses

🏁 Unit 6

Summary

2 min

Redis vector search: FT.CREATE with HNSW index → HSET with binary embedding → FT.SEARCH KNN with DIALECT 2. Cosine similarity = 1 - distance. Semantic cache returns cached LLM responses for similar prompts (≥0.95 threshold). Always TTL cache entries. Use decode_responses=False for binary vector operations.

Enhance AI Solutions with Azure Managed Redis

Cache Content with Azure Managed Redis

Introduction

Explore Azure Managed Redis

Cache-Aside Pattern: App checks Redis → HIT returns instantly → MISS fetches DB then caches result

Client Libraries and Configuration

Implement Redis Data Operations

⚡ Redis Master Cheatsheet

Exercise — Cache AI Inference Results

Knowledge Check

Summary

Azure Managed Redis

Implement Vector Indexing and Semantic Caching with Azure Managed Redis

Introduction to Redis Vector Search

Create a Vector Index with FT.CREATE

KNN Vector Search

Semantic Caching Pattern

Exercise

Summary

Quick Quiz

Related Modules — Develop AI Solutions by Using Azure Data Management Services

Frequently Asked Questions