What percentage of the AI-200 exam covers Develop AI Solutions by Using Azure Data Management Services?

Domain 2 (Develop AI Solutions by Using Azure Data Management Services) accounts for 25–30% of the AI-200 exam. Develop AI Solutions with Azure Database for PostgreSQL topics like Azure Database for PostgreSQL and pgvector are actively tested.

Is PostgreSQL & pgvector on the AI-200 exam?

Yes. Develop AI Solutions with Azure Database for PostgreSQL is part of Domain 2 in the official AI-200 skill outline, weighted at 25–30%. The key services tested are Azure Database for PostgreSQL, pgvector, HNSW, IVFFlat.

How do I practice PostgreSQL & pgvector hands-on?

Create a free Azure account and follow the code examples in this module step-by-step. The official Microsoft Learn sandbox for Course AI-200T00-A also provides free lab environments for Azure Database for PostgreSQL and related services.

Module 5: PostgreSQL & pgvector — AI-200 Study Notes

Module

Build and Query with Azure Database for PostgreSQL

units

🎬 Unit 1

Introduction

3 min

Azure Database for PostgreSQL is a fully managed PostgreSQL service — Microsoft handles patching, backups, HA, and connection pooling. You get full PostgreSQL compatibility including JSONB, extensions (pgvector for vector search), and transactional guarantees. Perfect for AI agents storing conversation history, task state, and structured context alongside vector embeddings.

💡 Exam Tip

Exam pillars: 1) Compute tiers + PgBouncer availability 2) Entra auth token flow 3) TLS modes (verify-full) 4) JSONB and AI-relevant data types 5) Transactional DDL (ALTER TABLE in BEGIN/COMMIT).

📘 Unit 2

Explore Azure Database for PostgreSQL

7 min

PostgreSQL Connection Architecture: App → Entra token → PgBouncer (port 6432) → PostgreSQL

1. Compute Tiers

#	Tier	VM Series	CPU Burst	PgBouncer	Use For
1	Burstable	B-series	Yes	❌ No	Dev/test, small apps, proof-of-concept
2	General Purpose	D-series	No	✅ Yes	Production APIs, steady workloads
3	Memory Optimized	E-series	No	✅ Yes	Complex queries, large AI working sets, caching

⚠️ Common Gotcha

PgBouncer is only available on General Purpose and Memory Optimized — NOT Burstable. This is a classic exam trap. If the question mentions connection pooling with PostgreSQL → answer cannot be Burstable.

2. Managed Capabilities

Automatic backups — 7–35 day retention, point-in-time restore to any second within the window. AES-256 encrypted at rest.
High Availability — zone-redundant HA with automatic failover. Standby replica in a different AZ.
Extensions — enable with CREATE EXTENSION:
1. pgvector — vector similarity search (cosine, L2, inner product)
2. pg_trgm — fuzzy text matching for AI document search
3. uuid-ossp — UUID generation
4. hstore — key-value pairs in a column

📘 Unit 3

Connect to PostgreSQL

10 min

1. Connection Parameters

Endpoint: servername.postgres.database.azure.com
Direct port: 5432
PgBouncer port: 6432
Username format (Entra): username@servername

2. Entra ID Authentication (Recommended)

Short-lived OAuth2 tokens instead of passwords. Works with managed identities. No stored credentials.

from azure.identity import DefaultAzureCredential

credential = DefaultAzureCredential()
token = credential.get_token("https://ossrdbms-aad.database.windows.net/.default")
# Use token.token as the password in your psycopg2 connection string

💡 Exam Tip

Entra auth token resource = https://ossrdbms-aad.database.windows.net/.default. Memorize this URL — the exam tests it. Tokens auto-refresh; no manual rotation needed.

3. TLS / SSL Modes

disable — no encryption. Azure rejects this for managed PostgreSQL.
require — encrypts connection but does NOT validate server certificate.
verify-ca — validates the certificate authority.
verify-full — validates CA AND hostname. Recommended for production.

⚠️ Common Gotcha

Azure Database for PostgreSQL requires TLS. The strongest mode that validates both CA and hostname is verify-full. The exam asks "which mode ensures the client is connecting to the correct server?" — answer: verify-full.

4. PgBouncer — Connection Pooling

AI applications make many short-lived DB calls (one per inference request). Each new TCP connection is expensive. PgBouncer pools connections — the app creates a connection to PgBouncer, which reuses existing DB connections.

az postgres flexible-server parameter set \\
  --resource-group rg --server-name myserver \\
  --name pgbouncer.enabled --value true

# Connect via PgBouncer on port 6432
postgresql://[email protected]:6432/mydb?sslmode=require

📘 Unit 4

Create and Manage Schemas

10 min

1. Hierarchy: Server → Database → Schema → Table

Default schema is public. Use separate databases for full isolation; separate schemas for logical grouping with cross-schema JOINs possible.

2. AI-Relevant Data Types

JSONB — binary JSON with GIN indexing. Use for flexible metadata, model parameters, nested structures.
TIMESTAMPTZ — always use over TIMESTAMP for global apps. Stores UTC, displays per session timezone.
BIGSERIAL — auto-incrementing 64-bit integer PK. Use over SERIAL (32-bit overflows on high-volume tables).
TEXT — unbounded string. Same performance as VARCHAR in PostgreSQL. No reason to use VARCHAR unless enforcing max length.
UUID — universally unique ID. Use DEFAULT gen_random_uuid(). Good for distributed, globally-unique identifiers.

3. Conversation History Table Example

CREATE TABLE conversations (
    id          BIGSERIAL PRIMARY KEY,
    session_id  UUID NOT NULL DEFAULT gen_random_uuid(),
    user_id     VARCHAR(255) NOT NULL,
    started_at  TIMESTAMPTZ DEFAULT CURRENT_TIMESTAMP,
    metadata    JSONB DEFAULT '{}'::jsonb
);

CREATE TABLE messages (
    id              BIGSERIAL PRIMARY KEY,
    conversation_id BIGINT NOT NULL REFERENCES conversations(id) ON DELETE CASCADE,
    role            VARCHAR(50) NOT NULL CHECK (role IN ('user', 'assistant', 'system')),
    content         TEXT NOT NULL,
    created_at      TIMESTAMPTZ DEFAULT CURRENT_TIMESTAMP
);

4. Indexes

CREATE INDEX idx_messages_conversation_id ON messages(conversation_id);
CREATE INDEX idx_messages_conv_created ON messages(conversation_id, created_at);

Indexes speed reads but slow writes. Add indexes when query profiling reveals slow queries — not preemptively on every column.

5. Transactional DDL (PostgreSQL Superpower)

BEGIN;
ALTER TABLE conversations ADD COLUMN category VARCHAR(100);
CREATE INDEX idx_conversations_category ON conversations(category);
COMMIT;   -- or ROLLBACK on failure

💡 Exam Tip

PostgreSQL DDL is transactional (unlike MySQL). Wrap related schema changes in BEGIN/COMMIT — if one fails, all roll back. This is a key differentiator the exam tests.

📘 Unit 5

Query Data

10 min

1. JSONB Queries

-- Access JSONB field
SELECT metadata->>'model' AS model_name FROM conversations;

-- Filter on JSONB (containment operator @>)
SELECT * FROM conversations WHERE metadata @> '{"tier": "premium"}'::jsonb;

-- Check JSONB key existence
SELECT * FROM conversations WHERE metadata ? 'user_preferences';

2. Upserts with ON CONFLICT

INSERT INTO conversations (session_id, user_id) VALUES ($1, $2)
ON CONFLICT (session_id)
DO UPDATE SET user_id = EXCLUDED.user_id, started_at = CURRENT_TIMESTAMP;

3. Keyset Pagination (Avoid OFFSET)

OFFSET gets slower as the offset grows. Keyset pagination is O(log n) with the right index:

-- Get next 50 messages after last seen id
SELECT * FROM messages
WHERE conversation_id = $1 AND id > $last_id
ORDER BY id ASC LIMIT 50;

4. INSERT RETURNING (Avoid Extra SELECT)

INSERT INTO messages (conversation_id, role, content)
VALUES ($1, 'user', $2)
RETURNING id, created_at;   -- Get generated values without extra round-trip

⚡ PostgreSQL Master Cheatsheet

PgBouncer port6432 (vs 5432 direct)

Entra token resourcehttps://ossrdbms-aad.database.windows.net/.default

Best SSL modeverify-full (validates CA + hostname)

Flexible metadata typeJSONB (with GIN index support)

Auto-increment PKBIGSERIAL PRIMARY KEY

TimestampsTIMESTAMPTZ (always, not TIMESTAMP)

Tier for PgBouncerGeneral Purpose or Memory Optimized (NOT Burstable)

Transactional DDLWrap ALTER TABLE in BEGIN...COMMIT

Vector search extensionpgvector (CREATE EXTENSION vector)

Pagination (large sets)Keyset (id > last_id) not OFFSET

🧪 Unit 6

Exercise — Conversation History Store

30 min

Create a PostgreSQL server (General Purpose tier)
Enable PgBouncer and verify port 6432
Obtain Entra token and connect with psycopg2
Create conversations and messages tables with JSONB metadata
Insert messages, query with JSONB filter, implement keyset pagination
Wrap an ALTER TABLE + CREATE INDEX in a transaction

✅ Unit 7

Knowledge Check

5 min

Q: Which tier supports PgBouncer? A: General Purpose and Memory Optimized (not Burstable)
Q: SSL mode that validates both CA and hostname? A: verify-full
Q: AI app needs flexible metadata alongside relational data. Data type? A: JSONB
Q: Why keyset pagination instead of OFFSET? A: OFFSET performance degrades as offset grows; keyset is O(log n) with the right index
Q: Entra token resource URL for PostgreSQL? A: https://ossrdbms-aad.database.windows.net/.default

🏁 Unit 8

Summary

2 min

Choose General Purpose or higher for production (PgBouncer access). Use Entra authentication with token-based access. Always enforce verify-full TLS. Design schemas using TIMESTAMPTZ and BIGSERIAL. Use JSONB for flexible AI metadata. Enable PgBouncer on port 6432 for high-concurrency AI services. Wrap schema changes in transactions.

🧠 Memory Tricks

"BuGM" — Burstable (no PgBouncer), General Purpose (PgBouncer ✅), Memory Optimized (PgBouncer ✅). Want Bouncer? Go General or above.

TLS modes order: disable (blocked) → require (encrypt only) → verify-ca (+ CA check) → verify-full (+ hostname check = best)

🏁 Unit 9

Exam Summary Card

2 min

Scenario	Answer
Connection pooling needed	General Purpose or Memory Optimized tier (port 6432)
Secure Entra auth token resource	https://ossrdbms-aad.database.windows.net/.default
Validates CA AND hostname	sslmode=verify-full
Flexible metadata type	JSONB
Vector similarity search	pgvector extension
Large dataset pagination	Keyset (WHERE id > last_id LIMIT N)
Schema change that auto-rollbacks on failure	ALTER TABLE inside BEGIN/COMMIT transaction
UUID primary key	DEFAULT gen_random_uuid()

🐘

Module Cheatsheet

Azure Database for PostgreSQL

25–30% PDF

🔑 Key Facts

PgBouncer tier — General Purpose or Memory Optimized ONLY (not Burstable)
PgBouncer port — 6432 (vs 5432 direct)
Entra token resource — https://ossrdbms-aad.database.windows.net/.default
Best TLS mode — verify-full — validates CA AND hostname
JSONB — Binary JSON with GIN indexing — use for flexible AI metadata
BIGSERIAL — Auto-increment 64-bit PK (use over SERIAL which overflows)
TIMESTAMPTZ — Always over TIMESTAMP — stores UTC, timezone-aware
Transactional DDL — Wrap ALTER TABLE in BEGIN...COMMIT — rolls back on failure

💻 Commands & Patterns

-- Create AI messages table
CREATE TABLE messages (
  id         BIGSERIAL PRIMARY KEY,
  session_id UUID NOT NULL DEFAULT gen_random_uuid(),
  role       VARCHAR(50) CHECK (role IN ('user','assistant')),
  content    TEXT NOT NULL,
  metadata   JSONB DEFAULT '&#123;&#125;'::jsonb,
  created_at TIMESTAMPTZ DEFAULT CURRENT_TIMESTAMP
);
-- Keyset pagination (not OFFSET)
SELECT * FROM messages
WHERE session_id=$1 AND id > $last_id
ORDER BY id ASC LIMIT 50;
-- Transactional DDL
BEGIN;
ALTER TABLE messages ADD COLUMN tokens INT;
CREATE INDEX idx ON messages(tokens);
COMMIT;

Module

Vector Search with pgvector and RAG Patterns

units

Build AI Copilot with PostgreSQL — Microsoft Learn

🎬 Unit 1

Introduction to pgvector

3 min

pgvector is a PostgreSQL extension that adds a vector data type and vector similarity operators. Azure Database for PostgreSQL Flexible Server ships with pgvector pre-installed — enabling RAG (Retrieval-Augmented Generation) directly in your existing Postgres database.

💡 Exam Tip

pgvector exam pillars: 1) CREATE EXTENSION vector 2) vector(1536) column type 3) <=> cosine / <-> L2 / <#> inner product operators 4) HNSW vs IVFFlat indexes 5) HNSW = better recall, IVFFlat = lower memory.

📘 Unit 2

Setup and Store Embeddings

8 min

Enable pgvector and Create Table

-- Enable extension (once per database)
CREATE EXTENSION IF NOT EXISTS vector;

-- Create table with embedding column
CREATE TABLE documents (
    id          SERIAL PRIMARY KEY,
    title       TEXT NOT NULL,
    content     TEXT NOT NULL,
    embedding   vector(1536),  -- matches text-embedding-3-small
    category    TEXT,
    created_at  TIMESTAMPTZ DEFAULT NOW()
);

-- HNSW index for fast approximate search
CREATE INDEX ON documents
    USING hnsw (embedding vector_cosine_ops)
    WITH (m=16, ef_construction=64);

-- Insert with Python
import psycopg2
cur.execute(
    "INSERT INTO documents (title, content, embedding) VALUES (%s, %s, %s)",
    (title, content, embedding)  # embedding = list of floats
)

📘 Unit 3

Similarity Search and RAG

10 min

Vector Search + RAG Pipeline

import psycopg2
import openai

def rag_answer(user_question):
    # 1. Embed the question
    q_emb = oai.embeddings.create(
        input=user_question,
        model="text-embedding-3-small"
    ).data[0].embedding

    # 2. Find top-5 similar docs (cosine distance)
    cur.execute("""
        SELECT content, title,
               1 - (embedding <=> %s::vector) AS score
        FROM documents
        ORDER BY embedding <=> %s::vector
        LIMIT 5
    """, [str(q_emb), str(q_emb)])

    chunks = cur.fetchall()
    context = "\n\n".join(
        f"[{title}]: {content}"
        for content, title, score in chunks
    )

    # 3. Generate answer with context
    response = oai.chat.completions.create(
        model="gpt-4o",
        messages=[
            {"role": "system", "content":
                f"Answer using context:\n{context}"},
            {"role": "user", "content": user_question}
        ]
    )
    return response.choices[0].message.content

💡 Exam Tip

<=> = cosine distance (use for text). <-> = L2/Euclidean. 1 - distance = cosine similarity (0–1). Always cast to ::vector when passing from Python.

📘 Unit 4

HNSW vs IVFFlat Indexes

6 min

Index	Recall	Build Speed	Memory	Use When
HNSW	Higher	Slower	Higher	Production, few million rows
IVFFlat	Lower	Faster	Lower	Large datasets, memory constrained

⚠️ Common Gotcha

HNSW can be built on empty table. IVFFlat requires data first (needs to cluster). Exam default: HNSW with vector_cosine_ops for text embeddings.

🏁 Unit 5

Summary

2 min

pgvector: CREATE EXTENSION vector → vector(1536) column → HNSW index with vector_cosine_ops → <=> cosine distance in queries → RAG pipeline: embed question, find top-K, pass context to LLM. Azure PostgreSQL Flexible Server includes pgvector pre-installed. HNSW = better recall; IVFFlat = lower memory. Always use 1 - distance for similarity score.

Develop AI Solutions with Azure Database for PostgreSQL

Build and Query with Azure Database for PostgreSQL

Introduction

Explore Azure Database for PostgreSQL

PostgreSQL Connection Architecture: App → Entra token → PgBouncer (port 6432) → PostgreSQL

Connect to PostgreSQL

Create and Manage Schemas

Query Data

⚡ PostgreSQL Master Cheatsheet

Exercise — Conversation History Store

Knowledge Check

Summary

Exam Summary Card

Azure Database for PostgreSQL

Vector Search with pgvector and RAG Patterns

Introduction to pgvector

Setup and Store Embeddings

Similarity Search and RAG

HNSW vs IVFFlat Indexes

Summary

Quick Quiz

Related Modules — Develop AI Solutions by Using Azure Data Management Services

Frequently Asked Questions