Does the DPDP Act require all AI processing to happen inside India?

Not by default. The DPDP Act 2023 (Section 16) allows cross-border transfer except where the Central Government restricts transfer to specific countries. Sectoral regulators — RBI for payments data, SEBI for certain trading data, IRDAI for insurance records — impose additional localisation on specific data classes. Treat Indian-region hosting as the safe default for personal data and go cross-border only with a documented legal basis and risk assessment.

What's the difference between encryption at rest, in transit, and customer-managed keys?

Encryption in transit protects data moving between systems (TLS 1.2+); at rest protects data on disk. Customer-Managed Encryption Keys (CMEK) means you hold the key material, not the cloud provider — so the provider cannot technically decrypt your data even under subpoena. For AI workloads with sensitive data, CMEK plus HSM-backed keys is increasingly standard for BFSI and healthcare.

Can we use OpenAI or Anthropic direct APIs for Indian personal data?

With controls. Direct APIs typically route to US data centers by default. For DPDP-sensitive workloads, prefer enterprise hosting via Azure OpenAI (India regions), AWS Bedrock Claude (ap-south-1), or Google Vertex AI Gemini (asia-south1). Where direct API is the only option, apply PII redaction and tokenisation before the call, sign a DPA with the provider, and document the transfer basis. SDFs should default to the Indian-region cloud route.

Is there a sovereign cloud option in India for AI workloads?

Yes. Domestic providers (e.g., Yotta, Tata Communications, CtrlS, Nxtra by Airtel) offer AI-capable sovereign cloud. Hyperscalers with MeitY-empanelled and CERT-In-aligned offerings in Indian regions are also used by regulated firms. For the most sensitive workloads (defence-adjacent, critical-information-infrastructure), government-grade sovereign cloud (e.g., MeghRaj/NIC) is the path.

What should a PII-redaction layer actually do?

A production-grade redaction layer performs: (1) entity-recognition for names, addresses, Aadhaar, PAN, mobile numbers, email, medical IDs; (2) tokenisation with a reversible mapping stored separately under strict access; (3) output scanning to block re-emergence of PII in responses; (4) structured logging of every redaction decision for audit. It sits between your application and the LLM, never bypassed.

AI Security & Data Residency India 2026 (DPDP) — India Guide 2026

AI Security & Data Residency India 2026 (DPDP)

DPDP Act 2023 + data localisation for AI workloads, sovereign cloud, encryption, CISO matrix

AI workloads amplify every existing data-security worry and add a few new ones. A single chat completion may pull from a vector database, hit an LLM hosted thousands of kilometres away, and return content that lands in a user's email. Each hop is a data-residency question, an encryption question, an access-control question, and an auditability question.

This guide lays out a practical security and data-residency stack for AI workloads in India in 2026. It anchors on the DPDP Act 2023, layers in sectoral localisation rules, and produces a controls matrix that is consumable by a CISO, a DPO, and an auditor from the same document.

What You'll Learn

DPDP Act 2023 sections that apply specifically to AI
India cloud-region options for AI workloads
Sovereign cloud paths and when to use them
Encryption patterns — at rest, in transit, CMEK
PII-redaction and tokenisation architectures
Controls matrix mapping DPDP → NIST AI RMF → ISO/IEC 42001

DPDP Act 2023 Sections That Matter for AI

The DPDP Act 2023 (No. 22 of 2023) is sector-agnostic but has specific implications when the processing is done by AI systems.

| DPDP section / theme | Implication for AI systems | |---|---| | Definitions (Section 2) — Data Fiduciary, Data Principal, Personal Data | Every AI system processing personal data makes you a Data Fiduciary; the LLM provider may be a Data Processor | | Consent (Section 6) | Consent notice must state AI processing specifically — generic ToS language is insufficient | | Notice (Section 5) | Itemise the categories of personal data, purpose, and retention — includes prompt and output logs | | Grounds beyond consent (Section 7) — legitimate uses | Narrow list; cannot be stretched to include "we wanted to build an AI" | | Data Fiduciary obligations (Section 8) | Accuracy, retention minimisation, security safeguards, breach notification — all must extend to AI logs | | Significant Data Fiduciary (Section 10) | DPO based in India, independent data auditor, DPIA for new AI/ML models | | Rights — correction and erasure (Sections 11–12) | Drives preference for RAG over fine-tuning; prompt logs must be erasable | | Cross-border transfer (Section 16) | Permitted except where Central Government restricts — document the transfer basis | | Penalties (Chapter VIII) | Up to ₹250 crore per specified breach category; AI is not exempt |

For the HIPAA, PCI-DSS, and SOC2 overlay sitting alongside DPDP, see the enterprise AI compliance guide.

India Cloud-Region Options for AI Workloads

All three hyperscalers now offer Indian regions with leading AI models available locally. Availability changes quarterly — verify at procurement time.

| Provider | India regions | Notes for AI workloads | |---|---|---| | AWS | ap-south-1 Mumbai, ap-south-2 Hyderabad | Bedrock with Claude, Llama, Titan; Knowledge Bases in region | | Google Cloud | asia-south1 Mumbai, asia-south2 Delhi | Vertex AI with Gemini; Model Garden; BigQuery-native RAG | | Azure | Central India (Pune), South India (Chennai), West India (Mumbai) | Azure OpenAI with GPT and o-series; Azure AI Foundry |

For platform comparison at the feature and pricing level, see VertexAI vs Bedrock vs Azure. For hands-on platform walkthroughs, see the AWS Bedrock setup guide and Google Vertex AI setup guide.

When Indian-Region Hosting Is Not Enough

Use cases falling under sectoral localisation rules (RBI payments data, certain SEBI categories) — verify storage and processing both inside India.
Sensitive-personal-data categories where the Central Government may issue future transfer restrictions — design now for Indian-only processing.
Critical Information Infrastructure as notified under the IT Act — sovereign cloud path.

Sovereign and Domestic Cloud Paths

Sovereign cloud is the answer when a workload cannot tolerate any foreign jurisdictional exposure.

Domestic cloud providers — Yotta, Tata Communications, CtrlS, Nxtra by Airtel, Sify, ESDS, and others operate MeitY-empanelled cloud infrastructure with AI-capable offerings. Foundation-model access is typically through partnerships and deployed copies, not live Frontier-model APIs.
Hyperscaler sovereign offerings — AWS, Google, and Azure each provide sovereign-cloud variants with controls aligned to Indian data-protection and cybersecurity requirements; adoption in BFSI is increasing.
Government cloud (MeghRaj) — for government departments and some CII workloads, the NIC-operated cloud provides the highest sovereignty.

When choosing a sovereign path, document the trade-offs: slower model refresh cycles, potentially older foundation models, and smaller tooling ecosystems compared to global regions of the hyperscalers.

Encryption Patterns for AI Workloads

In Transit

TLS 1.2 minimum; TLS 1.3 preferred. mTLS for service-to-service calls where feasible. Certificate pinning for critical clients calling LLM APIs.

At Rest

AES-256 at the storage layer. Apply to:

Raw prompt and output logs
Vector store embeddings (even embeddings can leak information)
Fine-tuning datasets
Model artefacts

Customer-Managed Encryption Keys (CMEK) and HYOK

CMEK means the customer (you) manages the key-encryption-key used by the cloud provider to encrypt data-encryption-keys. For very sensitive workloads, Hold-Your-Own-Key (HYOK) keeps key material in an HSM entirely under your control.

For BFSI and healthcare AI workloads, CMEK with HSM backing has moved from "advanced" to "expected" in 2026.

Secrets and API Keys

LLM API keys are credentials with material blast radius. Store in a secrets manager (AWS Secrets Manager, Azure Key Vault, Google Secret Manager, or HashiCorp Vault). Rotate on a schedule. Never embed in prompt templates.

PII-Redaction and Tokenisation Architecture

A reference architecture for AI processing of personal data:

User / App Request
      │
      ▼
[1] Input Scanner  — detect PII / PHI / financial identifiers
      │
      ▼
[2] Tokeniser       — replace with stable tokens; store reversal map separately
      │
      ▼
[3] Prompt Builder  — apply system prompt version, guardrails
      │
      ▼
[4] LLM API         — in India region where personal data is involved
      │
      ▼
[5] Output Scanner  — block re-emergence of PII, hallucinated identifiers
      │
      ▼
[6] Detokeniser     — re-map tokens for the authorised response
      │
      ▼
Audit Log — every step recorded with hashes, verdict, user context

Build this as a shared enterprise service, not per application. One well-maintained redaction layer serves many use cases; per-app redaction drifts.

For detailed hardening, see the enterprise AI security guardrails guide and secure AI prompting in regulated industries.

When to Keep Data Fully On-Premise

Some workloads will never be cross-border-safe regardless of controls:

Clinical decision support on full patient records without de-identification.
Defence-adjacent or export-controlled technology documentation.
Boards and senior-leadership confidential material where even tokenised exposure is risk-unacceptable.

For these, on-premise model inference is the answer. Open-weight models (Ollama, Llama, Mistral, Qwen) running on your infrastructure keep data entirely inside your control boundary. Accept lower capability than frontier models for the trade-off of full sovereignty.

For RAG patterns where proprietary knowledge stays in-house while the LLM remains hosted, see RAG for beginners.

SIG, PPA and Sub-Processor Hygiene

Whatever your architecture, vendor artefacts matter.

Standard Information Gathering (SIG) questionnaire — a widely used vendor assessment instrument; AI vendors should respond for their service.
Privacy Protection Agreement (PPA) — your data-processing agreement for AI vendors, with DPDP Act clauses.
Sub-processor list — every layer from application-layer vendor to foundation-model provider to cloud region must be named.
Training-data exclusion — contractual commitment that your prompts and outputs are not used to train the provider's models.
Breach notification clock — hours, not days.
Audit right — right to audit in person or through a contracted auditor.

Controls Matrix — DPDP × NIST AI RMF × ISO/IEC 42001

A single matrix serves three audiences.

| Control area | DPDP Act anchor | NIST AI RMF anchor | ISO/IEC 42001 anchor | |---|---|---|---| | Named AI policy and owner | Section 8 safeguards | Govern | Clause 5 leadership | | Data inventory incl. AI | Section 8 retention | Map | Clause 4 context | | Consent & lawful basis | Sections 5–7 | Map | Clause 6 risk & impact | | DPIA for new AI | Section 10 SDF duty | Map | Annex A impact assessment | | Encryption at rest & transit | Section 8 safeguards | Manage | Annex A technical controls | | Access control & SoD | Section 8 safeguards | Govern | Annex A operational controls | | PII redaction & tokenisation | Section 8 safeguards | Measure + Manage | Annex A data quality | | Prompt and audit logging | Section 8 accountability | Measure | Annex A monitoring | | Incident response | Section 8 breach notification | Manage | Clause 10 improvement | | Right to correction / erasure | Sections 11–12 | Govern | Annex A data principal rights | | Cross-border transfer controls | Section 16 | Govern | Annex A third-party | | Vendor & sub-processor management | Section 8 safeguards | Govern | Annex A provider obligations |

One row per control; map each to the underlying evidence artefact (policy, runbook, ticket, log query).

30-Day CISO Playbook

Days 1–5 — discovery: every LLM API key, every vector store, every fine-tuning dataset across the firm.
Days 6–10 — classify data flowing to AI; block high-risk flows at egress.
Days 11–15 — stand up secrets manager policy and rotate all AI API keys.
Days 16–20 — deploy centralised PII redaction layer as a shared service.
Days 21–25 — move personal-data AI workloads to Indian cloud regions where not already.
Days 26–30 — publish the controls matrix and schedule the first independent audit.

Pair this playbook with the AI governance framework on the governance side and AI compliance for RBI / SEBI / IRDAI for sectoral overlays.

Key Takeaways

DPDP Act 2023 is the floor; sectoral rules are the walls; AI workloads touch both.
Default personal-data AI workloads to Indian cloud regions; go cross-border only with documented basis.
Sovereign and domestic cloud is a real option for the most sensitive workloads.
CMEK with HSM backing has moved from advanced to expected in BFSI and healthcare.
A centralised PII-redaction service is the single most leveraged security control for AI.
Open-weight on-premise models solve the workloads that must never leave your infrastructure.
One controls matrix mapping DPDP → NIST AI RMF → ISO/IEC 42001 serves three audiences from one evidence base.

Official Resources

DPDP Act 2023 (MeitY PDF) — full statutory text
MeitY Data Protection Framework page — rules and notifications
RBI IT Governance Master Directions — BFSI technology governance
SEBI CSCRF circular — cyber resilience requirements for SEBI REs
IRDAI Guidelines page — insurance-sector technology expectations
NIST AI RMF 1.0 (PDF)
ISO/IEC 42001:2023

Next Steps

Build the governance layer with the AI governance framework guide
Apply sectoral controls using the RBI/SEBI/IRDAI compliance playbook
Harden the runtime with enterprise AI security guardrails
Select vendors with the AI vendor selection playbook for CIOs
Explore on-premise options with the Ollama local LLM guide

Last updated: April 19, 2026

Community Questions

No questions yet. Be the first to ask!

Share this guide

r/developersIndia r/india r/ChatGPT

AI Security & Data Residency India 2026 (DPDP)

DPDP Act 2023 + data localisation for AI workloads, sovereign cloud, encryption, CISO matrix

What You'll Learn

DPDP Act 2023 sections that apply specifically to AI
India cloud-region options for AI workloads
Sovereign cloud paths and when to use them
Encryption patterns — at rest, in transit, CMEK
PII-redaction and tokenisation architectures
Controls matrix mapping DPDP → NIST AI RMF → ISO/IEC 42001

DPDP Act 2023 Sections That Matter for AI

The DPDP Act 2023 (No. 22 of 2023) is sector-agnostic but has specific implications when the processing is done by AI systems.

For the HIPAA, PCI-DSS, and SOC2 overlay sitting alongside DPDP, see the enterprise AI compliance guide.

India Cloud-Region Options for AI Workloads

All three hyperscalers now offer Indian regions with leading AI models available locally. Availability changes quarterly — verify at procurement time.

For platform comparison at the feature and pricing level, see VertexAI vs Bedrock vs Azure. For hands-on platform walkthroughs, see the AWS Bedrock setup guide and Google Vertex AI setup guide.

When Indian-Region Hosting Is Not Enough

Use cases falling under sectoral localisation rules (RBI payments data, certain SEBI categories) — verify storage and processing both inside India.
Sensitive-personal-data categories where the Central Government may issue future transfer restrictions — design now for Indian-only processing.
Critical Information Infrastructure as notified under the IT Act — sovereign cloud path.

Sovereign and Domestic Cloud Paths

Sovereign cloud is the answer when a workload cannot tolerate any foreign jurisdictional exposure.

Domestic cloud providers — Yotta, Tata Communications, CtrlS, Nxtra by Airtel, Sify, ESDS, and others operate MeitY-empanelled cloud infrastructure with AI-capable offerings. Foundation-model access is typically through partnerships and deployed copies, not live Frontier-model APIs.
Hyperscaler sovereign offerings — AWS, Google, and Azure each provide sovereign-cloud variants with controls aligned to Indian data-protection and cybersecurity requirements; adoption in BFSI is increasing.
Government cloud (MeghRaj) — for government departments and some CII workloads, the NIC-operated cloud provides the highest sovereignty.

Encryption Patterns for AI Workloads

In Transit

TLS 1.2 minimum; TLS 1.3 preferred. mTLS for service-to-service calls where feasible. Certificate pinning for critical clients calling LLM APIs.

At Rest

AES-256 at the storage layer. Apply to:

Raw prompt and output logs
Vector store embeddings (even embeddings can leak information)
Fine-tuning datasets
Model artefacts

Customer-Managed Encryption Keys (CMEK) and HYOK

For BFSI and healthcare AI workloads, CMEK with HSM backing has moved from "advanced" to "expected" in 2026.

Secrets and API Keys

PII-Redaction and Tokenisation Architecture

A reference architecture for AI processing of personal data:

User / App Request
      │
      ▼
[1] Input Scanner  — detect PII / PHI / financial identifiers
      │
      ▼
[2] Tokeniser       — replace with stable tokens; store reversal map separately
      │
      ▼
[3] Prompt Builder  — apply system prompt version, guardrails
      │
      ▼
[4] LLM API         — in India region where personal data is involved
      │
      ▼
[5] Output Scanner  — block re-emergence of PII, hallucinated identifiers
      │
      ▼
[6] Detokeniser     — re-map tokens for the authorised response
      │
      ▼
Audit Log — every step recorded with hashes, verdict, user context

Build this as a shared enterprise service, not per application. One well-maintained redaction layer serves many use cases; per-app redaction drifts.

For detailed hardening, see the enterprise AI security guardrails guide and secure AI prompting in regulated industries.

When to Keep Data Fully On-Premise

Some workloads will never be cross-border-safe regardless of controls:

Clinical decision support on full patient records without de-identification.
Defence-adjacent or export-controlled technology documentation.
Boards and senior-leadership confidential material where even tokenised exposure is risk-unacceptable.

For RAG patterns where proprietary knowledge stays in-house while the LLM remains hosted, see RAG for beginners.

SIG, PPA and Sub-Processor Hygiene

Whatever your architecture, vendor artefacts matter.

Standard Information Gathering (SIG) questionnaire — a widely used vendor assessment instrument; AI vendors should respond for their service.
Privacy Protection Agreement (PPA) — your data-processing agreement for AI vendors, with DPDP Act clauses.
Sub-processor list — every layer from application-layer vendor to foundation-model provider to cloud region must be named.
Training-data exclusion — contractual commitment that your prompts and outputs are not used to train the provider's models.
Breach notification clock — hours, not days.
Audit right — right to audit in person or through a contracted auditor.

Controls Matrix — DPDP × NIST AI RMF × ISO/IEC 42001

A single matrix serves three audiences.

One row per control; map each to the underlying evidence artefact (policy, runbook, ticket, log query).

30-Day CISO Playbook

Days 1–5 — discovery: every LLM API key, every vector store, every fine-tuning dataset across the firm.
Days 6–10 — classify data flowing to AI; block high-risk flows at egress.
Days 11–15 — stand up secrets manager policy and rotate all AI API keys.
Days 16–20 — deploy centralised PII redaction layer as a shared service.
Days 21–25 — move personal-data AI workloads to Indian cloud regions where not already.
Days 26–30 — publish the controls matrix and schedule the first independent audit.

Pair this playbook with the AI governance framework on the governance side and AI compliance for RBI / SEBI / IRDAI for sectoral overlays.

Key Takeaways

DPDP Act 2023 is the floor; sectoral rules are the walls; AI workloads touch both.
Default personal-data AI workloads to Indian cloud regions; go cross-border only with documented basis.
Sovereign and domestic cloud is a real option for the most sensitive workloads.
CMEK with HSM backing has moved from advanced to expected in BFSI and healthcare.
A centralised PII-redaction service is the single most leveraged security control for AI.
Open-weight on-premise models solve the workloads that must never leave your infrastructure.
One controls matrix mapping DPDP → NIST AI RMF → ISO/IEC 42001 serves three audiences from one evidence base.

Official Resources

DPDP Act 2023 (MeitY PDF) — full statutory text
MeitY Data Protection Framework page — rules and notifications
RBI IT Governance Master Directions — BFSI technology governance
SEBI CSCRF circular — cyber resilience requirements for SEBI REs
IRDAI Guidelines page — insurance-sector technology expectations
NIST AI RMF 1.0 (PDF)
ISO/IEC 42001:2023

Next Steps

Build the governance layer with the AI governance framework guide
Apply sectoral controls using the RBI/SEBI/IRDAI compliance playbook
Harden the runtime with enterprise AI security guardrails
Select vendors with the AI vendor selection playbook for CIOs
Explore on-premise options with the Ollama local LLM guide

Last updated: April 19, 2026

Community Questions

No questions yet. Be the first to ask!

Share this guide

r/developersIndia r/india r/ChatGPT

What You'll Learn

DPDP Act 2023 Sections That Matter for AI

India Cloud-Region Options for AI Workloads

When Indian-Region Hosting Is Not Enough

Sovereign and Domestic Cloud Paths

Encryption Patterns for AI Workloads

In Transit

At Rest

Customer-Managed Encryption Keys (CMEK) and HYOK

Secrets and API Keys

PII-Redaction and Tokenisation Architecture

When to Keep Data Fully On-Premise

SIG, PPA and Sub-Processor Hygiene

Controls Matrix — DPDP × NIST AI RMF × ISO/IEC 42001

30-Day CISO Playbook

Key Takeaways

Official Resources

Next Steps

Community Questions

Share this guide

More guides in Enterprise AI

V.A.U.L.T. — AI Transformation Framework

AI Compliance — HIPAA, PCI-DSS & SOC2

VertexAI vs Bedrock vs Azure AI — Comparison

You Might Also Like

AI for Security Engineers 2026: SAST, Threat Modeling, IaC Scanning

AI for DevOps — CI/CD, Infra & Monitoring

The Ultimate AI Coding Workflow 2026

What You'll Learn

DPDP Act 2023 Sections That Matter for AI

India Cloud-Region Options for AI Workloads

When Indian-Region Hosting Is Not Enough

Sovereign and Domestic Cloud Paths

Encryption Patterns for AI Workloads

In Transit

At Rest

Customer-Managed Encryption Keys (CMEK) and HYOK

Secrets and API Keys

PII-Redaction and Tokenisation Architecture

When to Keep Data Fully On-Premise

SIG, PPA and Sub-Processor Hygiene

Controls Matrix — DPDP × NIST AI RMF × ISO/IEC 42001

30-Day CISO Playbook

Key Takeaways

Official Resources

Next Steps

Community Questions

Share this guide

More guides in Enterprise AI

V.A.U.L.T. — AI Transformation Framework

AI Compliance — HIPAA, PCI-DSS & SOC2

VertexAI vs Bedrock vs Azure AI — Comparison

You Might Also Like

AI for Security Engineers 2026: SAST, Threat Modeling, IaC Scanning

AI for DevOps — CI/CD, Infra & Monitoring

The Ultimate AI Coding Workflow 2026