Manish Gaud

— CASE STUDY / AI PLATFORM

A vertical AI suite for Indian businesses, designed before built.

AI Core Platform is the vertical AI suite I'm architecting for Indian businesses — one core, four modules (Docusense, DataSense, ComplianceSense, TaxSense), 13 platform capabilities, 9 architectural principles, 15 ADRs. Phase 0: every load-bearing decision written before any production code is committed.

Founder & lead architect · 2026–present · Phase 0 · Architecture-stage

— Section B

Context

Generic AI loses to ChatGPT itself. The buyer that wants a tool to understand Indian contracts, Indian tenders, MCA filings, GST notices — in Hindi or Gujarati or Marathi alongside English — does not get well served by a US horizontal that defaults to English and stores data in Virginia. Mid-market Indian businesses with document-heavy operations (law firms, consulting firms, real-estate developers, procurement teams, compliance teams) need vertical AI built around the local surface: Indian data residency, INR billing with GST, named-user pricing, audit logs, and integrations with the Indian document stack.

AI Core Platform is the bet that the 80% of every AI SaaS is the same — auth, billing, file ingest, vector retrieval, structured extraction, AI orchestration, prompt management, chat UI, usage metering — and building that 80% once is the only way a solo founder ships two products in six months and six in two years. On top of the core sit four modules: Docusense for documents, DataSense for spreadsheets, ComplianceSense for regulatory monitoring, TaxSense for Indian tax forms. One workspace, many modules. The Microsoft 365 playbook, scaled down for a vertical.

This case study covers architecture-stage work. Phase 0: nine architectural principles, thirteen platform capabilities, fifteen ADRs, four module briefs, a six-month roadmap, and a Phase 0 setup checklist (Razorpay KYC, Supabase Pro in ap-south-1, Lightsail Mumbai, Cloudflare Pages, Sentry, Document AI). No production code is committed yet. The work is the design.

Most software fails on decisions made in the first month, not the last. The job of Phase 0 is to make those decisions before the first month.

— Section C

My role

I’m the founder, sole architect, and product designer. I wrote every ADR, every principle, every capability contract, every module brief. The architecture decisions, the vendor choices, the rollout sequence, the India-first positioning: all mine. Phase 1 implementation begins after Phase 0 setup closes — accelerated with modern AI coding tools (Cursor, Claude) under tight direction. Every line that ships will be reviewed and owned by me.

— Section D

The hard parts

Five architectural decisions worth surfacing. Each one is the kind of choice people regret not making earlier — and the entire premise of this case study is that on this product, those decisions were made first.

D1

Workspace is the tenant, user is a member

Enterprise buyers buy organizations, not seats. Self-serve individuals are still organizations of one. The temptation in early SaaS is to make user_id the primary key on every domain table — and then to spend a year migrating to a proper tenant primitive when the first enterprise contract lands. AI Core Platform refuses that path.

Every domain object — document, chunk, embedding, run, prompt version, invoice, audit log — belongs to a workspace_id. A user is a member of one or more workspaces. Row-level security and application-level scoping both use workspace_id, never user_id, as the primary tenant key. Personal workspaces are modeled as a workspace of one with a single owner. Self-serve and enterprise share the same code path; the same RLS policy serves a freelancer and a Fortune-500 procurement team.

D2

One platform, four modules, thirteen shared capabilities

Four AI products built independently is four codebases, four billing surfaces, four chat UIs, four ingest pipelines, four prompt frameworks, four observability stacks. Four times the maintenance, four times the deploy risk, and a velocity curve that goes flat the day you start the third product.

AI Core Platform builds the platform once. Thirteen platform capabilities — identity and workspace, billing and entitlements, file ingest (structured and unstructured), structured extraction, vector retrieval, structured store with execution sandbox, scheduled external ingestion, AI orchestration, prompt framework, workflow engine, chat UI shell, notifications and usage and audit — are implemented in the core. Each capability must be used by at least two modules to earn its place in the core. The four modules (Docusense, DataSense, ComplianceSense, TaxSense) are plugins on top, each contributing module-specific domain knowledge: Indian contract law, structured tax forms, regulatory scrapers, spreadsheet semantics. The platform pays for itself the day the second module ships.

D3

Module contracts, no DB reach-across

A modular monolith without discipline becomes a tangled monolith in three months. Cross-module imports of model classes, cross-module SQL joins, cross-module direct service calls — each one is a load-bearing wire that prevents extraction later, and the cumulative effect is a monolith that cannot be split even when the team is ready to.

AI Core Platform’s third principle is structural: a module = entitlement + surfaces + contracts, never DB reach-across. Modules may not read or write another module’s tables directly. Cross-module needs go through the core (both Docusense and ComplianceSense use Vector Retrieval — the core service, with module="docusense" in queries). Module tables live in module-prefixed schemas. A static check on pull requests rejects cross-module imports of model classes. The result: any module can be lifted into its own service in year two if scale demands it, without rewriting the rest.

D4

Ports-and-adapters everywhere

Every vendor choice an early-stage AI platform makes is a guess. The right embedding provider in 2026 may be the wrong one in 2027. The right vector store, the right LLM router, the right OCR engine, the right workflow queue — these are all guesses that benefit from being reversible.

AI Core Platform implements every core capability as a typed Python interface (port) with one or more adapters: local for development, hosted for production, mock for tests. Picked pgvector for Phase 1? Swap to Qdrant in year two by writing one adapter, not editing 200 callsites. Picked OpenAI for the launch? Add an Anthropic adapter when the price/latency curve shifts. Fifteen ADRs document the choice and the reasoning at the time it was made; the architecture survives the choice changing.

D5

India-first, multi-region capable

Indian data residency is not a feature toggle. It is a default. If the architecture assumes US-region storage and then adds India later, every workspace ID, every signed URL, every cross-region query has to be retrofitted. AI Core Platform inverts the default: ap-south-1 (Mumbai) is the deployment region, data residency is a workspace plan attribute, and multi-region routing is deferred but not architecturally impossible.

Workspace IDs are designed to carry region prefixes when multi-region lands. Storage buckets are per-region from day one. Billing is INR-first with GST invoices and Razorpay; USD is a secondary path for non-Indian customers. The Hindi, Gujarati, Marathi, and Tamil language paths are first-class in the ingest, embedding, and chat flows — not English with a translation layer bolted on. The wedge against US horizontals is structural, not marketing.

— Section E

Architecture

CORE PLATFORM  13 SHARED CAPABILITIES Identity & Workspace � Billing & Entitlements � File Ingest � Vector Retrieval � Structured Extraction AI Orchestration � Prompt Framework � Workflow Engine � Chat UI Shell � Usage & Audit each capability used by e 2 modules to earn its place in the core Docusense document intelligence launch module DataSense spreadsheet intelligence deferred to post-MVP ComplianceSense regulatory monitoring launch module TaxSense Indian tax forms deferred to post-MVP Modules communicate with the core via typed contracts  no direct DB reach-across between modules
One platform core delivering thirteen capabilities; four vertical modules consuming them via contracts. Docusense and ComplianceSense are the launch pair.
  • Docusense

    first candidate for extraction

  • DataSense

  • ComplianceSense

    first candidate for extraction

  • TaxSense

Four modules planned. Launch pair is Docusense and ComplianceSense; DataSense and TaxSense follow.

— Section F

Stack

Backend
FastAPI · Python 3.12 · uv workspaces · Postgres · pgvector · Redis-backed queues · modular monolith (one process, multiple modules)
Frontend
React · TypeScript · Vite · TanStack Router · shadcn/ui + Tailwind · auto-generated TS SDK from FastAPI OpenAPI
Marketing
Astro (suite umbrella + per-module marketing sites)
Infrastructure
AWS Lightsail Mumbai (backend) · Cloudflare Pages (frontend) · Supabase ap-south-1 (auth + Postgres) · Cloudflare R2 (media) · Sentry + Better Stack (observability) · Razorpay (INR + GST) · Google Document AI (OCR)

— Section G

What shipped

Phase 0 work shipped — the design itself — with production implementation beginning after Phase 0 setup closes. The case study earns its place by what was written, not what was deployed.

  • — Vision document (v0.3): the bet, the wedge, the suite, four module briefs, 13 platform capabilities, out-of-scope list.
  • — Architecture overview (v0.2): nine load-bearing principles, the tenancy model, module contracts, service topology, isolation rules.
  • — Fifteen ADRs locked: repo layout, backend framework, frontend framework, identity provider, vector store, structured store, workflow queue, prompt management, SSO strategy, billing, observability, file ingest pipeline, LLM provider strategy, deployment platform, OCR engine.
  • — Six-month roadmap published; Phase 0 setup checklist defined.
  • — Launch pair locked: Docusense + ComplianceSense. DataSense and TaxSense designed but deferred to post-MVP.
  • — Phase 0 setup in progress: Razorpay KYC, Supabase Pro ap-south-1, Lightsail Mumbai, Cloudflare DNS + Pages, Sentry, Better Stack, Google Cloud + Document AI, deploy pipeline.
  • — Monorepo scaffolded (pnpm workspaces + uv workspaces), Caddyfile for the deploy edge, docker-compose for local development.

Not yet shipped: any production code surface. Phase 1 begins when Phase 0 setup is signed off. The architecture is locked; the code is next.

Designing an AI platform with this shape — multi-module, vertical, India-first, with every load-bearing decision written down before the first deploy?

Let’s talk.

Book a call →