Portfolio - The Cloud Codex

Selected AI engineering work — production AI platform, customer-facing pursuit microsite pattern, token-efficient Claude Code corpus design, and supporting primitives.

Sanitized for public publication. Customer names, internal product names, employee names, and specific deal amounts removed. Patterns and methods preserved.

Public résumé: /michael-tuszynski/ · Blog: The Cloud Codex

What I build

Production AI systems for engineering and revenue teams — focusing on versioned, auditable platforms with guardrails, evaluations, and real adoption rather than demos. Current platform work codifies a Fortune-1000 professional services org's pursuit lifecycle as an internal AI platform: a Claude Code plugin marketplace orchestrating Salesforce, Microsoft 365, OneDrive, Anthropic Claude, and AWS Bedrock.

AI engineering platform (in production, 9+ months)

An AI-native operating system for enterprise presales pursuits covering discovery, qualification, statement-of-work lifecycle, risk review, and engagement handoff on a single context store.

Scale at last cut:

11 plugins, 60+ slash commands, 562 passing tests
~51K LOC TypeScript
Multi-provider model routing (Anthropic Claude primary, AWS Bedrock fallback) with env-var toggle
Prompt-contract framework with versioned input/output schemas
Evidence-citation guardrails: High/Medium/Low confidence + dated source URLs required on every claim
Session-start hooks hot-loading operational knowledge from synced channels
Prompt caching for shared guardrails and knowledge base
Per-command rule contracts; provider switching via environment variable

Outcomes (anonymized):

Compressed manual artifact production hours into command-driven workflows
Enabled $0.5M+ and $1M+ enterprise engagements to close on platform-authored statements of work with iterative revisions
Partner-funded engagement program added co-funding to a pursuit
Productized AI Well-Architected workshop at small-five-figure list price; first paying customer in flight

The pattern: start shipping custom for one customer, then extract patterns into a framework for ten field engineers.

Customer-facing pursuit microsite pattern

Built a customer-facing pursuit microsite in days for an enterprise AI agent engagement in K-12 EdTech using Claude-on-platform. The microsite anchored follow-up conversations with the client's CEO and hyperscaler co-sell partner; the pursuit moved from ideation workshop to verbal Phase-1 commitment in three weeks.

Pattern observations:

AI-listener-agent architecture on AWS Bedrock + ECS Fargate behind WebSocket API Gateway, with OpenSearch Serverless for RAG and Transcribe/Polly for voice
POC built live in ideation workshop (single-file Flask + Anthropic, ~3,700 lines, eight views) served as conversation piece pulling stakeholders into architecture discussions faster than slides
Microsite + POC + workshop collapsed what used to be three serial deliverables into parallel motion

The artifact pattern is portable: AI pursuit microsites are cheap to build, land differently than decks, and serve as persistent reference points through Phase-1/Phase-2 sequencing.

Token-efficient Claude Code corpus design

A reproducible methodology for keeping a multi-plugin Claude Code corpus lean. Took a production corpus from 221K tokens → 179K tokens (19% reduction) without removing procedural steps or contract terms.

The numbers:

Pool	Before	After	Saved	%
Commands (per invocation)	146,105	111,140	−34,965	−23.9%
Rules (auto-load every session)	18,375	14,478	−3,897	−21.2%
Skills (per invocation)	56,620	53,815	−2,805	−5.0%
Total	221,100	179,433	−41,667	−18.8%

Token counts via tiktoken / cl100k_base — close proxy for Anthropic tokenization.

Why it matters:

Context-window headroom on every command
Faster first-token latency
Cleaner prompts drive better instruction adherence
Linear scaling — doubled users means doubled savings

Seven patterns that drove the savings:

Rules dedup is the cheapest 4K tokens you'll find. Multiple plugins carried byte-identical context files. Collapsing into a shared core/rules/ directory saved ~3.9K tokens loading in every session.
The top decile of commands held 42% of the command corpus. Token distribution is heavily right-skewed. Sort by tokens descending and trim the top decile.
Pipeline-banner repetition is invisible but expensive. Step counters like > **Step 3 of 5** repeated at the top AND inside every subsection. Once at the top is enough. Saved ~600–1,200 tokens per affected command.
Auto-resolve fallback boilerplate copy-pastes faster than it earns. Six-bullet "if exactly one... if zero, stop and ask... if multiple, stop and ask..." blocks collapsed to one sentence. Saved ~400 tokens per command.
Multi-format output templates with 90% overlap should be one base + deltas. If "JSON output," "markdown output," and "console output" list the same 12 fields, write one structured template and note format-specific variants as deltas.
Persona reload reminders inside commands are redundant. Personas load once at session start. Reminders inside individual commands are dead weight.
Skills are different — be careful what you trim. Framework specs with load-bearing prose and code (helper-class definitions, PPTX coordinates with EMU values) shouldn't be touched. Skills accounted for only 5% of savings.

Best practices for new commands:

Default to terse. Most commands need 1.5K–2.5K tokens, not 5K+.
Don't restate guardrails. They live in core/rules/. Reference them.
Don't restate path resolution. The contract is in core/rules/.
Auto-resolve fallback is one sentence.
Multi-format outputs go in one template with conditional sections.
Banner repetitions: once at the top, never repeat.
Procedural specifics, code, and contract terms stay verbatim.

Methodology (reproducible):

Baseline the corpus. A tools/token-baseline.py walks plugins/*/{commands,rules,skills}/*.md, counts tokens with tiktoken, emits CSV + XLSX with per-file detail, totals by type and plugin, top-20 sheet.
Identify the top offenders. Sort by tokens descending. Top decile concentrates the savings.
Audit rules for byte-identical duplication. An md5 across plugin rule files surfaces collapse candidates.
Trim each targeted file. Cut redundant pre-flight prose, repeated path-resolution boilerplate, restated guardrails, banner repetitions, verbose multi-format templates. Keep procedural specifics, code, contract terms.
Re-baseline after each batch. Label snapshots (baseline, after-dedup, after-top7, final). Every snapshot is auditable.

"Lean prompts compound. Cluttered prompts compound differently."

Run quarterly, or any time the corpus grows by >10% without functional additions.

Selected AI engineering primitives

Cross-platform OneDrive path resolver — disambiguates personal vs. organizational mount paths across macOS and Windows; survives the OneDrive rename/move dance that breaks most tooling.
Prompt-contract framework — versioned input/output schemas per command, with contract violations surfacing in the developer console rather than degrading silently.
Evidence-citation + confidence-disclosure guardrails — every claim in generated output must cite a dated source URL and a High/Medium/Low confidence rating; outputs refuse to generate if they can't satisfy the contract.
Session-start hook that hot-loads operational knowledge — institutional memory becomes ambient without requiring a plugin release; the ops team can update guidance in a synced channel and the next session picks it up.
Multi-provider AI infrastructure — Anthropic Claude (primary) with AWS Bedrock fallback; prompt caching for shared guardrails and knowledge base; per-command rule contracts; provider switching via env-var toggle.
:council multi-agent advisory pattern — pressure-tests recommendations through a 5-advisor council before reaching the user; surfaces dissent that single-agent flows hide.
Document generation pipeline — branded Microsoft Word output via AI-augmented markdown rendering; preserves enterprise template fidelity while generating from structured content.

Background

25 years in engineering leadership and platform work. Former CTO of a streaming platform (20+ person team, $4MM+ budget, 99.98% uptime, 100% YoY user growth, monolith-to-microservices migration). Six years as Senior Solutions Architect at AWS — co-authored theECS Workshop (canonical hands-on container workshop), featured speaker on theAWS re:Think Podcaston cost optimization.

UC Berkeley Executive Education — Professional Certificate in Machine Learning and Artificial Intelligence (2025). Harvard Data Science Review — Agentic AI Intensive (December 2025). AWS Certified Solutions Architect Professional + DevOps Engineer Professional.

Personal AI lab — NEXUS

Multi-machine personal R&D workspace prototyping patterns for client work. NAS + Mac-mini + laptop topology with ~7 production services (TypeScript/Node), 18+ LaunchAgents, a custom Slack bot with auto-discovered slash commands, semantic search over conversation history (Ollama embeddings), self-hosted dashboards behind Cloudflare Tunnels, and integrations with Anthropic Claude, Plaid, Microsoft 365, Synology DSM, GitHub, and Ghost CMS. Test bed for prompt-contract patterns, agent orchestration, hot-loaded knowledge, and evaluation harnesses deployed to enterprise engagements.

Public writing

The Cloud Codex — daily-ish writing on AI engineering, platform work, agent orchestration, prompt caching, evaluation harnesses.
What I Learned Building an Agentic AI Operating System for My Own Job — practitioner voice on shipping internal AI tooling.
Agentic Application Modernization Reality — patterns applied to real modernization engagements.
ECS Workshop — co-authored hands-on container training (AWS open source).

Inquiries: mike@mpt.solutions ·415-793-4717 ·Full résumé (PDF)