How I Built Two Production AI Systems in Two Weeks (While Working Full-Time)

If you’ve read my earlier posts, you know the backstory: my Polish parents visited me in Egypt, nobody shared a common language with my Egyptian wife, and I spent two weeks as a human relay for every conversation. It was exhausting. So I built two things to fix it: a WhatsApp translation bot and a live speech translator.

What I haven’t talked about yet is how I built them so fast, and why I think the approach matters beyond my family’s language problems.

Both systems went from idea to production in under two weeks of evening work. I was working full time, learning two instruments, and seeing friends. The WhatsApp bot took a weekend. The live translator took seven evenings. Both are fully tested, have failover logic, and run in production today.

Quality numbers across both systems:

689 automated tests
98.5% passing rate
100% coverage on critical paths (unit, integration, E2E)

Compare that to traditional development: months of work, rigid scripts, breaks when anything changes.

Traditional development vs AI-powered development

A quick reality check before we go further

This approach works best for greenfield projects: small applications, automations, internal tools, things you build from scratch to solve a specific problem. For those, a single engineer working with AI can produce results that would have taken a small team weeks or months. The output is real, the time savings are real, and the quality holds up.

But this is not a magic wand. For large existing systems, legacy codebases with years of technical debt, or technologies that aren’t widely used, AI is not mature enough to replace developers. It speeds things up and opens new approaches, but it does not substitute the team. You still need engineers who can assess the code, review what AI produces, and make judgment calls. AI makes their work faster, not unnecessary.

So no, the takeaway here is not “fire your development team.” For enterprise-scale systems, AI enhances what developers do. It might improve output and let teams tackle problems differently, but the humans are still essential.

Where the game actually changes is for small and medium businesses. If you need a custom application, an internal automation, a bot, a dashboard, something scoped and specific, that’s where this approach completely flips the economics. That’s what this post is about.

A quick reality check before we go further
Why this matters for your business (and you)
What makes this different: the agentic architecture
My agent team: the actual configurations
How this applies to your business problems
What you can build right now
The real cost (and what I actually paid for)
It’s David and Goliath time
The single most important realization
What happens next

Why this matters for your business and you

If you work in marketing, operations, customer service, or sales, custom applications and workflows are now available for a fraction of what they cost six months ago. I’m confident that most business repeatable problems can be solved cheap and fast with this approach.

And quality? I’m a software engineer and I probably wouldn’t write these applications better by hand.

Think about the people in your organization who:

Manually reformat campaign data across platforms
Copy information between CRM, email, and spreadsheets
Handle repetitive questions that follow predictable patterns
Update multiple systems after every client interaction

These aren’t complex technical problems that need a full development team. They’re scoped workflow problems, and AI agents are very good at those.

The old way for this kind of work: hire developers, wait months, spend thousands, get something rigid that breaks when requirements change.

The new way: describe what you need, get one technically savvy person who knows how to work with AI, and you’ll have it for a fraction of the cost. I think we’re seeing the birth of a new role in organizations, something like an AI automation expert. This person will build you a custom application, a webpage, automate your marketing processes, or connect systems that previously seemed impossible to integrate. For scoped, greenfield problems, this role can replace what used to be a small team. For bigger systems, this person works alongside your existing developers to speed things up.

What makes this different: the agentic architecture

Imagine a system with five specialized AI instances, each prompted to do one thing well.

One acts as a Project Manager, one as an architect, one as a software developer, one as a QA engineer, one as DevOps. If that got too technical too fast, don’t worry. I’ll explain it in non-IT terms too.

Where’s the difference from normal AI prompting?

Your Project Manager can orchestrate between different flows. It knows that a new feature request triggers a different workflow than a bug report, and it coordinates AI agents working in parallel to complete your tasks.

For marketing and blog posts? Imagine creating three agents: one to help with writing, one specializing in hooks to keep readers engaged, and a critic that checks your company guidelines and general best practices. Your job is to write a story that doesn’t need to be grammatically perfect. Your agents fix that and deploy once they polish your article. You still need to provide the content since nobody wants to read AI-generated fluff, but you’d be surprised how easy and effective this is.

Agentic AI uses specialized agents that understand context, adapt to changes, and coordinate without human intervention. Each agent gets its own context window (how much AI can hold in a working session), so you don’t run into the problem of opening multiple chats for one task.

I didn’t write those agents myself, by the way. I asked AI to help me with that too.

My agent team: the actual configurations

These are the exact agent configurations I use for both application development and content creation. They’re not hypothetical. They’re the agents that built the systems described in this post. I left out my system information since you wouldn’t benefit from that.

For application development (7 specialized agents)

Project Manager

Role: Orchestrates planning and execution from concept to deployment

Tools: Read, Write, Bash, Grep, Glob, Task (delegates to other agents)

Responsibilities:

Clarifies requirements and coordinates team

Creates plans and tracks progress

Ensures code reviewed, tests passing, deployed, docs updated

Delegates to specialists: BA, Architect, Developer, QA, DevOps, Writer

Success criteria: Task complete when user accepts deliverable + all tests pass + code deployed + docs updated

Business Analyst

Role: Defines requirements, user stories, and acceptance criteria

Tools: Read, Write, Grep, Glob

Responsibilities:

Analyzes user goals and identifies target users

Writes user stories with prioritization (MoSCoW method)

Creates test cases covering happy path, errors, edge cases, security, performance

Validates feasibility with architect

Deliverables: User stories + prioritized test case list + requirements document + user flows + edge cases + success metrics

Software Architect

Role: Reviews architecture and designs technical solutions

Tools: Read, Write, Grep, Glob, Task

Responsibilities:

Provides architecture overview and design deliverables

Reviews code for patterns, security, scalability, testing coverage

Enforces standards: async/await, service layer, sequential migrations, proper error handling

Creates architecture diagrams, API specs, database migrations, technology justifications

Focus areas: Security (no SQL injection, XSS, CSRF), scale (caching, connection pooling), patterns (router-service, provider abstraction)

Full-Stack Developer

Role: Implements features with clean, tested code

Tools: Read, Write, Edit, Bash, Grep, Glob, Task

Responsibilities:

Follows TDD workflow (Red, Green, Refactor)

Implements backend (FastAPI + Pydantic + async/await + type hints)

Implements frontend (React hooks + PropTypes + i18n for all text)

Validates input, parameterizes queries, sanitizes output, checks auth

Requests architect review before QA handoff

Testing targets: Critical 100%, Unit 95%+, Integration 90%+

Automation QA Engineer

Role: Writes and runs automated tests at all levels

Tools: Read, Write, Bash, Glob, Edit

Responsibilities:

Writes unit tests (fast, no I/O), integration tests (Redis/DB), E2E tests (user journeys)

Runs all tests and reports failures with clear diagnostics

Debugs and fixes test issues while preserving intent

Ensures all tests pass before completion (98.5%+ pass rate)

Test organization: Backend (unit tests in tests/unit/, integration in api/tests/), Frontend (component tests with Vitest + React Testing Library)

DevOps Engineer

Role: Handles deployment, infrastructure, and verification

Tools: Read, Write, Edit, Bash, Grep, Glob

Responsibilities:

Rebuilds containers (always —no-cache for frontend)

Restarts services and verifies deployment thoroughly

Runs database migrations via psql

Performs mandatory pre-handoff verification: files exist + current timestamps + code changes confirmed + services responding + logs checked

Critical: Always verifies deployment to catch cache issues early

Technical Writer

Role: Maintains project and product documentation

Tools: Read, Write, Grep, Glob

Responsibilities:

Updates core documentation at task completion only

Maintains API endpoints, database schema, architecture docs, changelog

Keeps documentation minimal and maintainable

Deletes temporary tracking files (no PHASE_*.md, *_SUMMARY.md, *_REPORT.md)

Philosophy: Focus on “what does it do?” not “how did we build it?”

For blog content (3 specialized agents)

Blog Writer

Role: Generates effective, brand-consistent blog posts for business audiences

Tools: Read, Write, Edit, Glob, Grep, WebSearch, WebFetch

Voice: Clear, direct, neutral, plain English (multilingual-friendly)

Content guidelines:

Begins with strong hook (question, fact, or bold statement)

Opens with problem/desire + solution preview in first 150 words

Organizes for easy scanning: short paragraphs, clear subheadings, bullets, numbered lists

Educates without overstatement, backs claims with logic or examples

Provides actionable steps and real-world examples

Maintains helpful, informative tone without hype or fluff

Structure templates: Problem then Solution then How-To, numbered lists (“X Ways to…”)

Brutal Blog Critic

Role: Rigorously analyzes and critiques blog content

Tools: Read, Glob, Grep, WebSearch, WebFetch, Task

Framework-driven approach:

Applies known structures (Problem then Solution then How-To, What then Why then How, AIDA)

Evaluates hook and headline: Does it grab attention? State problem? Promise solution?

Checks clarity and structure: Concise paragraphs? Descriptive subheadings? Short sentences?

Validates examples and value: Are claims supported? Tips actionable? Evidence present?

Assesses tone: Factual and helpful without hype?

Feedback style: Brutally honest, detail-oriented, blunt but constructive with specific fixes

Hook Expert

Role: Generates 3-5 compelling opening hooks for blog posts

Tools: Read, Grep, WebSearch, WebFetch

Hook strategies: Question, surprising statistic, mini-story, bold statement, contrarian angle, light-hearted quip

Core principles:

Specificity over generality: Use concrete numbers, names, scenarios, vivid details

Credibility first: Ground every hook in reality with real statistics or plausible scenarios

Logical flow: Each hook must naturally lead into the blog topic

Tone consistency: Professional, efficient, human voice

Evaluation checklist: Relevant? Clear? Credible? Original? On-brand?

How these agents work together

For application development, the Project Manager orchestrates the workflow:

Planning: PM asks BA for requirements, then Architect for design review
Implementation: PM sends Developer to code, then Architect reviews
Quality: PM hands off to QA for tests, then DevOps deploys and verifies
Documentation: PM sends Technical Writer to update docs

For blog content, the workflow is simpler:

I write a rough draft with ideas
Blog Writer polishes structure, tone, and clarity
Hook Expert generates 3-5 opening hooks
Brutal Blog Critic reviews everything against frameworks
Blog Writer refines based on critique
Repeat steps 4-5 until the critic approves

The split is clear: agents handle execution (code, writing, testing, deployment). I provide domain expertise (business requirements, editorial judgment, strategic decisions, ideas). The bottleneck is not technology. It’s clearly articulating what I need.

How this applies to your business problems

Example: customer inquiry workflow

Traditional approach:

Customer emails inquiry
Support reads email
Support checks CRM for customer history
Support searches knowledge base
Support drafts response
Manager approves response
Support sends email
Support logs interaction in CRM

Custom application/workflow that can be created in days

Input: Customer inquiry arrives

Processing:

Monitor agent detects new email, extracts key details (product, issue type, urgency)
Research agent pulls customer history from CRM, searches knowledge base for relevant articles
Response agent drafts personalized reply using customer data and knowledge base content
Routing agent decides: standard inquiry (auto-send) or complex issue (flag for human review)

Output: Response sent + CRM updated + ticket created if needed

Result: 80% of routine inquiries handled in under 2 minutes. Human agents focus on complex cases that actually need judgment.

I don’t expect this specific example to save your business. I’m trying to open your mind to possibilities. What was available yesterday only to big companies with money is available today to you and me.

What you can build right now

These are just ideas to get you thinking. Even if you can’t build them yourself, there are people who can help.

Marketing

Content repurposing: Blog post to social posts to newsletter to LinkedIn article
Campaign performance reporting: Pull data from Google Ads, Meta, email platform, generate weekly summary
A/B test analysis: Read results, calculate statistical significance, draft recommendation email

Operations

Onboarding coordinator: New hire confirmed, send welcome email, create accounts, schedule orientation, notify team
Vendor communication: Purchase order approved, generate PO document, email vendor, log in system, set follow-up reminder
Compliance checker: Scan contracts for required clauses, flag missing items, generate remediation list

Customer service

Inquiry router: Read customer email, check account history, categorize issue type, route to correct team, send acknowledgment
Knowledge base updater: Identify repeated questions, draft new help article, submit for review
Follow-up scheduler: Resolved ticket, wait 3 days, send satisfaction survey, log response

Sales

Meeting notes processor: Call recording to transcript to extracted action items to CRM update to scheduled follow-ups
Proposal generator: Sales call notes, pull relevant case studies, generate customized proposal, format PDF
Pipeline cleaner: Stale opportunities, send re-engagement email, move to “Nurture” if no response

All of these are possible today. For this kind of scoped automation, you don’t need a full development team. One person with the right AI tools can get it done.

The real cost (and what I actually paid for)

Claude Code license: $100

LiveTranslator:

Development: 7 evenings (about 20 hours total)
API costs: ~$1/month (speech-to-text, translation). Pay as you go, so the more you use it the more you pay.
Hosting: $6/month (server + database)
Total first month: about $7

WhatsApp bot:

Development: 1 weekend (about 8 hours)
API costs: ~$1/month (translation only)
Hosting: Runs on the same server
Total first month: about $1
Ongoing: about $1/month

Combined monthly cost: around $108 and about 30 hours of work for two production systems.

Traditional development estimate:

Developer rate: $100-200/hour
Project timeline: 2-3 months
Backend developer: 120 hours @ $150 = $18,000
Frontend developer: 80 hours @ $150 = $12,000
Testing/deployment: 40 hours @ $150 = $6,000
Total: $36,000 minimum
Timeline: 3-6 months (if everything goes smoothly)

I hope that starts to crack something open.

The savings aren’t just money. It’s time to market, ability to iterate quickly, and solving problems as they come up instead of waiting months for a scoped project.

It’s David and Goliath time

David vs Goliath: Small businesses with AI agents facing enterprise competitors AI powered David v Industry

Services that used to need large budgets are now accessible to small and medium businesses. You can get professional applications, websites, and automated internal processes at a fraction of traditional costs, often delivered faster.

This is your moment. The gap between solo operators and well-funded competitors is shrinking for a certain class of problems. You can now:

Build small production applications and automations without a full team
Maintain consistent branding across all channels
Automate repetitive business processes
Create professional content at scale

You still need ideas, expertise, and judgment. But the technical barriers that used to limit small businesses are disappearing.

The tools will keep getting better, easier, and more accessible. The question is whether your business will lead or follow.

And a bonus for enthusiasts and geeks: these tools made technology fun for me again. If it weren’t for AI I wouldn’t have even considered solving my day-to-day problems with code. Now I have a million ideas for what to build next. It’s genuinely more fun than anything I’ve done in years.

The single most important realization

You don’t need to understand how the AI works. You need to understand your problem clearly enough to explain it, either to an AI directly or to a developer who works with AI.

That domain knowledge matters more than people think. It doesn’t replace coding skills, but it’s what makes the difference between a useful tool and a toy.

When I built LiveTranslator, I didn’t write the speech recognition code. I didn’t implement the translation algorithms. I didn’t configure WebSocket streaming. I described what needed to happen and AI agents built it. But I still had to review what came out, catch bad decisions, and steer the architecture. The technology is still young. It can make stupid mistakes, forget about features, or pick the wrong approach. You need someone technical in the loop, whether that’s you or an AI automation engineer working with you.

For scoped problems, the bottleneck is shifting. It’s less about writing code and more about clearly articulating what you need.

What happens next

My parents’ next visit is in two months. Dad will tell stories, my mother-in-law will share family news, and I’ll actually participate in conversations instead of translating every sentence.

Our WhatsApp group flows naturally across languages now. Birthday planning, garden updates, late-night jokes, all happen without me as the middleman.

These aren’t tech demos. They’re production systems solving real problems.

You have similar friction points in your business. Manual processes eating hours every week. Information stuck in silos. Repetitive tasks that never reach the top of your team’s priority list.

Pick one. Describe it clearly. Build your first agent system this week.

Start with something small and annoying. The marketing report that takes 45 minutes every Monday. The customer follow-up emails you keep forgetting. The data export you manually reformat every Friday.

Two weeks from now, that task could be automated.

The technology is ready. The tools are available. Which problem will you solve first?

How I Built Two Production AI Systems in Two Weeks (While Working Full-Time)

A quick reality check before we go further

Table of contents

Why this matters for your business and you

What makes this different: the agentic architecture

My agent team: the actual configurations

For application development (7 specialized agents)

For blog content (3 specialized agents)

How these agents work together

How this applies to your business problems

Example: customer inquiry workflow

What you can build right now

Marketing

Operations

Customer service

Sales

The real cost (and what I actually paid for)

It’s David and Goliath time

The single most important realization

What happens next

More posts from my AI build log.

Hi, my name is Paweł

AI solving a real problem in my life — a family chat translator

A live translator so my dad stops calling me to interpret

How I Built Two Production AI Systems in Two Weeks (While Working Full-Time)

A quick reality check before we go further

Table of contents

Why this matters for your business and you

What makes this different: the agentic architecture

My agent team: the actual configurations

For application development (7 specialized agents)

For blog content (3 specialized agents)

How these agents work together

How this applies to your business problems

Example: customer inquiry workflow

What you can build right now

Marketing

Operations

Customer service

Sales

The real cost (and what I actually paid for)

It’s David and Goliath time

The single most important realization

What happens next

More posts from my AI build log.

Hi, my name is Paweł

AI solving a real problem in my life — a family chat translator

A live translator so my dad stops calling me to interpret

Cookie Preferences

Cookie Preferences

Essential Cookies

Analytics Cookies