Capstone Project: Presentation & Demo

Nailing Your AI Web App Demo

Pascual Vila

AI Eng Instructor // Code Syllabus

Building the AI app is only half the battle. Presenting it effectively to stakeholders, investors, or interviewers—while managing unpredictable LLM latency and hallucination risks—is what separates juniors from senior engineers.

The Storytelling Arc

Don't just open your laptop and start clicking buttons. Start with the Problem. Who suffers without your app? Next, introduce the Concept. Then, run the Live Demo to prove it works. Finally, peel back the curtain and explain the Architecture (e.g., Next.js, Vercel, OpenAI).

The "Demo God" Fallback Strategy

Live APIs will fail you when you need them most. Rate limits hit, internet connections drop. Your code must have graceful degradation. Have a hard-coded JSON response ready, and if the fetch takes longer than 5 seconds, swap to the mock data. The audience won't know the difference, and your pitch survives.

Explaining the Magic (Tech Stack)

When asked how it works, don't read code line-by-line. Use a diagram or a clear verbal flow: "The user input goes from our React client to a secure Next.js API Route, where we attach our secret keys. We then construct a tailored prompt and ping OpenAI. We stream that data back to the client using Server-Sent Events to keep perceived latency low."

❓ Presenting AI: Expert FAQs

How do I prevent my AI demo from failing during a presentation?

1. Use Mock Fallbacks: Have hardcoded responses ready if the API call fails or times out.
2. Control the Inputs: Don't ask the audience for random prompts. Use pre-tested "happy path" prompts that you know yield great AI results.
3. Handle Latency Visually: Use skeleton loaders or UI streaming so the app doesn't look "frozen" while the AI generates.

Why is perceived latency important in AI applications?

LLMs can take 3 to 10 seconds to generate a full response. If the screen is completely static during this time, users assume the app is broken. Perceived latency is managed by providing immediate visual feedback (like a loading indicator or streaming text token-by-token) the millisecond the user clicks submit.

What is graceful degradation in the context of an AI Web App?

Graceful degradation means that if the core AI feature fails (e.g., OpenAI API is down), the application doesn't crash entirely. Instead, it catches the error and provides a fallback, such as a cached response, a simpler non-AI feature, or a polite, formatted error message telling the user to try again later.

Demo Survival Glossary

Graceful Degradation

Designing a system so it continues to operate (possibly at a reduced level) rather than failing completely when a component (like an AI API) breaks.

concept.js

Perceived Latency

How long the user feels they are waiting. Streaming AI text token-by-token reduces perceived latency compared to waiting for the whole block.

concept.js

Rate Limiting

A strategy to limit network traffic. Crucial to implement before demoing so your public app doesn't drain your API budget.

concept.js

Happy Path

A default scenario featuring no exceptional or error conditions. Always stick to the happy path when conducting a live demo.

concept.js

AI Capstone Presentation

Demo Preparation

Phase 1: The Pitch

System Check

Demo Challenges

Demo Day Hub

Practice Your Pitch

Nailing Your AI Web App Demo

The Storytelling Arc

The "Demo God" Fallback Strategy

Explaining the Magic (Tech Stack)

❓ Presenting AI: Expert FAQs

Demo Survival Glossary