Section-by-section evaluation with immediate feedback
Version 2.0
Progress:0 of 7 sections completed
0%
How This Works
Choose Your Path: Start with any section or complete them in order
Flexible Navigation: Jump between sections anytime
Immediate Feedback: Get detailed explanations after each section
Personalized Learning: Receive targeted recommendations based on performance
Time: ~5-7 minutes per section (35-50 minutes total)
🎉 What's New in V2.0
✨ Brand New Question Bank Tougher questions that test real-world AI PM skills, plus a new section on AI Use Cases Across Industries
📊 Smarter Scoring System 100-point scale with difficulty-based weighting: • Beginner: 1 pt• Intermediate: 2 pts• Advanced: 3 pts
📚 Enhanced Learning Paths Curated resources with direct links to courses, articles, and hands-on exercises tailored to your performance
📧 Email Your Results Complete breakdown of your scores by section with personalized learning recommendations sent to your inbox
Select a Section
Choose any section to begin. You can complete them in any order.
Section 1: AI Fundamentals
Understanding how AI systems work at a conceptual level
Multiple Choice - 1pt
1. What does GPT stand for in models like GPT-4?
Multiple Choice - 2pts
2. Your AI customer service bot is hallucinating incorrect policy information. Which approach MOST effectively reduces hallucinations?
Scenario - 3pts
You're building a legal research assistant that answers questions about case law. Your database contains 50,000 court documents that are updated weekly with new rulings.
3. What's the MOST appropriate technical architecture?
Multiple Choice - 2pts
4. What's the PRIMARY advantage of few-shot prompting compared to fine-tuning a model?
Multiple Choice - 3pts
5. Your AI feature makes 1,000 API calls per day. Each call uses 2,000 input tokens and generates 500 output tokens. Your API charges $0.01 per 1,000 input tokens and $0.03 per 1,000 output tokens. What's your daily cost?
True/False - 1pt
6. Chain-of-thought prompting improves LLM performance on complex reasoning tasks by instructing the model to show its intermediate reasoning steps.
Multiple Choice - 2pts
7. You're deciding between using an LLM API versus fine-tuning your own model. Which factor MOST favors using a third-party API?
Section 2: AI Strategy
Making strategic decisions about when and how to use AI
Scenario - 3pts
Your B2B SaaS product has 500 enterprise customers. Sales team wants an AI sales forecasting feature using machine learning, but engineering says it needs 50,000 historical deals to train effectively.
8. What's your BEST response as PM?
Multiple Choice - 2pts
9. Your competitor launches "AI-powered insights" generating massive PR buzz. Your analytics show your existing rule-based insights have 95% user satisfaction. What's your move?
Multiple Choice - 2pts
10. You're prioritizing AI investments across four validated opportunities with similar technical feasibility. Which is MOST strategically valuable?
Scenario - 3pts
Your AI recommendation engine drives 30% of revenue. Data scientists report model performance degrading from 85% to 78% accuracy over 6 months. Engineering wants 8 weeks to retrain and redeploy.
11. What's your FIRST priority as PM?
True/False - 1pt
12. If your AI feature achieves 90% success rate across all users, it's ready to ship even if it fails for the remaining 10%.
Multiple Choice - 3pts
You have 50,000 posts/day that must be moderated. Your AI content moderation tool catches 60% of policy violations (recall=60%) with 98% precision. Your community team can manually review 1,000 items daily.
13. What's your strategic approach to ensure effective content moderation?
Section 3: Hands-On Building
Practical experience with AI tools and prototyping
Multiple Choice - 2pts
14. You're testing a new RAG system that answers questions about your company's product documentation. During testing, you notice it sometimes cites information from Page 47 when the correct answer is actually on Page 12. What's the MOST likely cause?
Scenario - 3pts
Your AI-powered email drafting tool uses GPT-4. In production, you're seeing high costs ($8,000/month) and slow response times (3-4 seconds). Analysis shows 70% of requests are simple formatting fixes like "make this more professional" or "add a greeting," while 30% require complex rewrites.
15. What's your BEST optimization strategy?
Multiple Choice - 2pts
16. You're writing a prompt for an AI assistant that summarizes customer support tickets. Which prompt structure is MOST effective?
True/False - 1pt
17. When using few-shot examples in your prompts, you should always provide at least 10 examples to ensure the model learns the pattern effectively.
Multiple Choice - 3pts
18. You're building a content moderation system that flags policy violations in user posts. During evaluation, you find the system has 85% accuracy with this distribution: 90% of safe posts correctly marked safe, but only 60% of violating posts correctly flagged. What should you prioritize?
Multiple Choice - 3pts
19. Your production RAG system suddenly starts giving poor answers after you upgraded from GPT-4 to GPT-4 Turbo. The retrieval is working correctly and returning relevant documents, but answers are often off-topic or outdated. What's the MOST likely root cause?
Section 4: Data & Privacy
Managing data responsibly and understanding privacy implications
Multiple Choice - 1pt
20. Your AI chatbot logs all user conversations for quality improvement. Under most data privacy regulations, what is the minimum you must do?
Scenario - 3pts
Your AI-powered HR tool uses employee performance reviews, salary data, and manager feedback to predict promotion readiness. A data scientist proposes including employee demographic data (age, gender, ethnicity) to "improve model accuracy" by accounting for historical patterns.
21. What's your BEST response as PM?
Multiple Choice - 2pts
22. Your customer support chatbot uses OpenAI's API to answer user questions. Users sometimes share sensitive information like order numbers, email addresses, and account details in their messages. What's the MOST important consideration?
True/False - 2pts
23. If you anonymize personal data by removing names and email addresses before training an AI model, the data is no longer considered "personal data" under regulations like GDPR.
Scenario - 3pts
Your medical diagnosis AI app stores patient symptom data and AI-generated health recommendations. You're using AWS to host the application and OpenAI's API for the AI functionality. An EU resident patient requests deletion of all their data under GDPR's "right to be forgotten."
24. What must you do to fully comply?
Multiple Choice - 3pts
25. Your AI product processes customer data and is based in the US. You have 50 customers in California, 30 in New York, and 20 in the EU. Your legal team says you need to conduct a Data Protection Impact Assessment (DPIA). What triggers this requirement?
Section 5: Product Development
Managing AI product development effectively
Multiple Choice - 1pt
26. You're planning an MVP for an AI-powered feature. Which approach BEST represents an AI product MVP strategy?
Scenario - 3pts
You're building an AI feature that recommends personalized workout plans. Your data science team says they need 6 months and 50,000 user workout histories to train an effective machine learning model. You have 500 beta users and pressure to launch in 2 months.
27. What's your BEST product strategy?
Multiple Choice - 2pts
28. Your AI-powered customer service bot is performing well in testing but users complain it feels "robotic" and "unhelpful" in production. What's the MOST likely issue?
True/False - 2pts
29. When building an AI product feature, you should always prioritize achieving the highest possible model accuracy before launch, even if it delays release by several months.
Scenario - 3pts
Your AI writing assistant has two potential features: (1) Grammar correction with 95% accuracy requiring 2 months of work, or (2) Tone adjustment (professional/casual/friendly) with 75% accuracy also requiring 2 months. User research shows equal interest in both features.
30. How should you prioritize?
Multiple Choice - 3pts
31. You're launching an AI feature that personalizes content recommendations. After launch, engagement increases 15% but you notice certain user segments (older users, non-English speakers) have 40% lower engagement than others. What's your BEST approach?
Section 6: Economics
Understanding financial implications of AI products
Multiple Choice - 1pt
32. Your AI feature costs $0.02 per API call. If 10,000 users each make an average of 5 calls per month, what are your monthly API costs?
Scenario - 3pts
Your SaaS product charges $50/user/month. You're considering adding an AI feature that costs $0.10 per user per day in API costs. Marketing estimates it will increase conversion rate from 2% to 3% and reduce churn from 5% to 3% monthly.
33. What's the MOST important factor in determining if this AI feature is economically viable?
Multiple Choice - 2pts
34. Your AI product has two cost components: $5,000/month for model hosting infrastructure and $0.05 per prediction. Which statement is TRUE?
True/False - 2pts
35. Switching from GPT-4 to GPT-3.5 will always reduce your AI costs by approximately 10x because GPT-3.5 is 10x cheaper per token.
Scenario - 3pts
You're deciding between two AI approaches for document analysis: (1) Using Claude API at $3/1M input tokens with 98% accuracy, or (2) Fine-tuning an open-source model for $15,000 upfront + $500/month hosting with 95% accuracy. You process 50M tokens monthly.
36. How should you evaluate this decision?
Multiple Choice - 3pts
37. Your AI-powered search feature costs $20,000/month but you're unsure if it drives revenue. Which approach BEST measures the economic value of this feature?
Section 7: AI Use Cases Across Industries
Applying AI across different domains and use cases
Scenario - 2pts
Your fintech company's AI fraud detection system flags 3% of transactions for manual review and catches 92% of fraud. The fraud team requests you increase the flagging rate to catch more fraud. Engineering says they can adjust the model threshold to flag 8% of transactions and catch 97% of fraud.
38. What's the key trade-off you need to evaluate before making this change?
Multiple Choice - 3pts
39. Your healthcare AI suggests treatment options based on patient symptoms and medical history. Doctors report it works well for common conditions but performs poorly for patients with multiple chronic conditions. What does this indicate about your AI strategy?
Scenario - 2pts
Your legal tech AI reviews M&A contracts and flags risky clauses. It achieves 94% accuracy on standard contracts but only 78% on contracts involving international jurisdictions. Your sales team wants to market it as "M&A contract review AI" without restrictions.
40. What's your responsibility as PM?
Multiple Choice - 3pts
41. Your retail AI predicts demand for 50,000 SKUs across 800 stores. Finance calculated it could reduce excess inventory costs by $12M annually. Six months post-launch, you've only achieved $2M in savings. What's the most likely strategic oversight?
True/False - 1pt
42. When implementing AI for customer support, you should expect to fully automate 80-90% of support volume within the first 6 months if the AI is well-designed.
Scenario - 3pts
Your e-commerce AI recommends products based on browsing/purchase history. After 6 months, you notice it's highly effective for fashion (35% click-through) but barely works for electronics (8% click-through). Both categories have similar data volume.
43. What does this performance gap likely indicate about your product strategy?
Multiple Choice - 2pts
44. Your marketing team's AI generates social media content that performs 20% better in engagement than human-written posts. However, it occasionally produces content with subtle brand voice inconsistencies that your brand manager catches in review. What's the right product approach?
Scenario - 2pts
Your HR AI screens resumes and ranks candidates. You've ensured diverse training data and regular bias audits. However, hiring managers report they disagree with AI rankings 40% of the time and end up interviewing candidates the AI ranked lower.
45. What does this indicate about your AI strategy?
Multiple Choice - 1pt
46. You're considering AI for three projects: (A) Generating financial compliance reports from transaction data, (B) Personalizing learning content for students, (C) Routing customer support tickets to appropriate teams. Which generally requires the MOST human oversight?
🎉 Assessment Complete!
0 / 100
Your Overall AI PM Fluency
🎯 Your Personalized Development Roadmap
💼 Connect with the Builder
This assessment was built to prototype a solution to a genuine workforce upskilling problem. If you are building products at the intersection of AI, Learning, and Workforce Strategy, I'm seeking my next full-time challenge in this space. Let's connect and discuss how to solve your next big problem.
Thanks for taking this assessment! Your results and learning paths will be emailed to you. I am building tools to help the tech workforce - please share your experience!