Lesson ProgressPhase 1 of 6
Phase 1Hook
Hook: Outliers and Data Quality

Review previous learning and connect to today's application activities

The Problem Sarah Can't Ignore

Sarah has analyzed the café's weekend data and calculated that the average (mean) transaction is $12.50. She's ready to use this number to help the café predict revenue and plan inventory.

But there's a problem. When she looks more closely at individual transactions, something doesn't add up.

⚠️ The Data Doesn't Look Right

Sarah calculates: Average = $12.50

But then she notices these transactions:

  • Coffee: $4.25
  • Muffin: $2.75
  • Latte: $5.25
  • Lunch combo: $12.95
  • Catering Order: $127.50
  • Data Entry: $0.05

The Friction Point

If most transactions are $3-15, how can the average be $12.50? And more importantly - what should Sarah do with those unusual values? Are they errors? Legitimate business events? Something in between?

This is where outlier detection becomes essential for every data analyst.

In This Lesson, You'll Learn To:

🔍 Detect Outliers

Use z-scores to objectively measure how unusual each data point is

⚖️ Make Quality Decisions

Decide whether to keep, flag, or remove outliers based on business context

💼 Explain Your Reasoning

Defend data cleaning decisions to café management with evidence

📊 Improve Analysis Accuracy

Ensure forecasts and recommendations are based on reliable data

Data Detective Readiness Check
Test your understanding of why outlier detection matters

1. Sarah notices a transaction for $127.50 in the café data when most transactions are between $3-15. Before calculating anything, what is the FIRST question she should ask about this unusual value?

2. The café manager says Saturday's average (mean) transaction was $18.50. However, when Sarah looks at individual transactions, she sees mostly $4-12 values with just a few very high ones. Why is this difference important?

3. Sarah finds two unusual values in the data: $0.05 (likely a data entry error) and $127.50 (could be a catering order). What makes these different from regular transactions beyond just being unusual numbers?

4. If the café's typical Saturday has 300 transactions with an average of $12, what approximate total revenue would you expect?

0 of 4 questions answered