Capstone: Visual Weld-Defect Inspection
The capstone runs the length of the program. In Agile sprints, your team builds an image-analysis model that flags weld defects from photographs — applying each module's skills to a real manufacturing-inspection problem. Tuesdays transition from labs into project work; Wednesdays are facilitated project days.
Sprint plan
- Weeks 1–2
Sprint 1 — Frame the problem
Modules 1–2Stand up an Agile team and board. Write a problem charter: what defect are we detecting, why it matters, and how we measure success. Assess it honestly as a data science problem and note ethical/safety concerns.
Deliverable: Problem charter + success metric - Weeks 3–5
Sprint 2 — Data foundation
Modules 3–4Set up the dev environment and git workflow. Ingest weld/inspection images, organize them into labeled folders, and clean/validate the dataset for training.
Deliverable: Versioned dataset + data card - Weeks 6–7
Sprint 3 — Explore & engineer
Modules 5–6Explore the image set: class balance, lighting, resolution. Build augmentation and preprocessing pipelines and prepare train/validation/test splits without leakage.
Deliverable: EDA notebook + preprocessing pipeline - Weeks 8–9
Sprint 4 — Train
Module 7Establish a simple baseline, then train a CNN (including transfer learning). Tune hyperparameters and use GPUs/batching to train efficiently.
Deliverable: Trained model + training report - Week 10
Sprint 5 — Evaluate & explain
Module 8Measure precision, recall, and F1 (accuracy alone misleads on rare defects). Do error analysis and use Grad-CAM to confirm the model looks at the right regions.
Deliverable: Evaluation + interpretability report - Weeks 11–12
Sprint 6 — Deploy & present
Module 9Package the model behind a simple inference path, write a monitoring and retraining plan, and present the end-to-end solution to stakeholders.
Deliverable: Deployment plan + final demo
Deliverables
- Problem charter with a measurable success metric
- Labeled image dataset with a data card
- Reproducible training pipeline in git
- Trained model with precision/recall/F1 on a held-out set
- Interpretability report (Grad-CAM) and error analysis
- Deployment and monitoring plan
- Final stakeholder presentation