Changelog¶

Development history of ScamShield AI, organized by phase.

Phase 1: Core System (Feb 11, 2026)¶

The initial build — from zero to working honeypot in a single day.

Architecture design — Firebase Cloud Functions + Gemini Flash + Firestore
GUVI webhook handler — POST endpoint with API key authentication
Pydantic models — Request/response models matching GUVI spec (camelCase aliases)
Scam classification — Gemini-powered classifier with 6 initial scam types
3 personas — Sharma Uncle, Lakshmi Aunty, Vikram Professional
Evidence extraction — Regex patterns for UPI IDs, bank accounts, phone numbers
Keyword detection — 11 categories with weighted scoring
Session management — In-memory store (later migrated to Firestore)
CI/CD pipeline — GitHub Actions with Firebase CLI deploy

Production hardening and operational visibility.

Moving from in-memory to persistent storage.

Firestore sessions — honeypot_sessions collection with batch updates
Evidence index — Cross-session evidence linking (evidence_index collection)
Rate limiter — Per-session rate limiting with Firestore counters
Cloud Tasks — Delayed callback scheduling (10s inactivity)

Broader scam coverage and better evidence capture.

6 new scam types — Investment, Insurance, Romance, Loan, Custom Duty, Crypto
IFSC code extraction — Bank branch identification
Aadhaar detection — 12-digit with Verhoeff checksum validation
PAN detection — ABCDE1234F format
Amount extraction — ₹/Rs patterns with Indian numbering
Phone number improvements — +91 prefix preservation, helpline filtering

Maximizing evaluation scores through systematic improvements.

Per-turn callbacks — Send intelligence on every response, not just at conversation end
Scam detection from turn 1 — Always report scamDetected: true with initial classification
Response format enrichment — extractedIntelligence, engagementMetrics, agentNotes on every response
Strategy state machine — BUILDING_TRUST → EXTRACTING → DIRECT_PROBE → PIVOTING
Pipeline context — Dynamic prompt assembly with language detection and edge-case analysis
Bug fix — scamDetected duplication in callback payloads
Regex improvements — Higher extraction accuracy for UPI and phone patterns

Transforming from private hackathon project to public GitHub showcase.

PII sanitization — All personal info and infrastructure references replaced with placeholders
MkDocs documentation site — Architecture reference, educational chapters, deployment guides
"Building ScamShield" series — 10-chapter educational walkthrough
Contributing guides — Tutorials for adding personas and extractors
MIT License