#data-engineering
6 posts tagged with "data-engineering".
April 20, 2026
~6 min read
An optical defect and the machine event that caused it happen minutes apart and live in separate databases. Time-windowing SQL joins close the loop — here is how.
April 13, 2026
~7 min read
SMT machines log generic errors milliseconds before they log the component that caused them. A forward-fill CTE using SQL window functions bridges that gap — here is exactly how it works.
March 30, 2026
Flying Probe Tester XML files are massive, nested, and namespace-unstable. Here is the three-table relational schema and defensive parsing strategy that tames them.
March 23, 2026
AOI machines generate defect reports with part numbers that almost match your library. Here is the dynamic parsing and fuzzy matching system that bridges that gap.
March 16, 2026
Customers send spreadsheets that look nothing alike. Hardcoded column indices fail on day two. Here is the four-step heuristic pipeline that handles anything.
March 9, 2026
A 10kΩ resistor has at least eight names in the wild. Here is how a regex-driven normalization engine turns manufacturing chaos into a canonical data layer.