Show HN: DataCompose – Composable data transformation primitives for PySpark
1 tccole 1 8/25/2025, 5:12:23 PM datacompose.io ↗
This is NOT a traditional library - it gives you production-ready data transformation primitives that you can modify to fit your exact needs. You own the code, with no external dependencies to manage or worry about breaking changes.
Inspired by shadcn's component approach: instead of importing a black box, you get battle-tested PySpark functions that live in YOUR repository. Handle email validation, address parsing, phone formatting.
https://www.datacompose.io/blog/introducing-datacompose