Skip links

  • Skip to primary navigation
  • Skip to content
  • Skip to footer
Menu
  • Home
  • Posts
  • Projects
  • About

basix

data, software and the universe

The Annoying Truth About ‘Serverless’ Data

Serverless mostly means ‘someone else runs the servers’. You still pay.

3 min read January 16, 2026

Lambda Architecture Without the Trauma

Hybrid batch+streaming can work—if you pick a single source of truth and stop duplicating business logic.

3 min read December 30, 2025

Vector Search Pipelines: Embeddings Are Data Engineering Too

Embeddings drift; treat them like any other dataset.

3 min read December 16, 2025

Feature Stores: Centralize Reuse, Decentralize Blame

A feature store is a contract system with extra steps.

3 min read December 4, 2025

Observability: Trace IDs for Data Pipelines (Yes, It Works)

Correlate events across ingest → transform → serve. Debugging gets boring.

3 min read November 24, 2025

Serving Layers: Materialized Views, Caches, and the Myth of ‘Realtime’

Realtime is a budget decision.

3 min read November 5, 2025

Metadata-Driven Pipelines: Dynamic Doesn’t Mean Uncontrolled

Drive config from metadata, but validate like a paranoid adult.

3 min read October 22, 2025

Bronze Table Quality Gates: Yes, Even Bronze

If you ingest garbage, you’ll analyze garbage. That’s not ‘agile’.

3 min read October 9, 2025

Kubernetes for Data Jobs: The Part Where YAML Becomes a Lifestyle

It’s great until you run 5000 pods and discover quotas.

3 min read September 25, 2025

Change Data Capture on Azure: Event Hubs, Debezium, and Reality

Azure can do CDC fine—if you respect throughput units and partition keys.

3 min read September 12, 2025

© 2026 basix. Powered by Jekyll & So Simple.