Analytics, engineering, warehousing, governance, ML ops, visualization. The layer that turns raw events into decisions — and the one regulators care most about. Every domain in the tree produces data; this branch is where you make sense of it.
Six sub-categories under data/. Analytics is live with Apache Superset as the first leaf. The rest are mapped and landing as content ships.
BI tools, dashboards, data exploration. Apache Superset is the first live leaf and the 2nth default for new builds. Metabase, Evidence, Druid, and Grafana mapped as stubs.
ETL/ELT pipelines, dbt transformations, orchestration with Airflow and Dagster. The plumbing that gets data from source systems into warehouses in a shape humans and models can use.
PostgreSQL, ClickHouse, DuckDB, BigQuery, Snowflake. The storage layer analytics tools read from — OLTP to OLAP, star schemas to wide tables, and the tradeoffs between them.
POPIA compliance, data quality frameworks, lineage tracking, cataloging, and access control. The regulatory and operational guardrails that keep data trustworthy and lawful.
Model training, deployment, monitoring, and feature stores. The engineering discipline that turns a notebook experiment into a production prediction service.
Charting libraries, mapping, reporting tools beyond BI dashboards. D3, Observable, deck.gl, and the patterns for building data stories that don't need a full BI platform.
Apache Superset is live — the 2nth default BI tool for new builds. More leaves landing as the data branch grows.
Data is produced by every domain in the tree. These are the branches it pulls on most — either because they generate the raw material (Business, Technology) or because they consume the output (Finance, Healthcare).