About

Data infrastructure
is our craft.

DataRazi is a data engineering and AI consulting practice founded by Ray Tarazi, helping organisations design, build, and optimise modern data platforms and AI systems — from Spark pipelines at petabyte scale to autonomous multi-agent trading systems.


Background

With deep experience across the data engineering stack — Databricks, Apache Spark, Delta Lake, streaming systems, and cloud infrastructure — DataRazi was founded to bring production-grade engineering to organisations that need their data infrastructure to actually perform.

We've helped teams optimise Spark jobs yielding 10-100X performance improvements, designed lakehouse architectures processing 50TB+ daily, and built quantitative trading systems that operate on live markets with real P&L.

Every engagement starts with understanding your specific data architecture, constraints, and objectives — then applying proven engineering principles rather than cookie-cutter templates.


What We Believe

Production First

The best architecture is the one that stays reliable under load. We design for production from day one — monitoring, alerting, failover, and cost governance are features, not afterthoughts.

Measure Everything

If it isn't measured, it isn't managed. We instrument every layer — from Spark query execution plans to broker P&L — and use real data to drive optimisation decisions.

Systems Thinking

Data pipelines, AI agents, and trading systems don't exist in isolation. We understand the full chain — from data ingestion to business outcomes — and optimise at the system level, not just individual components.


Contact

Interested in working together? Get in touch — we're always open to a conversation about how we can help your team.