Data Engineer
Posted 3 days 20 hours ago by DOJO AI
About DOJO
We're building a new product category - the AI Marketing Operating System. One and a half years in, we're well-funded, shipping fast, and used by 100+ world-class brands. We were recently named one of Wired's 100 Hottest Startups and included on Sifted's Startups To Watch.
Under the hood, we're building a next-generation AI and data platform applied to autonomous marketing - multi-agent systems, a real-time data fabric synthesizing hundreds of millions of signals, graph-based knowledge representations, and proprietary evaluation infrastructure. All in production, all evolving fast. Our technical surface spans agentic reasoning at scale, data quality across thousands of heterogeneous sources, and real-time intelligence from noisy unstructured data - in a domain where results are immediately measurable. Our engineers come from teams like Feedzai, OutSystems, Talka, and Unbabel, where shipping production AI and data systems at scale is the baseline.
We're a product company first. We don't build tools for consultants to configure - we build a product customers love, one that works flawlessly, with great design, supported by engineering excellence that makes it possible. We make the simple easy and the complex possible. And we build our business around this ethos.
About This Role
DOJO's data layer is our biggest competitive advantage. It's a strategic platform that powers product features, feeds our AI agents, drives evaluation systems, and gives customers insights no other tool can. It sits at the center of everything we build.
You'll own the data infrastructure that makes this work: ingestion from thousands of heterogeneous sources, transformation, quality, and delivery at scale. Synthesizing noisy, unstructured marketing data into structured intelligence that grounds our AI agents - your work directly shapes the quality of our product and the trust our customers place in us.
What You'll Do
- Own the data platform end-to-end - ingestion from diverse sources, transformation, quality assurance, and delivery at scale, making deliberate infrastructure choices that compound over time
- Build and maintain the graph-based semantic layer that bridges unstructured marketing signals with structured, queryable data that is continuously enriched and updated
- Establish and enforce data quality standards - schema validation, anomaly detection, and monitoring that catches problems before customers do
- Work directly with the founders, AI engineers, and product teams to ensure the data layer enables fast iteration on new features and agent capabilities
- Close the AI-data loop - keep data fresh and accurate so our agents learn from the best signals, and use AI at the core of the data infrastructure itself
- Help shape our engineering culture and raise the bar as the team grows - through code review, architectural decisions, and how we work together
You May Be a Good Fit If You Have
- Deep experience building and operating production data infrastructure - pipelines, storage, quality systems, and the tooling around them
- Experience with modern data engineering patterns - data mesh, data vault, data quality, and data governance - not classic ETL and BI warehouse architectures
- Strong proficiency in Python and modern data tools (e.g. Dagster, Polars, DuckDB, Apache Iceberg, Apache Arrow, ClickHouse, Kafka)
- An eye for data quality and scale as a first-class engineering problem, not an afterthought
- Strong foundations in CS or Engineering, though exceptional candidates with alternative backgrounds are welcome
- Experience with data infrastructure for AI systems - evaluation pipelines, data flywheels, or MLOps - is a strong plus
- Experience with streaming architectures and real-time data pipelines is a plus
The Way of the DOJO
We love what we build and who we build it with. Low ego, high trust, and a winning spirit - we believe in each other, push each other, and have a lot of fun doing it together. These are the values we live by:
- Ownership - It's your dojo. The company is yours too. We hire people, not job descriptions - and we expect you to go where you're needed. We own our mistakes, we relish feedback, and we build and ship rather than plan and talk.
- Drive - Do the work. We choose the harder path when it's the right one and do the unglamorous work that actually moves the needle, rather than focusing on flash and vanity outcomes. Persistence over shortcuts, dedication over comfort.
- Honesty - No spin. We say the hard thing early. We're straight with our teammates, our customers, and ourselves - and we always strive to find the hard truth behind the easy answers.
- Excellence - Raise the bar. The best work lives at the intersection of deep technology and deep human understanding. We care about the whole, not just our corner - from the smallest detail to the biggest challenge.
- Simplicity - Make the simple easy and the complex possible. We reduce complexity to increase impact. We build a product that doesn't need training or consulting - it just works. When in doubt, take it out.
- First Principles - From the ground up. We challenge the status quo. We strip things to essentials and redesign from the ground up. We choose principles and systems that compound over quick fixes and band-aids.
What We Offer
Competitive salary and meaningful equity in a company at an inflection point. Comprehensive health coverage. Hybrid work model based in Lisbon with flexible hours and top-of-the-line equipment.