About DataGym.io
Learning data work should feel like doing data work.
DataGym.io is a learning ecosystem for the modern data workflow - a growing set of interactive labs, visual explainers, and practical tools, all running in your browser. No setup, no accounts, no paywall.
Why DataGym.io exists
Most data education stops at the slides. You watch someone explain a concept, nod along, and then face a blank editor with no idea where to start. The gap between understanding a concept and doing it is where most learning quietly fails.
DataGym.io closes that gap. Every lab is something you operate, not something you watch. You write the SQL. You build the dbt model. You watch the DAG rebuild. The concepts stick because you put your hands on them - the same way they stuck the first time for anyone who does this for a living.
Why browser-native learning matters
The fastest way to lose a learner is to ask them to install something. Local Python environments, warehouse credentials, dbt project scaffolding - every one of those is a place to give up before the learning even begins.
So DataGym.io runs entirely in the browser. The SQL engine, the dbt simulation, the animations - all of it executes on the page in front of you. Open a tab and you are already in the environment. That constraint is not a limitation; it is the point. It keeps the focus on the ideas.
Who is building it
DataGym.io is built by Bruno Lima - Lead Data Engineer and dbt Tech Lead at phData, where he designs scalable data modeling solutions on large-scale projects.
Bruno is the only practitioner from Latin America to receive the dbt Community Award - twice, in 2023 and 2024 - one of the field's most selective recognitions, given each year to only a handful of people worldwide. He teaches dbt as an instructor in Zach Wilson's Data Engineering course, has spoken at Coalesce, dbt Labs' flagship conference, and organizes in-person dbt meetups in São Paulo and Florianópolis. His dbt cheat sheet went quietly viral in the data community - and is now one of the tools inside DataGym.io.
The throughline is production judgment. DataGym.io is not theory written by someone who read about the job - it is the workflow, the trade-offs, and the common mistakes from someone who does it.
What's coming
DataGym.io is early and openly so. The flagship lab - Analytics Engineering Quest - and the dbt Cheat Sheet are live today. Next come more visual explainers for the dbt concepts that are hardest to picture, a scenario-based data modeling lab, and a guided path that ties it all together. And all of it is free to use, no account needed.