The Data Pipeline is an approach developed by the School of Data network to work with data from beginning to end. Aside from being a flexible guide for doing data-driven projects, it is also a wonderful tool for teaching how to work with data to beginners and experienced data practitioners alike as it divides the process into understandable and manageable steps. It is simple enough for beginners to grasp yet open enough for experienced practitioners to play around with.
The Data Pipeline is an ever-improving, dynamic tool that has been utilized, extended, and improved upon by countless data practioners over the years. Its current steps are: