How can you master handling massive datasets and transform raw data into insightful, actionable information?
Join our workshops to dive deep into advanced data management and analysis techniques designed for graduate students. Discover the secrets of efficient database management, unravel the complexities of ETL (Extract, Transform, Load) processes, and get hands-on experience with cutting-edge big data technologies.
Are you ready to elevate your data engineering skills and stand out in the rapidly evolving field of data science?
Are you curious about how to kickstart your journey in data engineering with user-friendly tools before diving deep into the core complexities of the field?
We begin our workshop series with an accessible introduction to Streamlit and Gradio, crafting interactive web applications to visualize and manipulate data effortlessly. However, this is just the beginning. As the weeks progress, we will seamlessly transition into the heart of data engineering, unraveling the intricacies of ETL (Extract, Transform, Load) processes. This gradual progression ensures a solid foundation, paving the way for you to master advanced data engineering techniques with confidence. Are you ready to evolve from creating engaging data-driven applications to mastering the art of data extraction, transformation, and loading?
RESOURCES AND NOTES:
- REGISTER to join in person or via Zoom.
- Navigating the World of Data Engineering wiki
- We meet on Mondays at 2 PM in Weaver Science and Engineering Library Rm 212.
- Zoom: https://arizona.zoom.us/j/86423223879
- There will be no workshops during Spring Break.
- Content schedule and content are subject to change.
- Youtube Playlist
Date | Topic | Resources |
---|---|---|
01/29/24 | Building Python web apps with Streamlit and Gradio | Streamlit - Notebook Gradio - Notebook Presentation Slides Youtube Video |
02/05/24 | Deploying ML models with Streamlit and Gradio | Streamlit - Notebook Gradio - Notebook Presentation Slides Youtube Video |
02/12/24 | Introduction to SQL Part-1 | SQL and duckDB Notebook Presentation Slides Youtube Video |
02/19/24 | Introduction to SQL Part-2 | SQL and duckDB Notebook Presentation Slides Youtube Video |
02/26/24 | Introduction to noSQL Part-1 | mongoDB-Pymongo Notebook Presentation Slides Youtube Video |
03/04/24 | Spring Break | - |
03/11/24 | Introduction to noSQL Part-2 | Cassandra Notebook Copy the link above and open using Jupyter's Open from URL function on Cyverse Presentation Slides Youtube Video |
03/18/24 | Introduction to Hadoop and Hive | Hadoop & Hive Notebook Presentation Slides Youtube Video |
03/25/24 | Introduction to Spark and PySpark | Spark-PySpark Notebook Presentation Slides Youtube Video |