Week 7: CST383 Introduction to Data Science
This course is coming to an end, and with the final exam only a few days away! This week’s highlight was the coding homework on predicting customer churn. Unlike previous assignments, this project required working with categorical variables, careful preprocessing, and comparing two different classification models. It felt more complete and realistic like working on an actual data science problem rather than just running algorithms. One thing that stood out to me was how easy it would have been to preprocess everything at once... Splitting the data first, then encoding and scaling properly, made me feel much more confident in the validity of my work. Overall, this week felt like a step forward in thinking more like a data scientist instead of just writing code. Structuring the notebook carefully was not easy at all!! I’m excited to keep learning and dig deeper into data science.