One of my favorite parts of the SURE Program was experimenting with different machine learning models. Going in, I thought the whole goal was to find the model that performs best. But what I discovered is that each model looks at the problem differently, and sometimes the real value comes from comparing their perspectives. The... Continue Reading →
From Bar Charts to Battle Teams: A Data Journey Through the Pokédex
Source: https://x.com/Pokemon/status/1106326121228857344/photo/1 This project started with a simple question:"Is there a relationship between a Pokémon’s type and its base stats?" That’s how most data projects begin: with curiosity and a CSV. I loaded the full Pokédex dataset expecting to run some basic distributions: maybe compare average Attack across types, or see if Dragon-types really are... Continue Reading →
How I Spent My Winter Break: Pandas, Machine Learning Models, and Continuous Growth
When the semester ended and winter break began, I saw it as the perfect opportunity to focus on skill-building. I spent time diving deep into data manipulation with Pandas and experimenting with machine learning models. It was a challenging yet rewarding experience that left me feeling more confident and equipped for future projects. Here’s what... Continue Reading →
Data Trasformation
When working with machine learning models, raw data often isn't enough. Features in your dataset may need transformation to make them understandable for algorithms, especially when dealing with categorical data. That’s where encoding techniques like One-Hot Encoding and Ordinal Encoding come into play. What Are One-Hot Encoding and Ordinal Encoding? One-Hot Encoding One-hot encoding is... Continue Reading →
Getting Started in Data Science: A Beginner’s Guide to Essential Libraries
Embarking on the journey of Data Science can be both exciting and overwhelming. There’s so much to learn—statistics, machine learning algorithms, data manipulation, and more. However, a few essential Python libraries can simplify the process and get you up to speed. In this post, I’ll briefly walk you through two fundamental libraries—pandas, NumPy, Scikit-learn, and... Continue Reading →