30 Rock Analysis

TLDR: Some basic expoloritory analysis with 30 Rock Data. Introduction In this post we will explore some 30 Rock data. The data includes imdb ratings, viewership numbers, and writers of the episodes. We will take a step by step approach to creating plots for ratings and viewership over time. Libraries For this analysis we will be using the below Python libraries. Pandas for working with data frames matplotlib.pyplot for creating plots and adjusting features matplotlib.mdates for working with date formatting import pandas as pd import matplotlib.pyplot as plt import matplotlib.dates as mdates Create Data Frames The dataset from kaggle came in two csv files. The initial step is to read the csv files into pandas dataframes using the .read_csv method. It is important to know what type of data the dataframes contain. Using pandas’ .dtypes property shows the column name and what data type the values are. Comparing the results between the two data frames, it is clear that there is some overlap in the data. Both data frames have columns that hold values for season, episodes, title, and airdates. This information is important when the data frames are merged together. Another interesting thing that is shown by viewing the data types is that the column original_air_data is showing as on object; it might be more helpful for it to be a datetime data type. ...

April 17, 2024 · 9 min

Rose Bowl Win Probability

TLDR: Create a plot that shows the win probability throughout the 2024 Rose Bowl Game using data from College Football Database API. You can click here to go to the full code. Introduction What follows is the step by step approach that I took to plot the win probability for both teams in the 2024 Rose Bowl Game. The task is pretty straight forward. The api provides the exact data needed for the plot and one could make this plot with one api call. However I did seek to add a little more to the plot that I believe enhances the representation. Things like using the team colors for the lines, adding the final score with logos, and using quarter endings as the x tick marks. To achieve this there was additional data needed and a little extra work that needed to be done. ...

April 3, 2024 · 15 min