In STEP 3 we have to focus on
because those routes are with the largest number of scheduled flights are, by far, the ones connecting Melbourne and Sydney, the two largest cities in Australia.
Actual code for STEP 2 doesn’t show up this fact:
df = pd.read_csv("otp_time_series_web.csv")
df.head()
Update suggestion code for STEP 2:
df = pd.read_csv("otp_time_series_web.csv")
df.sort_values(by=["Month","Sectors_Scheduled"],ascending=[False,False]).head()