β Week 02 Lab Solution
Collecting data from APIs (Weather data)
In this page you will find tips and solutions to the π» W02 Lab
1. Using VS Code
In the video below I (silently) demonstrate how to open a folder in VS Code, create files with a mix of shell commands and the drop-down menus, and how to interact with files in the editor. You can add comments and questions directly to the video if anything is unclear to you.
3. Solving the π Challenge
The challenge task asks:
Overall, how did the hourly temperature change from last year to this year? Compare each hour of todayβs forecast and last yearβs records and then take an average.
Assume I extracted the 24-hour forecast for today and the 24-hour historical data for the same day last year and saved them in the variables forecast_temp
and historical_temp
, respectively. These lists have the same length (24 elements) and the same order of hours (position 0 corresponds to midnight, position 1 to 1am, and so on). Watch the video above to see how to extract this data from the JSON files.
Before you jump to a solution, I want you to take πΌ baby steps 1. Think explicitly of what you know and what you donβt know and take it from there.
Hereβs the type of reasoning process I would like you to follow:
3.1. βWhat do I know?β
- Lists are things that hold elements
- I can access elements in a list by their position (also called index). For example,
forecast_temp[0]
gives me a number that is stored in the first position of the listforecast_temp
. - The two lists are comparable because they have the same length and all the elements are numbers.
- I can compare two numbers using the comparison operators (
<
,>
,==
,!=
,<=
,>=
) just like I do in real life. For example,5 > 3
isTrue
and5 < 3
isFalse
.
3.2 βLet me compare the elements manually firstβ
Doing manual comparisons is a good way to test ideas before writing more complex code. If I want compare the first element of the two lists, I can do it like this:
0] > historical_temp[0] forecast_temp[
Then, Iβd swap the positions of the two lists to see if the comparison changes:
1] > historical_temp[1] forecast_temp[
But what I need to do is store the difference, not simply compare them. Instead of >
I should use -
(subtraction) to calculate the difference:
# Difference between what's in
# the forecast today vs
# the historical temperature at the same hour last year
0] - historical_temp[0] forecast_temp[
3.3 βI need a way to store a bunch of differencesβ
Storing multiple elements is a job for a list. I can create an empty list and append the differences to it:
= []
differences
0] - historical_temp[0])
differences.append(forecast_temp[1] - historical_temp[1])
differences.append(forecast_temp[ ...
Great! If I keep repeating that until I reach the end of the lists, I will have all the differences stored in the list differences
.
βJon says itβs not a good idea to repeat code too much. I should use something else to avoid repeating the same code over and over.β
3.4 βA paradox: I need to repeat the same code without repeating the same codeβ
Ideally, you would revisit the material (Python pre-sessionals or the Control flow section of π§βπ« W02 Lecture Notes) until you come across the notion of for
loops.
Loops are part of the building blocks of programming. You write the code once and βasksβ your computer to do that repeatedly for you. How do you specify the number of times? In Python, we use the range()
function for this.
Type this on a Python shell or a Jupyter Notebook cell:
list(range(24))
The range function creates a sequence (a list!) of numbers from 0 to 23. This is perfect for our case because we have 24 hours in a day.
= []
differences
for i in range(24):
- historical_temp[i]) differences.append(forecast_temp[i]
The code above is equivalent to you writing the code 24 times, but in a much elegant and concise way. The first time the loop runs, i
receives the value 0
, the second time 1
, and so on until 23
.
If you find the above too cluttered, you can split the code into two lines:
= []
differences
for i in range(24):
# Create a variable to store the difference
= forecast_temp[i] - historical_temp[i]
diff
# Now it's clearer to me that
# I'm just adding a number to the list
differences.append(diff)
Either way works. The second version is more verbose but might be easier to understand.
π Think about it:How could I replace the number 24 in there such that in the future I can reuse this same code with a list of any arbitrary length?
3.5 βI have all the differences. Now what?β
You have all the differences stored in the list differences
. Now you need to calculate the average.
The average is calculated by adding all the numbers and dividing by the total number of elements. In Python, you can use the sum()
function to add all the elements of a list and the len()
function to get the total number of elements.
The new version of the code would look like this:
= []
differences
for i in range(24):
= forecast_temp[i] - historical_temp[i]
diff
differences.append(diff)
# Calculate the average
= sum(differences) / len(differences)
average
print(f"The average temperature difference is {average:.2f}Β°C")
And this is a solution to the challenge task. There are still things you can do to improve this code, but this is a good starting point.
Footnotes
If you are a beginner in coding, rushing to ChatGPT just to get a solution is the worst thing you can do for your learning. A Generative AI tools will surely give you a working solution but you wonβt be in control of your learning and as a consequence, you will struggle massively when we introduce more complex elements. Be in control!β©οΈ