Lecture 6

Welcome!

  • In previous weeks, you were introduced to the fundamental building blocks of programming.
  • You learned about programming in a lower-level programming language called C.
  • Today, we are going to work with a higher-level programming language called Python.
  • As you learn this new language, you’re going to find that you are going to be more able to teach yourself new programming languages.

Python

  • Humans, over the decades, have seen how previous design decisions could be improved upon.
  • Python is a programming language that builds upon what you have already learned in C.
  • Unlike in C, Python is an interpreted language, where you need not separately compile your program. Instead, you run your program in the Python Interpreter.

Hello

  • Up until this point, the code has looked like this:

    // A program that says hello to the world
    
    #include <stdio.h>
    
    int main(void)
    {
        printf("hello, world\n");
    }
    
  • Today, you’ll find that the process of writing and compiling code has been simplified.
  • For example, the above code will be rendered in Python as:

    # A program that says hello to the world
    
    print("hello, world")
    

    Notice that the semicolon is gone and that no library is needed.

  • Python notably can implement what was quite complicated in C with relative simplicity.

Speller

  • To illustrate this simplicity, let’s type ‘code dictionary.py’ in the terminal window and write code as follows:

    # Words in dictionary
    words = set()
    
    
    def check(word):
        """Return true if word is in dictionary else false"""
        return word.lower() in words
    
    
    def load(dictionary):
        """Load dictionary into memory, returning true if successful else false"""
        with open(dictionary) as file:
            words.update(file.read().splitlines())
        return True
    
    
    def size():
        """Returns number of words in dictionary if loaded else 0 if not yet loaded"""
        return len(words)
    
    
    def unload():
        """Unloads dictionary from memory, returning true if successful else false"""
        return True
    

    Notice that there are four functions above. In the check function, if a word is in words, it returns True. So much easier than an implementation in C! Similarly, in the load function the dictionary file is opened. For each line in that file, we add that line to words. Using rstrip, the trailing new line is removed from the added word. size simply returns the len or length of words. unload only needs to return True because Python handles memory management on its own.

  • The above code illustrates why higher-level languages exist: To simplify and allow you to write code more easily.
  • However, speed is a tradeoff. Because C allows you, the programmer, to make decisions about memory management, it may run faster than Python – depending on your code. While C only runs your lines of code, Python runs all the code that comes under the hood with it when you call Python’s built-in functions.
  • You can learn more about functions in the Python documentation

Filter

  • To further illustrate this simplicity, create a new file by typing code blur.py in your terminal window and write code as follows:

    # Blurs an image
    
    from PIL import Image, ImageFilter
    
    # Blur image
    before = Image.open("bridge.bmp")
    after = before.filter(ImageFilter.BoxBlur(1))
    after.save("out.bmp")
    

    Notice that this program imports modules Image and ImageFilter from a library called PIL. This takes an input file and creates and output file.

  • Further, you can create a new file called edges.py as follows:

    # Finds edges in an image
    
    from PIL import Image, ImageFilter
    
    # Find edges
    before = Image.open("bridge.bmp")
    after = before.filter(ImageFilter.FIND_EDGES)
    after.save("out.bmp")
    

    Notice that this code is a small adjustment to your blur code, but produces a dramatically different result.

  • Finally, you can even do face detection as follows:

    # Find faces in picture
    # https://github.com/ageitgey/face_recognition/blob/master/examples/find_faces_in_picture.py
    
    from PIL import Image
    import face_recognition
    
    # Load the jpg file into a numpy array
    image = face_recognition.load_image_file("office.jpg")
    
    # Find all the faces in the image using the default HOG-based model.
    # This method is fairly accurate, but not as accurate as the CNN model and not GPU accelerated.
    # See also: find_faces_in_picture_cnn.py
    face_locations = face_recognition.face_locations(image)
    
    for face_location in face_locations:
    
        # Print the location of each face in this image
        top, right, bottom, left = face_location
    
        # You can access the actual face itself like this:
        face_image = image[top:bottom, left:right]
        pil_image = Image.fromarray(face_image)
        pil_image.show()
    

    Notice how this file uses a third-party library called face_recognition. This is enabled by running pip install face_recognition in one’s terminal window.

  • Python allows you to abstract away programming that would be much more complicated within C and other lower-level programming languages.

CS50 Library

  • As with C, the CS50 library can be utilized within Python.
  • The following functions will be of particular use:

      get_float
      get_int
      get_string
    
  • You also have the option of importing only specific functions from the CS50 library as follows:

    from CS50 import get_float, get_int, get_string
    

Strings

  • In C, you might remember this code:

    // get_string and printf with %s
    
    #include <cs50.h>
    #include <stdio.h>
    
    int main(void)
    {
        string answer = get_string("What's your name? ");
        printf("hello, %s\n", answer);
    }
    
  • This code is transformed in Python to:

    # get_string and print, with concatenation
    
    from cs50 import get_string
    
    answer = get_string("What's your name? ")
    print("hello, " + answer)
    

    You can write this code by executing code hello.py in the terminal window. Then, you can execute this code by running python hello.py. Notice how the + sign concatenates "hello, " and answer.

  • Similarly, you could implement the above code as:

    # get_string and print, with format strings
    
    from cs50 import get_string
    
    answer = get_string("What's your name? ")
    print(f"hello, {answer}")
    

    Notice how the curly braces allow for the print function to interpolate the answer such that answer appears within. The f is required to include the answer properly formatting.

Variables

  • Variable declaration is simplified too. In C, you might have int counter = 0;. In Python, this same line would read counter = 0. You need not declare the type of the variable.
  • Python favors counter += 1 to increment by one, losing the ability found in C to type counter++.

Types

  • Data types in Python do not need to be explicitly declared. For example, you saw how answer above is a string, but we did not have to tell the interpreter this was the case: It knew on its own.
  • In Python, commonly used types include:

      bool
      float
      int
      str
    

    Notice that long and double are missing. Python will handle what data type should be used for larger and smaller numbers.

  • Some other data types in Python include:

      range
      list
      tuple
      dict
      set
    
  • Each of these data types can be implemented in C, but in Python they can be implemented more simply.

Calculator

  • You might recall calculator.c from earlier in the course:

    // Addition with int
    
    #include <cs50.h>
    #include <stdio.h>
    
    int main(void)
    {
        // Prompt user for x
        int x = get_int("x: ");
    
        // Prompt user for y
        int y = get_int("y: ");
    
        // Perform addition
        printf("%i\n", x + y);
    }
    
  • We can implement a simple calculator just as we did within C. Type code calculator.py into the terminal window and write code as follows:

    # Addition with int [using get_int]
    
    from cs50 import get_int
    
    # Prompt user for x
    x = get_int("x: ")
    
    # Prompt user for y
    y = get_int("y: ")
    
    # Perform addition
    print(x + y)
    

    Notice how the CS50 library is imported. Then, x and y are gathered from the user. Finally, the result is printed. Notice that the main function that would have been seen in a C program is gone entirely! While one could utilize a main function, it is not required.

  • It’s possible for one to remove the training wheels of the CS50 library. Modify your code as follows:

    # Addition with int [using input]
    
    # Prompt user for x
    x = input("x: ")
    
    # Prompt user for y
    y = input("y: ")
    
    # Perform addition
    print(x + y)
    

    Notice how executing the above code results in strange program behavior. Why might this be so?

  • You may have guessed that the interpreter understood x and y to be strings. You can fix your code by employing the int function as follows:

    # Addition with int [using input]
    
    # Prompt user for x
    x = int(input("x: "))
    
    # Prompt user for y
    y = int(input("y: "))
    
    # Perform addition
    print(x + y)
    

    Notice how the input for x and y is passed to the int function which converts it to an integer. Without converting x and y to be integers, the characters will concatenate.

Conditionals

  • In C, you might remember a program like this:

    // Conditionals, Boolean expressions, relational operators
    
    #include <cs50.h>
    #include <stdio.h>
    
    int main(void)
    {
        // Prompt user for integers
        int x = get_int("What's x? ");
        int y = get_int("What's y? ");
    
        // Compare integers
        if (x < y)
        {
            printf("x is less than y\n");
        }
        else if (x > y)
        {
            printf("x is greater than y\n");
        }
        else
        {
            printf("x is equal to y\n");
        }
    }
    
  • In Python, it would appear as follows:

    # Conditionals, Boolean expressions, relational operators
    
    from cs50 import get_int
    
    # Prompt user for integers
    x = get_int("What's x? ")
    y = get_int("What's y? ")
    
    # Compare integers
    if x < y:
        print("x is less than y")
    elif x > y:
        print("x is greater than y")
    else:
        print("x is equal to y")
    

    Notice that there are no more curly braces. Instead, indentations are utilized. Second, a colon is utilized in the if statement. Further, elif replaces else if. Parentheses are also no longer required in the if and elif statements.

  • In C, we faced challenges when we wanted to compare two values. Consider the following code:

    // Conditionals, Boolean expressions, relational operators
    
    #include <cs50.h>
    #include <stdio.h>
    
    int main(void)
    {
        // Prompt user for integers
        int x = get_int("What's x? ");
        int y = get_int("What's y? ");
    
        // Compare integers
        if (x < y)
        {
            printf("x is less than y\n");
        }
        else if (x > y)
        {
            printf("x is greater than y\n");
        }
        else
        {
            printf("x is equal to y\n");
        }
    }
    
  • In Python, we can execute the above as follows:

    # Conditionals, Boolean expressions, relational operators
    
    from cs50 import get_int
    
    # Prompt user for integers
    x = get_int("What's x? ")
    y = get_int("What's y? ")
    
    # Compare integers
    if x < y:
        print("x is less than y")
    elif x > y:
        print("x is greater than y")
    else:
        print("x is equal to y")
    

    Notice that the CS50 library is imported. Further, minor changes exist in the if statement.

  • Further looking at comparisons, consider the following code in C:

    // Logical operators
    
    #include <cs50.h>
    #include <stdio.h>
    
    int main(void)
    {
        // Prompt user to agree
        char c = get_char("Do you agree? ");
    
        // Check whether agreed
        if (c == 'Y' || c == 'y')
        {
            printf("Agreed.\n");
        }
        else if (c == 'N' || c == 'n')
        {
            printf("Not agreed.\n");
        }
    }
    
  • The above can be implemented as follows:

    # Logical operators
    
    from cs50 import get_string
    
    # Prompt user to agree
    s = get_string("Do you agree? ")
    
    # Check whether agreed
    if s == "Y" or s == "y":
        print("Agreed.")
    elif s == "N" or s == "n":
        print("Not agreed.")
    

    Notice that the two vertical bars utilized in C is replaced with or. Indeed, people often enjoy Python because it is more readable by humans. Also, notice that char does not exist in Python. Instead, strs are utilized.

  • Another approach to this same code could be as follows using lists:

    # Logical operators, using lists
    
    from cs50 import get_string
    
    # Prompt user to agree
    s = get_string("Do you agree? ")
    
    # Check whether agreed
    if s in ["y", "yes"]:
        print("Agreed.")
    elif s in ["n", "no"]:
        print("Not agreed.")
    

    Notice how we are able to express multiple keywords like y and yes in a list.

Object-Oriented Programming

  • Up until this point, our programs in this course have been linear: sequential.
  • It’s possible to have certain types of values not only have properties or attributes inside of them but have functions as well. In Python, these values are known as objects
  • In C, we could create a struct where you could associate multiple variables inside a single self-created data type. In Python, we can do this and also include functions in a self-created data type. When a function belongs to a specific object, it is known as a method.
  • For example, strs in Python have a built-in methods. Therefore, you could modify your code as follows:

    # Logical operators, using lists
    
    from cs50 import get_string
    
    # Prompt user to agree
    s = get_string("Do you agree? ").lower()
    
    # Check whether agreed
    if s.lower() in ["y", "yes"]:
        print("Agreed.")
    elif s.lower() in ["n", "no"]:
        print("Not agreed.")
    

    Notice how the old value of s is overwritten with the result of s.lower(), a built-in method of strs.

  • In this class, we will only scratch the surface of Python. Therefore, the Python documentation will be of particular importance as you continue.
  • You can learn more about string methods in the Python documentation

Loops

  • Loops in Python are very similar to C. You may recall the following code in C:

    // Demonstrates while loop
    
    #include <stdio.h>
    
    int main(void)
    {
        int i = 0;
        while (i < 3)
        {
            printf("meow\n");
            i++;
        }
    }
    
  • In Python, this code appears as:

    # Demonstrates while loop
    
    i = 0
    while i < 3:
        print("meow")
        i += 1
    
  • for loops can be implemented in Python as follows:

    # Better design
    
    for i in range(3):
        print("meow")
    

    Notice that i is never explicitly used. However, Python will increment the value of i.

  • Similarly, one could express the above code as:

    # Abstraction with parameterization
    
    def main():
        meow(3)
    
    
    # Meow some number of times
    def meow(n):
        for i in range(n):
            print("meow")
    
    
    main()
    

    Notice that a function is utilized to abstract away the meowing.

  • Finally, a while loop could be implemented as follows:

    # Demonstrates while loop
    
    i = 0
    while i < 3:
        print("meow")
        i += 1
    
  • To further our understanding of loops and iteration in Python, let’s create a new file called uppercase.py as follows:

    # Uppercases string one character at a time
    
    before = input("Before: ")
    print("After:  ", end="")
    for c in before:
        print(c.upper(), end="")
    print()
    

    Notice how end= is used to pass a parameter to the print function that continues the line without a line ending. This code passes one string at a time.

  • Reading the documentation, we discover that Python has methods that can be implemented upon the entire string as follows:

    # Uppercases string all at once
    
    before = input("Before: ")
    after = before.upper()
    print(f"After:  {after}")
    

    Notice how .upper is applied to the entire string.

Abstraction

  • As we hinted at earlier today, you can further improve upon our code using functions and abstracting away various code into functions. Modify your earlier-created meow.py code as follows:

    # Abstraction
    
    def main():
        for i in range(3):
            meow()
    
    # Meow once
    def meow():
        print("meow")
    
    
    main()
    

    Notice that the meow function abstracts away the print statement. Further, notice that the main function appears at the top of the file. At the bottom of the file, the main function is called. By convention, it’s expected that you create a main function in Python.

  • Indeed, we can pass variables between our functions as follows:

    # Abstraction with parameterization
    
    def main():
        meow(3)
    
    
    # Meow some number of times
    def meow(n):
        for i in range(n):
            print("meow")
    
    
    main()
    

    Notice how meow now takes a variable n. In the main function, you can call meow and pass a value like 3 to it. Then, meow utilizes the value of n in the for loop.

  • Reading the above code, notice how you, as a C programmer, are able to quite easily make sense of the above code. While some conventions are different, the building blocks you previously learned are very apparent in this new programming language.

Truncation and Floating Point Imprecision

  • Recall that in C, we experienced truncation where one integer being divided by another could result in an inprecise result.
  • You can see how Python handles such division as follows by modifying your code for calculator.py:

    # Division with integers, demonstration lack of truncation
    
    # Prompt user for x
    x = int(input("x: "))
    
    # Prompt user for y
    y = int(input("y: "))
    
    # Divide x by y
    z = x / y
    print(z)
    

    Notice that executing this code results in a value, but that if you were to see more digits after .333333 you’d see that we are faced with floating-point imprecision. Truncation does not occur.

  • We can reveal this imprecision by modifying our codes slightly:

    # Floating-point imprecision
    
    # Prompt user for x
    x = int(input("x: "))
    
    # Prompt user for y
    y = int(input("y: "))
    
    # Divide x by y
    z = x / y
    print(f"{z:.50f}")
    

    Notice that this code reveals the imprecision. Python still faces this issue, just as C does.

Exceptions

  • Let’s explore more about exceptions that can occur when we run Python code.
  • Modify calculator.py as follows:

    # Implements get_int
    
    def get_int(prompt):
        return int(input(prompt))
    
    
    def main():
    
        # Prompt user for x
        x = get_int("x: ")
    
        # Prompt user for y
        y = get_int("y: ")
    
        # Perform addition
        print(x + y)
    
    
    main()
    

    Notice that inputting the wrong data could result in an error.

  • We can try to handle and catch potential exceptions by modifying our code as follows:

    # Implements get_int with a loop
    
    def get_int(prompt):
        while True:
            try:
                return int(input(prompt))
            except ValueError:
                print("Not an integer")
    
    
    def main():
    
        # Prompt user for x
        x = get_int("x: ")
    
        # Prompt user for y
        y = get_int("y: ")
    
        # Perform addition
        print(x + y)
    
    
    main()
    

    Notice that the above code repeatedly tries to get the correct type of data, providing additional prompts when needed.

Mario

  • Recall a few weeks ago our challenge of building three blocks on top of one another, like in Mario.

    three vertical blocks

  • In Python, we can implement something akin to this as follows:

    # Prints a column of 3 bricks with a loop
    
    for i in range(3):
        print("#")
    
  • In C, we had the advantage of a do-while loop. However, in Python it is convention to utilize a while loop, as Python does not have a do while loop. You can write code as follows in a file called mario.py:

    # Prints a column of n bricks with a loop
    
    from cs50 import get_int
    
    while True:
        n = get_int("Height: ")
        if n > 0:
            break
    
    for i in range(n):
        print("#")
    

    Notice how the while loop is used to obtain the height. Once a height greater than zero is inputted, the loop breaks.

  • Consider the following image:

    four horizontal question blocks

  • In Python, we could implement by modifying your code as follows:

    # Prints a row of 4 question marks with a loop
    
    for i in range(4):
        print("?", end="")
    print()
    

    Notice that you can override the behavior of the print function to stay on the same line as the previous print.

  • Similar in spirit to previous iterations, we can further simplify this program:

    # Prints a row of 4 question marks without a loop
    
    print("?" * 4)
    

    Notice that we can utilize * to multiply the print statement to repeat 4 times.

  • What about a large block of bricks?

    three by three block of mario blocks

  • To implement the above, you can modify your code as follows:

    # Prints a 3-by-3 grid of bricks with loops
    
    for i in range(3):
        for j in range(3):
            print("#", end="")
        print()
    

    Notice how one for loop exists inside another. The print statement adds a new line at the end of each row of bricks.

  • You can learn more about the print function in the Python documentation

Lists

  • lists are a data structure within Python.
  • lists have built in methods or functions within them.
  • For example, consider the following code:

    # Averages three numbers using a list
    
    # Scores
    scores = [72, 73, 33]
    
    # Print average
    average = sum(scores) / len(scores)
    print(f"Average: {average}")
    

    Notice that you can use the built-in sum method to calculate the average.

  • You can even utilize the following syntax to get values from the user:

    # Averages three numbers using a list and a loop
    
    from cs50 import get_int
    
    # Get scores
    scores = []
    for i in range(3):
        score = get_int("Score: ")
        scores.append(score)
    
    # Print average
    average = sum(scores) / len(scores)
    print(f"Average: {average}")
    

    Notice that this code utilizes the built-in append method for lists.

  • You can learn more about lists in the Python documentation
  • You can also learn more about len in the Python documentation

Searching and Dictionaries

  • We can also search within a data structure.
  • Consider a program called phonebook.py as follows:

    # Implements linear search for names using loop
    
    # A list of names
    names = ["Carter", "David", "John"]
    
    # Ask for name
    name = input("Name: ")
    
    # Search for name
    for n in names:
        if name == n:
            print("Found")
            break
    else:
        print("Not found")
    

    Notice how this implements linear search for each name.

  • However, we don’t need to iterate through a list. In Python, we can execute linear search as follows:

    # Implements linear search for names using `in`
    
    # A list of names
    names = ["Carter", "David", "John"]
    
    # Ask for name
    name = input("Name: ")
    
    # Search for name
    if name in names:
        print("Found")
    else:
        print("Not found")
    

    Notice how in is used to implement linear search.

  • Still, this code could be improved.
  • Recall that a dictionary or dict is a collection of key and value pairs.
  • You can implement a dictionary in Python as follows:

    # Implements a phone book as a list of dictionaries, without a variable
    
    from cs50 import get_string
    
    people = [
        {"name": "Carter", "number": "+1-617-495-1000"},
        {"name": "David", "number": "+1-617-495-1000"},
        {"name": "John", "number": "+1-949-468-2750"},
    ]
    
    # Search for name
    name = get_string("Name: ")
    for person in people:
        if person["name"] == name:
            print(f"Found {person['number']}")
            break
    else:
        print("Not found")
    

    Notice that the dictionary is implemented having both name and number for each entry.

  • Even better, strictly speaking, we don’t need both a name and a number. We can simplify this code as follows:

    # Implements a phone book using a dictionary
    
    from cs50 import get_string
    
    people = {
        "Carter": "+1-617-495-1000",
        "David": "+1-617-495-1000",
        "John": "+1-949-468-2750",
    }
    
    # Search for name
    name = get_string("Name: ")
    if name in people:
        print(f"Number: {people[name]}")
    else:
        print("Not found")
    

    Notice that the dictionary is implemented using curly braces. Then, the statement if name in people searches to see if the name is in the people dictionary. Further, notice how, in the print statement, we can index into the people dictionary using the value of name. Very useful!

  • Python has done their best to get to constant time using their built-in searches.
  • You can learn more about dictionaries in the Python documentation

Command-Line Arguments

  • As with C, you can also utilize command-line arguments. Consider the following code:

    # Prints a command-line argument
    
    from sys import argv
    
    if len(argv) == 2:
        print(f"hello, {argv[1]}")
    else:
        print("hello, world")
    

    Notice that argv[1] is printed using a formatted string, noted by the f present in the print statement.

  • You can print all the arguments in argv as follows:

    # Printing command-line arguments, indexing into argv
    
    from sys import argv
    
    for i in range(len(argv)):
        print(argv[i])
    

    Notice that the above will not present the word python if executed, and the first argument will be the name of the file you are running. You can think of the word python as being analogous to ./ when we were running programs in C.

  • You can slice pieces of lists away. Consider the following code:

    # Printing command-line arguments
    
    from sys import argv
    
    for arg in argv:
        print(arg)
    

    Notice that executing this code will result in the name of the file you are running being sliced away.

  • You can learn more about the sys library in the Python documentation

Exit Status

  • The sys library also has built-in methods. We can use sys.exit(i) to exit the program with a specific exit code:

    # Exits with explicit value, importing sys
    
    import sys
    
    if len(sys.argv) != 2:
        print("Missing command-line argument")
        sys.exit(1)
    
    print(f"hello, {sys.argv[1]}")
    sys.exit(0)
    

    Notice that dot-notation is used to utilize the built-in functions of sys.

Third-Party Libraries

  • One of the advantages of Python is its massive user-base and similarly large number of third-party libraries.
  • For example, David demoed the use of cowsay and qrcode libraries.

Summing Up

In this lesson, you learned how the building blocks of programming from prior lessons can be implemented within Python. Further, you learned about how Python allowed for more simplified code. Also, you learned how to utilize various Python libraries. In the end, you learned that your skills as a programmer are not limited to a single programming language. Already, you are seeing how you are discovering a new way of learning through this course that could serve you in any programming language – and, perhaps, in nearly any avenue of learning! Specifically, we discussed…

  • Python
  • Variables
  • Conditionals
  • Loops
  • Types
  • Object-Oriented programming
  • Truncation and floating point imprecision
  • Exceptions
  • Dictionaries
  • Command-line arguments
  • Third-Party libraries

See you next time!