Lecture 1
- C
- hello, world
- Compilers
- String
- Scratch blocks in C
- Types, formats, operators
- More examples
- Memory, imprecision, and overflow
C
- Today weâll learn a new language, C: a programming language that has all the features of Scratch and more, but perhaps a little less friendly since itâs purely in text:
    #include <stdio.h> int main(void) { printf("hello, world\n"); }- Though the words are new, the ideas are exactly as same as the âwhen green flag clickedâ and âsay (hello, world)â blocks in Scratch:
  
 
- Though the words are new, the ideas are exactly as same as the âwhen green flag clickedâ and âsay (hello, world)â blocks in Scratch:
- Though cryptic, donât forget that 2/3 of CS50 students have never taken CS before, so donât be daunted! And though at first, to borrow a phrase from MIT, trying to absorb all these new concepts may feel like drinking from a fire hose, be assured that by the end of the semester weâll be empowered by and experienced at learning and applying these concepts.
- We can compare a lot of the constructs in C, to blocks weâve already seen and used in Scratch. The syntax is far less important than the principles, which weâve already been introduced to.
hello, world
- The âwhen green flag clickedâ block in Scratch starts the main program; clicking the green flag causes the right set of blocks underneath to start. In C, the first line for the same is int main(void), which weâll learn more about over the coming weeks, followed by an open curly brace{, and a closed curly brace}, wrapping everything that should be in our program.int main(void) { }
- The âsay (hello, world)â block is a function, and maps to printf("hello, world");. In C, the function to print something to the screen isprintf, wherefstands for âformatâ, meaning we can format the printed string in different ways. Then, we use parentheses to pass in what we want to print. We have to use double quotes to surround our text so itâs understood as text, and finally, we add a semicolon;to end this line of code.
- To make our program work, we also need another line at the top, a header line #include <stdio.h>that defines theprintffunction that we want to use. Somewhere there is a file on our computer,stdio.h, that includes the code that allows us to access theprintffunction, and the#includeline tells the computer to include that file with our program.
- To write our first program in Scratch, we opened Scratchâs website. Similarly, weâll use the CS50 Sandbox to start writing and running code the same way. The CS50 Sandbox is a virtual, cloud-based environment with the libraries and tools already installed for writing programs in various languages. At the top, there is a simple code editor, where we can type text. Below, we have a terminal window, into which we can type commands:
  
- Weâll type our code from earlier into the top, after using the +sign to create a new file calledhello.c:
  
- We end our programâs file with .cby convention, to indicate that itâs intended as a C program. Notice that our code is colorized, so that certain things are more visible.
Compilers
- Once we save the code that we wrote, which is called source code, we need to convert it to machine code, binary instructions that the computer understands directly.
- We use a program called a compiler to compile our source code into machine code.
- To do this, we use the Terminal panel, which has a command prompt. The $at the left is a prompt, after which we can type commands.- We type clang hello.c(whereclangstands for âC languagesâ, a compiler written by a group of people). But before we press enter, we click the folder icon on the top left of CS50 Sandbox. We see our file,hello.c. So we press enter in the terminal window, and see that we have another file now, calleda.out(short for âassembly outputâ). Inside that file is the code for our program, in binary. Now, we can type./a.outin the terminal prompt to run the programa.outin our current folder. We just wrote, compiled, and ran our first program!
 
- We type 
String
- But after we run our program, we see hello, world$, with the new prompt on the same line as our output. It turns out that we need to specify precisely that we need a new line after our program, so we can update our code to include a special newline character,\n:#include <stdio.h> int main(void) { printf("hello, world\n"); }- Now we need to remember to recompile our program with clang hello.cbefore we can run this new version.
 
- Now we need to remember to recompile our program with 
- Line 2 of our program is intentionally blank since we want to start a new section of code, much like starting new paragraphs in essays. Itâs not strictly necessary for our program to run correctly, but it helps humans read longer programs more easily.
- We can change the name of our program from a.outto something else, too. We can pass command-line arguments, or additional options, to programs in the terminal, depending on what the program is written to understand. For example, we can typeclang -o hello hello.c, and-o hellois telling the programclangto save the compiled output as justhello. Then, we can just run./hello.
- In our command prompt, we can run other commands, like ls(list), which shows the files in our current folder:$ ls a.out* hello* hello.c- The asterisk, *, indicates that those files are executable, or that they can be run by our computer.
 
- The asterisk, 
- We can use the rm(remove) command to delete a file:$ rm a.out rm: remove regular file 'a.out'?- We can type yoryesto confirm, and uselsagain to see that itâs indeed gone forever.
 
- We can type 
- Now, letâs try to get input from the user, as we did in Scratch when we wanted to say âhello, Davidâ:
   string answer = get_string("What's your name?\n"); printf("hello, %s\n", answer);- First, we need a string, or piece of text (specifically, zero or more characters in a sequence in double quotes, like "","ba", or âbananasâ), that we can ask the user for, with the functionget_string. We pass the prompt, or what we want to ask the user, to the function with"What is your name?\n"inside the parentheses. On the left, we want to create a variable,answer, the value of which will be what the user enters. (The equals sign=is setting the value from right to left.) Finally, the type of variable that we want isstring, so we specify that to the left ofanswer.
- Next, inside the printffunction, we want the value ofanswerin what we print back out. We use a placeholder for our string variable,%s, inside the phrase we want to print, like"hello, %s\n", and then we giveprintfanother argument, or option, to tell it that we want the variableanswerto be substituted.
 
- First, we need a string, or piece of text (specifically, zero or more characters in a sequence in double quotes, like 
- If we made a mistake, like writing printf("hello, world"\n);with the\noutside of the double quotes for our string, weâll see an errors from our compiler:$ clang -o hello hello.c hello.c:5:26: error: expected ')' printf("hello, world"\n); ^ hello.c:5:11: note: to match this '(' printf("hello, world"\n); ^ 1 error generated.- The first line of the error tells us to look at hello.c, line 5, column 26, where the compiler expected a closing parentheses, instead of a backslash.
 
- The first line of the error tells us to look at 
- To simplify things (at least for the beginning), weâll include a library, or set of code, from CS50. The library provides us with the stringvariable type, theget_stringfunction, and more. We just have to write a line at the top toincludethe filecs50.h:#include <cs50.h> #include <stdio.h> int main(void) { string name = get_string("What's your name?\n"); printf("hello, name\n"); }
- So letâs make a new file, string.c, with this code:#include <stdio.h> int main(void) { string name = get_string("What's your name?\n"); printf("hello, %s\n", name); }
- Now, if we try to compile that code, we get a lot of lines of errors. Sometimes, one mistake means that the compiler then starts interpreting correct code incorrectly, generating more errors than there actually are. So we start with our first error:
    $ clang -o string string.c string.c:5:5: error: use of undeclared identifier 'string'; did you mean 'stdin'? string name = get_string("What's your name?\n"); ^~~~~~ stdin /usr/include/stdio.h:135:25: note: 'stdin' declared here extern struct _IO_FILE *stdin; /* Standard input stream. */- We didnât mean stdin(âstandard inâ) instead ofstring, so that error message wasnât helpful. In fact, we need to import another file that defines the typestring(actually a training wheel from CS50, as weâll find out in the coming weeks).
 
- We didnât mean 
- So we can include another file, cs50.h, which also includes the functionget_string, among others.#include <cs50.h> #include <stdio.h> int main(void) { string name = get_string("What's your name?\n"); printf("hello, %s\n", name); }
- Now, when we try to compile our program, we have just one error:
    $ clang -o string string.c /tmp/string-aca94d.o: In function `main': string.c:(.text+0x19): undefined reference to `get_string' clang-7: error: linker command failed with exit code 1 (use -v to see invocation)- It turns out that we also have to tell our compiler to add our special CS50 library file, with clang -o string string.c -lcs50, with-lfor âlinkâ.
 
- It turns out that we also have to tell our compiler to add our special CS50 library file, with 
- We can even abstract this away and just type make string. We see that, by default in the CS50 Sandbox,makeusesclangto compile our code fromstring.cintostring, with all the necessary arguments, or flags, passed in.
Scratch blocks in C
- The âset [counter] to (0)â block is creating a variable, and in C we would write int counter = 0;, whereintspecifies that the type of our variable is an integer:
  
- âchange [counter] by (1)â is counter = counter + 1;in C. (In C, the=isnât like an equals sign in a equation, where we are sayingcounteris the same ascounter + 1. Instead,=is an assignment operator that means, âcopy the value on the right, into the value on the leftâ.) And notice we donât need to sayintanymore, since we presume that we already specified previously thatcounteris anint, with some existing value. We can also saycounter += 1;orcounter++;both of which are âsyntactic sugarâ, or shortcuts that have the same effect with fewer characters to type.
  
- A condition would map to:
   if (x < y) { printf("x is less than y\n"); }- Notice that in C, we use {and}(as well as indentation) to indicate how lines of code should be nested.
 
- Notice that in C, we use 
- We can also have if-else conditions:
   if (x < y) { printf("x is less than y\n"); } else { printf("x is not less than y\n"); }- Notice that lines of code that themselves are not some action (if..., and the braces) donât end in a semicolon.
 
- Notice that lines of code that themselves are not some action (
- And even else if:<
   if (x < y) { printf("x is less than y\n"); } else if (x > y) { printf("x is greater than y\n"); } else if (x == y) { printf("x is equal to y\n"); }- Notice that, to compare two values in C, we use ==, two equals signs.
- And, logically, we donât need the if (x == y)in the final condition, since thatâs the only case remaining, and we can just sayelse.
 
- Notice that, to compare two values in C, we use 
- Loops can be written like the following:
   while (true) { printf("hello, world\n"); }- The whilekeyword also requires a condition, so we usetrueas the Boolean expression to ensure that our loop will run forever. Our program will check whether the expression evaluates totrue(which it always will in this case), and then run the lines inside the curly braces. Then it will repeat that until the expression isnât true anymore (which wonât change in this case).
 
- The 
- We could do something a certain number of times with while:
   int i = 0; while (i < 50) { printf("hello, world\n"); i++; }- We create a variable, i, and set it to 0. Then, whilei < 50, we run some lines of code, and we add 1 toiafter each run.
- The curly braces around the two lines inside the whileloop indicate that those lines will repeat, and we can add additional lines to our program after if we wanted to.
 
- We create a variable, 
- To do the same repetition, more commonly we can use the forkeyword:for (int i = 0; i < 50; i++) { printf("hello, world\n"); }- Again, first we create a variable named iand set it to 0. Then, we check thati < 50every time we reach the top of the loop, before we run any of the code inside. If that expression is true, then we run the code inside. Finally, after we run the code inside, we usei++to add one toi, and the loop repeats.
 
- Again, first we create a variable named 
Types, formats, operators
- There are other types we can use for our variables
    - bool, a Boolean expression of either- trueor- false
- char, a single character like- aor- 2
- double, a floating-point value with even more digits
- float, a floating-point value, or real number with a decimal value
- int, integers up to a certain size, or number of bits
- long, integers with more bits, so they can count higher
- string, a string of characters
 
- And the CS50 library has corresponding functions to get input of various types:
    - get_char
- get_double
- get_float
- get_int
- get_long
- get_string
 
- For printf, too, there are different placeholders for each type:- %cfor chars
- %ffor floats, doubles
- %ifor ints
- %lifor longs
- %sfor strings
 
- And there are some mathematical operators we can use:
    - +for addition
- -for subtraction
- *for multiplication
- /for division
- %for remainder
 
More examples
- For each of these examples, you can click on the sandbox links to run and edit your own copies of them.
- In int.c, we get and print an integer:#include <cs50.h> #include <stdio.h> int main(void) { int age = get_int("What's your age?\n"); int days = age * 365; printf("You are at least %i days old.\n", days); }- Notice that we use %ito print an integer.
- We can now run make intand run our program with./int.
- We can combine lines and remove the daysvariable with:int age = get_int("What's your age?\n"); printf("You are at least %i days old.\n", age * 365);
- Or even combine everything in one line:
        printf("You are at least %i days old.\n", get_int("What's your age?\n") * 365);
- Though, once a line is too long or complicated, it may be better to keep two or even three lines for readability.
 
- Notice that we use 
- In float.c, we can get decimal numbers (called floating-point values in computers, because the decimal point can âfloatâ between the digits, depending on the number):#include <cs50.h> #include <stdio.h> int main(void) { float price = get_float("What's the price?\n"); printf("Your total is %f.\n", price * 1.0625); }- Now, if we compile and run our program, weâll see a price printed out with tax.
- We can specify the number of digits printed after the decimal with a placeholder like %.2ffor two digits after the decimal point.
 
- With parity.c, we can check if a number is even or odd:#include <cs50.h> #include <stdio.h> int main(void) { int n = get_int("n: "); if (n % 2 == 0) { printf("even\n"); } else { printf("odd\n"); } }- With the %(modulo) operator, we can get the remainder ofnafter itâs divided by 2. If the remainder is 0, we know thatnis even. Otherwise, we know n is odd.
- And functions like get_intfrom the CS50 library do error-checking, where only inputs from the user that matches the type we want is accepted.
 
- With the 
- In conditions.c, we turn the condition snippets from before into a program:// Conditions and relational operators #include <cs50.h> #include <stdio.h> int main(void) { // Prompt user for x int x = get_int("x: "); // Prompt user for y int y = get_int("y: "); // Compare x and y if (x < y) { printf("x is less than y\n"); } else if (x > y) { printf("x is greater than y\n"); } else { printf("x is equal to y\n"); } }- Lines that start with //are comments, or note for humans that the compiler will ignore.
- For David to compile and run this program in his sandbox, he first needed to run cd src1in the terminal. This changes the directory, or folder, to the one in which he saved all of the lectureâs source files. Then, he could runmake conditionsand./conditions. Withpwd, he can see that heâs in asrc1folder (inside other folders). Andcdby itself, with no arguments, will take us back to our default folder in the sandbox.
 
- Lines that start with 
- In agree.c, we can ask the user to confirm or deny something:// Logical operators #include <cs50.h> #include <stdio.h> int main(void) { // Prompt user to agree char c = get_char("Do you agree?\n"); // Check whether agreed if (c == 'Y' || c == 'y') { printf("Agreed.\n"); } else if (c == 'N' || c == 'n') { printf("Not agreed.\n"); } }- We use two vertical bars, ||, to indicate a logical âorâ, whether either expression can be true for the condition to be followed.
- And if none of the expressions are true, nothing will happen since our program doesnât have a loop.
 
- We use two vertical bars, 
- Letâs implement the coughing program from week 0:
    #include <stdio.h> int main(void) { printf("cough\n"); printf("cough\n"); printf("cough\n"); }
- We could use a forloop:#include <stdio.h> int main(void) { for (int i = 0; i < 3; i++) { printf("cough\n"); } }- By convention, programmers tend to start counting at 0, and so iwill have the values of0,1, and2before stopping, for a total of three iterations. We could also writefor (int i = 1, i <= 3, i++)for the same final effect.
 
- By convention, programmers tend to start counting at 0, and so 
- We can move the printfline to its own function:#include <stdio.h> void cough(void); int main(void) { for (int i = 0; i < 3; i++) { cough(); } } void cough(void) { printf("cough\n"); }- We declared a new function with void cough(void);, before ourmainfunction calls it. The C compiler reads our code from top to bottom, so we need to tell it that thecoughfunction exists, before we use it. Then, after ourmainfunction, we can implement thecoughfunction. This way, the compiler knows the function exists, and we can keep ourmainfunction close to the top.
- And our coughfunction doesnât take any inputs, so we havecough(void).
 
- We declared a new function with 
- We can abstract coughfurther:#include <stdio.h> void cough(int n); int main(void) { cough(3); } void cough(int n) { for (int i = 0; i < n; i++) { printf("cough\n"); } }- Now, when we want to print âcoughâ any number of times, we can just call the same function. Notice that, with void cough(int n), we indicate that thecoughfunction takes as input anint, which we refer to asn. And insidecough, we usenin ourforloop to print âcoughâ the right number of times.
 
- Now, when we want to print âcoughâ any number of times, we can just call the same function. Notice that, with 
- Letâs look at positive.c:#include <cs50.h> #include <stdio.h> int get_positive_int(void); int main(void) { int i = get_positive_int(); printf("%i\n", i); } // Prompt user for positive integer int get_positive_int(void) { int n; do { n = get_int("%s", "Positive Integer: "); } while (n < 1); return n; }- The CS50 library doesnât have a get_positive_intfunction, but we can write one ourselves. Our functionint get_positive_int(void)will prompt the user for anintand return thatint, which our main function stores asi. Inget_positive_int, we initialize a variable,int n, without assigning a value to it yet. Then, we have a new construct,do ... while, which does something first, then checks a condition, and repeats until the condition is no longer true.
- Once the loop ends because we have an nthat is not< 1, we can return it with thereturnkeyword. And back in ourmainfunction, we can setint ito that value.
 
- The CS50 library doesnât have a 
Screens
- We might want a program that prints part of a screen from a video game like Super Mario Bros. In mario0.c, we have:// Prints a row of 4 question marks #include <stdio.h> int main(void) { printf("????\n"); }
- We can ask the user for a number of question marks, and then print them, with mario2.c:#include <cs50.h> #include <stdio.h> int main(void) { int n; do { n = get_int("Width: "); } while (n < 1); for (int i = 0; i < n; i++) { printf("?"); } printf("\n"); }
- And we can print a two-dimensional set of blocks with mario8.c:// Prints an n-by-n grid of bricks with a loop #include <cs50.h> #include <stdio.h> int main(void) { int n; do { n = get_int("Size: "); } while (n < 1); for (int i = 0; i < n; i++) { for (int j = 0; j < n; j++) { printf("#"); } printf("\n"); } }- Notice we have two nested loops, where the outer loop uses ito do everything insidentimes, and the inner loop usesj, a different variable, to do somethingntimes for each of those times. In other words, the outer loop printsnârowsâ, or lines, and the inner loop printsnâcolumnsâ, or#characters, in each line.
 
- Notice we have two nested loops, where the outer loop uses 
- Other examples not covered in lecture are available under âSource Codeâ for Week 1.
Memory, imprecision, and overflow
- Our computer has memory, in hardware chips called RAM, random-access memory. Our programs use that RAM to store data as they run, but that memory is finite. So with a finite number of bits, we canât represent all possible numbers (of which there are an infinite number of). So our computer has a certain number of bits for each float and int, and has to round to the nearest decimal value at a certain point.
- With floats.c, we can see what happens when we use floats:#include <cs50.h> #include <stdio.h> int main(void) { // Prompt user for x float x = get_float("x: "); // Prompt user for y float y = get_float("y: "); // Perform division printf("x / y = %.50f\n", x / y); }- With %50f, we can specify the number of decimal places displayed.
- Hmm, now we get âŠ
        x: 1 y: 10 x / y = 0.10000000149011611938476562500000000000000000000000
- It turns out that this is called floating-point imprecision, where we donât have enough bits to store all possible values, so the computer has to store the closest value it can to 1 divided by 10.
 
- With 
- We can see a similar problem in overflow.c:#include <stdio.h> #include <unistd.h> int main(void) { for (int i = 1; ; i *= 2) { printf("%i\n", i); sleep(1); } }- In our forloop, we setito1, and double it with*= 2. (And weâll keep doing this forever, so thereâs no condition we check.)
- We also use the sleepfunction fromunistd.hto let our program pause each time.
- Now, when we run this program, we see the number getting bigger and bigger, until:
        1073741824 overflow.c:6:25: runtime error: signed integer overflow: 1073741824 * 2 cannot be represented in type 'int' -2147483648 0 0 ...
- It turns out, our program recognized that a signed integer (an integer with a positive or negative sign) couldnât store that next value, and printed an error. Then, since it tried to double it anyways, ibecame a negative number, and then 0.
- This problem is called integer overflow, where an integer can only be so big before it runs out of bits and ârolls overâ. We can picture adding 1 to 999 in decimal. The last digit becomes 0, we carry the 1 so the next digit becomes 0, and we get 1000. But if we only had three digits, we would end up with 000 since thereâs no place to put the final 1!
 
- In our 
- The Y2K problem arose because many programs stored the calendar year with just two digits, like 98 for 1998, and 99 for 1999. But when the year 2000 approached, the programs would have stored 00, leading to confusion between the years 1900 and 2000.
- A Boeing 787 airplane also had a bug where a counter in the generator overflows after a certain number of days of continuous operation, since the number of seconds it has been running could no longer be stored in that counter.
- So, weâve seen a few problems that can happen, but now understand why, and how to prevent them.
- With this weekâs problem set, weâll use the CS50 Lab, built on top of the CS50 Sandbox, to write some programs with walkthroughs to guide us.