Regular, um, Expressions

It’s not uncommon, in English, at least, to say “um” when trying to, um, think of a word. The more you do it, though, the more noticeable it tends to be!

In a file called um.py, implement a function called count that expects a line of text as input as a str and returns, as an int, the number of times that “um” appears in that text, case-insensitively, as a word unto itself, not as a substring of some other word. For instance, given text like hello, um, world, the function should return 1. Given text like yummy, though, the function should return 0.

Structure um.py as follows, wherein you’re welcome to modify main and/or implement other functions as you see fit, but you may not import any other libraries. You’re welcome, but not required, to use re and/or sys.

import re
import sys

def main():
    print(count(input("Text: ")))

def count(s):
    ...

...

if __name__ == "__main__":
    main()

Either before or after you implement count in um.py, additionally implement, in a file called test_um.py, three or more functions that collectively test your implementation of count thoroughly, each of whose names should begin with test_ so that you can execute your tests with:

pytest test_um.py

Hints

Recall that the re module comes with quite a few functions, per docs.python.org/3/library/re.html, including findall.
Recall that regular expressions support quite a few special characters, per docs.python.org/3/library/re.html#regular-expression-syntax.
Because backslashes in regular expressions could be mistaken for escape sequences (like \n), best to use Python’s raw string notation for regular expression patterns. Just as format strings are prefixed with f, so are raw strings prefixed with r. For instance, instead of "harvard\.edu", use r"harvard\.edu".
Note that \b is “defined as the boundary between a \w and a \W character (or vice versa), or between \w at the beginning/end of the string,” per docs.python.org/3/library/re.html#regular-expression-syntax.
You might find regex101.com or regexr.com helpful for testing regular expressions (and visualizing matches).
See thefreedictionary.com/words-containing-um for some words that contain “um”.

Demo

How to Test

How to Test `um.py`

Here’s how to test um.py manually:

Run your program with python um.py. Ensure your program prompts you for an input. Type um, followed by Enter. Your count function should return 1.
Run your program with python um.py. Type um?, followed by Enter. Your count function should return 1.
Run your program with python um.py. Type Um, thanks for the album., followed by Enter. Your count function should return 1.
Run your program with python um.py. Type Um, thanks, um..., followed by Enter. Your count function should return 2.

How to Test `test_um.py`

To test your tests, run pytest test_um.py. Try to use correct and incorrect versions of um.py to determine how well your tests spot errors:

Ensure you have a correct version of um.py. Run your tests by executing pytest test_um.py. pytest should show that all of your tests have passed.
Modify the count function in the correct version of um.py. count might, for example, mistakently also count any “um” that is part of a word. Run your tests by executing pytest test_um.py. pytest should show that at least one of your tests has failed.
Again modify the count function in the correct version of um.py. count might, for example, mistakenly only match an “um” that is surrounded on either side by a space. Run your tests by executing pytest test_um.py. pytest should show that at least one of your tests has failed.

Regular, um, Expressions

Demo

How to Test

How to Test um.py

How to Test test_um.py

How to Test `um.py`

How to Test `test_um.py`