Activity or Lab
In this archive there are two activities/assignments suitable for use in a CS0 or Intro course which uses Python.
In the first activity, students are asked to "fill in the code" in a series of short programs that compute a similarity metric (cosine similarity) for text documents. This involves string tokenization, and frequency counting using Python string methods and datatypes.
In the second activity (taken directly from Think Python 2e) students use a pronunciation dictionary to solve a riddle involving homophones.
This OER material was produced as a result of the CS04ALL CUNY OER project
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 4.0 License.
Johnson, Hunter R., "CS04ALL: Natural Language Processing Project" (2019). CUNY Academic Works.