Michael B. Gousie

Comp 198 - 21st Century Applied Text Computing (Spring 2026)

Introductory Information

Examples/Handouts

Linux

Sample data files

  • hof5.jpg - Hof, Germany. How random is that?
  • mobyDick.txt.
  • randomNumbers - some random numbers. Note that there is no file extension! How can this be?
  • week2.tar.gz - practice files for Week 2.
  • baseballDB.txt - practice database containing baseball statistics for week 3.
  • actors.csv - comma separated values file containing a list of actors and their median "rank".
  • us-statesJun8Totals.csv - CSV file containing a list of states and COVID data from June 8, 2000. The FIPS column refers to the Federal Information Processing Standard code for each state and county. Cases refers to the number of total reported cases as of that day, and deaths are the total deaths as of that day.
  • myAmsterdam10-area.tmg - METAL TMG file for comparison purposes.

Bash scripting

  • Bash scripting cheatsheet
  • bash1.sh - sample bash script.
  • bash2.sh - more input, files.
  • bash3.sh - floating point math.
  • bash4.sh - command line arguments.
  • bash5.sh - FOR loops and IF statements.
  • bash6.sh - basic one-dimensional array and how to generate random numbers.
  • bash7.sh - display contents of directory in reverse order.
  • bash7a.sh - way easier method to do the previous!
  • bash8.sh - same as bash7.sh but using a while loop.
  • bash9.sh - count the number of .sh files in directory.
  • bash10.sh - reading and writing files.
  • averageFile.sh - display sum and average of integers in a file.
  • sums - answer to Quiz 2 problem 2. Note how you don't need to read the entire file to find the median. Instead, use Linux commands to get to the middle value.

Projects/Homeworks

  • HW 1 - written homework, due February 3.
  • awk tutorial.
  • HW 2 - electronic homework, due February 16.
  • HW 3 - electronic homework, due March 5.
Last updated
Home | Comp 198 | Comp 220 | Comp 318 | Research | Anti-Research