From: parson@kutztown.edu Sent: Wednesday, August 30, 2023 10:52 AMSubject: Python regular expression practice, office hours today CSC523 Students, I sent the email message below in response to a student's questions. Please read it. You are welcome to attend today's Zoom office hours as noted. I will post a video recording of that session at the bottom of the course page as noted. Do not worry about assignment 1 at this point. The goal is to practice with Pythex & regular expressions from the example handout code. Forget acad for now. Just download this zip file that is linked on the course page and unzip it on your computer: There is a copy of this solution zip file linked here. https://pythex.org/ is a very valuable interactive tool Here are the files of interest in that zip file. You can inspect them with notepad++ on your local PC or any text editor with which you are familiar: scanhtml.py code with regular expressions htmlSpring2021rawdata.txt Input data FileLink.csv.ref Output from scanhtml.py for file: href lines HttpLink.csv.ref Output from scanhtml.py for https?: href lines Pick a regular expression in scanhtml.py such as: httpPatternString = r'^(\d+)\s+.*href.*(https?):([^"]+)"' Paste the part between the single quotes into Pythex's regular expression panel: ^(\d+)\s+.*href.*(https?):([^"]+)" Paste a test string from the Input data file into Pythex's string panel. scanhtml.py already has an example line for each regular expression, so you don't even need to look at htmlSpring2021rawdata.txt Input data until you work through each pattern. Here is the spring from scanhtml.py: # 61 href="https://learningtechnologysupport.kutztown.edu/support/solutions/folders/9000185752">Zoom Paste this part into Pythex's string and understand how it works. Do that for all regular expressions in scanhtml.py: 61 href="https://learningtechnologysupport.kutztown.edu/support/solutions/folders/9000185752">Zoom Set all this up and practice on your home machine. I have Zoom office hours today 3-5 PM: https://kutztown.zoom.us/j/94322223872 CSC523 students are welcome to attend, and I plan to record any discussion around this assignment. I will post this video on Friday at the bottom of the course page. We will work through one example regular expression. After that, I can take questions about a specific problem via email, but I cannot debug massive problems via email. That is what the September 11 in-class work session is for. Read Assignment 1's handout before September 11. If you find it within reach you can work on it, but if not, wait until we go over it in class that night. The second half of class will be work time.