CPSC 558 - Data Mining & Predictive Analytics II, Fall 2024, Thursday 6:00-8:50 PM, Old Main 158 .

Use Firefox or try other non-Chrome browser for these links. Chrome has problems
 
    Pitfalls of trying to use off-the-shelf AIs to do your work.

    Link to the Spring 2023 course.

Dr. Dale E. Parson Class will be live face-to-face or on-line at class time via Zoom.
Mon 6-8:50 PM, Zoom classes & recordings, https://faculty.kutztown.edu/parson
Class-time Zoom link for CSC558: See D2L Course CSC558 -> Content -> Overview for the link.
Student instructions for using Zoom.
IF you don’t want to be recorded or are a minor, use PRIVATE ZOOM CHAT to me for questions.
Please fill out & email Dr. Parson this permission to record slip. I will use it to take attendance in week 1.

Dr. Dale E. Parson, parson@kutztown.edu, Office hours: https://kutztown.zoom.us/j/94322223872
Office Hours
Mon 11 AM-1 PM, Wed 12-2 PM, Th 4-5 PM, or by appt . All available via Zoom.

KU offers a 4-course Graduate Certificate in Data Analytics. Talk with me if you want to sign up.

Our department is adding a Scripting Certificate, a Data Science major, and a Data Science minor in fall 2024.
    Instructions to Change, Add, Remove an UNDERGRADUATE Major, Minor or Certificate Program
 

First day handout (syllabus that is specific to this semester).

You may need to use the acad Linux server in another CPSC course. You will have to come in
via a VPN starting this fall. Here are the instructions for that. Download the VPN from here.
Non-Kutztown wireless devices now have to come in through the Golden Bears Wireless LAN.
 

RESOURCES & HANDOUTS


Open source Weka is the primary machine-learning library that we will use to analyze data relationships.
Here is our optional textbook's web page.
We will be using the Weka tool set, which you can download to your machine from here. (Download & install Weka 3.8.6).
    If your campus PC login comes up with a mount of the S: networked drive, then double click
        S:\ComputerScience\WEKA\WekaWith2GBcampus.bat from the Windows File explorer.
    You can also copy weka.jar to a thumb drive and run it from there using java -jar weka.jar.

        Here is a current weka.jar for Windows. I will add one for Mac soon.
    The PDF Appendix to our textbook is here. It is a 128-page tutorial on using Weka. Here is the Weka Wiki.
    I will draw some material from this textbook as well.
    You will turn in assignment solutions using D2L as instructed in assignment handouts.

Compilation of Weka slides on Instance Based Learning and Clustering.
A graph on informational entropy, relates to building rules & decision trees.
A page describing Bayes theorem and related matters.
A Bayes computer for a 52-card deck is on acad at ~parson/DataMine/BayesCards.py
Weka slides on evaluating numeric prediction.
A summary of the Kappa Statistic.
A subset of Weka Chapter 5 on Evaluation and 7 on Data Transformations.
Chapter 12 on Ensemble Learning.


ASSIGNMENTS

There is a 10% per late late penalty for projects that come in after the due date.    

Assignment 1 is due
via D2L Assignment 1 drop box by 11:59 PM Saturday September 21.
    All students are submitting via the D2L page for the regular course section .501.


ZOOM VIDEO ARCHIVE. Use Firefox or try other non-Chrome browser for these links. Chrome has problems.

August 29 Course intro, classification, regression, entropy, Bayes, instance-based models, kappa statistics, some demos.