Please log into acad mcgonagall and run the following commands:
$ python -V Python 3.7.7 $ ipython -V 7.14.0
If you see earlier version numbers, edit a file called .bash_profile in your login directory and add the following 2 lines at the top:
alias python="/usr/local/bin/python3.7"
alias ipython="/usr/local/bin/ipython3"Log out, log back in, and check the version numbers again. Let me know if you run into problems.
After that, ssh
mcgonagall from acad and check
the versions. They should be the same. CSC523 makes heavy use
of mcgonagall in future assignments.
*****
I am teaching CSC223
for the first time this semester for anyone wanting to
review Python tutorials & Zoom videos.
8/28/2023 A student recommended from
experience this free terminal emulator for remote work on acad
/ mcgonagall:
MobaXterm
free Xserver and tabbed SSH client for Windows (mobatek.net)
The free version includes terminal emulator, SFTP-based
upload/download, and SSH tunneling (great for connecting to
mcgonnegall from acad) but just limits the user to 4 saved
connections.
Please log into acad and run the following commands:
$ python -V Python 3.7.7 $ ipython -V 7.14.0
If you see earlier version numbers, edit a file called .bash_profile in your login directory and add the following 2 lines at the top:
alias python="/usr/local/bin/python3.7"
alias ipython="/usr/local/bin/ipython3"Log out, log back in, and check the version numbers again. Let me know if you run into problems.
After that, ssh mcgonagall from acad and check the versions. They should be the same. CSC523 makes heavy use of mcgonagall in future assignments.
INSTANCE-BASED (LAZY) LEARNING
Compilation
of Weka slides on Instance Based Learning and Clustering
Weka Chapter 4, instance-based
learning at slide 90, clustering at slide 102.
Added 12/2/2023
~parson/DataMine/sciKitClusterCSC523Fall2023Assn3.02Dec2023.zip
https://scikit-learn.org/stable/modules/clustering.html
https://scikit-learn.org/stable/modules/classes.html#module-sklearn.cluster
https://scikit-learn.org/stable/modules/clustering.html#k-means
https://scikit-learn.org/stable/modules/generated/sklearn.cluster.AgglomerativeClustering.html#sklearn-cluster-agglomerativeclustering
https://scikit-learn.org/stable/modules/generated/sklearn.cluster.DBSCAN.html#sklearn.cluster.DBSCAN
Wissam Malke's thesis "Machine
Listening with Very Small Training Datasets"
Slides for his
thesis
Follow-up white
paper "Mapping
Data Visualization to Timbral Sonification and Machine
Listening"
Instance-Based
Learning Algorithms, a paper from 1991.
K*:
An Instance-based Learner Using an Entropic Distance Measure,
a paper from 1995.
Locally
Weighted Naive Bayes, a paper from 2012.
sklearn.neighbors.KNeighborsClassifier
and sklearn.neighbors.KNeighborsRegressor
A student and I discovered during office hours that:
Therefore: MAKE SURE
TO DO ALL make test
RUNS on mcgonagall.
ADDED 10/6:
To test STUDENT 1 thru 4, comment out the inside of this
table.
configTable = [ # list of 12-tuples per
CSC523f23Regressassn2_main.py:
# [modelType, dataName,
regressorName, regressor,
# nontargetDATAtrain,
targetDATAtrain,
# nontargetDATAtest,
targetDATAtest,
# nontargetNames,
targetATTR, classifierLabels, accuracyList]
# COMMENT OR TEMPORARILY REMOVE CONTENTS TO BE RESTORED
LATER.
]
Once STUDENT 1 thru 4 are done, the first diff should pass,
no errors.
+ diff --ignore-trailing-space --strip-trailing-cr
parson_CCs.txt parson_CCs.txt.ref
ADDED October 9:
The 3rd entry in the configTable has a mistake.
(Thanks to the student who caught this.):
['regressor',
'minsmooth', 'LinearRegression', linearRegression,
minsmoothTrainNontargetData, minsmoothTrainTargetData,
minsmoothTestNontargetData, minsmoothTestTargetData,
SmoothHeader[0:-1], RawHeader[-1], None, None],
SHOULD BE:
['regressor',
'minsmooth', 'LinearRegression', linearRegression,
minsmoothTrainNontargetData, minsmoothTrainTargetData,
minsmoothTestNontargetData, minsmoothTestTargetData,
SmoothHeader[0:-1], SmoothHeader[-1], None, None],
RawHeader goes with raw and SmoothHeader goes with smooth in
these tables.
After you make this fix there will be diffs in correct
solutions with your LOGINID and raptor species instead of BW:
$ cat LOGINID_CSC523f23Regressassn2.txt.dif
9c9
< BW_All_smooth =
---
> BW_All =
$ cat LOGINID_CSC523Fall2023TimeRegressOut.txt.dif
11c11
< ATTRIBUTES FOR DATA 3 ['WindSpd_mean_smooth',
'HMtempC_mean_smooth', 'wnd_WNW_NW_smooth'] ->
BW_All_smooth
---
> ATTRIBUTES FOR DATA 3 ['WindSpd_mean_smooth',
'HMtempC_mean_smooth', 'wnd_WNW_NW_smooth'] -> BW_All
After you fix that third entry in configTable, do the
following:
$ make clobber getfiles
That pulls down the .ref files that I updated this morning.