Plotting timecourse of coefficients from EEG classification model using scipy.interpolate and matplotlib.animation

4 minute read

Published: October 29, 2020

This post outlines a python script I wrote that takes in coefficients from a series of EEG classification models and projects the coefficients back on the scalp over time using scipy.interpolate and matplotlib.animation.

I’ve wanted an excuse to play around with matplotlib animations and scipy’s interpolation functionality. In the past, I’ve just outputted each frame as a .png and then uploaded them all to a gif-making website… Not ideal!

First, I’ll briefly describe the data I’m working with. These are coefficients from a ordinal logistic regression model that classifies EEG data. EEG is electrical activity recorded from an array of electrodes on the scalp. Without going into too much detail, this model classifies the number of items an individual is holding in their visual working memory. A separate model is trained at each timepoint, with the array of electrodes as the predictors.

After training the models (I’ll make a more in depth blog post about this process at a later date), I extract the coefficients from each subject, each electrode, and each timepoint. That leaves us with this:

print(coefs.shape)
(30, 30, 145, 30)

That is a numpy array of shape: (n_subjects, n_cross_val_iters, n_timepoints, n_electrodes). I also need the x and y coordinates of where the electrodes are placed on the scalp. This allows me to project the 1-D array back into 2-D space. I’ll pick a random timepoint and project the coefficients on the “scalp”. Darker blue means that electrode was more heavily weighted in the model.

plt.figure(figsize=(5,6.7))
coefs = abs(np.mean(np.mean(coefs,1),0)[48])
plt.scatter(chan_locs_y,chan_locs_x,s=100,c=coefs, cmap='Blues',edgecolors='k') #plotting coefficients of electrodes
plt.scatter(0,0,s=70000,marker='o',facecolors='none', edgecolors='k') #plotting "head"
plt.scatter(0,1.3,marker='^',s=750, facecolors='none',edgecolors='k') # plotting "nose"
plt.ylim(-1.5,1.75)
plt.axis('off')

For this plot, imagine you are looking down at the top of someone’s head (the triangle is my attempt at a nose). These points are where the electrodes are placed. You can see how sparse the electrode array is. I will use scipy.interpolate.griddata to interpolate the data between the electrodes for better visualization.

# create grid for interpolation
grid_x, grid_y = np.mgrid[-1:1:1000j, -1:1:2000j]
# calculate average across subjects and cross-val iterations, and grab single timepoint
coefs_avg = abs(np.mean(np.mean(coefs,1),0)[i_timepoint])
# interpolate data across scalp
interp = interpolate.griddata((chan_locs_y, chan_locs_x),coefs_avg,(grid_x,grid_y),method='cubic')

The above code calculates the actual interpolation. I will stick this in a function along with some other basic plotting settings (i.g. removing axes, adding a colorbar, etc).

create_frame(i_timepoint=48, coefs=coefs, timepoints=timepoints,chan_locs_x=chan_locs_x,chan_locs_y=chan_locs_y)

This is the same information as the previous plot, but it’s much easier to visualize. It’s clear that electrodes in the back of the head have higher coefficients than the rest. It’s worth noting that the electrode voltages were z-scored before the model was trained. This allows me to interpret these weights since the electrode voltages have the same scale.

But this is only one frame of data. In reality, this signal develops over time. This is a perfect excuse to use matplotlib’s animation functionality. And the create_frame() function is already setup in such a way to work well with animation.FuncAnimation. First I will create the animator.

fig = plt.figure(figsize=(10,10))
ani = animation.FuncAnimation(fig, create_frame, fargs=(coefs, timepoints,chan_locs_x,chan_locs_y), frames=len(timepoints), repeat=True)

The above code basically passes animation.FuncAnimation a figure, a frame function, the parameters that get passed to the frame function, how many frames the animation should be, and if the animation should loop. Then, I need to create the writer and save the gif.

writer = animation.writers['pillow']
writer = writer(fps=5)
ani.save('coef.gif',writer=writer)

I opted to use Pillow just because I already had it installed. I tested a few frames-per-second and decided 5 was good. Then, I passed the writer to ani and saved the gif as “coef.gif”. Here is the final result below.

This is an interesting way of assessing my model. It allows me to see which regions of electrodes contribute to the model’s predictions the most at each timepoint. The coefficients are scattered before 0 ms because that is actually before the participant evens sees the memory array. Around 150 ms I can see that rear electrodes are very heavily weighted. Then after around 400 ms this pattern becomes much more distributed.

I suspect variations on this visualization could be useful for any time-series classification/regression model that has spatial information. I’m glad that I tried this project because it got me using two tools I’ve been interested in for a while.

Share on

Twitter Facebook LinkedIn

Calm Hands, help reduce nail-biting using computer vision and AI

3 minute read

Published: March 02, 2023

drawing Calm Hands helps the user reduce nail-biting during computer use. It provides realtime feedback about nail-biting habits using a deep neural net that monitors images from your webcam stream. This process is entirely local and images are never saved. Feedback is provided through audio and visual cues to alert you of when you are biting your nails. Realtime data visualization is provided as well. Check out the full repo here. Built with: Fastai, OpenCV, Tkinter, CustomTkinter, Matplotlib.

How I Made This

Step 1. Collect training and heldout test images

First I had to collect several hundred images of my biting my nails and not biting my nails. So I created camera.py and the Cam class. I call this in collect_training_data.ipynb. This allowed me to collect hundreds of photos in a variety of locations, lighting setups, and angles very easily.

from camera import Cam

cam = Cam()
# Collect 2 frames per second for 60 seconds.
cam.write_frame_stream(length = 60, wait = .5, path = 'frames/biting')

Step 2. Train the image classifier in Google Colab with fastai

I trained an edgenext_small model (imported from the timm library) using fastai. I used the proven method of finetuning a pretrained image classifier on this specific task.

learn = vision_learner(dls, 'edgenext_small', metrics=error_rate)
learn.fine_tune(3,base_lr = .001)

With ~1000 images and 3 cycles of training, I was at >90% accuracy. But I found there were specific positions and angles that the model was getting wrong.

Step 3. Collecting more data based on model mistakes

I created the realtime_model_preds.ipynb to collect more data quickly, based on the predictions of the existing model. First I made a couple of functions to make predictions and print them out.

def do_prediction(frame):
    with learn.no_bar(), learn.no_logging():
        return learn.predict(frame)

def print_pred(pred):

    conf = round(float(pred[2][pred[1]])*100,2)
    output = f'The model is {conf}% confident that you are {pred[0]}'
    print(output)

Then, I fed the functions a stream of frames from the webcam. This would print out the model predictions of each frame from my webcam.

    # If I press 1, save the frame to the 'good' folder.
    # If I press 2, save it to the biting folder
    key_press_dict = {49:'train/good',50:'train/bad'}
    path = cam.get_path_from_keypress(key_press_dict)
    cam.write_frame(frame,labelled_folder_path=path)

I moved around until I found positions that the model was incorrectly predicting. Then, I pressed either ‘1’ or ‘2’ on the keyboard to save that frame to the correct folder and add it to the training set. After collecting several hundred more photos, I retrained the model. Now it was performing much better on the edge cases.

Step 4. Creating the app

I used tkinter and customtkinter to create the GUI for the app. It displays the webcam feed on one side, and displays an matplotlib plot of the predictions of the model. It also provides instant auditory feedback if I’m biting my nails. Creating the GUI was much more complicated than I thought. I’ve never really created a user interface before, and getting all of the positioning and callbacks working correctly took a while. Also, getting the plot to update smoothly and correctly took a lot of effort. But I think the final product looks and functions well! drawing

Read more

Interesting and useful resources for data science, AI, neuroscience, statistics, and more

3 minute read

Published: March 18, 2022

A list of useful or interesting articles, courses, and blogs relating to data science, neuroscience, AI, and statistics (mostly). Read more

Advice for becoming a data scientist as a psychology/neuroscience PhD student

2 minute read

Published: March 03, 2022

Here’s some advice if you’re considering moving into data science during a psychology PhD. I tried to make it as practical as possible.

1. Learn fundamentals of programming, statistics, and data science.

Learn Python or R, and probably not Matlab. I see way fewer positions that mention Matlab compared to R and especially Python. Take online courses in machine learning and statistics. I highly recommend The Missing Semester of Your CS Education if you have no CS background, like me. Don’t start with deep learning and neural networks. Make sure you understand data science and machine learning fundamentals very well before moving onto deep learning. If you do, I like the fastai course. And check out some of my other recommended Courses & Textbooks.

2. Do data science.

This obviously happens in parallel to step 1. Most neuroscience/psychology research already involves lots of data science elements. You’re probably already a better data scientist than you realize. Building and cleaning data sets, data visualization, statistical testing, modelling, predictions, writing, science communication, etc. Emphasize these aspects of your research. Projects and publications will help show your skills on your resume and during interviews. Make a website to showcase your research and side projects. I use the Minimal Mistakes GitHub Page theme.

3. Communicate early and often.

Tell your PI/advisor that you’re considering doing a summer internship. Some advisors might not let you. I was lucky enough to have a supportive advisor. Tell you’re lab mates, fellow grad students, and collaborators. When people know you’re interested, they are more likely to tell you about interesting positions they know of.

4. Apply to internships.

Aim to do at least 1 summer internship during your PhD. 2 even is better. Ideally between your 3rd & 4th, and 4th & 5th years. Apply to lots of internships. I think I applied to over 50. Use job posting sites like Indeed or Google Jobs. Check Twitter (search “PhD Internship”, “PhD Data Internship”, etc.). Ask friends and collaborators who you know went into industry. Connections might be extremely important because, honestly, it can be difficult to convince recruiters that a psychology PhD student would make a data scientist.

5. Do your internships.

Do a data science internship! You’ll hopefully learn a lot about putting models into production, collaboration, and more structured development. Some internships turn into full-time offers. Once you have one or two internships under your belt, you’ll be in a much better position when applying for full-time jobs upon graduation.

Additional reading.

Advice for PhD Students Thinking about Data Science Internships
Crushed it! Landing a data science job
What candidates can and cannot control in their job hunt
Advice for Applying to Data Science Jobs Read more

Competitive fighting game with heuristic-based AI

6 minute read

Published: January 04, 2022

I created a 1v1 fighting game in Python using PyGame. Included is a 1-player option that allows you to play against a challenging AI opponent. It can make fast decisions based on many different factors of the current game state. I personally lose to it almost half of the time. Read more

William Thyer, PhD