Cluster analysis and spike sorting Kenneth D. Harris 15/7/15

Cluster analysis and spike sorting

Kenneth D. Harris15/7/15

Exploratory vs. confirmatory analysis• Exploratory analysis

• Helps you formulate a hypothesis• End result is often a nice-looking picture• Any method is equally valid – because it just helps you think of a hypothesis

• Confirmatory analysis• Where you test your hypothesis• Multiple ways to do it (Classical, Bayesian, Cross-validation)• You have to stick to the rules

• Inductive vs. deductive reasoning (K. Popper)

Principal component analysis

• Finds directions of maximum variance in a data set• These correspond to the eigenvectors of the covariance matrix

Cluster analysis

Two main ways to do cluster analysis• Model-free• Requires a distance measure between every pair of points

• Model-based• Assumes that points come from a probability distribution

Hierarchical clustering

• Model-free method

• Agglomerative• “Bottom up”• Sequentially merge similar points/clusters

• Divisive• “Top down”• Sequentially split clusters• Need to define how to split clusters• Can be slow, but can give better results

• Choose number of clusters by “slicing” dendrogram• Both slow for large numbers of points: O(N3) unless you use tricks

Mean-shift clustering

• Compute a density estimate

• Compute its gradient

• Move each point “uphill”

• Number of clusters is set by density estimation prarameters

Rodriguez-Laio clustering

• Number of clusters set by how many points you select• Both Rodriguez-Laio and Mean Shift are order N2 unless you use tricks

Density

Model-based clustering

• Fit a family of probability distributions, usually a “mixture model”:

• Example: mixture of circular Gaussians• ,

• Example: mixture of general Gaussians• ,

How to fit?

• Usually by maximum likelihood: choose to maximize:

• Can’t be done in one step.

E-M algorithm

• E (expectation) step: compute probability point lies in cluster :

• M (maximization) step: cluster parameters:

• Repeat until convergence

“Hard” EM algorithm

• E (expectation) step: choose single cluster that maximizes

• Makes things much faster

• Hard EM with circular Gaussian clusters is called k-means

How many clusters?

• Could choose by hand

• Or add a “penalty term” to the log likelihood and try many

• AIC (Akaike’s information criterion):

• BIC (Bayesian information criterion):

• AIC produces a lot more clusters than BIC

Spike sorting

High dimensions

• EM algorithm is order . (Good!)

• But it does really badly in high dimensions. (As do others)

• No general solution

• Solution for spike sorting: “masked EM algorithm”

Local spike detection

Step 2: Masked EM algorithm

• Masked features are ignored

– Solves “curse of dimensionality”• Scales as rather than • 1 million spikes, 128 channels: 1 day.

Kadir et al, Neural Computation 2014

Estimating performance

Manual verification essential

http://klusta-team.github.io/https://github.com/kwikteam/phy

Cluster analysis and spike sorting Kenneth D. Harris 15/7/15

Documents

Spike sorting based on discrete wavelet transform coefﬁcients€¦ · Spike sorting based on discrete wavelet transform coefﬁcients Juan Carlos Letelier *, Pamela P. Weber Departamento

Spike sorting based upon machine learning algorithms (SOMA)feng/papers/feng_horton_07.pdf · Keywords: Spike sorting; Neural networks; Olfactory bulb; Odour; Sheep temporal cortex;

Spike Sorting and Behavioral analysis software · 2013. 12. 4. · Spike Sorting and Behavioral analysis software Ajinkya Kokate Department of Computational Science University of

Spike Sorting

Spike Train Decoding without Spike Sorting - CMU Statisticsvventura/SS.pdf · ABSTRACT We propose a novel paradigm for spike train decoding, which avoids entirely spike sorting based

Spike Sorting for Extracellular Recordings Kenneth D. Harris Rutgers University

Chapter 1 Spike Sorting - University of California, San Diego · Chapter 1 Spike Sorting 1.1 Introduction The point process component of an extracellular recording results from the

Spike Sorting I: Bijan Pesaran New York University

Spike sorting-tutorial

An Efficient Hardware Circuit for Spike Sorting Based on … · 2017. 10. 7. · A spike sorting system usually provides the operations of spike detection, feature extraction and

Microelectronics System Design - Async.org.uk …async.org.uk/MSD.pdf• Stroke rehabilitation, bionic arm • Real-time spike processing, spike detection, sorting and decoding •

“Spike sorting”

Spike Sorting Algorithm Implemented on FPGA

Spike Sorting Using Time-Varying Dirichlet Process Mixture ...ucabjga/papers/Gasthaus_2008... · Spike sorting is the task of grouping action potentials observed in extracellular

Spike Sorting Goal: Extract neural spike trains from MEA electrode data Method 1: Convolution of template spikes Method 2: Sort by spikes features

Spike sorting in the frequency domain with overlap detectionwbialek/our_papers/rinberg+al... · 2004. 2. 12. · Spike sorting in the frequency domain with overlap detection Dima

TECHNIQUE(S) FOR SPIKE - SORTING · in the spike-sorting procedure 3.1 Spike waveforms from a single neuron are usually not stationary on a short time - scale 3.1.1 An experimental

ImplementationofSpatialTransformationRulesfor Goal ... · Hz to 8.8 kHz; 400–800 ; Plexon), before on-line spike-sorting was conducted (Sort Client; Plexon). In addition to spike

Spike sorting Tutorial

A Non-Parametric Bayesian Approach to Spike Sortinghomepages.inf.ed.ac.uk/sgwater/papers/embs06-spikes.pdf · A Non-Parametric Bayesian Approach to Spike Sorting Frank Wood †Sharon