The most up-to-date list of my publications can be found on Google Scholar.

I’ve contributed several columns to the Nature Methods Points of Significance column on statistics.

Logistic regression

Regression can be used on categorical responses to estimate probabilities and to classify.

Classification evaluation

It is important to understand both what a classification metric expresses and what it hides.

Model selection and overfitting

“With four parameters I can fit an elephant and with five I can make him wiggle his trunk”. John von Neumann


Constraining the magnitude of parameters of a model can control its complexity

Principal component analysis

PCA helps you interpret your data, but it will not always find the important patterns.