Writing
The most up-to-date list of my publications can be found on Google Scholar.
I’ve contributed several columns to the Nature Methods Points of Significance column on statistics.
Logistic regression
Regression can be used on categorical responses to estimate probabilities and to classify.
Classification evaluation
It is important to understand both what a classification metric expresses and what it hides.
Model selection and overfitting
“With four parameters I can fit an elephant and with five I can make him wiggle his trunk”. John von Neumann
Regularization
Constraining the magnitude of parameters of a model can control its complexity
Principal component analysis
PCA helps you interpret your data, but it will not always find the important patterns.