Posts
- Group by operations comparison in pandas, dplyr and SQL
- Non-normality
- Distance to the origin in multivariate normal distributions
- Debugging scikit-learn pipelines
- stackoverflow developer survey
- Handling imbalance in ML
- Github actions for automated testing
- Sample and effect sizes in hypothesis testing
- Survival analysis
- slider package
- A bayesian trick for feature engineering
- Feature selection part 2
- Feature selection part 1
- The Art of Readable Code