Skip to main content

Datascience

loading · loading ·
Organizing computational and research projects, an expanded guide
·640 words·4 mins
A series of principles and practical habits for building, managing, and sustaining computational and data science projects with clarity and reproducibility.
Cleaning Data Methodology
·1681 words·8 mins
Data is never clean, and that’s okay. Understanding its noise, origins, and structure tells us a lot about the data.
Domain-Driven Design for Data Scientists
·1204 words·6 mins
Data scientists should have domain-driven thinking and why understanding the problem domain still matters more than the model even in the rise of LLMs.
The Problem With Proprietary LLM Providers: Removing Model Access without recourse
·414 words·2 mins
OpenAI’s removal of GPT-4o, o3, and other models after GPT-5’s launch breaks fundamental MLOps principles. Without model versioning and control, data science workflows become unreliable. Local LLMs offer a better alternative for maintaining consistency.
Shape Up Method in Data Science Projects
·993 words·5 mins
Adapting Shape Up methodology for data science work, what works and what doesn’t
2022 - The year of my Azure certifications
·270 words·2 mins
I wanted to get good at Azure that I was using at work and decided to upskill and do a certification journey.