Blog
About Me
Systems, Data, and the Practice of Statistical Thinking
Categories
All
(18)
A/B Testing
(2)
Airflow
(2)
Animation
(1)
App
(1)
BI
(1)
Clustering
(1)
CompVis
(1)
Docker
(1)
ETL
(2)
Geo
(1)
Graphs
(1)
LLM
(2)
ML
(1)
Macroeconomics
(1)
Marketing
(1)
Matlab
(1)
Money
(1)
NLP
(2)
News
(1)
Product
(3)
Python
(7)
R
(6)
RAG
(1)
Spatial
(1)
Statistics
(2)
Time-Series
(2)
Visualization
(1)
Time-Series Clustering with R's dtwclust Package
Time-Series
Clustering
R
This article is a practical guide for time-series clustering using the
dtwclust
package. The
dtwclust
package in R (see vignette) provides a powerful and flexible framework…
Aug 27, 2025
Aleksei
Minimum Detectable Effect (MDE) Calculation
A/B Testing
Product
Statistics
R
Minimum Detectable Effect (MDE) is defined as the smallest difference between a control and a test group that your A/B test can reliably identify as statistically…
Aug 7, 2025
Aleksei
A/B Testing: Concepts and Techniques
A/B Testing
Product
Statistics
Key components, metrics, errors, CUPED, multiple testing, peeking, and Bayesian vs frequentist approaches.
Jul 29, 2025
Aleksei
Animation of Spatial Data
Visualization
Spatial
Animation
R
Creating an animated visualization of spatial data using R
Jul 4, 2025
Aleksei
Product Cards Creation Application
App
LLM
Python
This application is designed to create product cards for an online store. It utilizes a large language model (LLM) and programming libraries to generate detailed…
May 31, 2025
Aleksei
Creating Anki Flashcards From List of Words
NLP
Python
This notebook demonstrates how to create Anki flashcards from a list of words. The example uses a list of German words related to “Die Stadt” (The City) and translates them…
May 3, 2025
Aleksei
Implementing a Local Retrieval-Augmented Generation System
RAG
NLP
LLM
Python
RAG (Retrieval-Augmented Generation) is a technology that combines the search external sources and generation using large language models. It allows for more accurate and…
Mar 21, 2025
Aleksei
Nerdy Valentine's in Python, R, and Matlab
Python
R
Matlab
Let’s celebrate Valentine’s Day with some nerdy love! In this post, we will create heart-shaped plots using Python, R, and Matlab. These heart-shaped plots are a fun and…
Feb 14, 2025
Aleksei
Run Docker Containers Remotely with Airflow
Airflow
Docker
Airflow is a powerful tool for automating workflows. Once you start using it, you’ll likely find it a great replacement for cron jobs on Linux machines. One common use case…
Jan 8, 2025
Aleksei
BI System Blueprint
BI
ETL
In this post, I’d like to present a blueprint for a BI system that I’ve been working on. The system is designed to provide insights into the company’s performance and…
Jan 6, 2025
Aleksei
CV Week 2024
CompVis
ML
In November 2024, the Yandex School of Data Analysis (YSDA) and Yandex conducted an open online intensive course on computer vision, focusing on generative diffusion models…
Dec 18, 2024
Aleksei
Using Airflow FileSensor for Triggering ETL Process
ETL
Airflow
In this article, we will discuss how to use the Airflow FileSensor to trigger an ETL process. We will walk through the process of setting up a FileSensor in Airflow and…
Nov 5, 2024
Aleksei
European Tech Salaries
Money
R
Recently, I stumbled upon a Reddit post where someone was gathering salary data from the tech sector throughout Europe. It piqued my interest to explore how these salaries…
Sep 27, 2024
Aleksei
Python Library for Russian Macroeconomics Data
Time-Series
Macroeconomics
Python
The article introduces a Python library called
sophisthse
that provides access to Russian macroeconomic statistics time series published by the National Research University…
Aug 22, 2024
Aleksei
Kano Method for Prioritization of Features
Marketing
Product
Python
The Kano model is a theory for product development and customer satisfaction developed in the 1980s by Professor Noriaki Kano. The model classifies customer preferences into…
Aug 5, 2024
Aleksei
Merging Customers Records Using Graphs in Python
Python
Graphs
Traditional relational databases and spreadsheets fall short in capturing complex relationships among customers. Enter graph theory – a powerful framework for representing…
Jul 31, 2024
Aleksei
Exploring Geospatial Insights with R and rnaturalearth
R
Geo
The article showcases the utilization of the
rnaturalearth
package for handling geographical data. This package provides valuable tools and functions for working with…
Jul 25, 2024
Aleksei
Welcome To My Blog
News
This is the first post in my blog. Welcome!
Jul 21, 2024
Aleksei
No matching items