Probabilistic Data Structures and Algorithms for Big Data Applications.

A technical book about popular space-efficient data structures that are extremely useful in modern big data applications.

Read More

PyCon UA 2018, Kharkiv, April 28-29, 2018

PyCon UA 2018 on in Kharkiv, Ukraine. It is an independent, community-run, community-controlled and not-for-profit conference dedicated to the Python programming language.

I'm going to be a speaker with a talk about Time Series Forecasting with Python.

Read More

Load distribution with DNS Delegation

The talk is about the problem of balancing the load without a single point of failure with user geographics built-in support.

Read More

Implementing a Fileserver with Nginx and Lua.

Using the power of Nginx it is easy to implement the quite complex logic of file upload with metadata and authorization support and without the need of any heavy application server. In this article, you can find the basic implementation of such Fileserver using Nginx and Lua only.

Read More

An automatic terms extraction for Domain-specific corpora.

Using simple frequency-based methods, such as Domain Specificity method and Domain-Specific TF-IDF, it is possible to automatically extract and score terms for given domain-specific corpus. In this article, we will use Python and its ecosystem to illustrate such methods in action.

Read More

Recurrent Neural Networks. Part 1: Theory

In presentation I cover basic aspects of the popular RNN architectures: LSTM and GRU.

Read More

Data Mining 2014/2015 (Rus)

The course was offered in Fall 2014 to students of the School of Computer Science at V. Karazin Kharkov National University, Ukraine. It consists of 8 lectures and the final coursework task.

Read More

Probabilistic data structures. Quotient filter.

In this article, we continue our acquaintance with implementations of probabilistic sets and consider a modern successor of the Bloom filter that is called Quotient filter. Such data structures can effectively work in situations when we need to handle billions of elements and have optimized memory access.

Read More

A Simple Way to Find Turning points for a Trajectory with Python.

Using Ramer-Douglas-Peucker algorithm I construct an approximated trajectory and find valuable turning points.

Read More

A Simple Way to Find Outliers in an array with Python.

Using a basic definition of an outlier I show a simple Python function to detect such values and highlight them on a plot.

Read More