Engineering Skills for Data Scientists A Brief Introduction to HDF5 Data models and data formats are an easily overlooked but critical aspect of modern data infrastructure and development work. This post gives an introduction to HDF5 and how to get started using it in Go.
Computer Science Fundamentals Object-Oriented Programming: A Practical Introduction (Part 2) In Part 1 of this mini-series, you saw how OOP concepts can be used to structure and manipulate code. In this part, you'll see how these ideas are formally defined, and look at a couple of more advanced concepts too.
Engineering Skills for Data Scientists Fire: Simple CLIs done right Creating CLIs can help improve accessibility and reuse of your scripts and packages, but they can also be a bit of a pain to set up and maintain. Fire makes building CLIs for your latest ML pipeline a breeze.
Programming Cython: Lightspeed Python Python is a wonderful language for many applications, but it is not renowned for its speed. This post looks at how you can quickly and easily use Cython to dramatically accelerate your Python code (in some cases).
Technology Tracking Monster Jobs with TQDM TQDM is a tiny Python package that lets you add customisable progress bars to your code. Ideal for some those nasty multi-hour model training jobs.