usethe.computer - a blog by Derrick W. Turk

Inadvisable Unit Shenanigans

2026-06-02

It’s overwrought, hard to type check, confusing to read, and probably doesn’t meaningfully catch errors a human would actually make. Nonetheless, I like it.

Trie-ing Too Hard With 297 and 298 Files

2026-05-13

Once upon a time, there was a family of data formats called 297 and 298.

Blood From a Stone

2026-04-08

Reverse-engineering the Petra .grd grid format.

Fun With Simulated Typestate in Python 3.8

2020-07-21

“There’s an old saying in Tennessee—I know it’s in Texas, probably in Tennessee—that says, ‘Fool me once, shame on…shame on you. Fool me—you can’t get fooled again.’”
–George W. Bush

It’s true, you can’t get fooled again! Not any more than you can open an already-open door. But does your type system know that?

Today, in honor of the recent release of Python 3.8, we’ll introduce a fun type-level programming trick well-known already in other language communities, which will let us automatically check these and other invariants.

Irregular Expressions, Revisited

2020-06-26

Last time, we used a minimalist parser combinator library to build a parser for an oddly familiar language called OBAN. The problem with our previous parser is that it produces extremely unhelpful error messages. This is probably fine for a parser which runs as part of an automated toolchain and processes almost-always-valid input, but is completely unacceptable for a user-facing tool.

We’ll address this, while making only minimal changes to the parser’s structure, by tweaking the “base monad” on which it's built. In other words, we’ll change what it means to chain parsers together.

Irregular Expressions: You Need a Parser

2020-06-22

Some people, when confronted with a problem, think “I know, I’ll use regular expressions.” Now they have two problems.
–Jamie Zawinski

The fundamental problem with regular expressions is that they only recognize regular languages. This sounds, and is, tautological, but it has huge implications.

Minimalist DCA in Python

2020-05-20

It’s important, as a rule of thumb, when operating or investing in a firm which produces a physical commodity, to have the ability to reliably quantify the expected future production of the commodity given the firm’s assets. In the oil and gas industry, we have many different ways to forecast future production from an oil or gas well. Some rely on detailed measurements and explicitly incorporate detailed mathematical models of flow physics. Others use whatever historical data we can scrape together and a bit of curve-fitting.

Today, we’re talking about the second kind.

XMHell: Handling 38GB of UTF-16 XML with Rust

2020-05-11

A couple weekends ago, I found myself with the desire to fetch oil and gas production data for a specific county in New Mexico from the New Mexico Oil Conservation Division (OCD).

Fortunately, the OCD provides access to historical well production via an FTP server. The OCD doesn’t seem to provide a way to query a limited time- or area-based subset of production history data, so we’re stuck with a single ZIP file for “all of New Mexico since the dawn of time”. The result is a whopping 712MB ZIP file.

Here’s where I knew I was in trouble: the only thing inside was a single 38 GB file called wcproduction.xml.

Typing `group by`, Revisited

2020-04-29

If you wish to make an apple pie from scratch, you must first create the universe.
–Carl Sagan

Open Recursion: the Essence of Object Oriented Programming?

2020-04-19

What is object-oriented programming really about? What’s so special about “late binding”? And why do I have to pass self around everywhere in Python? We’ll take a meandering path in today’s post which will try to answer each of these questions, and build our own miniature object system along the way.

💻 usethe.computer

// a blog by Derrick W. Turk

Selected posts