Posts on Robin's blog

Posts on Robin's bloghttps://kaveland.no/posts/Recent content in Posts on Robin's blogHugo -- 0.126.0en-usSat, 21 Sep 2024 00:00:00 +0000Consider using array operators over the SQL in operatorhttps://kaveland.no/posts/2024-09-21-equals-any-over-where-in/Sat, 21 Sep 2024 00:00:00 +0000https://kaveland.no/posts/2024-09-21-equals-any-over-where-in/In my post about batch operations, I used the where id = any(:ids) pattern, with ids bound to a JDBC array. I’ve gotten questions about that afterwards, asking why I do it like that, instead of using in (:id1, :id2, ...). Many libraries can take care of the dynamic SQL generation for you, so often you can just write in (:ids), just like the array example. I would still prefer to use the = any(:ids) pattern, and I decided to write down my reasoning here.Batch operations using composite keys in postgres over jdbchttps://kaveland.no/posts/2024-08-30-multi-selecting-by-composite-key/Fri, 30 Aug 2024 00:00:00 +0000https://kaveland.no/posts/2024-08-30-multi-selecting-by-composite-key/Throughout a career as a software developer, you encounter many patterns. Some appear just often enough to remember that they exist, but you still need to look them up every time. I’ve discovered that writing things down helps me remember them more easily. This particular pattern is very useful for my current project. So, it’s time to write it down and hopefully commit it to memory properly this time. Although this post is specific to PostgreSQL, I’m sure other databases have the necessary features to achieve the same results efficiently.Norwegian Wild Salmon Fishing Ban of 2024https://kaveland.no/posts/2024-06-27-salmon-ban/Thu, 27 Jun 2024 00:00:00 +0000https://kaveland.no/posts/2024-06-27-salmon-ban/For this blog post, I’m trying something different. This is a jupyter notebook that I’m using to study some data, and just dumping my brain out in text. If I can easily export this to a format that works with hugo, this might become a common occurrence. For this one, I’m leaving the code in. There isn’t that much of it, but I think it’s fun to show how much visualization per line of code you can get with seaborn and pandas.Using short lived postgres servers for testinghttps://kaveland.no/posts/2024-05-27-shortlived-postgres-servers/Mon, 27 May 2024 00:00:00 +0000https://kaveland.no/posts/2024-05-27-shortlived-postgres-servers/Database servers are usually long-lived, and important parts of the infrastructure that we build on. We rarely set them up from scratch, because we have to take such good care of them over time. I think this causes a lot of people to think that setting up a database server is some mysteriously difficult ordeal. To be clear, that’s actually true, if you need high availability and a solid recovery point objective.Building documentation for Eugenehttps://kaveland.no/posts/2024-05-20-building-docsite-for-eugene/Mon, 20 May 2024 00:00:00 +0000https://kaveland.no/posts/2024-05-20-building-docsite-for-eugene/I’ve been busy working on a documentation site for eugene, and I think it’s starting to look pretty good. I wanted to write down some of my thoughts around the process so far, and some of the things I’ve learned. It’s just been a few days since I ported my blog to hugo, so since I was already feeling like I was up to speed on that, I decided I’d try using it for the eugene documentation too.Moving the blog to Hugohttps://kaveland.no/posts/2024-05-18-moving-to-hugo/Sat, 18 May 2024 00:00:00 +0000https://kaveland.no/posts/2024-05-18-moving-to-hugo/I’ve been using pelican for my blog for a while now, and I don’t really have anything negative to say about it. But for a while, I’ve been wanting a more minimal theme. I ended up on the front page of hacker news a couple of times, and the old theme had my face on all the pages, which made me feel a bit uncomfortable. I was looking at some other themes around the web, and I found PaperMod which I absolutely loved.Linting postgres migration scriptshttps://kaveland.no/posts/2024-05-16-linting-postgres-migration-scripts/Thu, 16 May 2024 00:00:00 +0000https://kaveland.no/posts/2024-05-16-linting-postgres-migration-scripts/I have been working quite a bit on picking up dangerous migration patterns in migration scripts over at the eugene repository lately. A major feature I’ve added is syntax tree analysis, so that we can pick up some patterns without having to run the SQL scripts. This isn’t quite as precise as running the scripts, but it’s a lot faster and can catch quite a few common mistakes. So let’s take a look at how it works!Porting an application from cats effects to ZIOhttps://kaveland.no/posts/2024-05-16-porting-from-cats-effects-to-zio/Thu, 16 May 2024 00:00:00 +0000https://kaveland.no/posts/2024-05-16-porting-from-cats-effects-to-zio/In my current project, we’re working on a large-ish code base that is written in Scala and uses cats effect as an effect system in large parts of the code base. If you’re not familiar with what an effect system is, I think the most important detail is that it’s a tool that gives you certain superpowers if you promise to be honest about it when your code does things that can be considered “effectful”, such as interacting with the network or reading files.Careful with That Lock, Eugene: Part 2https://kaveland.no/posts/2024-05-06-careful-with-that-lock-eugene-pt-2/Mon, 06 May 2024 00:00:00 +0000https://kaveland.no/posts/2024-05-06-careful-with-that-lock-eugene-pt-2/A while back, I wrote Careful with That Lock, Eugene about an idea for how to check if a database migration is likely to disturb production. That post came about after having an inspiring chat with a colleague about the advantages of transactional migration scripts and the ability to check the postgres system catalog views before committing a transaction. Over the past few weeks, I’ve been experimenting with this idea to test if I can use it to build valuable safety checks for DDL migrations.Careful with That Lock, Eugenehttps://kaveland.no/posts/2024-04-12-careful-with-that-lock-eugene/Fri, 12 Apr 2024 00:00:00 +0000https://kaveland.no/posts/2024-04-12-careful-with-that-lock-eugene/It is rewarding to work on software that people care about and use all around the clock. This constant usage means we can’t simply take the system offline for maintenance without upsetting users. Therefore, techniques that allow us to update the software seamlessly without downtime or compromising service quality are incredibly valuable. Most projects I’ve worked on use a relational database for persistence, and have some sort of migration tool like flyway or liquibase to make changes to the database schema.How to test for missing indexes on foreign keyshttps://kaveland.no/posts/2024-04-04-testcase-for-foreign-keys/Thu, 04 Apr 2024 00:00:00 +0000https://kaveland.no/posts/2024-04-04-testcase-for-foreign-keys/If you’re developing a transactional application backed by postgres, there’s a pretty cool trick you can use to check if you’re missing indexes that could potentially cause serious performance issues or even outages. In particular, I mean foreign keys where the referencing side of the constraint does not have an index. The idea is very simple, we can select all of the columns that take part in a foreign key, then remove the ones that take part in a complete index, and the remainder should be the empty set, or possibly match a known allowlist.Friends don't let friends export to CSVhttps://kaveland.no/posts/2024-03-10-friends-dont-let-friends-export-csv/Sun, 24 Mar 2024 00:00:00 +0000https://kaveland.no/posts/2024-03-10-friends-dont-let-friends-export-csv/I worked for a few years in the intersection between data science and software engineering. On the whole, it was a really enjoyable time and I’d like to have the chance to do so again at some point. One of the least enjoyable experiences from that time was to deal with big CSV exports. Unfortunately, this file format is still very common in the data science space. It is easy to understand why – it seems to be ubiquitous, present everywhere, it’s human-readable, it’s less verbose than options like JSON and XML, it’s super easy to produce from almost any tool.Isolating integration tests that commit transactionshttps://kaveland.no/posts/2024-03-10-testing-transactions-that-commit/Sun, 10 Mar 2024 00:00:00 +0000https://kaveland.no/posts/2024-03-10-testing-transactions-that-commit/For tests that need to touch the database, it is generally a really good idea to roll back transactions. That way, you can run lots of tests in parallell or in any arbitrary order and the tests won’t interfere with each other. But sometimes, that just isn’t possible. One reason for this could be that the code base handles transactions in a way that makes it really hard to get a handle on them in the right place, or it could be a legacy code base where everything is running with auto-commit or some other explanation.Protecting your postgres server from your applicationhttps://kaveland.no/posts/2023-05-09-configure-postgres/Tue, 09 May 2023 00:00:00 +0000https://kaveland.no/posts/2023-05-09-configure-postgres/There are 2 configuration options that every OLTP application that uses postgres should set, in order to protect the database from high load: statement_timeout idle_in_transaction_session_timeout These can both be set by client configuration and require no special permissions to set, and are easily overridden locally for transactions that have different requirements. They can be a bit scary to retrofit to existing applications, but we can activate two postgres extensions to help us measure our queries to find safe values to set: