Neil Ashford

A Path to a New Sudoku Algorithm

2021-12-08T00:00:00Z

A Path to a New Sudoku Algorithm

08/11/2021

Warning: This one's a pretty long read. I might add a table of contents to my blog generator one day, but don't hold out hope.
Sorry.

You're all familiar with Sudoku, the puzzle game? You're given a partially filled in grid of numbers, something like this:

+-----+-+-----+-+-----+
|     | |9    | |2 1 8|
|  7  | |    4| |     |
|2    | |  3  | |  4  |
+-----+-+-----+-+-----+
|9 2  | |     | |8    |
|  1  | |  6  | |  7  |
|    8| |     | |  9 4|
+-----+-+-----+-+-----+
|  3  | |  5  | |    9|
|     | |7    | |  2  |
|5 9 4| |    2| |     |
+-----+-+-----+-+-----+

And then you need to turn it into a fully filled in grid of numbers, like this:

+-----+-+-----+-+-----+
|3 4 5| |9 7 6| |2 1 8|
|8 7 9| |2 1 4| |3 5 6|
|2 6 1| |5 3 8| |9 4 7|
+-----+-+-----+-+-----+
|9 2 7| |1 4 5| |8 6 3|
|4 1 3| |8 6 9| |5 7 2|
|6 5 8| |3 2 7| |1 9 4|
+-----+-+-----+-+-----+
|7 3 2| |4 5 1| |6 8 9|
|1 8 6| |7 9 3| |4 2 5|
|5 9 4| |6 8 2| |7 3 1|
+-----+-+-----+-+-----+

Where the filled-in grid follows a few rules.

Each 3×3 box in the Sudoku needs to contain the digits 1–9;
Each row in the Sudoku needs to contain the digits 1–9; and
Each column in the Sudoku needs to contain the digits 1–9.

Sudoku is a very popular puzzle game, and a lot of very smart people have come up with algorithms to solve Sudokus either by hand or programmatically. When I decided I wanted to make my own solver, I started looking into those algorithms a lot. I nearly ended up just re-implementing an existing algorithm, but instead I ended up having a very interesting idea. I can't say that the idea I had was a particularly good one, and I haven't noticed anything spectacular in the performance of the solver that it led to, but it was a fun adventure.

What I'm about to describe isn't something I've seen before in other solvers, and so I hope that means I've come up with something unique. However, Sudoku solving algorithms are hardly a small or unexplored field, so it's quite possible that everything I'm about to do has already been done by someone else. If you can help direct me to a blog post or repo or person that's done this before, by all means go ahead, I'd be keen to see it.

Mainstream Solvers

Before we get into what I came up with, I want to quickly summarise how regular Sudoku solving algorithms work. This is partly so I can write some comparisons with them later⁰, but it's also because I did a lot of research into this before swapping over to my current algorithm and I don't want that knowledge to go to waste. So with that said, and at risk of sounding a lot like my algorithms lecturer from uni, there are two key facts about Sudoku as a puzzle that influence how most solvers are designed.

Sudoku (with an N×N grid, instead of 9×9) fits into the "Nondeterministic Polynomial" ( $NP$ ) complexity class of problems. That is to say, if you had a computer with infinite CPU cores¹, you could solve a Sudoku puzzle in polynomial time.
Any other $NP$ problem can be transformed into a Sudoku (again the N×N variant) and back again in polynomial time, with the solution to the Sudoku being transformed into a solution for the original problem.

The first of these properties is reasonably simple, at least compared to the other — you could write a polynomial time algorithm to see if an assignment of digits is valid, and then just get each core to pick a different permutation of the 81 digits to test, reporting back when it's done.

The second property is incredibly complex. There is a small group of problems that have been discovered with this property, but it's only ever been directly proven for one problem (that I know of). Everything else, including N×N Sudoku, is shown to have this property transitively: you can reduce any $NP$ problem to $x$ , and you can reduce $x$ to an N×N Sudoku, therefore you can reduce any $NP$ problem to an N×N Sudoku.

If you know both of these properties about a problem, it gives you a shortcut solution that you can use instead of writing your own algorithm. A problem with both of these properties is called " $NP$ Complete" and any $NP$ Complete problem can be solved by transforming it into a different $NP$ Complete problem without any real performance loss.

Property 1 of Sudoku, combined with property 2 of our target $NP$ Complete problem, shows that the transformation is possible in polynomial time.
Property 2 of Sudoku, combined with property 1 of our target $NP$ Complete problem, shows that the solution for our target problem is most likely at least as fast as the solution for any direct Sudoku algorithm, as you could always solve it by transforming it back at only polynomial cost.
The unproved yet quite likely hypothesis that $P != NP$ implies that the actual cost of solving the $NP$ Complete problem is way higher than the cost of the polynomial transformation between the problems, so the overhead isn't worth considering.

And all of this combines with the fact that the poster child of $NP$ Complete problems, Boolean Satisfiability (SAT), has been solved by dozens of hyper-optimised engines that are almost always going to be faster than anything you write for yourself, and you're left with a neat solution to any $NP$ Complete problem you're stuck with. You transform it into a SAT problem, you run the off-the-shelf SAT solver on it, and you transform your solution back.

And as far as I can tell, this is how most Sudoku-solving algorithms work today. There's some caveats where you might try to write a custom SAT solver that's more tuned to the exact type of SAT problems your Sudoku turns into, but it's all essentially the same underlying SAT algorithms underneath.

The First Breakthrough

So for a while, I was on a nice trajectory from reading how SAT algorithms work to implementing my own custom SAT solver. It was going to be a nice learning experience, and I'd have a shiny project to put on my GitHub. Where did it all go wrong?

What happened is, I was browsing the SAT-solver algorithm / Sudoku corner of Wikipedia, when I stumbled across a lovely page: Mathematics of Sudoku — Solutions. The discussion here is around the number of different ways to fill out a Sudoku grid. The first number they give is $6.67 \times 10^{21}$ , which is ridiculous. But later down on the page, they give a number for how many "essentially different" Sudoku solutions are available (essentially different meaning that isn't just another Sudoku with some rows swapped around or something like that). And that number is just under five and a half billion. This is where the breakthrough happened, and the downward spiral began. Because, when I saw that number on my screen, I had a wonderful thought:

That sounds like it would fit in ram!

Now there are some caveats here: you'd need a lot of ram. Unfortunately my first estimate of "5.5 billion solutions" is roughly "5.5 GigaBytes" was way off, because you'd need more than one byte per Sudoku puzzle. There's also generally some overhead involved with indexing the data for fast lookups, but the bottom line is still the same. It might not fit in the ram on my computer, but the GCP instance I'd need to hire to fit it all in RAM would probably not break the bank if I didn't leave it running 24/7. I started coming up with a rough algorithm:

Parse the Sudoku;
Perform a series of flips, row/column swaps and number-reassignments to get the board into a "canonical representation" that matches my database;
Perform a (hopefully) $\log{N}$ time lookup through the database to see which puzzle solution matched the hints I was provided with;
Apply the reverse of the transformations I did in step 2 to the canonical-form solution; and finally
Print the answer.

When you write it out like that, it honestly seems pretty simple. There are only two teeny tiny issues that I completely underestimated.

Indexing the solution database for fast lookups
Inventing a canonical representation of a solved Sudoku that I could transform partial puzzles into.

Everything Going Wrong

I promise, I was going to write something here. But by the time I got to the end of the blog post and was ready to come back to it, everything was already way too long. In summary, I couldn't find a solution to either problem. I also almost think it might not be possible to solve the second problem, but I'm not sure there.

Moving on…

A Path Forward

I went back to the drawing board, and started playing with some easy Sudokus to clear my mind while I thought. And as I was playing, I realised that the way I looked at Sudokus while I was playing was different to how I was thinking about them while writing code. I came up with a (very preliminary) brand new idea that would fit a few fun criteria that I was looking for in my algorithm:

Still involve generating a giant database in memory and searching it;
Stay close to how I think about Sudokus in my head while solving them; and
Not be too complicated to implement (or hopefully optimise).

And it all revolves around a concept of Paths (made-up term, because I couldn't find anything online of people taking the same approach as me). So let's get into this, with a small example from the easy-level Sudoku I was working on at the time.

+-----+-+-----+-+-----+
|  3  | |    4| |  7 8|
|     | |9 3  | |1    |
|2    | |1    | |  9 3|
+-----+-+-----+-+-----+
|3   9| |4    | |    7|
|  1  | |3 8 9| |5 4  |
|  2 8| |     | |  3  |
+-----+-+-----+-+-----+
|  9 7| |8 4 3| |    6|
|6 4 3| |5   1| |7    |
|  8  | |6    | |3 5  |
+-----+-+-----+-+-----+

At this point I've solved all the 3s, and I want to get started on the 1s. Let's take some of the clutter away from the board for a second, and have a new look.

+-----+-+-----+-+-----+
|? x ?| |    x| |  x x|
|     | |x x  | |1    |
|x    | |1    | |  x x|
+-----+-+-----+-+-----+
|x   x| |x ?  | |  ? x|
|  1  | |x x x| |x x  |
|  x x| |  ?  | |  x ?|
+-----+-+-----+-+-----+
|? x x| |x x x| |  ? x|
|x x x| |x   1| |x    |
|? x ?| |x    | |x x ?|
+-----+-+-----+-+-----+

I've put a 1 in all the squares that contain a 1, an x in all the squares that contain a 2-9, and a ? in all the squares that appear legally able to contain a 1 at this point in time. There are eleven question marks on the board so far, and of those five need to become 1s before the puzzle is completed. Naively, this is looking like (checks combinatorics notes that I haven't used in many years) 462 possible ways of filling in the question marks.

By thinking in Paths, we can reduce this to just three possibilities. I'm going to start by zooming in on the middle box, and the box immediately to its right.

+-----+-+-----+
|x ?  | |  ? x|
|x x x| |x x  |
|  ?  | |  x ?|
+-----+-+-----+

On their own, these two boxes have four question marks, and need two 1s to be complete. The maths says there's six ways this could go down, but practically you can look at the Sudoku puzzle and narrow it down to two possibilities. I'll redraw the two boxes using L to show where the 1s would go for one of the possibilities, and using R to show where they go for the other possibility.

+-----+-+-----+
|x R  | |  L x|
|x x x| |x x  |
|  L  | |  x R|
+-----+-+-----+

Any other way of choosing how to allocate the 1s aside from these two would lead to conflicts. Now that we know that the middle box depends entirely on the box to its right (or vice versa) I'm going to hide the middle box for a while, and focus on the center-right box and the bottom-right box.

+-----+
|  L x|
|x x  |
|  x R|
+-----+
|  ? x|
|x    |
|x x ?|
+-----+

The two possibilities we had from our previous exercise carry across here: we can assign L and R to the question marks in this new box as well. The problem of selecting three 1s from six ?s is ultimately still down to a binary choice between the L assignment and the R assignment.

+-----+-+-----+
|x L  | |  R x|
|x x x| |x x  |
|  R  | |  x L|
+-----+-+-----+
        |  L x|
        |x    |
        |x x R|
        +-----+

Once you add in the next connected box, the one on the bottom left (which shares rows with the bottom-right box, that you can use to cancel things out), things get more complicated. We go from two valid assignments of 1s among the question marks to three, and some of the assignments overlap, so I'm not going to be able to draw them in the same diagram any more. However, that makes this a good time to swap from talking about assignments to talking about Paths.

What is a Path?

A Path is a series of nine locations across the Sudoku grid, such that one location from the path is present in each row, column and box of the overall grid. In a solved Sudoku, you could assign one path to each digit, making the locations in the path the locations on the grid that that digit is present. Any set of nine paths that don't overlap with each other, and therefore fill up the board, would constitute a solution to the Sudoku.

Here's an example Path, which is the Path that I filled with 3s when I first started solving the example Sudoku. The exclamation marks show which locations in the grid are part of the Path.

+-----+-+-----+-+-----+
|  !  | |     | |     |
|     | |  !  | |     |
|     | |     | |    !|
+-----+-+-----+-+-----+
|!    | |     | |     |
|     | |!    | |     |
|     | |     | |  !  |
+-----+-+-----+-+-----+
|     | |    !| |     |
|    !| |     | |     |
|     | |     | |!    |
+-----+-+-----+-+-----+

My next breakthrough, and what actually led to the workable solver that I have now, was deciding to throw away the "fill the grid with numbers" problem and replace it with a "find 9 Paths that don't overlap" problem. And that brings us to coding.

Feasibility of Paths
I did stop to think, before diving straight into coding, about whether it would be feasible to solve path-based problems. I had an idea that my algorithm would probably involve looking at all possible different paths, so I did a quick calculation to see how many there were. I figured that any given Path would need one spot in each row, and those spots would need to be arranged so they each fell into a different column. The actual number is lower than this, because not all permuations are valid, but there are $9!$ ways to permutate the "put each position into a different column" problem, which means we have an upper bound of 362,880 different Paths. Definitely a small enough number to store in memory or loop through, so we move on.

Representing States (the Glue Code)

Before I could get hacking on anything like an algorithm, I needed some glue code and a bit of a foundation to build with. I started with these two abstractions:

A Bitfield abstraction to make all my "does this intersect" maths faster, and give me a compact way of storing things in-memory.
A generate_paths() function which lists all possible Paths through a Sudoku, because I'm going to need it sooner or later surely.

The code for these two steps is here. In particular look at src/bitfield.rs and src/path.rs. Fun fact, I did the testing and the upper bound of 362,880 valid Paths from before was very conservative. The actual number is only 46,656.

And then after that I needed a way to build and represent a board. A completed Sudoku could be represented as 9 Paths, one for each digit, but an incomplete Sudoku I don't have enough information to do that. Instead, I decided to represent it as 9 Bitfields — the idea being that I could turn them into Paths (or lists of potential Paths) easily enough with some bitwise arithmetic.

You can check out the board representation here. The important thing (to me) is that I can write code like this:

// Find a list of candidates for the Path that the digit 3 follows through the board
let potential_three_paths = generate_paths()
    .filter(|&path| path.contains(board[Digit::_3]))
    .filter(|&path| (path & board[Digit::_1]).is_empty())
    .filter(|&path| (path & board[Digit::_2]).is_empty())
    .filter(|&path| (path & board[Digit::_4]).is_empty())
    .filter(|&path| (path & board[Digit::_5]).is_empty())
    // and so on
    .filter(|&path| (path & board[Digit::_9]).is_empty())
    .collect::>();

Or this:

// We found a new cell to fill in
board[Digit::_3] |= Bitfield::new(5, 5);

And I'm hopeful that those two operations are all I'll really need to write a Sudoku solver. Oh, I also have helper code to parse and pretty-print boards, which is pretty important. So let's jump right in and start solving!

Some Experimentation with Real Sudokus

We're at the point where I'm gonna need to start looking at concrete Sudokus now, and so I've put together a small list of boards to test with. Here they are:

I've got a Sudoku app on my phone. It's called Sudoku Master Edition and it's available here. Most of the puzzles I've started testing with came straight from the generator in this app. This one comes from the "Easy" difficulty.

+-----+-+-----+-+-----+
|2   3| |    5| |8   1|
|  4 6| |     | |  3 2|
|1 5  | |    4| |7    |
+-----+-+-----+-+-----+
|    1| |8 4 7| |9 5  |
|    5| |  1  | |  2  |
|4    | |  2 3| |     |
+-----+-+-----+-+-----+
|     | |     | |1   5|
|     | |  6 9| |     |
|9 8  | |4 5  | |  7  |
+-----+-+-----+-+-----+

Moderate Sudoku

Same app, next difficulty up: "Moderate". This is the difficulty that I spent most of my hours grinding away Sudokus at when I was first getting into the game.

+-----+-+-----+-+-----+
|5    | |8   7| |2    |
|     | |9 2  | |6    |
|7 4 2| |     | |  8  |
+-----+-+-----+-+-----+
|     | |  9 5| |    6|
|9 3 5| |7 6  | |    2|
|1 7  | |4   2| |    8|
+-----+-+-----+-+-----+
|     | |     | |     |
|2 9  | |     | |4    |
|3   1| |    6| |     |
+-----+-+-----+-+-----+

Master Sudoku

This is the highest difficulty offered by the app. I skipped a few difficulties in the middle here, as I'm trying to keep the dataset down to just a few Sudokus so I can look at results by hand.

+-----+-+-----+-+-----+
|    6| |2    | |    9|
|  4  | |6 7  | |2   3|
|     | |    9| |    4|
+-----+-+-----+-+-----+
|4    | |     | |  2  |
|    5| |  8  | |1    |
|  7  | |     | |    6|
+-----+-+-----+-+-----+
|2    | |9    | |     |
|6   1| |  3 4| |  7  |
|5    | |    6| |4    |
+-----+-+-----+-+-----+

Now that I've got these puzzles, I wanted to get a feel for how helpful the Path approach actually is. Using all the glue code from before, I put together this little script:

fn analyse_board(board: &Board) {
    println!("{}", board);
    let total_clues = Digit::iter()
        .map(|digit| board[digit])
        .fold(Bitfield::default(), BitOr::bitor);

    for digit in Digit::iter() {
        let our_clues = board[digit];
        let opposing_clues = total_clues & !our_clues;

        let possible_paths = sudoku::generate_paths()
            .filter(|&path| path.contains(our_clues))
            .filter(|&path| (path & opposing_clues).is_empty())
            .count();

        println!("There are {} possible Paths for {}", possible_paths, digit)
    }
}

And so let's check out how many possible Paths there are for each digit across the sample Sudokus.

Easy Sudoku:

1 possible Paths for 1
3 possible Paths for 2
18 possible Paths for 3
2 possible Paths for 4
1 possible Paths for 5
12 possible Paths for 6
15 possible Paths for 7
5 possible Paths for 8
1 possible Paths for 9

Moderate Sudoku:

55 possible Paths for 1
2 possible Paths for 2
55 possible Paths for 3
2 possible Paths for 4
20 possible Paths for 5
1 possible Paths for 6
11 possible Paths for 7
12 possible Paths for 8
3 possible Paths for 9

Master Sudoku:

42 possible Paths for 1
2 possible Paths for 2
56 possible Paths for 3
2 possible Paths for 4
42 possible Paths for 5
2 possible Paths for 6
8 possible Paths for 7
154 possible Paths for 8
8 possible Paths for 9

All in all, this is awesome! The worst scenario is 154 possible paths for 8, in the master difficulty Sudoku — but looping over all 154 times… and looping over the total number of Paths for all the other digits… is still only… okay it's around seven billion iterations. Still, that means that the master Sudoku is gonna get solved in a handful of seconds, which is way faster than I can solve it by hand.

Just to confirm that this hypothesis that I can loop through possible path combinations to solve a Sudoku, I wrote up the following little solver:

fn solver_helper(mask: Bitfield, paths: &[Vec]) -> Option> {
    match paths {
        [] => panic!(),
        [one] => {
            let &choice = one.iter().find(|&&path| (mask & path).is_empty())?;
            Some(vec![choice])
        },
        [head, tail @ ..] => {
            head.iter().filter(|&&path| (mask & path).is_empty()).find_map(|&path| {
                let mut tail_solution = solver_helper(mask | path, tail)?;
                tail_solution.insert(0, path);
                Some(tail_solution)
            })
        }
    }
}

let solution = solver_helper(Bitfield::default(), &possible_paths_per_digit).unwrap();
for (digit, bits) in Digit::iter().zip(solution) {
    board[digit] = bits;
}

And then I ran it. Good news and bad news: It did work! But it took six to seven seconds for all three Sudokus.

Hey, what if you tried cargo run --release instead of cargo run?

We are now down to 260ms, 291ms, and 288ms per Sudoku². This has been yet another reminder that LLVM is magical, and that I should always run my code through its optimisers before I do anything.

Issues with the compiler aside, though, we've now got a proof of concept that Paths are viable as an option. Now it's time to go back to the drawing board and see if there's a way we can speed this up further.

Easy Wins

I was halfway through writing the section after this one, on the algorithmic improvements I figured would be the next step in making the solver fast, when I realised something: I forgot to actually profile my code. Tragically, clicking the "Profile" button in my IDE didn't help much — I am being too fancy with lazy iterators for the profiler to know where I'm actually spending time beyond "inside the iterator.next function" or "inside the Vec::from_iter function". However, that's not going to stop me, because I have cheap hacks on my side!

let start = Instant::now();

// run part of the code

let first_part_time = start.elapsed();

// run next part of the code

let second_part_time = start.elapsed() - first_part_time;
println!("First part took {:?}", first_part_time);
println!("Second part took {:?}", second_part_time);

And here's what I learned:

Easy Sudoku:

Finding total clues: 76ns
Finding possible paths: 279.055611ms
Solving: 20.696µs

Moderate Sudoku:

Finding total clues: 74ns
Finding possible paths: 307.142538ms
Solving: 86.654µs

Master Sudoku:

Finding total clues: 76ns
Finding possible paths: 317.606837ms
Solving: 85.373µs

I had really thought that the actual solving would take the bulk of the program time. Apparently that was wrong, and I guess that just goes to show why you should never trust your hunches about performance in code.

Alright so let's look at the "finding possible paths" code — and apply the traditional (and completely trustworthy) method of "staring at it until I think I can guess what's taking so long".

let possible_paths = Digit::iter()
    .map(|digit| {
        let &our_clues = &board[digit];
        let opposing_clues = total_clues & !our_clues;

        sudoku::generate_paths()
            .filter(|&path| path.contains(our_clues))
            .filter(|&path| (path & opposing_clues).is_empty())
            .collect::>()
    })
    .collect::>();

So my first thought is "why am I calling generate_paths() every single time?" Let's calculate that once at the start — cache it — and go again. It's worth noting here that I am still just guessing, which isn't a great way to go about optimisations, but at least I'm guessing and measuring. It's a step up. Anyway, here's our results:

Generation of all Paths: 32.999076ms

# Possible path times
Easy Sudoku: 279ms -> 656µs
Moderate Sudoku: 307ms -> 669µs
Master Sudoku: 317ms -> 712µs

On the bright side, I've cut down the slowest part of my solving process by a factor of approximately 450! On the confusing side, generating the path list on its own is ten times faster than generating it and immediately filtering it down. This is gonna have to be one of those "out of scope" things, I think. And now I think we've hit the "Neil, this blog post is too long" point, so let's wrap up where the solver is at.

State of the Solver

So the code is here. I've built it into a little command-line tool, that takes the Sudoku pattern in text representation as a command line argument and prints it out on screen once it's solved. The whole thing looks like this:

λ sudoku .......49.....38..7.6.2.1.....3...6.6..784..5.9...1.....2.5.4.8..84.....37.......
+-----+-+-----+-+-----+
|8 2 3| |1 7 5| |6 4 9|
|5 1 9| |6 4 3| |8 7 2|
|7 4 6| |8 2 9| |1 5 3|
+-----+-+-----+-+-----+
|4 8 5| |3 9 2| |7 6 1|
|6 3 1| |7 8 4| |9 2 5|
|2 9 7| |5 6 1| |3 8 4|
+-----+-+-----+-+-----+
|1 6 2| |9 5 7| |4 3 8|
|9 5 8| |4 3 6| |2 1 7|
|3 7 4| |2 1 8| |5 9 6|
+-----+-+-----+-+-----+
Solution took 845.83µs

Based on the limited testing I've done, most of the time taken by the solver is spent in one of two places:

Generating a list of Paths that fit the initial set of clues; and
Testing all selections of those Paths to see which set we can use to fill up the board.

Of those two, I'm very surprised to report that the first step takes much longer than the second. I'll need to look into that… And that brings me to my next point: future posts! I've never written a series of blog posts before, so I don't know what this is going to look like. However, here are some things I'd like to investigate in future. If I get the chance, I'll update the list below with some hyperlinks as we progress.

How does this Sudoku solver perform against larger (and harder) sets of puzzles? Was I just really lucky to find some testing Sudokus that work really well with my algorithm?
Can I index the Path database in a way that makes searching it for Paths that match the clues faster?
Can I do better than a brute-force search through the possible paths for the final part of my algorithm? Do I need to, to speed things up? And how come it isn't prohibitively slow at the moment?
How does this algorithm compare to mainstream solvers? From a performance perspective and a simplicity perspective.

A new, static, blog

2021-09-10T00:00:00Z

A new, static, blog

10/08/2021

It's time, finally, for me to do what every other developer with a blog does — write a blog post about my blog setup. I have unquestionably over-engineered this, but that's gotta be half the fun, right? So without any further ado, let's go through the process of how I put this together.

A window into the developer

The main reason this blog exists is to show people what sort of developer I am. Shockingly, that's not a constant. My old blog setup, which I've archived but not deleted for the sake of being honest with myself, was that of a React developer. It looks like a giant mess to me now, but I guess hindsight is always more critical of bundle size and code complexity or something like that. The point is, I'm not a React developer any more, so what sort of developer am I? And what sort of blog does that developer create?

One day, I'm going to be a practical sort of developer. The sort that clicks the "Can I have a Jekyll site please" button on GitHub Pages, and never thinks about it again. I'm not that developer yet though, so for now the fun (read: over-engineering) continues.

For the last year, I've been working for Tiny Technologies. I might even write a blog post about it some time. For now though, I'll say that building a rich text editor that's optimised for compatibility with different site setups has changed every view I held about browsers⁰. Some key differences in how I think now vs earlier:

Before	After
Nothing can be done without React or another framework.	The native browser APIs are very powerful these days.
Nesting 30 layers of divs to get the layout you want is normal.	Sometimes people actually want to look at the inspector to debug something.
I can just put a banner up telling people not to use old browsers.	I wish I could just put a banner up telling people not to use old browsers.
And most importantly…
I only care about the HTML that fits my use-case.	I need to understand everything a user could conceivably make out of HTML, and how it's meant to work.

Now that's not to say that I hate React now. I really miss using it, and having that comfort layer between me and the DOM — especially for things like forms or quickly reorganising HTML structures when I change my mind about something. What I'm trying to say here isn't that I hate having a framework between me and the DOM, I'm just not so scared of the alternative any more.

And that lack of fear is what led me to believe that this would be a good idea:

My website isn't going to render on the client side

That's not a particularly courageous statement though. Server side rendering is a well-established pattern, and even predates client side rendering. Except, because I'm pretty cheap when it comes to hosting for side projects, I don't want to have a server. So where will the rendering happen? In the compiler.

Blog-pack

At time of writing, "Blog-pack" doesn't really mean anything and doesn't come up when I search for it, if that's no longer the case then I'm sorry.

You've heard of webpack? That tool that you use to package things for the web? I set it up to package things for my blog. Writing it out, that sounds like such a simple task, but it was anything but. What it comes down to is that I wanted to operate on HTML files, and webpack wanted to operate on JavaScript files.

I really wanted 3 tools to come together and work for me with this:

And I gotta say, 2/3 of the way through that list we were having a real good time. Both of those plugins are built to emit good HTML files from webpack. Unfortunately, html-loader is built to turn your HTML file into a pretty JavaScript module. It does a fantastic job of finding all of the and elements that you want the build system to resolve… and then it turns them into require calls.

Immediately, my zero-practicality brain jumped in with a solution: just write your own HTML loader! How hard could it be to coerce webpack into importing those assets without shelling out to require, right? So I tried it. I really did. I read (some of) the source, I read the docs, I logged semi-private webpack objects and APIs at runtime to see how they worked. You ever tried swimming against a strong current? Webpack is not built for this use case, and it let me know every chance it got.

So we turn to my zero-practicality brain's least favourite solution: compromise. I did still need to do some webpack-based processing of certain elements (extracting SVG icons from the @fortawesome/... folders in my node modules, for example). But for some things, like stylesheets, I just had to go with the flow a little. Behold, my entire index.js:

import './styles/page_layout.scss';
import './styles/content.scss';
import './styles/navbar.scss';

// This file doesn't do any processing. It just plays nice with webpack.

And if you look in your inspector, you'll see a little JS bundle filled with sweet nothings that's been downloaded as part of this page.

`My templating system`

One other important thing I've learned at Tiny is how scary HTML is. You know the infamous stackoverflow post about how you shouldn't parse HTML with regex? Turns out you shouldn't parse it with anything else either if you can get away with it. HTML is incredibly complex to deal with programmatically, and the browser (which does not make it look easy) makes it look so much easier than it really is. And so you can imagine my shock to see that it's pretty much industry standard for static site generators to ask handlebars to interpolate some strings together and call it HTML. Like I'm sure it works, but at what cost? And how fragile is it?

I'm not a fan of treating HTML as strings, and I am a fan (now) of using DOM APIs to make the scary bits of HTML go away. So my "templating engine" just loads the HTML up in jsdom, and calls document.querySelector('main').innerHTML =¹. It seemed a little weird to not have any {{}} in my templates at all, but honestly the more I work with it the more I like it.

No escaping any special "template characters" in my template or in my content,
I can open the template.html file on its own in a browser and debug it without seeing any template issues,
No questions about whitespace and where it will or won't need to be removed, and
When I inevitably reached some disagreements with various parts of my stack, (over things like how to render a footnote in HTML) I already had the fake DOM loaded to do some quick tweaks.

`Performance`

The other nice advantage of pre-rendering everything is that suddenly my website is fast. I'm serving up² plain HTML files, with plain CSS, all from a plain old CDN. There's nothing here to slow it down.

I'm not going to quantify things, because it's not worth the effort it would take to set up a meaningful benchmark. But I will say this: there's no more loading spinners here.



Mychro
2020-05-09T00:00:00Z
Mychro
09/04/2020

An X-Ray for your DNA
Under the wing of the Aginic Ventures team, I've been developing a little something to help make life easier for geneticists. The idea started off as a fusion between Aginic's skill at getting insight from data and a geneticist's need to get insight from a whole lot of data (the human genome). Just over a year later and the product – Mychro – is my masters thesis topic, my day job, and the largest codebase I've had this much of a hand in building.
In building Mychro, I've experimented with GraphQL, Kubernetes, and even a brief foray into Rust. These days the GraphQL is still there (provided by postgraphile), but the backend is now provided by GCP managed App Engine and Cloud SQL. The frontend is all React, just like this site, only with a bit more technical debt and a bit less reliance on bleeding edge technology like Suspense. The frontend also messes with enough SVGs to make me give up hope on ever finding a browser that is able to follow the SVG spec.
But while I am the only developer committed to the project, I'm not building this alone. I'm lucky enough to have Jason working his magic in the product design department, and it's been a pleasure trying to use osmosis to learn just how he does it. In the management department, we have Alan Robertson running the show: our resident geneticist turned startup founder, as well as the Aginic Ventures team.
I've committed a lot of my time to this project since we kicked it off. I asked for it to be brought off the Aginic Ventures' back-burners and into the light of day so I could work on it for my thesis, and I've been giving the project all I have in technical skills since then. I've done this because while the risks are always high in the startup world, I believe the potential payoff for this is huge. If Mychro has the chance to help make the people out there fighting genetic diseases and cancers more efficient at the work they do, I can't not do my best to build it.
Anyway, check out the product and by all means get in touch with me or the rest of the team.



Tutoring
2020-05-09T00:00:00Z
Tutoring
09/04/2020

This is the story of how my life got turned upside down
At the beginning of my third year of university, I got my first in-industry job in software: I became an official Casual Demonstrator at The University of Queensland. I was hired by a lecturer to walk around during lab demonstrations in a common second year course: CSSE2010, and answer any questions people had. I stuck with the job until I graduated, six semesters later, and during that time things got a little out of hand.
While I started out with a pretty small role, the job gave me plenty of room to grow, and I took every chance I was given. By the end my regular job involved a little bit of:
Marking exam papers, and supervising / marking practical exams.
Presenting content in front of 100 student classes.
Marking assignments for 30-50 students.
And if that had been all I'd done as a tutor, I would have still gained a tonne of experience and learned a lot. But things didn't really end there. Twice during my time as a tutor, the lecturer supervising me (and running the course I was meant to be tutoring) got unwell enough to need considerable time off. As a result, the Casual Academic section of my resume looks a little more like this:
Prepared and presented lectures to a course of 500 students.
Supervised teams of tutors in the preparation of course content.
Collaborated on specifications and support code for major programming assignments.
Produced and presented an example solution to a major programming assignment.
Automated compilation and JUnit testing of 1200 student assignments.
It was a drain, but it was a blast. And I think my resume looks a little better for all this extra experience.
The Content
During my time as a tutor, I ended up working as a part of four different courses. They were all great learning experiences, but one holds a special place in my heart as being my best academic experience at uni (both as a student and a teacher).
Computer Systems Principles and Programming – CSSE2310 – was the course I got the most involved in. I tutored this one for three years, and it was here that I got my lecturing experience. This course could be a blog post all on its own, but in summary we taught the students a crash course in what the operating system (Unix) looks like from the user-space. We started the semester by teaching C, and ended it by teaching BSD sockets and the difference between a hard and a soft link in a unix filesystem. I definitely wasn't an expert on any of these topics when I got the job, but over three years I've seen a lot of weird things (and even found a way to explain or fix most of them).
Outside of 2310, I got involved in the high performance computing course, the intro to OOP course, and the intro to computer systems course. I explained the difference between white-box and black-box testing, and taught people the importance of basic cable management while they built logic circuits on breadboards.
Tutoring really kick-started my knowledge on development. By working through an assignment with 30-90 students, I'd get to see 30-90 ways of solving a problem (well, maybe fifteen to twenty), and pick up experience that much faster. I also was forced to figure out how to debug code that I hadn't written, and how to communicate my thought process as I worked through a problem (really helps during the technical interviews). But if nothing else, I'd recommend the tutoring experience to anyone just for the information you can pick up by working alongside talented lecturers and senior tutors.



University of Queensland Computing Society
2020-04-26T00:00:00Z
University of Queensland Computing Society
26/03/2020

As someone who spent the past five years getting my software engineering degree from UQ, I saw a fair few student societies. The computing society was hands down my favourite.
When I first joined UQCS I was just starting my second year of engineering. I skipped a lecture to show up to a presentation from some Atlassian employees on how to use git. A few years later I gave them my own spin on the git talk to a fresh bunch of first and second years.
Being a UQCS member really helped me build my professional network up from nothing. I got my first real industry related job as a tutor through people that I met in the UQCS slack. I can go into #webdev on slack and bounce my weird ideas off of high quality industry professionals working at Google and Microsoft and the like for free. I even get to share the #c and #rust channels with high quality embedded systems engineers and kernel developers.
For all that UQCS has given me, I'm really proud that I'm able to give back to the society now. Clear Sky Genomics might not be a big name company, but I'm now one of the alumni giving advice to bright-eyed young students. I also really enjoyed spending a year on the committee (taxing as it was) and helping to organise events and sponsors for the year.
So if you're reading this, and I've managed to interest you in UQCS, awesome! It'd be really cool if you were to join the society, or convince your workplace to sponsor them so you can get access to rooms full of grads like me.