Dercuano drawings ⁑ Dercuano

Kragen Javier Sitaker, 2019-04-30 (updated 2019-05-30) (18 minutes)

At this point I’ve imported 90 notes into Dercuano, and the lack of images of any kind is sort of annoying. My actual, paper notebooks have a little line-drawing “icon” for each note, maybe 10 millimeters square, but often some kind of graphic would help a lot both with visual appeal and with comprehensibility. But my total byte budget of some 5MB, so that Dercuano remains easily downloadable in full, makes this challenging; I need a way to make very compact graphics.

Example uses

Deep freeze could benefit from diagrams of one or more of the following: 11 tonnes of foods including soybeans; a cutaway freezer with labels for insulation, refrigeration, and passive thermal storage; freezers of different sizes, with different amounts of surface area per unit volume, including a cubic-meter sphere and a thousand-cubic-meter sphere; food stored at the bottom of a 30-meter well; mounds of sand and locally excavated earth; a heat–temperature curve for water, including enthalpy of fusion and vaporization; plots of heat loss versus surface area and insulation thickness; etc. It would also benefit, I think, from a goofy line drawing of a freezer with a smiley face on it. A three-dimensional rotating rendering of a freezer would maybe be a plus, too; and a model whose parameters you could adjust interactively and see the relationships between the other parameters in the neighborhood would be awesome.

Dehydrating processes and other interaction models could benefit from sketches of user interfaces; from diagrams of the interaction sequences and data flows being discussed, of FlatBuffers or binary array state, and of the timing of sequences; and from illustrations of punched cards and perhaps a dehydrated window or something.

3-D printing by flux deposition could benefit from an illustration/diagram of the binder/powder-bed/tray setup, and maybe of some grains being fluxed — generally mechanical things like this really need diagrams; maybe also a temperature scale showing where different mixes sinter and/or melt would be helpful.

IMGUI programming language would benefit a lot from some sketches of the UI components being programmed, and maybe also some diagrams of activation records allocated on the stack and of Golang-style interfaces.

Transactional event handlers would benefit from some timing diagrams showing priority problems, concurrent transactions, transaction conflicts with pessimistic and optimistic synchronization, and deadlock.

Possible tools

No photos

A thing I’ve done in the past is to draw diagrams on paper and photograph them. This allows for pleasingly direct feedback during the drawing process, but even with aggressive compression, the resulting image files are dozens of kilobytes each.

No bloated SVGs

The Web standard format for line drawings is SVG — even hand computers running iOS support it now. SVG is super cool; it supports alpha-blending, Bézier curves, gradients, arbitrary affine transforms, some degree of abstraction and reuse, and text. You can do a lot in very few bytes of SVG, and it’s a lot more readable and debuggable than PostScript, my pre-SVG favorite. The standard example is probably something like this 78-byte triangle:

An example diagram in handwritten SVG

The source code for that looks like this, which is somewhat verbose but arguably not unreasonably so.

However, you can probably kind of tell from reading that that it was pretty slow to write, and from looking at it that the workflow doesn’t really support rapid iteration to get it to look good.

What you can’t necessarily tell from looking was that when I added that diagram to this document, it broke the triangle example higher up, so I hacked it not to interfere, and then I had to spend some time debugging incompatibilities between Firefox and Chromium about when they applied certain styles (arguably a bug in Firefox).

SVG’s abstraction capability is very limited

That diagram exploits SVG’s capabilities for reducing duplication to the maximum, to the point that I had to bring in code from three separate parts of its source document to use it here (and then, as I said, hack them so they wouldn’t break other SVGs). It uses a stylesheet with overrides to specify how lines, arrows, and areas should be drawn, and it uses SVG’s <marker> and <symbol> facilities to define graphic elements that can be used many times in the same drawing or across many drawings.

But those facilities, though complex, expensive in terms of syntactic overhead, and hard to debug, are still very limited. You can reuse a <symbol> in different places, for example, but not with different line widths or colors. (In theory I think you can, but it doesn’t fucking work, at least in Firefox.) You can apply the same style to different pieces of text or different paths, but as far as I can tell, you can’t instantiate a rectangular component at different widths in different places, other than by stretching the whole component, including its line widths. The kind of simple constraint satisfaction we routinely apply to HTML with the CSS box model is entirely outside of our reach in SVG.

WYSIWYG SVG editors produce insanely bloated output

On the other hand, the standard tools for generating SVGs produce data that looks more like this:

That’s eight significant figures on every coordinate, absolute coordinates everywhere instead of relative, and using diagonal lines instead of H horizontal or V vertical lines even when the relevant coordinate difference is in the fifth significant figure, and also using unnecessary spaces. The extra significant figures are essentially random, so gzip isn’t going to be able to compress them, except by noting that, being digits, they only need four bits each.

The problem with this kind of thing is that, even when it’s hand-drawn, it mixes the actual desired signal with a lot of random noise which is hopefully imperceptible but still incompressible. The mouse produces, say, 100 positions per second, each with three significant figures in each of X and Y, though the number is smaller with relative positions. Crudely, that’s 6400 bits of data per second, 800 bytes per second, after compression, that can be added to the Dercuano download package.

I launched Inkscape, plugged in a mouse, and drew with the pencil tool for a while. Running Inkscape for 110 seconds, I produced a 54-kilobyte SVG that compresses to 15.6 kilobytes; most of it looks like this (line breaks added for clarity):

That is, Inkscape has converted my three-significant-figure mouse coordinates into coordinates with seven to nine significant figures. Just in case. How helpful.

Why this is too much bloat for Dercuano to tolerate

The trouble with this is that 15.6 compressed kilobytes per 110 seconds is 1.1 kilobit per second, and the entire five-megabyte target size for Dercuano — containing many years of notes — would be completely filled with SVG in less than ten hours of drawing time. I’ve already spent over 20 hours just getting the first 600 kilobytes of Dercuano to work so far.

The basic reason many years of notes fit into five megabytes is that I can only type about 90 words per minute, which is 72 bits per second uncompressed, 24 bits per second compressed — a bit rate 45 times lower. And then I spend time revising the notes, which often makes them better without making them larger, though as anyone can see, I don’t revise my notes nearly enough.

TikZ won’t fit

One of the SVGs I excerpted above is a sort of graphic of a comic-book-style explosion or impact, but with a gradient. I generated it with the TikZ graphics system for T_EX from the following input:

This uses (θ ÷ 17)³ % 57 (composed with an affine function) to generate a “random” radius for each of 90 different angles to generate a jagged outer polygon, then fills it with a gradient. This is not something you can do in SVG, although you can do it in something else (like TikZ or JS) and generate SVG programmatically. And TikZ comes with a massive built-in library of things like arrowheads, directed graph layout algorithms, tree layout, plotting math functions, calendars, finite state machines, ERDs, Petri Nets, and so on. The sections of the TikZ manual about arrowheads (and arrow tails, etc.) total about 10 pages. It’s a bit overwhelming, honestly.

Unfortunately, I can’t use TikZ directly in Dercuano, because that would involve embedding not only the 5 megabytes of TikZ, but also the rest of T_EX and L^AT_EX that it depends on to run, in some kind of browser-executable form, probably compiled with Emscripten. And, while that’s probably a feasible thing to do, I’m pretty sure would blow my space budget for Dercuano.

I could draw (“write”?) graphics in TikZ and generate SVGs from them, but that just puts us back at square one: the SVGs are bloated, and I’d blow my space budget even faster that way.

TikZ has another problem, too: it’s purely intended for static graphics. But in Dercuano much of the time a dynamic, interactively-responsive graphic would be better, and only marginally harder to write.

d3 may be a good option, but not for illustrations

d3.js is a JS library for dynamic data-driven graphics which generates SVG at runtime using the DOM. One of the minified copies of d3.js I have here is 151 kilobytes, and it gzips to 53 kilobytes; the current v5.9.2 gzips to 79 kilobytes. Another, non-minified copy I have is d3 v3, and it’s 315kB, gzipping to 68kB. It’s pretty easy to use, and in particular it’s pretty easy to get really nice graphical output from, and it’s amazing at interactivity. On the downside, the JS code to use it is usually pretty verbose, and it’s not well suited for the kinds of sketching and illustration I earlier said were most important, although it’s probably fine for timelines.

Some kind of restricted-bandwidth WYSIWYG drawing program

For some things, though, the closer to paper, the better. I just don’t want to pay half a megabyte per hour for the privilege of faithfully recording the shaking of my hands as I sketch; shaky lines can be good but we might as well produce the shakiness from a highly-compressible random number generator. One possibility here would be drawing with a line that thrashes around near the mouse, starting at low-entropy points in whatever encoding I end up with, and gradually moving to higher-entropy points. Or maybe you stroke the mouse along a line multiple times to coax it into the shape you want. Maybe snap to a grid that gets finer if you zoom in: then you’d need to zoom in. Or maybe you need to click to nail down a spline point, and there’s a snap grid that is finer for smaller displacements.

Another possibility would be something that encourages drawings to be highly factored, perhaps by having clumsy graphical primitives (though primitives that compose flexibly, e.g., adopting the line width and color from their use context) and really low reuse overhead.

A third possibility, and this is kind of cheating in a way, is just to use a textual language to describe the graphics, but shorten the feedback cycle as much as possible so that you can vary things interactively to see the results on the screen. To the extent that you can map backwards from the screen image to the textual source code, you may be able to alter the textual source code by direct manipulation of the screen image.

Ivan Sutherland’s SKETCHPAD was a visual programming language for interactively constructing a set of graphical objects and constraints for them to fulfill, progressively approaching fulfillment, and visualizing the results.

The problem with these approaches is that in some sense you’re deliberately impeding the drawing process, which you would think would make it clumsier. But maybe that clumsiness can be minimized or even reversed — the bandwidth from my brain to the mouse is definitely not 1100 bits per second and may not even be the 24 bits per second I get on the keyboard. If the drawing program can somehow filter out the other 1000+ bits per second of pure noise, it might actually make it easier to draw things rather than harder.

Penrose looks like an appealing approach, but would need reimplementing

The unfortunately named Penrose (GitHub) diagramming software project is not related to Penrose diagrams (also known as Penrose–Carter diagrams), the Penrose Graphical Notation, the Penrose Project (a band), or the OpenJDK project Penrose. Instead, it’s Katherine Ye’s research team at CMU, which has produced software that largely consists of three DSLs, with even more unfortunate names — “DDL” defines the primitive objects of some mathematical domain, such as the vector space ℝ² or ZFC set theory; “STYLE” explains the desired visual representation of each domain concept in terms of constraints and optimization objectives; and then “SUBSTANCE” describes the particular objects to put into a particular diagram. Further interactive tweaking is then used to adjust one of the various outputs from the constraint solver to look best.

The underlying approach sounds reasonable, and they’ve gotten some impressive results, but the implementation is 8000 lines of Haskell, plus substantial bits in Java, TypeScript (for a React UI), JS, and Emacs Lisp; they’re actually using Alloy (thus the Java) to reduce some kind of problem to SAT and solve it with SAT4J (maybe the layout optimization problem itself, although I don’t think SAT4J would be particularly good at that? Or maybe some other problem). Using Penrose itself for diagrams in Dercuano would thus involve generating vector graphics in SVG or whatever during the Dercuano build process, since running the existing Penrose software in the browser would not be practical, even with Emscripten, which itself would probably blow Dercuano’s space budget.

Something like Zdog might be interesting for 3D

Zdog is a browser-based 3-D engine using points, lines, arcs, and splines rendered with stroke thickness, with SVG (and, I think, <canvas>) output. Its implementation, though lightweight, doesn’t run in old browsers, and I’m not that fond of its API.

It does a great job of demonstrating the potential of the approach, which is pretty easy to implement; doing it just for points with diameter (“spheres”) took me just a few lines of JS on <canvas> a few years ago:

In that case, though, the “scenegraph” (coordinates and colors) was randomly generated at startup. It doesn’t really help with the question of how to get the 3-D models made in the first place.

Topics

Graphics (91 notes)
Human–computer interaction (76 notes)
Compression (28 notes)
Dercuano (16 notes)
Multitouch (12 notes)
JS (12 notes)
Constraint satisfaction (9 notes)
Sketchpad (3 notes)
TikZ
SVG
Inkscape
D3