www.marginalia.nu

You may be looking for one of the following things

🔗 Search Engine
🔗 Encyclopedia (not mobile-friendly!)
🔗 Website Explorerimproved
🔗 Similar Website Finder
🔗 Server Status

My name is Viktor. I’m a Swedish software engineer and hypertext enjoyer. Marginalia is a website I’ve built. It’s really almost a bunch of websites on a common theme. If you find yourself clicking a link and ending up on a page that looks completely different, that’s just how things are.

🌎 Marginalia Search on GitHub
ðŸĶĪ @MarginaliaNu on Twitter
ðŸĶĪ @marginalia@mastodon.social
📚 @ViktorLofgren on YouTube
✉ïļ kontakt@marginalia.nu on Email

Site Index

NameDate
📁 Weblog/2025-05-23
📁 Miscellaneous/2025-01-18
📁 Release Notes/2024-10-14
💭 Problems/2024-04-03
📁 Recipes/2023-08-31
🔧 Server Status Log/2023-08-27
📁 Marginalia Search/2023-03-28
📁 Links/2022-09-15
ðŸĪ– Weird AI Crap/2022-08-01
📄 Uses2024-02-01

Recent Updates

2025-05-23 A 2030 morning routine in log
You wake at 05:30 in the morning, feeling somewhat groggy. Instead of the alarm clock ringing like it normally does, a cheerful hologram appears: “Hi! I’m Kyle, your new alarm clock assistant!” You get dressed as Kyle explains all of the fantastic things he is capable of. You head over to the coffee machine. “Hey there! I’m Evan! Are you ready for AI in your coffee? But first - tell me about yourself!
2025-05-13 PDF to Text, a challenging problem in log
The search engine has recently gained the ability to index the PDF file format. The change will deploy over a few months. Extracting text information from PDFs is a significantly bigger challenge than it might seem. The crux of the problem is that the file format isn’t a text format at all, but a graphical format. It doesn’t have text in the way you might think of it, but more of a mapping of glyphs to coordinates on “paper”.
2025-04-22 Debugging A Crawler Stall in log
Some time ago, I migrated the crawler off the okhttp library, to use Java’s builtin HTTP client. This seemed like a good idea at the time, but has led to a fair number of headaches. Java’s HttpClient has one damning flaw, and that that it doesn’t support socket timeouts. Its only supported timeout values are time to connect, and time until first byte of the response. This means the client can get stuck on a read call if a server stops responding, potentially for a very long time!
2025-03-27 Crawl Order and Disorder in log
A problem the search engine’s crawler has struggled with for some time is that it takes a fairly long time to finish up, usually spending several days wrapping up the final few domains. This has been actualized recently, since the migration to slop crawl data has dropped memory requirements of the crawler by something like 80%, and as such I’ve been able to increase the number of crawling tasks, which has led to a bizarre case where 99.
2025-03-25 Marginalia Search receives second nlnet grant in log
I’m happy and grateful to announce that the Marginalia Search project has been accepted for a second nlnet grant. All the details are not yet finalized, but tentatively the grant will go toward addressing most of the items in the project roadmap for 2025. I’ve already been working full time on the project since summer 2023, and this grant secures additional development time, and extends the runway to a comfortable degree.

Tags

NameCount
🏷ïļ ai/3
🏷ïļ bots/4
🏷ïļ cooking/6
🏷ïļ memex/2
🏷ïļ moral-philosophy/7
🏷ïļ nlnet/18
🏷ïļ platforms/9
🏷ïļ programming/24
🏷ïļ satire/6
🏷ïļ search-engine/70
🏷ïļ server/2
🏷ïļ sleep/2
🏷ïļ web-design/12