From 5a671268340c107ee94f8244f01b6cb8ee9decd0 Mon Sep 17 00:00:00 2001 From: dvs1 Date: Sat, 25 Jan 2025 03:14:10 +1000 Subject: YAR see description for details. New everything page, lists all the pages and where they came from, with links. Now we know where the page came from, type and URL. Rejigged how metadata works. Do our own metadata substitution. Fixed up a bunch of bugs. General cleanups. --- TODO.md | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) (limited to 'TODO.md') diff --git a/TODO.md b/TODO.md index 60a2eca..84f62b8 100644 --- a/TODO.md +++ b/TODO.md @@ -6,26 +6,27 @@ Bugs - - /users/Plentyn/WebHome.HTML   has that twisty thing which looks not simple to remove. - /users/dunno/Devuan Cluster.HTML   A very lengthy and complex document, I'll likely miss something, but chip away at the obvious. - /users/Debdog.HTML   pandoc can't handle the background table cell colours in the "Background colours" table, which is kinda the point of it. -- PmWiki in it's current config needs that ?n=foo.bar nonsense for the Original page link. Which I'm currently neatly sidestepping, the scraped page has a similar thing. Check the timestamps on the files, only update if source is newer than destination. Meh, it's already 600 times faster than the pandoc version. + - One quirk to watch for is if a URL path changes, the docs that have that URL need to be redone. - pandoc is a lot slower though, so do this for sure when dealing with that. - When scraping the web sites, they tend to be dynamically generated with no useful timestamp on them. - The web site scrape happens locally anyway, I can compare source file timestamps. Add atom feed for single page. Alas cgit only seems to have ATOM feed on the whole repo, not individual files. + - git.devuan.org might have usable per page history. - However, once timestamps are sorted, I can use that code to generate RSS and ATOM feeds, and create page histories using diffs. Deal with complex directory trees. + - /testing/even should display as even/deeper on the testing page, coz even/ has no files, but even/deeper does. - On the other hand, I could just create an empty page with the sub directories shown as usual. - Scanning /usr/share/doc on my super desktop with looots of software installed will be fun. - On the gripping hand, this is where we want empty directories to vanish. - ## Maybe not Extract title from Fos and Pm, maybe even pagetitle if possible. Title is pointless, both of them just use the file name. @@ -34,12 +35,13 @@ Extract title from Fos and Pm, maybe even pagetitle if possible. Title is point ## Try out +Lua pattern matching sucks in general. Might be why people use lpeg instead. Try it. + htmx cgit has Lua - ## User system Reuse the user system from SledjChisl. -- cgit v1.1