Show HN: Blogs.hn – tiny blog directory
blogs.hnLike others on here, I was inspired by the "personal blogs" post :)
[0] https://news.ycombinator.com/item?id=36575081
In proper HN fashion, the site is open-source and requires no JS!
There's instructions on how to add/edit a blog on the Github README.
[1] https://github.com/surprisetalk/blogs.hn
Note that your blog might appear in blogs.json, but not on the site! Right now I'm using the following code to filter out blogs. Your blog might appear if you add more metadata:
if (
3 > 0
+ !!blog.title
+ (blog.desc.length > 40)
+ !!blog.about
+ !!blog.now
+ !!blog.feed
+ 3 * (blog.hn.length > 1)
)
continue;
As I mention on the /about page, if you don't already have a blog, I recently made a minimal static site generator! It's easier than ever to begin your writing journey :)
> https://github.com/surprisetalk/blogs.hn/commits/main
Uh, wanna talk about your commit messages? I mean I love it to bits, but I feel something needs to be said. Not sure what. It may be U+2603.
Ha! I added an “emoji roulette” script to my workflow for personal projects.
There’s something fun about seeing my experience with a commit randomly summarized by an emoji haha
What kind of algo is it that summarizes your coding session as "carnival tent emoji"?
Or does roulette imply (pseudo?) random?
Here's the entire script that determines the commit message:
Sometimes I'll see a carnival tent or horse face during a late night coding session and think to myself, "Wow this emoji perfectly captures how I'm feeling right now!"
I know it's stupid, but it makes me smile :)
I feel you may also like lolcommits[0], it doesn't pollute the public repository, but takes a webcam photo of you and can optionally add the commit message into the image. Every few months I would convert the images into an animation and it's interesting watching the facial expressions, change in appearance and locations.
[0] https://lolcommits.github.io/
I'd have a fucking aneurysm if someone tried to pass off those commit messages at work.
Imagine being a doctor's patient and someone write that in your medical file! U+1F631
Luckily, this is clearly not a professional project in any way, so no aneurysms were had this day.
Glad I'm not the only one.
You're not at work though, this is someone's personal project that they built for their own enjoyment.
Because I was curious, frequency counts:
Recent and related:
Show HN: Hacker News user blogroll - https://news.ycombinator.com/item?id=36605493 - July 2023 (183 comments)
Show HN: OPML list of Hacker News Users Personal Blogs - https://news.ycombinator.com/item?id=36627112 - July 2023 (83 comments)
Normally we downweight follow-ups but this is cool and I think there's room for one more!
If several people create this, it is quite a decentralized way to discover blogs. More people should do this.
Yeah it's been an extremely useful dataset.
I've been using it to calibrate my search engine's quality website detection since it's effectively a large set of high quality websites.
Aw, my blog is still not in there :(
Add it :)
https://github.com/MarginaliaSearch/submit-site-to-marginali...
https://github.com/MarginaliaSearch/submit-site-to-marginali...
Ahh I looked a few times for a way to submit but didn't find this, thanks!
Yeah I only added the link the other week. Was a bit skeptical about using github, but so far github seems like a very useful way of crowdsourcing index entries.
some personal notes you may find useful
Sweet, I'm always on the look collections of websites with consistently higher quality. Anything is better than the sketchy SQL queries I have to resort to in order to grow the index programmatically.
Probably anything from the planets (eg: planet-if, planet gnome) should be of reasonable quality too, and focused by general topic.
Thank you. With respect to the various individuals working to build their own directories, marginalia search is the missing element that allows me to sift through the register of blogs that are being collected.
This whole thing that's been happening the past few days feels like HN at its finest.
I think we call it the year 2002 and I love it. A normally distributed Internet is the way.
It feels like a single HN post a few days ago reinvented web rings and DMOZ for a whole new generation.
What is DMOZ ?
https://en.m.wikipedia.org/wiki/DMOZ
Pertinent that DMOZ launches just a few months before Google. It was basically an open version of Yahoo directories, right at the time when Google swooped in and ate Yahoo’s lunch.
Ontologies are hard. Search won.
https://dmoz-odp.org/docs/en/about.html
https://www.dmoz-odp.org/docs/en/about.html
One of the earlier attempts at building a taxonomic directory of websites.
Thanks for making me feel old.
The new name for newhoo
There's also https://ooh.directory/ which is basically just new DMOZ.
Up next: RSS feeds.
Great! Nice design as well. There were so many great blogs on that post, and scrolling 1000 comments got a bit tedious.
Made me want to create my own blog, I'll see what I can come up with. I'll try to submit PR when I do. Love taylor.town as well!
You might be interested in: https://indieblog.page/
see also
- https://gossipsweb.net
- https://xn--sr8hvo.ws
- https://personalsit.es
- https://readsomethinginteresting.com
- https://ooh.directory
Also related,
- https://search.marginalia.nu/
- https://neocities.org/
- https://tildeverse.org/
- https://handmade.network/
- https://1mb.club/
- https://no-js.club/
Additional recommendation:
Ye Olde Blogroll: https://blogroll.org/
Do you think blogs with most recent posts can be bumped up (instead of a static ordering)?
Like... an RSS feed keeping tabs on all of these blogs/their "last updated" field value?
Remember "folksonomy" ...? So I would like to see some kind of automated keyword extraction for each blog to allow selection of topical blogs.
OK, keywords would need to normalied (e.g.plural to singular, plus maybe a bit of abstraction) and this would be a first (tiny) step to make it a blog-centered search engine ... ;-0
I've been experimenting with techniques for figuring out the topic of a blog, and never quite gotten anything to work well that didn't involve extreme computational costs that just do not work on a search-engine scale without a serious hardware budget (zero shot classifiers did a decent job IIRC). An unga bunga technique like TF-IDF works well for a given document, but less well for a website.
Had some half-decent success finding similar blogs given a reference though.
Yes, I expect full text extraction and normalization to be computationally expensive. I did some simple minded experiments on Usenet feeds back in the 90ies and it was quite some work for the VAX it ran on.
You can get it to run fairly fast on modern hardware. Like run a text extraction, tokenization and POS-tagging workflow on a quarter billion documents on PC hardware, takes like 24-36 hours. That's doable and affordable. But ML-adjacent methods are not. Requires far too much GPU compute, have no A100s :-/
I originally implemented it by roughly sorting via HN karma, which you can see in my comment here:
[0] https://github.com/surprisetalk/blogs.hn/blob/22a471d56504d5...
To maximize serendipity, I ended up going with random ordering.
I think there are other RSS-focus solutions (like https://ooh.directory) that are probably a better fit for sorting by recent.
It has something listed under my blog (https://www.hankruiger.com) that doesn’t seem to link to my blog (something about “Hiring advice for startups from Hackruiter (YC S10)”).
Ah! Sorry about that. Looks like Algolia search did some fuzzy matching :) I'll fix that up.
No problem! Cheers and thanks for making this. :)
Made another site like this, that attempts to also generate combined RSS feeds for either all of the blogs or the top 100 users' (by karma) blogs: https://news.ycombinator.com/item?id=36643086
That said, blogs.hn looks way more aesthetically pleasing and usable, good job! Getting sites that are both pretty to look at but also comfortable to use has always been challenging for me.
Couple of notes, I'm not an Node person, but you need a later version of Node than what comes default with Ubuntu 22.04.2 LTS, and your instructions have us installing modules globally, which #1 didn't work for me, and #2 requires root in Linux (please don't require me to install as root to update a text file).
It actually worked when I did:
Yikes! I’ll fix the script and update the instructions when I get home later today.
I tried to make it so that the script wasn’t required, so people should also feel free to enter in their info manually :)
Awesome, thanks for doing this!
As long as we're talking directories: https://ooh.directory
"In proper HN fashion, the site is open-source and requires no JS!"
It seems unfortunate that opening a PR requires having npm installed.
Seems fair though. Given how adversarial the web is with botspam, you basically need some form of bot mitigation. GitHub does a decent enough job, and has the nice feature of being easy to migrate off, since git itself is designed to be decentralized.
I don't quite follow. I don't have a problem with git or Github. I just don't happen to have npm installed.
Ah nevermind, I misunderstood how it worked.
Opening a PR doesn't actually require npm! The npm script is just supposed to save a few keystrokes :) I'll update the readme to reflect that
If it matters, I don't use titles in any of my blog posts, so the feed has a bunch of <title></title> in it.