I’d like if there was some support for customising it without liking and disliking so I could push topics I’m interested in first (e.g. those tagged with emacs). It would also be nice to hide the like and dislike buttons in general as it gives more of a social media feel that the newspaper style UI does well to shake.
And even with margins turned off, stories are split "across" pages in a way that makes them useless for printing: https://i.imgur.com/SvmTGa8.png Need to pay more attention to your "break-inside" properties: https://developer.mozilla.org/en-US/docs/Web/CSS/break-insid... (and switch from using JS-generated absolute styles to using a CSS column layout or masonry grid)
Sadly, I can't remember the name of it but it was pretty great.
This post is not even on it.
It was the peak of RSS for me, beautiful UX, customizable, all the posts in sequential order if I wanted instead of algorithms…
I remember it because useless when web publishers realized they were losing ad views to apps like these and all the posts became previews with links.
https://chromewebstore.google.com/detail/hn-explorer/amiaaon...
There's probably no money in it, but a physical weekly customized RSS feed highlights newspaper would be neat.
https://github.com/gilesbowkett/hacker_newspaper/blob/master...
I kept it running for 5 or 10 years but eventually let it die.
edit: I'm not hating on OP btw. their version has pics, which mine doesn't. just agreeing that I believe the visual hierarchy inherent to newspaper title design is an important benefit of the format.
It gets the top 100 stories, sends their html to GPT-4 to extract the main content (this was not producing good enough results with html parsing) and then gets an embedding using the title and content.
Likes/dislikes are stored in local storage and compared against all stories using cosine similarity to find the most relevant stories.
It costs about $10/day to run. I was thinking of offering additional value for a small subscription. Maybe more pages of the newspaper, full story content/comments, a weekly digest or ePub export or something?
Agreed. This is also why old-school print design product catalogs often had superior presentation compared to today's web UIs for browsing hierarchically organized products. Everything is given the same visual weight and is formatted the same way.
Anyway, improving on what you did with the tooling that's easily available in 2024 but wasn't in 2009 seems like a fun challenge.
Not sure if it's a "premium feature" so to speak, but would be very cool to extend this to comments generally.
Maybe 'See Comments' here could load the comments on the same page? In a newspaper like style.
there was an app called Flipboard at the time which did something similar, but for different news sources, although its model of interactivity was a bit more gimmicky than the endless scroll. (which, for all its faults, is really simple and easy to use.)
You're referring to using the embeddings for cosine similarity?
I am doing something similar with stocks. Taking several decades worth of 10-Q statements for a majority of stocks and weighted ETF holdings and using an autoencoder to generate embeddings that I run cosine and euclidean algorithms on via Rust WASM.
The console error is: (index):464 Error loading stories: TypeError: Failed to construct 'URL': Invalid URL at (index):482:36 at Array.forEach (<anonymous>) at NewspaperApp.displayStories ((index):471:25) at NewspaperApp.loadStories ((index):461:26) at async NewspaperApp.initialize ((index):418:17)
Can anyone help? I really want to use this product it seems great.
I've wanted to take a stab at it because I think it would be "neat" but haven't actually found any good reference implementations.
also seems like with almost everyone on mobile it's just not worth it.
A few years ago, a similar project was posted on HN that I thought was really cool too - E Ink smart screen puts a newspaper on your wall (https://news.ycombinator.com/item?id=22831323).
Found it - it was Instapaper!
EDIT: Well maybe not, this one seems more like a replacement for ReadLater/GetPocket whereas the one I used was purely based off RSS feeds. I used it on the original iPad 1st gen so it's probably long gone. I give up.
I'm thinking specifically of DieWorkwear on twitter, but others too.
I would probably use this or at least play with it extensively if not for this "feature." I find that, unlike "real" newspapers, leading images in blog posts and even much larger sites are frequently a net negative (a trend greatly worsened with the advent of AI image generators).
That's probably closer to the editors choice in the context of HN.
Regarding HTML to GPT-4, I seem to remember commenters here saying they got better results by converting the HTML to Markdown first, then sending to an LLM. Might save a bit of money too.
The closest thing I’ve found to something actually resembling a proper masthead’s layout would actually be this from HN a few months ago: https://cybernetic.dev/grid
Balancing high information density and readability is the key to a proper layout. You’ve erred to far on minimalist “readability”
I’d suggest looking at older Indesign/Quark Express magazines if you want to see elements of publishing layouts done digitally
There’s going to be a great layout one day that fuses the news-editing with web, but this isn’t it
i certainly suspect that the 4chan and reddit datasets, combined with HN's, and building a LoRA that ranks the 4chan and reddit stuff lower and the good HN stuff higher. essentially, subtract all reddit and 4chan style comments from the set of HN comments' weights. Training SD loras was pretty quick but i haven't looked into LLM loras. regardless, the LLM with the HN-4chan&reddit can do sentiment analysis and use the votes; just feed it csv or json: votes, user, comment. I guess you could do votes/age as a cleanup, too.
All this to say i still wouldn't read or use it. I'm not a fan of robots entertaining me.
So the idea is to replicate that and use receipt paper, and thermal or what, dot matrix print onto the roll of paper your tweet stream. then you get something like those plastic M&M bottles, pill bottle, 35mm film bottle (boy i am full of ancient tech ideas)...
if you make some 3d printed cheap compliant mechanism[1] that snaps together and everything fits in a small tin or box...
[0] https://gajitz.com/paper-trails-auto-scrolling-1930s-in-car-... et https://i.imgur.com/WpyOGkI.jpeg (two separate links)
I'm often reading via an e-ink tablet. Whilst I can drop text quality to better support animations, the effect is a gross degredation of everything else, and of course, why the fuck would I want to see animations randomly?
"Animate on hover" is a setting I've long advocated for sites, and coded into CSS both for my own sites and as restylings of third-party sites. It's a compromise between constant distraction and being able to benefit from the very rarely actually useful animation. In the case of the passport photos story, the same effect could be achieved by a grid (2x2, 3x3) showing the variety of photos simultaneously. Detail isn't relevant, variety apparently is, and animation is a cheap eyeball-grabbing trick.
It's reminiscent in some ways of Slashdot of yore, which would include a slug describing the submission. One of my more persistent issues with HN is that the 80-character-limit title gives parlous thin information on whether or not a submission is worth reading. Additional microcontent, even just another 120-240 characters (10--20 words or so) often helps greatly, and your project demonstrates this.
Auto-selecting slugs is of course itself somewhat fraught, one example on the front page of YourHackerNews as I write has a slug beginning "This website is using a security service to protect itself from online attacks. The action you just performed triggered the security solution...", which is probably not what you'd prefer: <https://www.tokyodev.com/articles/the-english-paradox-four-d...>.
I'm not a fan of animations as noted in another comment. The "Passport Photos" story has a hero image which animates: <https://maxsiedentopf.com/passport-photos/>. One option would be to permit removing an item entirely from the layout. Hitting "X" on a story does not presently do this. HN itself has "hide" feature accomplishing this.
In general, I would strongly caution against auto-including images from sites, particularly as those can be pathways to future abuse, including the appropriation of the image-hosting site by unsavoury content. I'd run across an example from an earlier HN submission a few months ago, of the ever-more-aptly named "ShadyURL": <https://news.ycombinator.com/item?id=41002786>.
On layout: Traditional print newspapers aren't merely an assortment of articles, or a ranked placement of articles, but an edited presentation of them. There's usually a top story, of course, but gathered around those will be stories related to the primary feature. See a recent archive of the (online) NY Times homepage for an example: <https://archive.is/HO7xW>.
Layouts are also grouped by topical sections. Again the Times demonstrates this (top news, analysis, opinion, "the great read", "the athletic", "culture and lifestyle", etc. The Guardian similarly, with several news blocks for top news, "headlines", "in focus", "spotlight", "opinion", "sports", "climate crisis", etc.
HN's news breakdown differs, though looking at the submitted sites, title, and in your case the slug should give some options for largely-automated story placement. I've done my own analysis of HN front-pages, and came up with a list of 47 categories of sites with > 17 front-page appearances (and a great many more without), totaling 16,185 classified sites.
Categories: programming (7719), blog (5506), media (816), science (635), news (344), comm. (227), government (129), software (127), video (78), discussion (73), interest (72), design (60), database (57), cryptocurrency (49), law (41), cybersecurity (25), technology (25), commentary (24), recreation (23), hardware (22), medicine (15), documents (14), military (10), literature (9), economics (8), publications (8), list (7), crowdfunding (6), education (6), webcomic (6), (wiki) (5), books (5), info (5), entertainment (4), environment (4), journalism (4), organisations (3), support (3), information (2), translation (2), humour (1), images (1), n/a (1), networking (1), podcast (1), society (1), ui/ux (1).
(I can provide the classification file on request, username at protomail.com.)
That provides pretty comprehensive coverage of the actual stories submitted (I'd had the exact factor once, I believe it's in excess of 90%).
Again: parsing of the titles and/or slugs (perhaps with an AI assist?) could give better classification. Sites such as Lobsters (<https://lobste.rs>) include tags and often have similar submission selections to HN, which might also be used to organise placement.
Another characteristic of traditional layouts is that the horizontal line is reset periodically. If you look at the NYTimes, Guardian, or other traditional news sites you'll find frequent use of horizontal breaks. I don't know if this is a peculiarity of mine or not, but I find that card-style summaries which are not randomly aligned vertically on a page are much easier to read.
My feedback is that a "front page photo" is a very high bar and most of the images fall well below what deserves the space. I would avoid:
- boilerplate page previews for e.g. github
- generic site logos e.g. arxiv, aide
- stock images and ai equivalents e.g. the models in suits stuff
- "decorative" images e.g. prime number
- author photos
- hate autoplaying gifs in this context. Very distracting. They might be great content better "play on demand" for me
Images work better when it's a relevant and a unique visual e.g. the gladiators for the history article or the cockroach but they are still are not really front-page material.To truly earn their place, an image has to add information above and beyond the text. Identifying when an article is actually about an image e.g. space photo, data visualisation etc. would add some real value to the presentation.
- "Whole OS" is a standard alpine image (4MB) with just lynx installed via standard alpine package. Plus a layer for Lynx itself and entrypoint.sh script.
So a very standardized way to run it, with reusable popular base image, decent backbone for delivering it to the public, with ability to easily mirror and/or cache (done by default) each layer. Currently base Alpine has 0 known vulnerabilities, which may not be 0 tomorrow, but it's still a marvel that it ever has such low number. New versions are available instantly after developer creates new public image, without the need for maintainer of a distribution to look at it. Meanwhile your main OS can live it's own life in his own pace, without interference.
It doesn't sound scary at all, if you really have a closer look.
Slugs are chosen using AI, but it doesn't work too well, as you mentioned. I will work on iterating the prompt to try and get better results.
A 'remove' option would be a good addition. It would directly remove the element, and not track it as a 'disliked' story.
Since this whole thing is automated, and there's no curation involved beyond embeddings and HN points, it's hard to display it like a traditional newspaper, with proper vertical alignment (since images and blurbs are manually entered to fill the content there).
I think grouping the newspaper into categories is a great idea. I don't think it works with the daily frontpage, since there isn't enough content (just 30 stories on the front page), but it would work quite well in a monthly version. Your categories are very helpful and I will email you about them.
I now almost exclusively get my HN feed through a simple script I wrote to get desc sorted posts by score or trend (score/time): https://github.com/faroukfaiz10/hackernews-homepage
The result looks something like this ({score/time} - {score} - {link} - {comments link}):
59 - 1478 - Passport Photos - https://maxsiedentopf.com/passport-photos/ - https://news.ycombinator.com/item?id=42069646
16 - 790 - Useful built-in macOS command-line utilities - https://weiyen.net/articles/useful-macos-cmd-line-utilities- https://news.ycombinator.com/item?id=42057431
...
Slug-selection is a tough challenge. AI may not solve it, but at least it will make it more non-deterministic ;-) Reviewing the site again, you're largely doing quite well.
(I'd really like to see a widely-used abstract or summary semantic markup usage, though microformats seem not to have been widely adopted: <https://microformats.org/>.)
The layout / horizontal breaks observation is mostly something that's nagged me for ages in card-based layouts going back to Google+ in the early 2010s. It took me a while to realise that what nagged me about the layout was having multiple columns of cards with no vertical coherence. It just sort of jumbles in my mind, and I'm not sure if that's a cognitive defect of mine or a more general response. I've come to appreciate print-based layout practices increasingly with time, particularly as I find online / fluid layouts increasingly less satisfactory.
(I've made some past observations about layout and on-screen reading on traditional displays vs. a large-format e-ink display. I increasingly strongly prefer the latter for reading, though even there many issues remain. Search: <https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu...> and focusing on scroll-vs-paginated displays: <https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...>, if you feel like hunting through a bunch of my whinges ;-)
I want to help these businesses. Instagram and Facebook have shopping features, but only for a few brands, and they are not available in many regions like India. India has the world's second most active social media users and millions of businesses trying to acquire customers using social media. I want users and buyers to be able to shop directly from posts without leaving the app. I want my payment model on these platforms to create more convenience for customers and reduce business costs.
So, if someone who has worked at Facebook, Instagram, or any other company has valuable advice for me, please share.
How can I make this happen?
Will these platforms allow me to?
Thank you
You could maybe just redirect to the comment page?
Slug selection uses a better AI prompt now.
I agree with the lack of vertical alignment. It's not just you, many people have the same view, and there's a reason it's been used in newspapers. And there's a reason it's not used when the content and layout is generated automatically, but I'm going to work on a solution.
Thank you, I will browse through your comments :)
The code lives here: https://github.com/jzombie/etf-matcher
The ad-hoc vector DB I've created lives here: https://github.com/jzombie/etf-matcher/blob/main/rust/src/da...