Show HN: Hacker News frontpage as a print newspaper that you can personalize

(yourhackernews.com)

564 points nimbusega | 2 comments | 06 Nov 24 15:23 UTC | HN request time: 0.415s | source

Show context

nimbusega ◴[06 Nov 24 18:32 UTC] No.42067000[source]▶

>>42063709 (OP) #

I made this to experiment with embeddings and explore how different ways of displaying information affect your perception.

It gets the top 100 stories, sends their html to GPT-4 to extract the main content (this was not producing good enough results with html parsing) and then gets an embedding using the title and content.

Likes/dislikes are stored in local storage and compared against all stories using cosine similarity to find the most relevant stories.

It costs about $10/day to run. I was thinking of offering additional value for a small subscription. Maybe more pages of the newspaper, full story content/comments, a weekly digest or ePub export or something?

replies(4): >>42067307 #>>42067813 #>>42072116 #>>42072371 #

jzombie ◴[06 Nov 24 19:22 UTC] No.42067813[source]▶

>>42067000 #

> Likes/dislikes are stored in local storage and compared against all stories using cosine similarity to find the most relevant stories.

You're referring to using the embeddings for cosine similarity?

I am doing something similar with stocks. Taking several decades worth of 10-Q statements for a majority of stocks and weighted ETF holdings and using an autoencoder to generate embeddings that I run cosine and euclidean algorithms on via Rust WASM.

replies(2): >>42072665 #>>42133823 #

1. tiborsaas ◴[07 Nov 24 02:26 UTC] No.42072665[source]▶

>>42067813 #

> I am doing something similar with stocks.

How well does it work?

replies(1): >>42143005 #

2. jzombie ◴[15 Nov 24 01:04 UTC] No.42143005[source]▶

>>42072665 (TP) #

It seems to do well for a lot of searches, though some are questionable, but I believe that I know why. I'm training some different autoencoders to give it some different perspectives.

The code lives here: https://github.com/jzombie/etf-matcher

The ad-hoc vector DB I've created lives here: https://github.com/jzombie/etf-matcher/blob/main/rust/src/da...

↑