Most active commenters

rarrrrrr(3)
j_baker(3)

Popular/hot comments

>>2121319 #

←back to thread

Why HN was slow and how Rtm fixed it

(ycombinator.com)

1. rarrrrrr ◴[19 Jan 11 20:49 UTC] No.2121225[source]▶

>>2120756 (OP) #

Since no one has mentioned it yet - Varnish-cache.org, written by a FreeBSD kernel hacker, has a very nice feature, in that it will put all overlapping concurrent requests for the same cacheable resource "on hold", only fetch that resource once from the backend, then serve the same copy to all. Nearly all the expensive content on HN would be cacheable by varnish. Then you can get it down to pretty close to "1 backend request per content change" and stop worrying about how arbitrarily slow the actual backend server is, how many threads, how you deal with the socket, garbage collection, and all that.

replies(4): >>2121261 #>>2121274 #>>2121319 #>>2122946 #

2. dauphin ◴[19 Jan 11 21:00 UTC] No.2121261[source]▶

>>2121225 (TP) #

You clearly don't understand the problem. Even mod_pagespeed or memcached would be more appropriate here: They are rate-limited by the LISP kernel anyway (we are talking about dynamic content here).

replies(1): >>2121593 #

3. j_baker ◴[19 Jan 11 21:05 UTC] No.2121274[source]▶

>>2121225 (TP) #

Can't you only use varnish for mostly non-dynamic content? Like for example, wouldn't the fact that it displays my username and karma score at the top of the page make it so that you couldn't use varnish (or at least make it more difficult)?

replies(2): >>2121326 #>>2121328 #

4. nuclear_eclipse ◴[19 Jan 11 21:21 UTC] No.2121319[source]▶

>>2121225 (TP) #

Reverse proxies won't work for HN, because requests for the same resource from multiple users can't use the same results. Not only are certain bits of info customized for the user (like your name/link at the top), but even things like the comments and links are custom per user.

Things like users' showdead value, as well as whether the user is deaded, can drastically change the output of each page. Eg, comments by a deaded user won't show as dead to that user, but they will for everyone else...

replies(5): >>2121335 #>>2121419 #>>2121430 #>>2121544 #>>2122733 #

5. dminor ◴[19 Jan 11 21:23 UTC] No.2121326[source]▶

>>2121274 #

Yes, you'd have to somehow separate the dynamic content from the static content so they could be fetched in different requests and then combined (probably via ajax). If it's just your username and karma then it's simple enough, but if the comments are displayed differently for different people then it could be bear.

replies(2): >>2121485 #>>2121585 #

6. seiji ◴[19 Jan 11 21:23 UTC] No.2121328[source]▶

>>2121274 #

Check out http://www.varnish-cache.org/trac/wiki/ESIfeatures

There could be a private internal URL to just return username and karma to populate the user info header.

replies(1): >>2121733 #

7. rfugger ◴[19 Jan 11 21:26 UTC] No.2121335[source]▶

>>2121319 #

To make this work you could do all the per-user stuff with javascript and ajax calls in the browser. It would be quite a bit of revamping though.

8. Aaronontheweb ◴[19 Jan 11 21:53 UTC] No.2121419[source]▶

>>2121319 #

So, you can't do donut caching in Varnish?

9. aonic ◴[19 Jan 11 21:57 UTC] No.2121430[source]▶

>>2121319 #

Varnish supports edge side includes. The header bar could be an ESI and the rest of the page could be cached

replies(1): >>2121555 #

10. yummyfajitas ◴[19 Jan 11 22:13 UTC] No.2121485{3}[source]▶

>>2121326 #

Comments are displayed differently for different people.

I once posted a comment which was immediately invisible to everyone besides me - I'm guessing it was marked as spam for some reason, but left visible to me so I think I successfully posted it.

11. jjoe ◴[19 Jan 11 22:34 UTC] No.2121544[source]▶

>>2121319 #

There's cookie-based caching in Varnish (and in some other proxy caches too). Essentially, the key is going to be made of the usual hash + the cookie like this:

sub vcl_hash { set req.hash += req.http.cookie; }

What this means is that the cache is per-logged-in-user and pretty much personalized. The server's going to need a lot more RAM than usual. You can set a low TTL on the cache entries so they're flushed and not kept in memory indefinitely. But the performance boost is great.

This is not recommended as an always-on measure. We wrote an entry about accomplishing something similar w/ python&varnish. Here it is if you're interesting in reading about it: http://blog.unixy.net/2010/11/3-state-throttle-web-server/

Regards

12. nuclear_eclipse ◴[19 Jan 11 22:37 UTC] No.2121555{3}[source]▶

>>2121430 #

> and the rest of the page could be cached

Except they can't, for the reasons I mentioned above. Eg, if my account is deaded, when I view a thread with one of my own comments, it looks different than if someone else was viewing that some thread, especially for those of us with or without showdead checked in our profiles.

Its not as straightforward as you would like it to be.

replies(2): >>2121915 #>>2122019 #

13. rarrrrrr ◴[19 Jan 11 22:44 UTC] No.2121585{3}[source]▶

>>2121326 #

I think the only way PG would try this is if it's clear that javascript, client side hacks, etc are not required -- and I think that's the case.

Aside from the header, there's only a relatively small number of variations for any given content, right? Showdead, ability to downvote, etc? So, each of these variations gets a distinct ESI URL. Like /item_$showdead_$downvote_$etc right in the internal URL, so any combination of these is a distinct URL. Only the first user to hit any particular combination would result in a request to the backend, and that could remain in cache until its content changed. No wizardry required.

14. rarrrrrr ◴[19 Jan 11 22:46 UTC] No.2121593[source]▶

>>2121261 #

Varnish sits in front of the backend, responding to all requests that it has cached content for directly, without bothering the backend at all. It lives higher in the stack than than pagespeed or memcache, and not limited by the backend's speed in any way.

15. j_baker ◴[19 Jan 11 23:23 UTC] No.2121733{3}[source]▶

>>2121328 #

Doesn't that kind of defeat the purpose though? The point of using varnish is that it keeps you from having to access the backend altogether. This is getting into an area where something like memcache might be more appropriate.

replies(1): >>2121765 #

16. danudey ◴[19 Jan 11 23:34 UTC] No.2121765{4}[source]▶

>>2121733 #

Well, the point of using varnish is to keep you from having to access the backend any more than is absolutely necessary. It's incredibly trivial to generate HTML showing a user's username and karma, and even if it weren't it could be stored in memcached. Generating the front page, the comments pages, etc. is the hard part, and varnish can keep that from being generated any more than is necessary.

replies(1): >>2121924 #

17. aonic ◴[20 Jan 11 00:14 UTC] No.2121915{4}[source]▶

>>2121555 #

Special cookies could be set for dead users and users who enable showdead to bypass the cache.

For example, one of the sites I run has about 50K pageviews/day by logged in users, and another 600K pageviews/day by anonymous users coming from referrals or search engines. Logged in users have similar customization options so we bypass cache for these users by detecting a cookie.

Obviously going the cache route would require some changes to how things are setup, its not a turn-key solution. But the insignificant amount of changes are well worth it for most content sites, but for a user generated content site like HN it would also depend on how the TTLs and cache purging are setup.

18. j_baker ◴[20 Jan 11 00:17 UTC] No.2121924{5}[source]▶

>>2121765 #

Of course, but I seem to recall pg writing at some point that one of the goals of HN being to prove that "slow" languages can scale using caching. I assume, therefore, that he already has caching of some kind in place for those things. If varnish isn't going to save an access to the server (which seems to be the primary thing that's slowing things down), what value is varnish providing above what pg already has in place?

replies(1): >>2121932 #

19. danudey ◴[20 Jan 11 00:21 UTC] No.2121932{6}[source]▶

>>2121924 #

The requests won't queue up as badly because the server will be able to clean out 'simple' requests in a much lower time than generating much larger pages. They won't queue up as much because the requests take less time to handle, so they can be cleared out faster than they come in (compared to larger requests that queue up faster than they can be handled).

20. nitrogen ◴[20 Jan 11 00:47 UTC] No.2122019{4}[source]▶

>>2121555 #

The majority of requests probably come from live accounts in good standing or from people not even logged in, so the majority of requests could still be cached.

replies(1): >>2122860 #

21. piotrSikora ◴[20 Jan 11 05:09 UTC] No.2122733[source]▶

>>2121319 #

Of course it will work. The whole point of reverse-proxy is to buffer slow requests and send them fast over LAN to your back-end servers that cannot handle high concurrency efficiently.

The FreeBSD's accept_filter() used by Rtm does more or less that (you can think of it as of reverse-proxy in the kernel), but it only works for plain HTTP and HEAD/GET methods.

22. nkurz ◴[20 Jan 11 06:05 UTC] No.2122860{5}[source]▶

>>2122019 #

Interesting point: what percentage of viewers are logged in? I was presuming it was high, but I guess I really don't know.

23. esbcupper ◴[20 Jan 11 07:03 UTC] No.2122946[source]▶

>>2121225 (TP) #

Instead of having concurrent requests wait for the single request to the backend it's almost always better to use stale-while-revalidate (http://tools.ietf.org/html/draft-nottingham-http-stale-while...). AFAIK both varnish and squid support this.

↑