A related question - if you feed each comment into an LLM and asked it to classify into {human-produced, llm-produced, not-sure}, how many would it think are from LLMs? How could you try to investigate the true answer?
Credit to Claude Code for showing me how to search the HN database through Google BigQuery and for writing the HTML for the leaderboard.