Show HN: GitMCP is an automatic MCP server for every GitHub repo

(gitmcp.io)

185 points liadyo | 5 comments | 03 Apr 25 18:28 UTC | HN request time: 1.029s | source

Show context

the_arun ◴[03 Apr 25 18:40 UTC] No.43573672[source]▶

>>43573539 (OP) #

But why would we need an MCP server for a github repo? Sorry, I am unable to understand the use case.

replies(5): >>43573747 #>>43574513 #>>43576792 #>>43576809 #>>43580608 #

qainsights ◴[03 Apr 25 19:50 UTC] No.43574513[source]▶

>>43573672 #

Same here. Can't we just give the repo URL in Cursor/Windsurf to use the search tool to get the context? :thinking:

replies(3): >>43575149 #>>43575514 #>>43577370 #

1. cruffle_duffle ◴[04 Apr 25 01:20 UTC] No.43577370[source]▶

>>43574513 #

MCP servers present a structured interface for accessing something and (often) a structured result.

You tell the LLM to visit your GitHub repository via http and it gets back… unstructured, unfocused content not designed with an LLM’s context window in mind.

With the MCP server the LLM can initiate a structured interface request and get back structured replies… so instead of HTML (or text extracted from HTML) it gets JSON or something more useful.

replies(1): >>43578640 #

2. cgio ◴[04 Apr 25 05:29 UTC] No.43578640[source]▶

>>43577370 (TP) #

Is html less structured than json? I thought with LLMs the schematic of structure is less relevant than the structure itself.

replies(1): >>43584909 #

3. cruffle_duffle ◴[04 Apr 25 16:44 UTC] No.43584909[source]▶

>>43578640 #

Just trying to explain it to you made me think of a very good reason why an MCP is preferable to just telling it to fetch a page. When you tell ChatGPT or Sonnet or even cursor/windsurf/whatever to fetch a website do you know exactly what it is fetching? Does it load the raw html into the context? Does it parse the page and return just the text? What about the navigation elements, footer and other “noise” or does it have the LLM itself waste precious context window trying to figure the page out? Is it loading the entire page into context or truncating it? If it is truncated, how is the truncation being done?

With an MCP there is no question about what gets fed to the model. It’s exactly what you programmed to feed into it.

I’d argue that right there is one of the key reasons you’d want to use MCP over prompting it to fetch a page.

There are many others too though like exposing your database via MCP rather than having it run random “psql” commands and then parsing whatever the command returns. Another thing is letting it paw through splunk logs using an MCP, which provides both a structure way for the LLM to write queries and handle the results… note that even calling out to your shell is done via an MCP.

It’s also a stateful protocol, though I haven’t really explored that aspect.

It’s one of those things that once you play with it you’ll go “oh yeah, I see how this fits into the puzzle”. Once you see it though, it becomes pretty cool.

replies(1): >>43590149 #

4. cgio ◴[05 Apr 25 02:37 UTC] No.43590149{3}[source]▶

>>43584909 #

I don’t mind schemas and repositories, but I feel it’s a bit backwards. That’s the kind of work I would hope we can avoid with AI.

replies(1): >>43590729 #

5. cruffle_duffle ◴[05 Apr 25 04:23 UTC] No.43590729{4}[source]▶

>>43590149 #

MCP is written for the AI we’ve got not the ones doing all the hyping want us to believe exists.

With a long enough context window it wouldn’t matter the difference. But “long enough” in this context to me means where you view its length as big enough where size no longer matters. Kind of like modern hard drives that are “big enough that I don’t care about a 1gb file” (I was thinking megabyte files but that might be too large of an order of magnitude )

↑