Most active commenters

didibus(5)
paulddraper(3)
whoknowsidont(3)
cstrahan(3)

Popular/hot comments

>>45950814 #
>>45957243 #

←back to thread

What if you don't need MCP at all?

(mariozechner.at)

Show context

whoknowsidont ◴[16 Nov 25 21:37 UTC] No.45948637[source]▶

>>45947444 (OP) #

MCP was a really shitty attempt at building a plugin framework that was vague enough to lure people into and then allow other companies to build plugin platforms to take care of the MCP non-sense.

"What is MCP, what does it bring to the table? Who knows. What does it do? The LLM stuff! Pay us $10 a month thanks!"

LLM's have function / tool calling built into them. No major models have any direct knowledge of MCP.

Not only do you not need MCP, but you should actively avoid using it.

Stick with tried and proven API standards that are actually observable and secure and let your models/agents directly interact with those API endpoints.

replies(8): >>45948748 #>>45949815 #>>45950303 #>>45950716 #>>45950817 #>>45951274 #>>45951510 #>>45951951 #

1. didibus ◴[17 Nov 25 03:51 UTC] No.45950716[source]▶

>>45948637 #

> MCP was a really shitty attempt at building a plugin framework

Can you go more in depth? The protocol is relatively simple, what about it you feel is "shitty" as a plugin framework?

replies(1): >>45950814 #

2. paulddraper ◴[17 Nov 25 04:17 UTC] No.45950814[source]▶

>>45950716 (TP) #

The hate for MCP here is absurd.

It's JSON-RPC, with some descriptors.

And some comments about OAuth 2.

The value is in the consensus. You can make a tool that agents can connect to with no apriori knowledge.

replies(3): >>45950849 #>>45953015 #>>45954572 #

3. whoknowsidont ◴[17 Nov 25 04:28 UTC] No.45950849[source]▶

>>45950814 #

>It's JSON-RPC, with some descriptors.

That's not even true. It defines the lifecycle of tool calling.

JSON-RPC with some descriptors would have been fine and amazing.

replies(2): >>45952839 #>>45954388 #

4. frumplestlatz ◴[17 Nov 25 11:56 UTC] No.45952839{3}[source]▶

>>45950849 #

I’m struggling to understand where you’re coming from. Your hate for MCP seems grossly outsized relative to what it actually is.

5. manbitesdog ◴[17 Nov 25 12:28 UTC] No.45953015[source]▶

>>45950814 #

Things like OpenAPI have existed for 15 years now and they also offer standarization.

The value on MCP is not on its features or innovation, but on the rate of adoption it has had. Companies have now an incentive to open, document and standarize their APIs to enable this new distribution channel.

6. paulddraper ◴[17 Nov 25 15:26 UTC] No.45954388{3}[source]▶

>>45950849 #

This comment is unhinged.

https://modelcontextprotocol.info/docs/concepts/transports/

replies(1): >>45955076 #

7. rcarmo ◴[17 Nov 25 15:42 UTC] No.45954572[source]▶

>>45950814 #

Actually, MCP wastes a lot of tokens when compared to regular tool calling. You might not notice it on more trendy models with large contexts, but for those of us trying to use locked down/local/cheap models it makes very little sense.

Also, MCP creates a new problem: providing the model with too much context when trying to combine tools across multiple servers. It works OK with small, very focused servers (like helpers for a specific data set), but if you try to mix and match servers things get out of hand really quickly and the entire workflow becomes very unreliable—too many options to digest and pursue, just like humans.

replies(1): >>45955557 #

8. whoknowsidont ◴[17 Nov 25 16:24 UTC] No.45955076{4}[source]▶

>>45954388 #

You linked to one part of the spec and just wanted to ignore everything else? That's fine, but then you wouldn't be obeying the standard and wouldn't be an "MCP."

So, are you agreeing with me?

Respectfully I think I've engaged with you before and you just seem generally confused about nuanced concepts.

replies(1): >>45955624 #

9. didibus ◴[17 Nov 25 17:06 UTC] No.45955557{3}[source]▶

>>45954572 #

Is that just bad implementation? Where are the wasted tokens?

I noticed your second issue, but to me it's just from bad implementation. For some reason people keep exposing generic overlapping tools from multiple MCP servers.

I don't know that MCP causes this issue, any vendor offering a "tools API" if they shove to many APIs it would bloat things up.

replies(1): >>45957243 #

10. didibus ◴[17 Nov 25 17:12 UTC] No.45955624{5}[source]▶

>>45955076 #

I'm open mindedly hoping you'll be more specific in what about the protocol you find problematic?

The base protocol is just JSON-RPC, and then you have to implement initialize. Everything else is optional.

11. cstrahan ◴[17 Nov 25 19:32 UTC] No.45957243{4}[source]▶

>>45955557 #

> Is that just bad implementation? Where are the wasted tokens?

How wouldn't it be wasteful?

I'll try to summarize a couple sources:

https://www.anthropic.com/engineering/code-execution-with-mc...

https://blog.cloudflare.com/code-mode/

Here's what Anthropic has to say about it: As MCP usage scales, there are two common patterns that can increase agent cost and latency:

    Tool definitions overload the context window;
    Intermediate tool results consume additional tokens.
    
    [...]
    
    Tool descriptions occupy more context window space, increasing response time and costs. In cases where agents are connected to thousands of tools, they’ll need to process hundreds of thousands of tokens before reading a request.
    
    [...]
    
    Most MCP clients allow models to directly call MCP tools. For example, you might ask your agent: "Download my meeting transcript from Google Drive and attach it to the Salesforce lead."
    
    The model will make calls like:
    
      TOOL CALL: gdrive.getDocument(documentId: "abc123")
              → returns "Discussed Q4 goals...\n[full transcript text]"
                 (loaded into model context)
      
      TOOL CALL: salesforce.updateRecord(
         objectType: "SalesMeeting",
         recordId: "00Q5f000001abcXYZ",
           data: { "Notes": "Discussed Q4 goals...\n[full transcript text written out]" }
        )
        (model needs to write entire transcript into context again)
    
    Every intermediate result must pass through the model. In this example, the full call transcript flows through twice. For a 2-hour sales meeting, that could mean processing an additional 50,000 tokens. Even larger documents may exceed context window limits, breaking the workflow.
    
    With large documents or complex data structures, models may be more likely to make mistakes when copying data between tool calls.

Now, if you were to instead have the LLM write code, that code can perform whatever filtering/aggregation/transformation etc that it needs, without having to round-trip from LLM to tool(s), back and forth, and the only tokens that are consumed are those of the final result. What happens with MCP? All of the text of each MCP call is flooded into the context, only for the LLM to have to make sense of what it just read to then either regurgitate that out into a file to post process (very likely with differences/"hallucinations" slipped in), or in the usual case (I'm personifying the LLM here for rhetorical purposes) it simply tries to reason about what it read to give you the filtered/aggregated/transformed/etc result you're looking for -- again, very likely with mistakes made.

replies(3): >>45959894 #>>45960427 #>>45970909 #

12. didibus ◴[18 Nov 25 00:01 UTC] No.45959894{5}[source]▶

>>45957243 #

But none of the criticisms here is specific to MCP, just to tool calls in general, it wouldn't matter if the agent used a custom tool protocol, plain OpenAPIs, etc. These issues would still exist.

replies(1): >>45977799 #

13. anon84873628 ◴[18 Nov 25 01:36 UTC] No.45960427{5}[source]▶

>>45957243 #

Sounds like what we need is for the MCP client to expose the tools as libraries for the agent's code interpreter. Then it can write code to wire them together without flowing through context.

We still get the benefits of standardization, higher level RPC endpoints, and vendor-supplied instructions.

14. paulddraper ◴[18 Nov 25 19:34 UTC] No.45970909{5}[source]▶

>>45957243 #

> Now, if you were to instead have the LLM write code

Okay....now what function calls or libraries should the API use to write that code?

Anthropic article describes a good approach, combining code gen with MCP.

replies(1): >>45977613 #

15. cstrahan ◴[19 Nov 25 09:38 UTC] No.45977613{6}[source]▶

>>45970909 #

> Okay....now what function calls or libraries should the API use to write that code?

Whatever the most popular library is which provides a client to whatever "thing" (locally running process, or service, or whatever) you are trying to interact with via LLM.

I have had LLMs generate probably 10s of thousands of lines of code, all without providing an MCP. LLMs can do that. They are trained on code.

What if there isn't a library available? Well, sure, then you could go implement an MCP server... or you could just write a library. It's practically the same effort.

Not that I'm 100% against MCP servers. But I think there's some misconception spreading across the software community that an MCP server is always inherently the optimal, obvious solution; all I'm suggesting is that people actually think about the problem, instead of outsourcing that responsibility via a "no one ever got fired for buying IBM" type of appeal to bandwagon fallacy.

16. cstrahan ◴[19 Nov 25 10:08 UTC] No.45977799{6}[source]▶

>>45959894 #

> These issues would still exist.

I just explained why these issues don't apply to the LLM writing and invoking code: again, this is because code can apply successive transformations to the input without having to feed the intermediate results into the LLM's context. That code can read a file that would weigh in at 50,000 tokens, chain 100 functions together, producing a line that would be 20 tokens, and only the LLM will only see the final 20 token result. That really is only 20 tokens for the entire result -- the LLM never sees the 50,000 tokens from the file that was read via the program, nor does the LLM see the 10s of thousands of tokens worth of intermediate results between the successive transformations from each of the 100 functions.

With MCP, there's no way for the LLM to invoke one tool call that expresses "compose/pipeline these 100 tools, please, and just give me the final result" -- the LLM must make 100 individual tool calls, manually threading the results through each tool call, which represents 100 opportunities for the LLM to make a mistake.

It sounds like you are disagreeing with what I am saying, but it doesn't look you're giving any reason why you disagree, so I'm a bit confused.

replies(1): >>45983657 #

17. didibus ◴[19 Nov 25 19:05 UTC] No.45983657{7}[source]▶

>>45977799 #

I'm not exactly disagreeing, its just that I don't think that's a criticism of MCP for tool calls.

You need a way to expose tools to LLM, even for what you're talking about. The LLM can't write code without access to code writing tools.

MCP is a protocol for exposing tools to an LLM agent in an extensible way.

If you didn't have MCP for this, the alternative wouldn't be a python script. It would be some other tool calling protocol.

Maybe a good criticism is that the protocol doesn't have a defined mechanism for piping tool calls together. It be nice to define as part of the protocol a standard way for the model to say call tool1 and then pass output.foo as input.bar of tool2.

I feel that can probably be an eventual extension.

As an advanced user of coding agents that you let run locally or have sandboxed yourself, you might get more power and mileage by having it implement scripts and execute them, or use bash/curl and so on, to say access the Google Drive standard APIs for whatever you want to do, than to use the Google Drive MCP. I 100% agree. But that's not adequate for all users, and it's not really an alternative to supporting custom tools in an agent.

↑