Most active commenters

punkpeye(4)

Popular/hot comments

>>43164415 #

←back to thread

Claude 3.7 Sonnet and Claude Code

(www.anthropic.com)

Show context

bcherny ◴[24 Feb 25 19:04 UTC] No.43163488[source]▶

>>43163011 (OP) #

Hi everyone! Boris from the Claude Code team here. @eschluntz, @catherinewu, @wolffiex, @bdr and I will be around for the next hour or so and we'll do our best to answer your questions about the product.

replies(82): >>43163527 #>>43163532 #>>43163549 #>>43163554 #>>43163555 #>>43163576 #>>43163585 #>>43163588 #>>43163589 #>>43163592 #>>43163593 #>>43163632 #>>43163642 #>>43163664 #>>43163677 #>>43163733 #>>43163758 #>>43163789 #>>43163803 #>>43163813 #>>43163821 #>>43163893 #>>43163909 #>>43163915 #>>43163921 #>>43163957 #>>43163958 #>>43163992 #>>43164069 #>>43164089 #>>43164102 #>>43164103 #>>43164104 #>>43164111 #>>43164127 #>>43164158 #>>43164329 #>>43164353 #>>43164424 #>>43164482 #>>43164514 #>>43164585 #>>43164616 #>>43164768 #>>43164797 #>>43164819 #>>43164899 #>>43165002 #>>43165057 #>>43165065 #>>43165088 #>>43165091 #>>43165187 #>>43165308 #>>43165355 #>>43165409 #>>43165468 #>>43165499 #>>43165516 #>>43165570 #>>43165578 #>>43165592 #>>43165836 #>>43165884 #>>43165965 #>>43165976 #>>43165995 #>>43166183 #>>43166711 #>>43166748 #>>43167130 #>>43167804 #>>43168626 #>>43168836 #>>43169047 #>>43169107 #>>43169119 #>>43169294 #>>43169310 #>>43173097 #>>43174353 #>>43192161 #

pookieinc ◴[24 Feb 25 19:09 UTC] No.43163554[source]▶

>>43163488 #

The biggest complaint I (and several others) have is that we continuously hit the limit via the UI after even just a few intensive queries. Of course, we can use the console API, but then we lose ability to have things like Projects, etc.

Do you foresee these limitations increasing anytime soon?

Quick Edit: Just wanted to also say thank you for all your hard work, Claude has been phenomenal.

replies(4): >>43163771 #>>43163889 #>>43164021 #>>43167940 #

1. punkpeye ◴[24 Feb 25 19:45 UTC] No.43164021[source]▶

>>43163554 #

If you are open to alternatives, try https://glama.ai/gateway

We currently serve ~10bn tokens per day (across all models). OpenAI compatible API. No rate limits. Built in logging and tracing.

I work with LLMs every day, so I am always on top of adding models. 3.7 is also already available.

https://glama.ai/models/claude-3-7-sonnet-20250219

The gateway is integrated directly into our chat (https://glama.ai/chat). So you can use most of the things that you are used to having with Claude. And if anything is missing, just let me know and I will prioritize it. If you check our Discord, I have a decent track record of being receptive to feedback and quickly turning around features.

Long term, Glama's focus is predominantly on MCPs, but chat, gateway and LLM routing is integral to the greater vision.

I would love feedback if you are going to give a try frank@glama.ai

replies(5): >>43164075 #>>43164764 #>>43167057 #>>43173593 #>>43174149 #

2. airstrike ◴[24 Feb 25 19:49 UTC] No.43164075[source]▶

>>43164021 (TP) #

The issue isn't API limits, but web UI limits. We can always get around the web interface's limits by using the claude API directly but then you need to have some other interface...

replies(1): >>43164415 #

3. punkpeye ◴[24 Feb 25 20:15 UTC] No.43164415[source]▶

>>43164075 #

The API still has limits. Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

The value proposition of Glama is that it combines UI and API.

While everyone focuses on either one or the other, I've been splitting my time equally working on both.

Glama UI would not win against Anthropic if we were to compare them by the number of features. However, the components that I developed were created with craft and love.

You have access to:

* Switch models between OpenAI/Anthropic, etc.

* Side-by-side conversations

* Full-text search of all your conversations

* Integration of LaTeX, Mermaid, rich-text editing

* Vision (uploading images)

* Response personalizations

* MCP

* Every action has a shortcut via cmd+k (ctrl+k)

replies(3): >>43164969 #>>43165930 #>>43166283 #

4. cmdtab ◴[24 Feb 25 20:47 UTC] No.43164764[source]▶

>>43164021 (TP) #

Do you have deepseek r1 support? I need it for a current product I’m working on.

replies(2): >>43165021 #>>43165209 #

5. airstrike ◴[24 Feb 25 21:10 UTC] No.43164969{3}[source]▶

>>43164415 #

Ok, but that's not the issue the parent was mentioning. I've never hit API limits but, like the original comment mentioned, I too constantly hit the web interface limits particularly when discussing relatively large modules.

replies(1): >>43165299 #

6. pclmulqdq ◴[24 Feb 25 21:16 UTC] No.43165021[source]▶

>>43164764 #

They are just selling a frontend wrapper on other people's services, so if someone else offers deepseek, I'm sure they will integrate it.

7. punkpeye ◴[24 Feb 25 21:35 UTC] No.43165209[source]▶

>>43164764 #

Indeed we do https://glama.ai/models/deepseek-r1

It is provided by DeepSeek and Avian.

I am also midway of enabling a third-provider (Nebius).

You can see all models/providers over at https://glama.ai/models

As another commenter in this tread said, we are just a 'frontend wrapper' around other people services. Therefore, it is not particularly difficult to add models that are already supported by other providers.

The benefit of using our wrapper is that you can use a single API key and you get one bill for all your AI bills, you don't need to hack together your own logic for routing requests between different providers, failovers, keeping track of their costs, worry what happens if a provider goes down, etc.

The market at the moment is hugely fragmented, with many providers unstable, constantly shifting prices, etc. The benefit of a router is that you don't need to worry about those things.

replies(1): >>43165480 #

8. glenstein ◴[24 Feb 25 21:42 UTC] No.43165299{4}[source]▶

>>43164969 #

Right, that's how I read it also. It's not that there's no limits with the API, but that they're appreciably different.

9. cmdtab ◴[24 Feb 25 22:05 UTC] No.43165480{3}[source]▶

>>43165209 #

Yeah I am aware. I use open router at the moment but I find it lacks a good UX.

replies(1): >>43165659 #

10. punkpeye ◴[24 Feb 25 22:25 UTC] No.43165659{4}[source]▶

>>43165480 #

Open router is great.

They have a very solid infrastructure.

Scaling infrastructure to handle billions of tokens is no joke.

I believe they are approaching 1 trillion tokens per week.

Glama is way smaller. We only recently crossed 10bn tokens per day.

However, I have invested a lot more into UX/UI of that chat itself, i.e. while OpenRouter is entirely focused on API gateway (which is working for them), I am going for a hybrid approach.

The market is big enough for both projects to co-exist.

11. Aeolun ◴[24 Feb 25 22:56 UTC] No.43165930{3}[source]▶

>>43164415 #

> Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

Even heavy coding sessions never run into Claude limits, and I’m nowhere near the highest tier.

replies(1): >>43167484 #

12. m_kos ◴[24 Feb 25 23:41 UTC] No.43166283{3}[source]▶

>>43164415 #

Your chat idea is a little similar to Abacus AI. I wish you had a similarly affordable monthly plan for chat only, but your UI seems much better. I may give it a try!

13. thrdbndndn ◴[25 Feb 25 01:37 UTC] No.43167057[source]▶

>>43164021 (TP) #

Just tried it, is there a reason why the webUI is so slow?

Try to delete (close) the panel on the right on a side-by-side view. It took a good second to actually close. Creating one isn't much faster.

This is unbearably slow, to be blurt.

14. smokeydoe ◴[25 Feb 25 02:44 UTC] No.43167484{4}[source]▶

>>43165930 #

I think it’s based on the tools you’re using. If I’m using Cline I don't have to try very hard to hit limits. I’m on the second tier.

15. tesch1 ◴[25 Feb 25 16:01 UTC] No.43173593[source]▶

>>43164021 (TP) #

Who is glama.ai though? Could not find company info on the site, the Frank name writing the blog posts seems to be an alias for Popeye the sailor. Am I missing something there? How can a user vet the company?

16. Daniel_Van_Zant ◴[25 Feb 25 16:40 UTC] No.43174149[source]▶

>>43164021 (TP) #

I see Cohere, is there any support for in-line citations like you can get with their first party API?

↑