RBS, Ruby’s new type signature language

(developer.squareup.com)

511 points mootrichard | 3 comments | 29 Jul 20 18:04 UTC | HN request time: 0.625s | source

Show context

muglug ◴[29 Jul 20 19:15 UTC] No.23990520[source]▶

Can someone explain why the types cannot live in Ruby code itself (after an appropriate version bump)?

Python 3 incorporated types into the language itself, in a similar way (though non-reified) to PHP. This seems much easier to deal with than requiring two files (.rb and .rbs) to describe a single data structure.

replies(4): >>23990918 #>>23990947 #>>23991201 #>>23991455 #

burke ◴[29 Jul 20 19:59 UTC] No.23990918[source]▶

>>23990520 #

Because Matz won't let people add type annotations to the ruby grammar.

replies(1): >>23991128 #

jakearmitage ◴[29 Jul 20 20:19 UTC] No.23991128[source]▶

>>23990918 #

Which is great.

replies(2): >>23991239 #>>23991260 #

zeptonix ◴[29 Jul 20 20:31 UTC] No.23991239[source]▶

>>23991128 #

Yeah, it is.

I'm having a really hard time understanding this "I need types forced down my throat" and "I like typing 3x as much as I would otherwise need to" and "yes, I want half my screen obscured by the types of everything I'm doing, not the actual code" and the "adding types now means bugs are impossible" mass cult hysteria that's running so rampant. Typing very occasionally prevents bugs that are generally easy to catch/fix or show up straight away when running an app. It's mostly a documentation system. And it slows development down.

Especially in Ruby which is such an elegant "programmer's language" I think it would just be silly.

replies(7): >>23991394 #>>23991403 #>>23991487 #>>23991507 #>>23991525 #>>23991716 #>>23992038 #

mumblemumble ◴[29 Jul 20 20:58 UTC] No.23991525[source]▶

>>23991239 #

> And it slows development down.

This can depend really heavily on what you mean by "development." If it's just getting the first version banged out, sure. If it includes coming back to code a couple years later in order to incorporate a new business requirement, having that documentation present can be a really big deal. 2 seconds spent typing out a type hint now might, down the line, save several minutes on average. Even in a recent Python project I did over the course of just a couple weeks, when I got to the "clean the code up and get it ready to put on the shelf for now" phase of the project, I ended up wishing that I had bothered to use type hints just a wee bit more when I was banging it out in the first place. It would have been a net time saver.

I don't like static typing super a lot in all cases because it makes it hard to do data-level programming. Which I find to be the true productivity booster in dynamic languages. But optional typing seems to hit the sweet spot for a great many purposes.

replies(1): >>23991556 #

jolux ◴[29 Jul 20 21:01 UTC] No.23991556[source]▶

>>23991525 #

I'm curious what you mean by "data-level programming."

replies(2): >>23991731 #>>23992742 #

mumblemumble ◴[29 Jul 20 21:21 UTC] No.23991731[source]▶

>>23991556 #

Leaving the data in basic data formats.

For example, JSON describes a logical structure of nested lists and dictionaries. If you were doing data-level programming, you would just map the JSON into actual nested lists of dictionaries and get on about your business.

The alternative, which is more common in static languages like Java, is to transform it all into some set of domain model objects, and probably validate it up-front, too. Even the bits you don't actually need to look at in order to accomplish the job at hand. IMO, that approach tends to mean creating a lot of unnecessary work for oneself. It also makes it harder to obey Postel's law.

(The corollary to that last bit is that it is also possible for static typing to create bugs.)

replies(3): >>23991843 #>>23992160 #>>24007508 #

dropofwill ◴[29 Jul 20 22:13 UTC] No.23992160[source]▶

>>23991731 #

I'm skeptical of Postel's law, if you deviate from the spec how can the meaning be clear? It seems to me like you would have to go out of your way to implement a buggy version of the spec?

A personal example of this was Httpd used to accept standard headers with spaces instead of dashes, this leads to strange behavior if you accidentally include both. So they decided to stop doing that in a major version. This major version was opaquely included by ops accidentally into our base images. This lead to a very long day of debugging on our end.

Point is, by being liberal with what you accept you create ambiguity, which you may not totally understand at the time. By putting that out into the wild you basically are forced to keep this ambiguous, undocumented spec alive or you no doubt will end up breaking some client.

replies(1): >>23992303 #

mumblemumble ◴[29 Jul 20 22:30 UTC] No.23992303[source]▶

>>23992160 #

That's definitely a concern, but it's also way outside of what I was talking about. I would also expect any JSON parser, even one in a dynamic language, to fail on JSON that is straight-up malformed. And ambiguous formats are always bad news.

I'm talking about situations where the JSON is formatted fine, it's just that some field wasn't specified, so then the entire input gets rejected. Even though there was zero need to read the contents of that field in the first place. It just happened to be included in some domain object that gets re-used everywhere, including some other places where the field's contents do matter.

Keep in mind that, when we're dealing with anything that might be transmitted in JSON, thinking that there might be a published spec, and that it manages to accurately cover all these details, is really optimistic. I've honestly never seen it happen in the wild. Oftentimes, any validation rules you might try to impose are guesswork as much as they are anything else. So complaining that a piece of data didn't conform to the spec might not even be a valid thing to do. All you can say for sure is that the data didn't meet the needs of some piece of business logic.

It's not perfect, but it's life. This tension, for example, is at the heart of why proto2 got replaced with proto3, and why using proto3 is strongly encouraged if you're looking to build a robust infrastructure.

replies(1): >>23992613 #

jolux ◴[29 Jul 20 23:16 UTC] No.23992613[source]▶

>>23992303 #

There are huge debates at Google internally over required vs optional in proto2 and proto3.

Beyond that I think you’re operating from a misconception about JSON parsing in static languages. There’s no requirement to convert to domain objects and reject data that doesn’t fit on a triviality, you’re just required to specify explicitly what happens when you encounter unexpected structure or data.

replies(1): >>23992877 #

mumblemumble ◴[29 Jul 20 23:58 UTC] No.23992877[source]▶

>>23992613 #

Sorry if I wasn't being clear. I'm not saying that's the only way it can work in static languages. I'm saying that that's the way it tends to work out in practice, because the ergonomics of most popular static languages tend to discourage a less brittle approach.

Whereas the ergonomics of popular dynamic languages tend to favor an approach that I find, for this specific purpose, to be both less verbose and more robust.

replies(2): >>23993018 #>>24003850 #

1. jolux ◴[30 Jul 20 00:19 UTC] No.23993018[source]▶

>>23992877 #

What is the difference in approach between these? I've programmed extensively in dynamic and static languages, and don't understand what you're talking about. Less verbose, I might concede. More robust though, I need some more evidence.

replies(1): >>23995129 #

2. jbritton ◴[30 Jul 20 06:00 UTC] No.23995129[source]▶

>>23993018 (TP) #

Reminds me of Rich Hickey’s “Maybe Not” speech, which I understand him suggesting that programming with “sets” is better than programming with “records” that may contain optional values.

replies(1): >>24003776 #

3. jolux ◴[30 Jul 20 23:04 UTC] No.24003776[source]▶

>>23995129 #

Yes, I know it and he seems to mostly ignore the fact that you can still fall back to manual typechecking in a statically typed language. That’s the part I don’t get. There’s nothing stopping you from manipulating JSON structurally in a static language.

↑