Parse, don't validate (2019)

(lexi-lambda.github.io)

Show context

bruce343434 ◴[07 Mar 23 10:54 UTC] No.35053912[source]▶

Note that this basically requires your language to have ergonomic support for sum types, immutable "data classes", pattern matching.

The point is to parse the input into a structure which always upholds the predicates you care about so you don't end up continuously defensively programming in ifs and asserts.

replies(12): >>35054046 #>>35054070 #>>35054386 #>>35054514 #>>35054901 #>>35054993 #>>35055124 #>>35055230 #>>35056047 #>>35057866 #>>35058185 #>>35059271 #

jim-jim-jim ◴[07 Mar 23 11:15 UTC] No.35054070[source]▶

>>35053912 #

I think we'll eventually come to regard `if` as we do `goto`.

replies(4): >>35054298 #>>35054351 #>>35054456 #>>35054814 #

1. raincole ◴[07 Mar 23 12:11 UTC] No.35054456[source]▶

>>35054070 #

I don't know about this. We all have seen this kind of code:

    if(!needToDoTheThing()) return;
    
    DoTheThing();

We could have written it this way:

    if(needToDoTheThing()) {
        DoTheThing();
    }
    else {
        return;
    }

The later is closer to how pattern match looks like. But in my experience, the majority of programmers prefer early return. I regularly see people "refactor" if-else to if-early-return, but I've never seen the opposite.

replies(4): >>35054651 #>>35054833 #>>35065147 #>>35065313 #

2. RHSeeger ◴[07 Mar 23 12:39 UTC] No.35054651[source]▶

>>35054456 (TP) #

I prefer the former. It separates the pre-conditions from the algorithm/logic, using gate clauses. I find this makes it easier to reason about the algorithm.

replies(1): >>35055047 #

3. pjc50 ◴[07 Mar 23 13:01 UTC] No.35054833[source]▶

>>35054456 (TP) #

It keeps the code closer to the left. It also keeps it conceptually simpler if you can discard a bunch of "obvious" cases early on.

replies(1): >>35055215 #

4. Timon3 ◴[07 Mar 23 13:26 UTC] No.35055047[source]▶

>>35054651 #

It's much nicer, especially since it keeps the complexity down.

If you nest if/else, you'll quickly approach a point where you have to keep a complex logic tree in your head to determine which states the system could be in inside of any given branch. If you use guard clauses and return early, you'll keep this complexity down to a minimum, since the list of possible states changes linearly with your code instead of exponentially.

I know not everybody likes it, but I think this makes cyclomatic complexity an extremely valuable metric for measuring "ease-of-reading".

5. jakelazaroff ◴[07 Mar 23 13:43 UTC] No.35055215[source]▶

>>35054833 #

Yup, this is my exact rationale for preferring this too. Branches are a significant source of complexity and early returns are one way to tame it — have the “meat” of the function deal with as few invariants as possible.

6. quickthrower2 ◴[08 Mar 23 04:10 UTC] No.35065147[source]▶

>>35054456 (TP) #

The second looks a lot more elegant in Haskell though. Funny how syntax and influence choice of semantics!

7. ParetoOptimal ◴[08 Mar 23 04:38 UTC] No.35065313[source]▶

>>35054456 (TP) #

I prefer using early return in monads with guard like:

    safeDiv :: (Monad m, Alternative m) => Int -> Int -> m Int
    safeDiv x y = do
      guard (y /= 0)
      pure (x `div` y)

    main :: IO ()
    main = do
      print $ safeDiv @Maybe 1 0
      print $ safeDiv @[] 1 0
      -- print =<< safeDiv @IO 1 0 -- guard throws an error in IO

Try it out at https://play.haskell.org/saved/a6VsE3uQ

↑