Most active commenters

naasking(12)
rjbwork(3)
mike_hearn(3)

The world could run on older hardware if software optimization was a priority

(twitter.com)

Show context

caseyy ◴[13 May 25 12:56 UTC] No.43972418[source]▶

There is an argument to be made that the market buys bug-filled, inefficient software about as well as it buys pristine software. And one of them is the cheapest software you could make.

It's similar to the "Market for Lemons" story. In short, the market sells as if all goods were high-quality but underhandedly reduces the quality to reduce marginal costs. The buyer cannot differentiate between high and low-quality goods before buying, so the demand for high and low-quality goods is artificially even. The cause is asymmetric information.

This is already true and will become increasingly more true for AI. The user cannot differentiate between sophisticated machine learning applications and a washing machine spin cycle calling itself AI. The AI label itself commands a price premium. The user overpays significantly for a washing machine[0].

It's fundamentally the same thing when a buyer overpays for crap software, thinking it's designed and written by technologists and experts. But IC1-3s write 99% of software, and the 1 QA guy in 99% of tech companies is the sole measure to improve quality beyond "meets acceptance criteria". Occasionally, a flock of interns will perform an "LGTM" incantation in hopes of improving the software, but even that is rarely done.

[0] https://www.lg.com/uk/lg-experience/inspiration/lg-ai-wash-e...

replies(27): >>43972654 #>>43972713 #>>43972732 #>>43973044 #>>43973105 #>>43973120 #>>43973128 #>>43973198 #>>43973257 #>>43973418 #>>43973432 #>>43973703 #>>43973853 #>>43974031 #>>43974052 #>>43974503 #>>43975121 #>>43975380 #>>43976615 #>>43976692 #>>43979081 #>>43980549 #>>43982939 #>>43984708 #>>43986570 #>>43995397 #>>43998494 #

dahart ◴[13 May 25 14:34 UTC] No.43973432[source]▶

>>43972418 #

The dumbest and most obvious of realizations finally dawned on me after trying to build a software startup that was based on quality differentiation. We were sure that a better product would win people over and lead to viral success. It didn’t. Things grew, but so slowly that we ran out of money after a few years before reaching break even.

What I realized is that lower costs, and therefore lower quality, are a competitive advantage in a competitive market. Duh. I’m sure I knew and said that in college and for years before my own startup attempt, but this time I really felt it in my bones. It suddenly made me realize exactly why everything in the market is mediocre, and why high quality things always get worse when they get more popular. Pressure to reduce costs grows with the scale of a product. Duh. People want cheap, so if you sell something people want, someone will make it for less by cutting “costs” (quality). Duh. What companies do is pay the minimum they need in order to stay alive & profitable. I don’t mean it never happens, sometimes people get excited and spend for short bursts, young companies often try to make high quality stuff, but eventually there will be an inevitable slide toward minimal spending.

There’s probably another name for this, it’s not quite the Market for Lemons idea. I don’t think this leads to market collapse, I think it just leads to stable mediocrity everywhere, and that’s what we have.

replies(36): >>43973826 #>>43974086 #>>43974427 #>>43974658 #>>43975070 #>>43975211 #>>43975222 #>>43975294 #>>43975564 #>>43975730 #>>43976403 #>>43976446 #>>43976469 #>>43976551 #>>43976628 #>>43976708 #>>43976757 #>>43976758 #>>43977001 #>>43977618 #>>43977824 #>>43978077 #>>43978446 #>>43978599 #>>43978709 #>>43978867 #>>43979353 #>>43979364 #>>43979714 #>>43979843 #>>43980458 #>>43981165 #>>43981846 #>>43982145 #>>43983217 #>>44022403 #

1. naasking ◴[13 May 25 16:05 UTC] No.43974427[source]▶

>>43973432 #

> What I realized is that lower costs, and therefore lower quality,

This implication is the big question mark. It's often true but it's not at all clear that it's necessarily true. Choosing better languages, frameworks, tools and so on can all help with lowering costs without necessarily lowering quality. I don't think we're anywhere near the bottom of the cost barrel either.

I think the problem is focusing on improving the quality of the end products directly when the quality of the end product for a given cost is downstream of the quality of our tools. We need much better tools.

For instance, why are our languages still obsessed with manipulating pointers and references as a primary mode of operation, just so we can program yet another linked list? Why can't you declare something as a "Set with O(1) insert" and the language or its runtime chooses an implementation? Why isn't direct relational programming more common? I'm not talking programming in verbose SQL, but something more modern with type inference and proper composition, more like LINQ, eg. why can't I do:

    let usEmployees = from x in Employees where x.Country == "US";

    func byFemale(Query<Employees> q) =>
      from x in q where x.Sex == "Female";

    let femaleUsEmployees = byFemale(usEmployees);

These abstract over implementation details that we're constantly fiddling with in our end programs, often for little real benefit. Studies have repeatedly shown that humans can write less than 20 lines of correct code per day, so each of those lines should be as expressive and powerful as possible to drive down costs without sacrificing quality.

replies(7): >>43974948 #>>43975561 #>>43975743 #>>43976283 #>>43979978 #>>43981699 #>>44018060 #

2. bflesch ◴[13 May 25 16:48 UTC] No.43974948[source]▶

>>43974427 (TP) #

Your argument makes sense. I guess now it's your time to shine and to be the change you want to see in the world.

replies(1): >>43975853 #

3. rjbwork ◴[13 May 25 17:39 UTC] No.43975561[source]▶

>>43974427 (TP) #

I consider functional thinking and ability to use list comprehensions/LINQ/lodash/etc. to be fundamental skills in today's software world. The what, not the how!

replies(1): >>43976097 #

4. mike_hearn ◴[13 May 25 17:54 UTC] No.43975743[source]▶

>>43974427 (TP) #

Hm, you could do that quite easily but there isn't much juice to be squeezed from runtime selected data structures. Set with O(1) insert:

    var set = new HashSet<Employee>();

Done. Don't need any fancy support for that. Or if you want to load from a database, using the repository pattern and Kotlin this time instead of Java:

    @JdbcRepository(dialect = ANSI) interface EmployeeQueries : CrudRepository<Employee, String> {
        fun findByCountryAndGender(country: String, gender: String): List<Employee>
    }

    val femaleUSEmployees = employees.findByCountryAndGender("US", "Female")

That would turn into an efficient SQL query that does a WHERE ... AND ... clause. But you can also compose queries in a type safe way client side using something like jOOQ or Criteria API.

replies(1): >>43975843 #

5. naasking ◴[13 May 25 18:04 UTC] No.43975843[source]▶

>>43975743 #

> Hm, you could do that quite easily but there isn't much juice to be squeezed from runtime selected data structures. Set with O(1) insert:

But now you've hard-coded this selection, why can't the performance characteristics also be easily parameterized and combined, eg. insert is O(1), delete is O(log(n)), or by defining indexes in SQL which can be changed at any time at runtime? Or maybe the performance characteristics can be inferred from the types of queries run on a collection elsewhere in the code.

> That would turn into an efficient SQL query that does a WHERE ... AND ... clause.

For a database you have to manually construct, with a schema you have to manually and poorly to an object model match, using a library or framework you have to painstakingly select from how many options?

You're still stuck in this mentality that you have to assemble a set of distinct tools to get a viable development environment for most general purpose programming, which is not what I'm talking about. Imagine the relational model built-in to the language, where you could parametrically specify whether collections need certain efficient operations, whether collections need to be durable, or atomically updatable, etc.

There's a whole space of possible languages that have relational or other data models built-in that would eliminate a lot of problems we have with standard programming.

replies(2): >>43976186 #>>43977741 #

6. naasking ◴[13 May 25 18:05 UTC] No.43975853[source]▶

>>43974948 #

I wish I had the time... always "some day"...

replies(1): >>43977757 #

7. naasking ◴[13 May 25 18:28 UTC] No.43976097[source]▶

>>43975561 #

Agreed, but it doesn't go far enough IMO. Why not add language/runtime support for durable list comprehensions, and also atomically updatable ones so they can be concurrently shared, etc. Bring the database into the language in a way that's just as easily to use and query as any other value.

replies(1): >>43977042 #

8. mike_hearn ◴[13 May 25 18:39 UTC] No.43976186{3}[source]▶

>>43975843 #

There are research papers that examine this question of whether runtime optimizing data structures is a win, and it's mostly not outside of some special cases like strings. Most collections are quite small. Really big collections tend to be either caches (which are often specialized anyway), or inside databases where you do have more flexibility.

A language fully integrated with the relational model exists, that's PL/SQL and it's got features like classes and packages along with 'natural' SQL integration. You can do all the things you ask for: specify what operations on a collection need to be efficient (indexes), whether they're durable (temporary tables), atomically updatable (LOCK TABLE IN EXCLUSIVE MODE) and so on. It even has a visual GUI builder (APEX). And people do build whole apps in it.

Obviously, this approach is not universal. There are downsides. One can imagine a next-gen attempt at such a language that combined the strengths of something like Java/.NET with the strengths of PL/SQL.

replies(2): >>43978666 #>>43986115 #

9. ndriscoll ◴[13 May 25 18:48 UTC] No.43976283[source]▶

>>43974427 (TP) #

You can do this in Scala[0], and you'll get type inference and compile time type checking, informational messages (like the compiler prints an INFO message showing the SQL query that it generates), and optional schema checking against a database for the queries your app will run. e.g.

    case class Person(name: String, age: Int)
    inline def onlyJoes(p: Person) = p.name == "Joe"

    // run a SQL query
    run( query[Person].filter(p => onlyJoes(p)) )
    
    // Use the same function with a Scala list
    val people: List[Person] = ...
    val joes = people.filter(p => onlyJoes(p))

    // Or, after defining some typeclasses/extension methods
    val joesFromDb = query[Person].onlyJoes.run
    val joesFromList = people.onlyJoes

This integrates with a high-performance functional programming framework/library that has a bunch of other stuff like concurrent data structures, streams, an async runtime, and a webserver[1][2]. The tools already exist. People just need to use them.

[0] https://github.com/zio/zio-protoquill?tab=readme-ov-file#sha...

[1] https://github.com/zio

[2] https://github.com/zio/zio-http

replies(1): >>43978923 #

10. rjbwork ◴[13 May 25 20:00 UTC] No.43977042{3}[source]▶

>>43976097 #

Well, you can do that with LINQ + EF and embedded databases like SQL Lite or similar.

replies(1): >>43978858 #

11. jimbokun ◴[13 May 25 21:07 UTC] No.43977741{3}[source]▶

>>43975843 #

Why aren’t you building these languages?

12. jimbokun ◴[13 May 25 21:08 UTC] No.43977757{3}[source]▶

>>43975853 #

Thus the answer to your question of why those languages don’t exist.

replies(1): >>43978730 #

13. naasking ◴[13 May 25 22:52 UTC] No.43978666{4}[source]▶

>>43976186 #

> There are research papers that examine this question of whether runtime optimizing data structures is a win

If you mean JIT and similar tech, that's not really what I'm describing either. I'm talking about lifting the time and space complexity of data structures to parameters so you don't have to think about specific details.

Again, think about how tables in a relational database work, where you can write queries against sets without regard for the underlying implementation, and you have external/higher level tools to tune a running program's data structures for better time or space behavior.

> A language fully integrated with the relational model exists, that's PL/SQL

Not a general purpose language suitable for most programming, and missing all of the expressive language features I described, like type/shape inference, higher order queries and query composition and so on. See my previous comments. The tool you mentioned leaves a lot to be desired.

replies(1): >>43987827 #

14. naasking ◴[13 May 25 22:58 UTC] No.43978730{4}[source]▶

>>43977757 #

That would be an explanation if new object/functional/procedural languages weren't coming out every year.

15. naasking ◴[13 May 25 23:11 UTC] No.43978858{4}[source]▶

>>43977042 #

LINQ is on the right track but doesn't quite go far enough with query composition. For instance, you can't "unquote" a query within another query (although I believe there is a library that tries to add this).

EF code-first is also on the right track, but the fluent and attribute mapping are awkward, foreign key associations often have to be unpacked directly as value type keys, there's no smooth transition between in-memory native types and durable types, and schema migration could be smoother.

Lots of the bits and pieces of what I'm describing are around but they aren't holistically combined.

replies(1): >>43995820 #

16. naasking ◴[13 May 25 23:18 UTC] No.43978923[source]▶

>>43976283 #

Notice how you're still specifying List types? That's not what I'm describing.

You're also just describing a SQL mapping tool, which is also not really it either, though maybe that would be part of the runtime invisible to the user. Define a temporary table whose shape is inferred from another query, that's durable and garbage collected when it's no longer in use, and make it look like you're writing code against any other collection type, and declaratively specify the time complexity of insert, delete and lookup operations, then you're close to what I'm after.

replies(1): >>43979193 #

17. ndriscoll ◴[13 May 25 23:53 UTC] No.43979193{3}[source]▶

>>43978923 #

The explicit annotation on people is there for illustration. In real code it can be inferred from whatever the expression is (as the other lines are).

I don't think it's reasonable to specify the time complexity of insert/delete/lookup. For one, joins quickly make you care about multi-column indices and the precise order things are in and the exact queries you want to perform. e.g. if you join A with B, are your results sorted such that you can do a streaming join with C in the same order? This could be different for different code paths. Simply adding indices also adds maintenance overhead to each operation, which doesn't affect (what people usually mean by) the time complexity (it scales with number of indices, not dataset size), but is nonetheless important for real-world performance. Adding and dropping indexes on the fly can also be quite expensive if your dataset size is large enough to care about performance.

That all said, you could probably get at what you mean by just specifying indices instead of complexity and treating an embedded sqlite table as a native mutable collection type with methods to create/drop indices and join with other tables. You could create the table in the constructor (maybe using Object.hash() for the name or otherwise anonymously naming it?) and drop it in the finalizer. Seems pretty doable in a clean way in Scala. In some sense, the query builders are almost doing this, but they tend to make you call `run` to go from statement to result instead of implicitly always using sqlite.

replies(1): >>44000929 #

18. gus_massa ◴[14 May 25 02:07 UTC] No.43979978[source]▶

>>43974427 (TP) #

Isn't this comprehension in Python https://www.w3schools.com/python/python_lists_comprehension.... ?

19. dragandj ◴[14 May 25 07:06 UTC] No.43981699[source]▶

>>43974427 (TP) #

Clojure, friend. Clojure.

Other functional languages, too, but Clojure. You get exactly this, minus all the <'s =>'s ;'s and other irregularities, and minus all the verbosity...

20. neonsunset ◴[14 May 25 16:01 UTC] No.43986115{4}[source]▶

>>43976186 #

Funnily enough, the combination of .NET and PL/SQL already exists today, albeit in a literal sense:

https://pldotnet.brickabode.com/cms/uploads/pldotnet_v0_99_b...

21. mike_hearn ◴[14 May 25 18:38 UTC] No.43987827{5}[source]▶

>>43978666 #

I guess the closest to that I've seen would be something like Permazen with some nice syntax sugar on top. It's not the relational model, but it does simplify away a lot of the complexity of the object relational mismatch (for Java) whilst preserving the expressiveness of a 'full' mainstream language.

replies(1): >>44000862 #

22. rjbwork ◴[15 May 25 15:05 UTC] No.43995820{5}[source]▶

>>43978858 #

Can you elaborate on what you mean by "unquote"? I've not heard this term and I can't find any relevant info from search.

replies(1): >>44000783 #

23. naasking ◴[16 May 25 00:46 UTC] No.44000783{6}[source]▶

>>43995820 #

In F# and MetaOCaml it's called splicing:

https://learn.microsoft.com/en-us/dotnet/fsharp/language-ref...

24. naasking ◴[16 May 25 01:02 UTC] No.44000862{6}[source]▶

>>43987827 #

Yes, that's getting closer, but as you implied it still leaves something to be desired. Ironically what I'm describing is sort of an evolution of Access database programming from 20+ years ago. Everything old is new again.

25. naasking ◴[16 May 25 01:14 UTC] No.44000929{4}[source]▶

>>43979193 #

> In real code it can be inferred from whatever the expression is (as the other lines are).

What I meant is that there would be no explicit List<T> types, or array types, or hash tables, or trees, etc. Contiguity of the data is an implementation detail that doesn't matter for the vast majority of programming, much like how fields are packed in an object is almost completely irrelevant. Existing languages drive people to attend to these small details like collection choice that largely don't matter except in extreme circumstances (like game programming).

What it would have is something more like a Set<T ordered by T.X>, and maybe not even ordering should be specifiable as that's typically a detail of presentation/consumers of data. Restrictions are freeing, so the point is to eliminate many ill-advised premature optimizations and unnecessary internal details. Maybe the runtime will use one of those classic collections internally from the constraints you specify on the set, but the fundamental choice would not typically be visible.

> That all said, you could probably get at what you mean by just specifying indices instead of complexity and treating an embedded sqlite table as a native mutable collection type with methods to create/drop indices and join with other tables.

Yes, something like sqlite would likely be part of the runtime of such a language, and seems like the most straightforward way to prototype it. Anyway, I don't have a concrete semantics worked out so much as rough ideas of certain properties, and this is only one of them.

26. jg0r3 ◴[18 May 25 00:33 UTC] No.44018060[source]▶

>>43974427 (TP) #

Could you link any of these studies?

I couldn't find anything specific when searching.

replies(1): >>44075244 #

27. naasking ◴[23 May 25 18:31 UTC] No.44075244[source]▶

>>44018060 #

Might be a good place to start, some citations and calculations in the replies there:

https://softwareengineering.stackexchange.com/a/450699

I did read actual studies that were conducted years ago, but don't have access to them at this point.

↑