Most active commenters

fragmede(4)

Popular/hot comments

>>40139496 #

←back to thread

The man who killed Google Search?

(www.wheresyoured.at)

Show context

gregw134 ◴[23 Apr 24 20:15 UTC] No.40136741[source]▶

>>40133976 (OP) #

Ex-Google search engineer here (2019-2023). I know a lot of the veteran engineers were upset when Ben Gomes got shunted off. Probably the bigger change, from what I've heard, was losing Amit Singhal who led Search until 2016. Amit fought against creeping complexity. There is a semi-famous internal document he wrote where he argued against the other search leads that Google should use less machine-learning, or at least contain it as much as possible, so that ranking stays debuggable and understandable by human search engineers. My impression is that since he left complexity exploded, with every team launching as many deep learning projects as they can (just like every other large tech company has).

The problem though, is the older systems had obvious problems, while the newer systems have hidden bugs and conceptual issues which often don't show up in the metrics, and which compound over time as more complexity is layered on. For example: I found an off by 1 error deep in a formula from an old launch that has been reordering top results for 15% of queries since 2015. I handed it off when I left but have no idea whether anyone actually fixed it or not.

I wrote up all of the search bugs I was aware of in an internal document called "second page navboost", so if anyone working on search at Google reads this and needs a launch go check it out.

replies(11): >>40136833 #>>40136879 #>>40137570 #>>40137898 #>>40137957 #>>40138051 #>>40140388 #>>40140614 #>>40141596 #>>40146159 #>>40166064 #

JohnFen ◴[23 Apr 24 20:24 UTC] No.40136833[source]▶

>>40136741 #

> where he argued against the other search leads that Google should use less machine-learning

This better echoes my personal experience with the decline of Google search than TFA: it seems to be connected to the increasing use of ML in that the more of it Google put in, the worse the results I got were.

replies(3): >>40137620 #>>40137737 #>>40137885 #

potatolicious ◴[23 Apr 24 21:37 UTC] No.40137620[source]▶

>>40136833 #

It's also a good lesson for the new AI cycle we're in now. Often inserting ML subsystems into your broader system just makes it go from "deterministically but fixably bad" to "mysteriously and unfixably bad".

replies(5): >>40137968 #>>40138119 #>>40138995 #>>40139020 #>>40147693 #

1. munk-a ◴[23 Apr 24 22:32 UTC] No.40138119[source]▶

>>40137620 #

I think - I hope, rather - that technically minded people who are advocating for the use of ML understand the short comings and hallucinations... but we need to be frank about the fact that the business layer above us (with a few rare exceptions) absolutely does not understand the limitations of AI and views it as a magic box where they type in "Write me a story about a bunny" and get twelve paragraphs of text out. As someone working in a healthcare adjacent field I've seen the glint in executive's eyes when talking about AI and it can provide real benefits in data summarization and annotation assistance... but there are limits to what you should trust it with and if it's something big-i Important then you'll always want to have a human vetting step.

replies(4): >>40138577 #>>40138723 #>>40138897 #>>40139084 #

2. acdha ◴[23 Apr 24 23:26 UTC] No.40138577[source]▶

>>40138119 (TP) #

I’m not optimistic on that point: the executive class is very openly salivating at the prospect of mass layoffs, and that means a lot of technical staff aren’t quick to inject some reality – if Gartner is saying it’s rainbows and unicorns, saying they’re exaggerating can be taken as volunteering to be laid off first even if you’re right.

replies(1): >>40163488 #

3. munificent ◴[23 Apr 24 23:46 UTC] No.40138723[source]▶

>>40138119 (TP) #

> I hope, rather - that technically minded people who are advocating for the use of ML understand the short comings and hallucinations.

The people I see who are most excited about ML are business types who just see it as a black boxes that makes stock valuation go vroom.

The people that deeply love building things, really enjoy the process of making itself, are profoundly sceptical.

I look at generative AI as sort of like an army of free interns. If your idea of a fun way to make a thing is to dictate orders to a horde of well-meaning but untrained highly-caffienated interns, then using generative AI to make your thing is probably thrilling. You get to feel like an executive producer who can make a lot of stuff happen by simply prompting someone/something to do your bidding.

But if you actually care about the grit and texture of actual creation, then that workflow isn't exactly appealing.

replies(2): >>40138898 #>>40139496 #

4. jorblumesea ◴[24 Apr 24 00:11 UTC] No.40138897[source]▶

>>40138119 (TP) #

> technically minded people who are advocating for the use of ML understand the short comings and hallucinations

really, my impression is the opposite. They are driven by doing cool tech things and building fresh product, while getting rid of "antiquated, old" product. Very little thought given to the long term impact of their work. Criticism of the use cases are often hand waved away because you are messing with their bread and butter.

5. spacemadness ◴[24 Apr 24 00:11 UTC] No.40138898[source]▶

>>40138723 #

They wouldn’t think this way if stock investors weren’t so often such naive lemmings ready to jump off yet another cliff with each other.

6. godelski ◴[24 Apr 24 00:35 UTC] No.40139084[source]▶

>>40138119 (TP) #

> but we need to be frank about the fact that the business layer above us (with a few rare exceptions) absolutely does not understand the limitations of AI and views it as a magic box where they type in

I think we also need to be aware that this business layer above us that often sees __computers__ as a magic box where they type in. There's definitely a large spectrum of how magical this seems to that layer, but the issue remains that there are subtleties that are often important but difficult to explain without detailed technical knowledge. I think there's a lot of good ML can do (being a ML researcher myself), but I often find it ham-fisted into projects simply to say that the project has ML. I think the clearest flag to any engineer that this layer above them has limited domain knowledge is by looking at how much importance they place on KPIs/metrics. Are they targets or are they guides? Because I can assure you, all metrics are flawed -- but some metrics are less flawed than others (and benchmark hacking is unfortunately the norm in ML research[0]).

[0] There's just too much happening so fast and too many papers to reasonably review in a timely manner. It's a competitive environment, where gatekeepers are competitors, and where everyone is absolutely crunched for time and pressured to feel like they need to move even faster. You bet reviews get lazy. The problems aren't "posting preprints on twitter" or "LLMs giving summaries", it's that the traditional peer review system (especially in conference settings) poorly scales and is significantly affected by hype. Unfortunately I think this ends up railroading us in research directions and makes it significantly challenging for graduate students to publish without being connected to big labs (aka, requiring big compute) (tuning is another common way to escape compute constraints, but that falls under "railroading"). There's still some pretty big and fundamental questions that need to be chipped away at but are difficult to publish given the environment. /rant

7. fragmede ◴[24 Apr 24 01:43 UTC] No.40139496[source]▶

>>40138723 #

We get it, you're skeptical of the current hype bubble. But that's one helluva no true Scotsman you've got going on there. Because a true builder, one that deeply loves building things wouldn't want to use text to create an image. Anyone who does is a business type or an executive producer. A true builder wouldn't think about what they want to do in such nasty thing as words. Creation comes from the soul, which we all know machines, and business people, don't have.

Using English, instead of C, to get a computer to do something doesn't turn you into a beaurocrat any more than using Python or Javascript instead does.

Only a person that truly loves building things, far deeper than you'll ever know, someone that's never programmed in a compiled language, would get that.

replies(4): >>40139565 #>>40139626 #>>40140078 #>>40140255 #

8. ethbr1 ◴[24 Apr 24 01:53 UTC] No.40139565{3}[source]▶

>>40139496 #

> Using English, instead of C, to get a computer to do something doesn't turn you into a beaurocrat any more than using Python or Javascript instead does.

If one uses English in as precise a way as one crafts code, sure.

Most people do not (cannot?) use English that precisely.

There's little technical difference between using English and using code to create...

... but there is a huge difference on the other side of the keyboard, as lots of people know English, including people who aren't used to fully thinking through a problem and tackling all the corner cases.

replies(1): >>40140179 #

9. pbar ◴[24 Apr 24 02:02 UTC] No.40139626{3}[source]▶

>>40139496 #

Was it intentional to reply with another no true Scotsman in turn here?

replies(2): >>40139742 #>>40140352 #

10. satvikpendem ◴[24 Apr 24 02:18 UTC] No.40139742{4}[source]▶

>>40139626 #

Yeah, I was also reading their response and was confused. "Creation comes from the soul, which we all know machines, and business people, don't have" ... "far deeper than you'll ever know", I mean, come on.

11. xarope ◴[24 Apr 24 03:12 UTC] No.40140078{3}[source]▶

>>40139496 #

using English has been tried many times in the history computing; Cobol, SQL, just to name a very few.

Still needed domain experts back then, and, IMHO, in years/decades to come

replies(1): >>40140262 #

12. dragonwriter ◴[24 Apr 24 03:25 UTC] No.40140179{4}[source]▶

>>40139565 #

> Most people do not (cannot?) use English that precisely.

No one can, which is why any place human interaction needs anything anywhere close to the determinancy of code, normal natural langauge is abandoned for domain-specific constructed languages built from pieces of natural language with meanings crafted especially for the particular domain as the interface language between the people (and often formalized domain-specific human-to-human communication protocols with specs as detailed as you’d see from the IETF.)

replies(1): >>40142085 #

13. WWLink ◴[24 Apr 24 03:36 UTC] No.40140255{3}[source]▶

>>40139496 #

Getting drunk off that AI kool-aid aren't ya

replies(1): >>40140366 #

14. WWLink ◴[24 Apr 24 03:37 UTC] No.40140262{4}[source]▶

>>40140078 #

Or you can draw pretty pictures in LabVIEW lol

15. fragmede ◴[24 Apr 24 03:52 UTC] No.40140352{4}[source]▶

>>40139626 #

If you have to ask, then you missed it

16. fragmede ◴[24 Apr 24 03:54 UTC] No.40140366{4}[source]▶

>>40140255 #

the othering of creators because they use a different paintbrush was bothering me.

replies(2): >>40140526 #>>40143041 #

17. stavros ◴[24 Apr 24 04:18 UTC] No.40140526{5}[source]▶

>>40140366 #

I can relate, AI is a tool, and if I want to write my code by LEGOing a bunch of AI-generated functions together, I should be able to.

18. cultofmetatron ◴[24 Apr 24 08:53 UTC] No.40142085{5}[source]▶

>>40140179 #

I gotta say, I love how you use english to perfectly demonstrate how imprecise english is without pre-understood context to disambiguate meaning.

19. karma_pharmer ◴[24 Apr 24 11:27 UTC] No.40143041{5}[source]▶

>>40140366 #

please go other yourself somewhere else

replies(1): >>40143823 #

20. fragmede ◴[24 Apr 24 12:58 UTC] No.40143823{6}[source]▶

>>40143041 #

Hit a nerve, it seems. Apologies.

21. nebula8804 ◴[25 Apr 24 21:48 UTC] No.40163488[source]▶

>>40138577 #

Yeah but what comes after the mass layoffs? Getting hired to clean up the mess that AI eventually creates? Depending on the business it could end up becoming more expensive than if they had never adopted GenAI at all. Think about how many companies hopped on the Big Data Bandwagon when they had nothing even coming close to what "Big Data" actually meant. That wasn't as catastrophic as what AI would do but it still was throwing money in the wrong direction.

replies(1): >>40165169 #

22. acdha ◴[26 Apr 24 02:00 UTC] No.40165169{3}[source]▶

>>40163488 #

I’m sure we’re going to see plenty of that but from the perspective of a person who isn’t rich enough to laugh off unemployment, how does that help? If speaking up got you fired, you won’t get your old job back or compensation for the stress of looking in a bad market. If you stick around, you’re under more pressure to bail out the business from the added stress of those bad calls and you’re far more likely to see retribution than thanks for having disagreed with your CEO: it takes a very rare person to appreciate criticism and the people who don’t aren’t going to get in the situation of making such a huge bet on a fad to begin with – they’d have been more careful to find something it’s actually good for.

↑