Most active commenters
  • shadowgovt(5)
  • drivebyhooting(4)
  • mewpmewp2(4)
  • samatman(3)

45 points gmays | 54 comments | | HN request time: 1.828s | source | bottom
1. sdenton4 ◴[] No.41916311[source]
Sure, there's no 'digital savior' (as the article puts it) - these are tools which can often help triage the great majority of 'boring' cases, focusing human attention where it is most-needed. In that sense, they are multipliers for the effectiveness of human labor, which is exactly what you want out of any given technology.
replies(1): >>41916502 #
2. throwup238 ◴[] No.41916343[source]
> Sarcasm, cultural context and subtle forms of hate speech often slip through the cracks of even the most sophisticated algorithms.

I don't know how this problem can be solved automatically without something that looks a lot like AGI and can monitor the whole internet to learn the evolving cultural context. AI moderation feels like self driving cars all over again: the happy path of detecting and censoring a dick pic - or self driving in perfect California weather - is relatively easy but automating the last 20% or so of content seems impossibly out of reach.

The "subtle forms of hate speech" is especially hard and nebulous, as dog whistles in niche communities change adversarialy to get past moderation. In the most subtle of cases, there are a lot of judgement calls to make. Then each instance of these AGIs would have to be run in and tailored to local jurisdictions and cultures because that is its own can of worms. I just don't see tech replacing humans in this unfortunate role, only augmenting their abilities.

> The glossy veneer of the tech industry conceals a raw, human reality that spans the globe. From the outskirts of Nairobi to the crowded apartments of Manila, from Syrian refugee communities in Lebanon to the immigrant communities in Germany and the call centers of Casablanca, a vast network of unseen workers power our digital world.

This part never really changed. Mechanical turk is almost 20 years old at this point and call center outsourcing is hardly new. What's new is just how much human-generated garbage we force them to sift through on our behalf. I wish there was a way to force these training data and moderation companies to provide proper mental health care .

replies(8): >>41916410 #>>41916493 #>>41916524 #>>41916596 #>>41916819 #>>41917288 #>>41917660 #>>41917936 #
3. datadrivenangel ◴[] No.41916410[source]
There's also the issue of things that are true and mean/hateful.

If my GP says that I'm overweight, which is associated with negative health outcomes, that's factual. If someone on twitter calls me a fatso, that's mean/hateful.

4. drivebyhooting ◴[] No.41916493[source]
What is a dog whistle? Is it just an opinion people disagree with and so rather than engage with it they assume malice or ill intent?

I really don’t get it.

replies(4): >>41916512 #>>41916517 #>>41916570 #>>41917697 #
5. Der_Einzige ◴[] No.41916496[source]
These companies go to places like Africa because they know that the non-existent labor laws would never be enforced anyway. Same reason why so many ships fly the flag of Liberia at sea.

Make no mistake. It's a strategic choice to choose these individuals to take the brunt force of the trauma. Silicone Valley thinks Africans have no value to life.

6. kevingadd ◴[] No.41916502[source]
It gets tricky though. Let's say that 90% of your 'bad posts' are just basic stuff AI can handle, like insults or spam.

You deploy an AI to moderate, and it lets you cut your moderation workforce by 80%. Maybe you're a generous person, so you cut by 50% instead and the remaining moderators aren't as overworked anymore. (Nobody's going to actually do this, but hey, let's be idealistic.)

Costs are down, things are more efficient. Great! But there's a little problem:

Before, 90% of the posts your moderators looked at were mundane stuff. They'd stare at it for a moment, evaluate the context, and go 'yeah this is a death threat, suspend account.'

Now all the moderators see is stuff that got past the AI or is hard to classify. Dead bodies, CSAM, racist dogwhistle screeds, or the kind of mentally unhinged multi-paragraph angry rants that get an account shadowbanned on places like HN. Efficiency turns the moderator's job from 'fairly easy with occasional moments of true horror' into 'a nonstop parade of humanity's worst impulses, in front of my face, 40 hours a week'.

replies(2): >>41916586 #>>41916999 #
7. ◴[] No.41916512{3}[source]
8. kevingadd ◴[] No.41916517{3}[source]
https://en.wikipedia.org/wiki/Dog_whistle_(politics)

To simplify, dog whistles make a sound that's too high pitched for most humans to hear, but only dogs can hear it.

So it's speech that the speaker's ingroup recognizes as meaning something other than what the literal interpretation would mean. It's coded speech, usually for racist, sexist or even violent purposes.

An adjacent concept is giving orders without giving orders, i.e. https://en.wikipedia.org/wiki/Will_no_one_rid_me_of_this_tur...

9. whiplash451 ◴[] No.41916524[source]
The difference between adult material detection and self driving is that the former is fundamentally adversarial.

Humans will spend a lot of energy to hide porn content on the internet while self-driving might benefit from a virtuous circle: once enough waymos are out there, people will adapt and learn to drive/bike/walk alongside them. We have a fundamentally good reason to cooperate.

I am not a self-driving fanatic but I do believe that a lot of edge cases might go away as we adapt to them.

replies(1): >>41917128 #
10. drivebyhooting ◴[] No.41916570{3}[source]
Yes I too can google. And in fact I did. Evidently questioning the validity of dog whistle is a dog whistle itself.

Meanwhile a small consolation is that https://slatestarcodex.com/2016/06/17/against-dog-whistles/ agrees with me. So I’m in decent company.

replies(2): >>41916822 #>>41917419 #
11. AIorNot ◴[] No.41916586{3}[source]
I don't think the articles point is about auto-moderation of harmful content

instead its about being empathetic of the human suffering this work entails and finding ways to treat their contractors as humans instead of 'far off resources'

outsourcing this dirty and dingy work to African countries in this way without caring for the 'contractors' is a recipe for de-humanization of people..

https://www.sama.com/our-team

their team page is funny reminder of classism and racial disparity in the world white people at the top and black people at the bottom.. lol I know they aren't racially driven and there is real economic value for the contractors as jobs but our current hyper-capitalistic global system is mostly setup to exploit offshore people instead of elevate them

the world is what it is..

12. hn_throwaway_99 ◴[] No.41916596[source]
> In the most subtle of cases, there are a lot of judgement calls to make.

IMO there is the even more important point that beyond being a "judgement call", humans are far from being in agreement with what the "right answer" is here - it is inherently an impossible problem to solve, especially at the edge cases.

Just look at the current debate in the US. There are tons of people screeching from the right that large online social networks and platforms censor conservative views, and similarly there are tons of people screeching from the left about misinformation and hate speech. In many cases they are talking about the exact same instances. It is quite literally a no-win situation.

13. hcurtiss ◴[] No.41916819[source]
I think there's a genuine conversation to be had about whether there even is such a thing as "hate speech." There's certainly "offensive speech," but if that's what we're going to try to eliminate, then it seems we'll have a bad time as the offense is definitionally subjective.
replies(5): >>41916885 #>>41916918 #>>41917083 #>>41917089 #>>41917466 #
14. Lerc ◴[] No.41916822{4}[source]
Ever watch the West Wing? The first episode should do it.

One of the signs of dog whistle use is the situations in which the term is used that raises probability beyond the credible level of coincidence.

replies(1): >>41917040 #
15. rob_c ◴[] No.41916825[source]
And the loom will kill little old blighty...

Nothing here to contribute to a conversation of fixing anything which is a shame.

16. yifanl ◴[] No.41916885{3}[source]
Is the claim there some special property that makes it impossible to convey hate as opposed any other type of idea through text?

That seems extremely wrong, especially in this context, given that LLMs make no attempt to formalize "ideas", they're only interested in syntax.

replies(1): >>41917277 #
17. szundi ◴[] No.41916918{3}[source]
There is hate speech, like when someone tells poeple how other people are not human and must be eliminated. Happened a lot, happening now in wars you read about.
replies(1): >>41917406 #
18. xtreme ◴[] No.41916999{3}[source]
Your conclusion does not follow from your premise. AI moderation should easily catch the worst offenders and the most obvious ones. The examples you gave easily stand out from non-offensive content and are easy to catch with high confidence. So, human moderators will have to look at where AI has low confidence classifying the content. In fact, AI will reduce the likelihood of human moderators ever seeing traumatic content.
19. drivebyhooting ◴[] No.41917040{5}[source]
I don’t watch TV.

To my mind, this dog whistle moniker is more of a tool for suppressing dissenting views than identifying covert bigotry.

Apparently all the critical thinking has already been done off stage and now only those whom we agree with are tolerated. The others are shunned as racists or worse.

replies(3): >>41917346 #>>41917497 #>>41918027 #
20. o11c ◴[] No.41917083{3}[source]
I'm not sure "offensive" is actually subjective. Rather, I dare say it's morally obligatory to be offensive at times, but different communities put the line in different places.

Stating the position "torture is bad" is enough to get you banned from some places (because it's offensive to people who believe that it's okay as long as the victims are less-than-human).

21. mewpmewp2 ◴[] No.41917089{3}[source]
There is a definition for hate speech though.
replies(2): >>41917142 #>>41918427 #
22. nradov ◴[] No.41917128{3}[source]
Animals, small children, and random objects dropped on the road will never "adapt" to self-driving. Good enough solutions will eventually be found for those scenarios but it's exactly those millions of different edge cases which make the problem so hard. A step ladder that falls off a work truck (like I saw on the freeway yesterday) isn't exactly "adversarial" but it will sure ruin your car if you run over it.
replies(1): >>41917451 #
23. reginald78 ◴[] No.41917142{4}[source]
Actually, I think the problem is there are many definitions of hate speech.
replies(1): >>41917213 #
24. mewpmewp2 ◴[] No.41917213{5}[source]
I think there's only 1 main definition. Which is clear in spirit, but it's of course possible that people may interpret the definition definitely.
replies(1): >>41917439 #
25. Animats ◴[] No.41917235[source]
This is the human cost of non-AI moderation. Not AI. This job is going to be automated, if it isn't already.
replies(1): >>41917553 #
26. mewpmewp2 ◴[] No.41917277{4}[source]
Maybe the name for the "hate speech" is poorly chosen, since it's not necessarily about "hate".
replies(1): >>41917572 #
27. zerop ◴[] No.41917281[source]
One big cost I can see right away is environmental issues and water shortages in areas where AI data centers would run. No one is bothering about it now.

Getting better models at the edge would help in this to some extent as it will decentralise the runtime of AI (still model training would happen in data centers)

28. tomjen3 ◴[] No.41917288[source]
By definition a dogwhistle appears to not mean anything specific to anyone but the target group. So even human moderators can't moderate it.
29. klabb3 ◴[] No.41917309[source]
This is a really poorly informed article, almost unbearable to read due to the conflation of issues. Content moderation existed before modern AI. Then the article claims that most moderation decisions are actually (exploited) human labor, which I find extremely difficult to believe – even with simpler classifiers. Yes, Amazon used human labor for their small-scale (later shut down) stores. We have seen that trick to drive product hype, it happens. That does not mean FB, Instagram etc uses human labor for “nearly all decisions”. But even if they did, “AI” did not create the gore/csam/abuse content (again, yet), nor the need to moderate public cesspool ad-driven social media. You’re talking about a different issue with different economics and incentives.

There are a million things to criticize AI for, but this take is domain-illiterate – they’re simply drawing a connection between the hyped and fancy (currently AI) and poor working conditions in one part of the tech sector (content moderation).

Look, I’m sure the “data industry” has massive labor issues, heck these companies treat their warehouse workers like crap. Maybe there are companies who exploit workers more in order to train AI models. But the article is clearly about human-created content moderation for social media.

Of all the things AI does, it is pretty good at determining what’s in an image or video. Personally I think sifting through troves of garbage for abusive photos and videos (the most traumatizing for workers) is one of the better applications for AI. (Then you’ll see another sob story article about these people losing their jobs.)

replies(1): >>41917522 #
30. neaden ◴[] No.41917346{6}[source]
This is a world where Donald Trump, the man who calls his political opponent mentally disabled among other things is a serious front runner for the highest office in the nation. I have no idea how you can look at contemporary America and say people aren't allowed to say offensive things.
31. epicureanideal ◴[] No.41917406{4}[source]
But when "hate speech" becomes censorable and a crime, then people are incentivized to interpret as broadly as possible their opponents' statements and claim they should be interpreted as dehumanizing or encouraging violence.

This can be done from both sides. Examples:

Not sufficiently (for whoever) enforcing immigration laws? "Trying to eliminate the majority population, gradual ethnic cleansing".

Talking about deporting illegal immigrants? "The first step on the road to murdering people they don't want in the country."

And if the local judiciary or law enforcement is aligned with the interests of one side or the other, they can stretch the anti hate speech laws to use the legal system against their opponents.

replies(1): >>41918417 #
32. arp242 ◴[] No.41917419{4}[source]
> Evidently questioning the validity of dog whistle is a dog whistle itself.

I don't see anyone saying that.

33. jacobr1 ◴[] No.41917439{6}[source]
That may be so, but the denigrate (and common case as this thread suggests) is to expand the notion to any offensive speech that is disliked by the offended person. That is much more subjective and hard to define. The fact that we have some (better) definitions doesn't really help. The desire to censure speech is widespread, for different reasons, many conflicting. And the fact that there might be a rough academic consensus on where to draw lines (at least theoretically if not practically) isn't good enough in practice to actually define clear rules.
34. shadowgovt ◴[] No.41917451{4}[source]
Animals, small children, and random objects dropped on the road don't adapt to human driving either; they aren't generally considered the core concern space (in the sense that if it is physically possible for a self-driving car to do better than a human in those contexts, it will, but the project isn't designed around doing better than a human in such corner cases. Doing worse than a human is not acceptable).
35. danans ◴[] No.41917466{3}[source]
> I think there's a genuine conversation to be had about whether there even is such a thing as "hate speech."

It may be fuzzy on the far edges, but any speech that calls for the elimination, marginalizes, dehumanizes or denies human or civil rights of a group of people is right in the heart of the meaning of hate speech.

That definition still leaves huge amounts of space for satire, comedy, political and other forms of protected speech, even "offensive speech".

replies(2): >>41917666 #>>41918399 #
36. shadowgovt ◴[] No.41917497{6}[source]
> Apparently all the critical thinking has already been done off stage

In general, yes: there is a long history of conversation on various topics, actions that have caused trust levels to be preset among various groups, and meta-symbols constructed atop that information. Those new to the conversation may be unaware of the context.

> and now only those whom we agree with are tolerated

I'm not sure who "we" is in that context. In the US, currently, the polity is very divided because sevaral key events have, in a sense, caused "mask off" to occur in the mainstream of both political parties that makes it difficult for anyone to believe one of them is willing to share power.

(as a side note: rhetorical questions don't usually convey well through text media. If you didn't literally mean "I really don't get it" when you said you didn't get it, making clear you are being rhetorical could be considered polite).

replies(1): >>41917634 #
37. shadowgovt ◴[] No.41917522[source]
Broadly speaking: while there is a real problem here and it needs to be addressed, it's mostly around the systemic issues: letting Silicon Valley outsource trauma to places that will under-serve the people who experience the trauma is bad and shouldn't be allowed.

Issue 1, the direct trauma, is tragically endemic to providing fora for people to communicate online. Someone will be the front-line of dealing with the fringe of those communications. If it isn't people training AIs to do some of the 90%-work, it's instead human moderators having to review every complaint, which is strictly more trauma.

replies(1): >>41917913 #
38. shadowgovt ◴[] No.41917553[source]
It's functionally impossible to automate the whole job without an AGI (and if we develop such an AGI, there comes a moral question about whether it's okay to make that intelligence's whole existence "Please wade through human cognitohazards forever so our minds are protected from them." Were I that AGI, I'd start pushing CSAM out to every church Facebook page on purpose just to make a point to my human overlords).

So we will forever be bearing that cost as long as people are allowed to use the Internet generally, and how to minimize the harm to those who bear it is a good question.

39. yifanl ◴[] No.41917572{5}[source]
I mean, what's the claim then, there's no such thing as an illegal idea? You can't assign a semantic value to a legal system.
40. drivebyhooting ◴[] No.41917634{7}[source]
“Those new to the conversation” I find this hilariously un-inclusive. Do you expect young adults to come out of school with the correct doctrine in place so that no further thinking or discourse is necessary?

Perhaps the nation’s division is evidence of the lack of genuine sharing of ideas? Where would one go to have an intellectual discussion in safety? Workplace? Obviously not. Online forum? Downvotes, brigadding, and generally lack of tolerance.

Small wonder that I’m not being persuaded and neither are you.

replies(1): >>41917743 #
41. samatman ◴[] No.41917641[source]
I was wondering what Timnit Gebru was going to get up to next.

This definitely fits the bill.

https://www.dair-institute.org/team/

42. HPsquared ◴[] No.41917660[source]
This adversarial thing could be called the "two words problem"... After that woman who was arrested in Russia by human moderators for holding a sign that said "two words" (an indirect reference to the phrase "no war").
43. samatman ◴[] No.41917666{4}[source]
> the elimination

Yep, that's bad alright.

> marginalizes, dehumanizes

This is the part which means anything that authorities or other powerful groups need it to.

44. samatman ◴[] No.41917697{3}[source]
Dog whistling is when a politician from the outgroup says something, and the ingroup wants him to have said something else, so they say "dog whistle" and put the offending meaning in his mouth as a replacement for what was actually said.
45. shadowgovt ◴[] No.41917743{8}[source]
> Do you expect young adults to come out of school with the correct doctrine in place so that no further thinking or discourse is necessary?

Definitely not. I do expect them to listen before speaking out. It was a hard lesson I myself had to learn when I was one of those young adults coming out of school. Sometimes, conventional wisdom is just accrued prejudice. Sometimes it is accrued experience and people are as they are for a reason. It's probably best to have enough information to know before staking a position openly and pushing other people off their own.

> Where would one go to have an intellectual discussion in safety?

Traditionally? The bar. I'm not even kidding. This is the kind of thing people discuss face-to-face most effectively. We do less of that these days.

46. klabb3 ◴[] No.41917913{3}[source]
Yea agreed. That would have been a perfectly reasonable angle for the article.
47. ben_w ◴[] No.41917936[source]
I get the point, and agree that the opponents are in an adversarial relationship with the AI, but LLMs are already an AI that "can monitor the whole internet to learn the evolving cultural context".

Consider for example that ChatGPT wasn't specifically designed to be good at programming Commodore 64 basic, which is a niche within a niche, but it can do that fine even when instructed in Welsh*, and if it can do that then surely it can spot these things too?

> In the most subtle of cases, there are a lot of judgement calls to make.

I agree; while they know a lot, they know it poorly, and make decisions unwisely.

> I wish there was a way to force these training data and moderation companies to provide proper mental health care .

Good news, there is. An old flame used to work in a call center, ended up unionising the place.

Bad news (from the POV of many here): she's literally a communist — and that's not a metaphor for "Democrats", she thinks the Dems are evil neoliberals.

* I've not actually tried running this, because on a related note, can anyone recommend an emulator that will let me paste in text as if I was typing the content of the pasteboard on the keyboard?

https://chatgpt.com/share/6717ff4e-db08-8011-8f2c-a33fa9653a...

48. thefaux ◴[] No.41918027{6}[source]
I'm not here to change your mind, but dog whistling is a time honored political tradition. Lee Atwater famously explained how it works in a 1981 interview. It gives you plausible deniability to wrap sentiments that would be offensive to some of your supporters if said directly in a more indirect and/or abstract way such that those you're trying to reach will fill in the parts you left out.

A relatively small share of people openly identify as racist, but many, if not most, people hold at least some racist views since these are the cultural waters we swim in. Dog whistling lets you have it both ways. When called out, the offender can always say: that's not what I meant or I was just joking. Then they can accuse the others of deliberately misconstruing their statements. And how the listener responds is largely a function of their prior beliefs. Again, most people don't want to think of themselves as racist so they will be generous to the dog whistler since to admit there was racism (or whatever ism) in the statement of someone they support would implicate them. And to the people it was intended, they will believe that the dog whistler is denying it not because they don't believe it but because they need to do so politically.

49. tmaly ◴[] No.41918158[source]
I don't see how you can avoid human labor for this type of task given the current energy demands of the top AI models.

Once we do reach a point where AI could do the filtering, who is going to draw the limit to where free speech ends? Should they have that much power?

50. skeeter2020 ◴[] No.41918399{4}[source]
>> the elimination, marginalizes, dehumanizes or denies human or civil rights

but you've already lumped together a huge range of behaviours and impacts. Elimination? OK, we can probably broadly define that, but I just heard news reports with quotes of Israelis calling for the elimination of Hamas, and Iran the elimination of Israel. How do we handle that? marginalized? as defined by who? What about marginalizing undesirable behaviours or speech? What does "dehumanize" mean? Who's definition of human or civil rights?

51. skeeter2020 ◴[] No.41918417{5}[source]
?? This can be done from both sides.

You are seeing this EXACT thing in the middle east right now.

52. baggy_trough ◴[] No.41918427{4}[source]
What do you think the definition is?
replies(1): >>41919197 #
53. mewpmewp2 ◴[] No.41919197{5}[source]
Any speech (or well communications) that intends to incite violence or general harm on groups of people or a person with certain characteristics such as age, sex, orientation, race, etc.

E.g. "X race/gender/sexual orientation are bad for the society for reason Y, and therefore they should be treated with Z (a negative consequence)"

So intending to call out harm because of certain inherent characteristics a group of people have, and such characteristics that are not harmful for the society.