←back to thread

467 points bundie | 1 comments | | HN request time: 0.21s | source
Show context
jeroenhd ◴[] No.44502053[source]
Google has been working on this since November last year going by the wayback archive of the support page for this feature.

I'm not seeing any indication that Gemini can read your messages, though. You can compose messages and start calls, but I can't get it to read me any of my messages. In fact, I can't even get it to send messages to group chats, only to individual contacts.

The feature makes a lot of sense, of course. WhatsApp is to many countries across the globe what texting and calling is to Americans. If your smart assistant can't even interact with WhatsApp, it's basically useless for many people.

Edit: ah, that explains why I can't make Gemini read my messages to me, Google's own documentation (https://support.google.com/gemini/answer/15574928) says it can't:

    What Gemini can’t do with WhatsApp
    
        Read or summarize your messages
        Add or read images, gifs, or memes in your messages
        Add or play audio or videos in your messages
        Read or respond to WhatsApp notifications
If you connected Google Assistant to WhatsApp, it seems like data may flow that direction, but then you've already hooked WhatsApp into Google before so I don't think anyone will be surprised there.

Does anyone know how I can make Gemini read messages? I can't even find the assistant settings necessary for that stuff to function.

replies(7): >>44502286 #>>44502292 #>>44502335 #>>44502367 #>>44502393 #>>44502644 #>>44502812 #
Hizonner ◴[] No.44502286[source]
What Gemini should be able to do with WhatsApp:

    Exactly and only what any other random app on the phone could do
    with WhatsApp, assuming that you have enabled that in exactly the
    way you would have to enable any other random app to do it.
Google needs to not be abusing its position as the source of the OS to give its software special privilege to reach inside of third-party apps.
replies(7): >>44502323 #>>44502371 #>>44502376 #>>44502572 #>>44502628 #>>44504114 #>>44509843 #
kccqzy ◴[] No.44502323[source]
The line is blurry. Google is positioning Gemini not just as an app, but as a OS level feature. The OS can by definition reach into any third-app app to do anything it wants. I'll give some more examples of OS-level features in case it's not clear: copy/paste is an OS-level feature and it is designed to extract arbitrary text or content from third party apps (copy) and insert them into third party apps (paste); screenshotting is an OS-level feature and it is designed to capture the visible views of any third party app with the only exception being DRM content.

Apple Intelligence has similar marketing. In last year's WWDC, there was the whole "Siri, when is my mom's flight landing?" segment (see https://developer.apple.com/videos/play/wwdc2024/101/ at 1h22m) that didn't generate any controversy. So for some reason people think Siri should rightfully be an OS-level feature but Gemini should not. Got it. I guess Apple's PR is just that much better than Google's.

replies(10): >>44502369 #>>44502374 #>>44502375 #>>44502707 #>>44502943 #>>44503060 #>>44503359 #>>44503549 #>>44503726 #>>44507690 #
1. Ajedi32 ◴[] No.44503359[source]
Making OS level features depend on an external cloud service is a rather dubious proposition in general. It feels a bit anti-competitive to me, if nothing else.