Digital life
fromTechRepublic
20 hours agoGoogle Vids Just Got a Major AI Upgrade - Here's What's New
Google Vids enables intuitive video creation using AI, allowing users to direct avatars and publish content quickly with simple text prompts.
The response was in Indonesian but shaped by values that centered individual autonomy over the consensus-building, social harmony and collective family dynamics that tend to matter more in Indonesian social life.
Cohere's Transcribe model is designed for tasks like note-taking and speech analysis, supporting 14 languages and optimized for consumer-grade GPUs, making it accessible for self-hosting.
In short, you can use voice commands to log events throughout your day - what you ate, what you drank, the supplements that you took, any workouts that you did, even subjective feelings. Previously, you had to pull out your phone to do that. That extra bit of friction could cause you to put it off for later and forget to log important things.
As explained by Meta: AI-powered translations for Reels are starting to roll out in more languages, including Bengali, Tamil, Telugu, Marathi, and Kannada, on Instagram. These new additions build on our existing language support for English, Hindi, Portuguese, and Spanish. The addition of more of the languages spoken in India is significant, because India is now the biggest single market for both Facebook and Instagram usage, beating out the U.S. by a significant margin.
The self-described "Un-carrier" announced on Wednesday that beta signups were now available for live call translation powered not by an app or device-level capability, but AI that lives directly on the T-Mobile network. Eligible T-Mobile customers using a phone connected to 4G LTE or 5G - from flagship smartphones to bog-standard flip phones - can activate the feature by dialing * 87 *, with only one caller required to be on the carrier's network. Access is currently limited to customers admitted into the beta.
On Wednesday, the Paris-based AI lab released two new speech-to-text models: Voxtral Mini Transcribe V2 and Voxtral Realtime. The former is built to transcribe audio files in large batches and the latter for nearly real-time transcription, within 200 milliseconds; both can translate between 13 languages. Voxtral Realtime is freely available under an open source license.
There's a good chance you spend more time talking to your phone's virtual assistant, or dictating text with your voice, instead of actually calling people these days. But, as convenient as voice input can be, you don't want to be the obnoxious person shouting commands to Siri in a quiet library. And you probably won't have much luck dictating an email in a room with toddlers screaming and Peppa Pig blaring on the TV. (Ask me how I know.)
Using the company's SmartVoice technology, the devices react to wake-up words for verbal commands, using built-in microphones. Most of the appliances will also offer a built-in speaker so that they can react audibly to the commands. IAI Smart emphasizes the ease of use that this offers. "Our guiding principle is simple: make smart home technology easier for everyone," said Jason Jiang, CEO of IAI Smart. "Voice control should be effortless, and now it is." And because everything is on-device, personal information never leaves the home.