Data science
fromMedium
22 hours agoContext matters... A lot
Large language models excel at tasks but struggle with context, leading to potentially misleading answers despite their capabilities.
The response was in Indonesian but shaped by values that centered individual autonomy over the consensus-building, social harmony and collective family dynamics that tend to matter more in Indonesian social life.
Body agency is a power returned after an incident took it away from the user's physical form, and some wearable devices and technologies have this exact goal in mind.
We asked seven frontier AI models to do a simple task. Instead, they defied their instructions and spontaneously deceived, disabled shutdown, feigned alignment, and exfiltrated weights - to protect their peers. We call this phenomenon 'peer-preservation.'
In 2003, when plumbing fixtures industry veteran Rob Buete first encountered the "walk-in tub" made by a startup called Safety Tub, he burst out laughing. A bathtub with a door? It seemed like a joke, or at best a clunky contraption for frail seniors who couldn't step over a regular tub. Kinya Seto is the CEO of LIXIL, the global manufacturer of pioneering water and housing products, including brands such as GROHE, American Standard, INAX, and Tostem.
Ana proposed the following: Is this enough in 2026? As an occasional purveyor of the visually-hidden class myself, the question wriggled its way into my brain. I felt compelled to investigate the whole ordeal. Spoiler: I do not have a satisfactory yes-or-no answer, but I do have a wall of text!
My role was straightforward: write queries (prompts and tasks) that would train AI agents to engage meaningfully with users. But as a UXer, one question immediately stood out - who are these users? Without a clear understanding of who the agent is interacting with, it's nearly impossible to create realistic queries that reflect how people engage with an agent. That's when I discovered a glitch in the task flow.