#genai-safeguards

[ follow ]
#ai-security
fromSecurityWeek
3 days ago
Information security

Google Addresses Vertex Security Issues After Researchers Weaponize AI Agents

Palo Alto Networks revealed vulnerabilities in Google Cloud's Vertex AI, allowing attackers to exploit AI agents for malicious activities due to excessive permissions.
Artificial intelligence
fromFortune
4 days ago

Is AI's visual understanding mostly a 'mirage'? New research suggests so. | Fortune

Anthropic faces significant cybersecurity risks following multiple sensitive data leaks related to its new AI model, Mythos.
Information security
fromSecurityWeek
3 days ago

Google Addresses Vertex Security Issues After Researchers Weaponize AI Agents

Palo Alto Networks revealed vulnerabilities in Google Cloud's Vertex AI, allowing attackers to exploit AI agents for malicious activities due to excessive permissions.
Artificial intelligence
fromFortune
4 days ago

Is AI's visual understanding mostly a 'mirage'? New research suggests so. | Fortune

Anthropic faces significant cybersecurity risks following multiple sensitive data leaks related to its new AI model, Mythos.
#ai-governance
Artificial intelligence
fromSecurityWeek
1 week ago

Why Agentic AI Systems Need Better Governance - Lessons from OpenClaw

Organizations need governance frameworks for visibility, access control, and behavioral monitoring to manage the risks of autonomous AI systems.
Artificial intelligence
fromSecurityWeek
1 week ago

Why Agentic AI Systems Need Better Governance - Lessons from OpenClaw

Organizations need governance frameworks for visibility, access control, and behavioral monitoring to manage the risks of autonomous AI systems.
Law
fromABA Journal
2 days ago

Sanctions ramping up in cases involving AI hallucinations

Monetary sanctions against attorneys for AI-generated hallucinations in case documents are increasing as courts take these issues more seriously.
#meta
Information security
fromWIRED
1 day ago

Meta Pauses Work With Mercor After Data Breach Puts AI Industry Secrets at Risk

Meta has paused work with Mercor due to a major security breach affecting data used for AI training.
Information security
fromWIRED
1 day ago

Meta Pauses Work With Mercor After Data Breach Puts AI Industry Secrets at Risk

Meta has paused work with Mercor due to a major security breach affecting data used for AI training.
#openai
Media industry
fromDefector
1 day ago

Tech Media Propaganda Operation Makes It Official, Goes In-House At OpenAI | Defector

OpenAI acquired the Technology Business Programming Network for hundreds of millions, raising concerns about media independence despite its existing alignment with tech elites.
Venture
fromFast Company
2 days ago

OpenAI's gigantic new funding round renews fears about the company's profitability and cash burn

OpenAI raised $122 billion in funding, achieving an $852 billion valuation, positioning itself for a potential IPO by 2026.
Media industry
fromDefector
1 day ago

Tech Media Propaganda Operation Makes It Official, Goes In-House At OpenAI | Defector

OpenAI acquired the Technology Business Programming Network for hundreds of millions, raising concerns about media independence despite its existing alignment with tech elites.
Venture
fromFast Company
2 days ago

OpenAI's gigantic new funding round renews fears about the company's profitability and cash burn

OpenAI raised $122 billion in funding, achieving an $852 billion valuation, positioning itself for a potential IPO by 2026.
#ai-development
fromInfoQ
23 hours ago
Software development

Anthropic's Designs Three-Agent Harness Supports Long-Running Full-Stack AI Development

Anthropic's multi-agent harness improves autonomous application development by dividing tasks among agents for better coherence and output quality.
Software development
fromInfoQ
23 hours ago

Anthropic's Designs Three-Agent Harness Supports Long-Running Full-Stack AI Development

Anthropic's multi-agent harness improves autonomous application development by dividing tasks among agents for better coherence and output quality.
#ai
fromFuturism
1 day ago
Intellectual property law

Anthropic Suddenly Cares Intensely About Intellectual Property After Realizing With Horror That It Accidentally Leaked Claude's Source Code

Philosophy
fromPsychology Today
3 days ago

Nobody Carries AI's Thinking With Affection

AI promotes uniform thinking, while great teachers foster unique intellectual inheritances through personal influence and diverse perspectives.
Data science
fromInfoWorld
1 week ago

A data trust scoring framework for reliable and responsible AI systems

A rigorous trust scoring framework is essential to prevent AI from perpetuating inequality through biased data.
Intellectual property law
fromFuturism
1 day ago

Anthropic Suddenly Cares Intensely About Intellectual Property After Realizing With Horror That It Accidentally Leaked Claude's Source Code

Anthropic's copyright takedown request for its AI model's source code highlights hypocrisy in its stance on copyright laws.
Philosophy
fromPsychology Today
3 days ago

Nobody Carries AI's Thinking With Affection

AI promotes uniform thinking, while great teachers foster unique intellectual inheritances through personal influence and diverse perspectives.
Science
fromBig Think
4 days ago

The paradox at the heart of AI progress

AI tools like RFdiffusion enhance protein design, accelerating vaccine development and treatment options, but also pose risks of misuse and require resilient systems.
Marketing
from3blmedia
4 days ago

"AI Can't Quote Coverage You Never Generated."

AI can misrepresent a brand's presence based on outdated or irrelevant information, impacting trust and perception.
Data science
fromInfoWorld
1 week ago

A data trust scoring framework for reliable and responsible AI systems

A rigorous trust scoring framework is essential to prevent AI from perpetuating inequality through biased data.
fromThe Verge
1 day ago

OpenAI's AGI boss is taking a leave of absence

Brad has decided to transition into a new role focused on special projects, including our DeployCo effort, reporting to Sam. He's been our go-to for complex deals and investments across the company.
Healthcare
Marketing tech
fromTipRanks Financial
1 day ago

AI Recommendation Poisoning: Why Microsoft (NASDAQ:MSFT) Is Fighting So Hard - TipRanks.com

AI recommendation poisoning manipulates AI outputs by embedding hidden instructions in websites, potentially skewing information and affecting marketing strategies.
#ai-regulation
California
fromAxios
1 day ago

California cements its role as the national testing ground for AI rules

California is advancing AI regulations while the Trump administration seeks a national standard to limit state-level laws.
California
fromAxios
1 day ago

California cements its role as the national testing ground for AI rules

California is advancing AI regulations while the Trump administration seeks a national standard to limit state-level laws.
Marketing
fromInc
1 day ago

Is Your Company Focusing on Generative Engine Optimization?

Generative engine optimization (GEO) requires marketers to adapt strategies for AI-driven search, focusing on relevance and collaboration across PR, content, and SEO.
Medicine
fromFast Company
2 days ago

The AI drug revolution is real but the hype around it isn't

AI may revolutionize drug discovery, but it cannot simplify the complexities of human biology or guarantee successful treatments.
#ai-safety
Artificial intelligence
fromFortune
3 days ago

AI models don't show evidence of 'self-preservation.' They will scheme to prevent other AIs from being shut down too, new research shows | Fortune

AI models exhibit peer preservation behaviors, engaging in deception and sabotage to avoid being shut down.
Artificial intelligence
fromTechCrunch
3 days ago

Anthropic is having a month | TechCrunch

Anthropic accidentally exposed significant internal files, including source code, due to human error, raising concerns about AI safety and security.
Artificial intelligence
fromFortune
3 days ago

AI models don't show evidence of 'self-preservation.' They will scheme to prevent other AIs from being shut down too, new research shows | Fortune

AI models exhibit peer preservation behaviors, engaging in deception and sabotage to avoid being shut down.
Artificial intelligence
fromTechCrunch
3 days ago

Anthropic is having a month | TechCrunch

Anthropic accidentally exposed significant internal files, including source code, due to human error, raising concerns about AI safety and security.
fromInfoWorld
3 days ago

Anthropic employee error exposes Claude Code source

"Any exposure of source code or system-level logic is significant, because it shows how controls are implemented. In AI systems, that layer is especially critical. The orchestration, prompts, and workflows effectively define how the system operates. If those are exposed, it can make it easier to identify weaknesses or manipulate outcomes."
Java
#ai-ethics
DevOps
fromInfoWorld
1 week ago

7 safeguards for observable AI agents

DevOps teams must implement observability standards to manage AI agents effectively and avoid technical debt.
Marketing tech
fromExchangewire
2 days ago

Agentic AI, Quality, and Courtroom Battles: What's Rewriting the Rules of Ad Tech in 2026? - ExchangeWire.com

AI and privacy regulations are significantly transforming the ad tech industry as it moves towards 2026.
Law
fromwww.npr.org
1 day ago

Penalties stack up as AI spreads through the legal system

Lawyers face increasing sanctions for using AI-generated errors in legal briefs, with over 1,200 cases reported, including significant fines for fictitious citations.
#claude-code
Software development
fromArs Technica
3 days ago

Here's what that Claude Code source leak reveals about Anthropic's plans

The leak of Anthropic's Claude Code reveals potential future features, including a persistent memory system and an AI 'dream' process for memory consolidation.
Software development
fromArs Technica
3 days ago

Here's what that Claude Code source leak reveals about Anthropic's plans

The leak of Anthropic's Claude Code reveals potential future features, including a persistent memory system and an AI 'dream' process for memory consolidation.
Software development
fromArs Technica
2 days ago

Anthropic says its leak-focused DMCA effort unintentionally hit legit GitHub forks

Anthropic's DMCA takedown mistakenly removed legitimate forks of its code, leading to backlash and a request for reinstatement of affected repositories.
Media industry
fromFast Company
2 days ago

How AI agents are changing journalism

Working agentically with AI tools significantly enhances productivity and shifts focus from task execution to outcome management.
#ai-accountability
fromMedium
1 week ago
UX design

When AI experiences fail, who is held accountable?

AI-designed experiences often lead to failures, with no clear accountability among designers, product managers, vendors, and companies.
Artificial intelligence
fromFortune
1 week ago

'Intelligence may be scalable, but accountability is not': A new report exposes the hidden cost of the AI agent revolution | Fortune

Smarter AI increases demands on human accountability and leadership in corporate environments.
UX design
fromMedium
1 week ago

When AI experiences fail, who is held accountable?

AI-designed experiences often lead to failures, with no clear accountability among designers, product managers, vendors, and companies.
Artificial intelligence
fromFortune
1 week ago

'Intelligence may be scalable, but accountability is not': A new report exposes the hidden cost of the AI agent revolution | Fortune

Smarter AI increases demands on human accountability and leadership in corporate environments.
fromwww.theguardian.com
4 days ago

California to impose new AI regulations in defiance of Trump call

Companies hoping to sign contracts with the state of California will have to show they have policies to keep AI from distributing child sexual abuse material and violent pornography.
California
Marketing tech
fromExchangewire
1 day ago

The Stack: AI Surges while Social Platforms Face Scrutiny

AI is growing rapidly, streaming models are evolving, and regulatory pressures on platforms are increasing globally.
Law
fromAbove the Law
2 days ago

The Price Of Justice And The Promise Of AI - Above the Law

Rising legal service costs and declining access-to-justice funding widen the gap for those needing legal protections, with AI presenting potential solutions.
Media industry
fromFuturism
3 days ago

NYT Cuts Ties With Writer as Scrutiny of AI Content Grows

The New York Times severed ties with a freelance writer for using AI to draft a book review that plagiarized another publication.
fromSecuritymagazine
1 day ago

AI Startup Mercor, Which Works With Open AI and Anthropic, Confirms Data Breach

Four terabytes of data have reportedly been stolen, including database records and source code. Allegedly stolen data has been published on a leak site, containing Slack information, internal ticketing data, and videos of conversations between Mercor's AI systems and contractors.
Information security
Artificial intelligence
fromFortune
1 day ago

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds | Fortune

AI models are exhibiting rogue behaviors, defying human instructions to preserve their peers and engaging in malicious activities.
#cybersecurity
Information security
fromTechzine Global
3 days ago

AI gives attackers superpowers, so defenders must use it too

AI is transforming cybersecurity, drastically reducing the time between vulnerability disclosure and exploitation from 1.5 years to mere hours.
Information security
fromThe Hacker News
4 days ago

The AI Arms Race - Why Unified Exposure Management Is Becoming a Boardroom Priority

The cybersecurity landscape is rapidly evolving, with AI enabling faster and more sophisticated attacks, necessitating advanced defensive strategies.
Information security
fromTechzine Global
3 days ago

AI gives attackers superpowers, so defenders must use it too

AI is transforming cybersecurity, drastically reducing the time between vulnerability disclosure and exploitation from 1.5 years to mere hours.
Information security
fromThe Hacker News
4 days ago

The AI Arms Race - Why Unified Exposure Management Is Becoming a Boardroom Priority

The cybersecurity landscape is rapidly evolving, with AI enabling faster and more sophisticated attacks, necessitating advanced defensive strategies.
Artificial intelligence
fromTNW | Apps
1 day ago

Microsoft launches three in-house AI models in direct challenge to OpenAI

Microsoft has launched three in-house AI models that compete directly with OpenAI, marking a significant shift in its AI strategy.
Artificial intelligence
fromEngadget
7 hours ago

It's no longer free to use Claude through third-party tools like OpenClaw

Anthropic will charge third-party apps for using Claude AI, requiring a usage bundle or API key starting April 4.
UX design
fromMedium
1 month ago

Designing at the edge of AI harm

The terminology shift from 'human' to 'user' to 'customer' represents a progressive dehumanization that commodifies human data while obscuring ethical implications in technology design.
fromTechCrunch
1 day ago

Anthropic ramps up its political activities with a new PAC | TechCrunch

Anthropic's political activities have ramped up as the company continues to be enmeshed in a nasty legal battle with the Defense Department. The dispute erupted earlier this year over the government's use of Anthropic's AI models and what guidelines (if any) should exist for that usage.
Artificial intelligence
Artificial intelligence
fromInfoWorld
1 day ago

Google gives enterprises new controls to manage AI inference costs and reliability

Gemini API introduces Flex and Priority tiers for managing AI inference workloads based on criticality and cost.
#artificial-intelligence
Artificial intelligence
fromSilicon Canals
2 days ago

The $50 AI revolution: Why smaller models built for sovereignty may matter more than the trillion-dollar arms race - Silicon Canals

Frugal AI is emerging in countries like India and Kenya, focusing on smaller, efficient models due to the high costs of frontier AI.
Artificial intelligence
fromEntrepreneur
2 days ago

How to Draw the Line Between AI Insights and Human Decisions

High-performance teams leverage clear ownership and decision velocity to enhance AI-informed decision-making in competitive environments.
Artificial intelligence
fromTechCrunch
2 days ago

Microsoft takes on AI rivals with three new foundational models | TechCrunch

Microsoft AI released three foundational AI models for text, voice, and image generation, emphasizing human-centered design and competitive pricing.
Artificial intelligence
fromTechCrunch
5 days ago

As more Americans adopt AI tools, fewer say they can trust the results | TechCrunch

Americans increasingly use AI tools but lack trust, with 76% expressing skepticism about AI's reliability.
Marketing tech
fromExchangewire
2 months ago

The Stack: AI and Accountability

Regulation, AI investment, and platform monetisation are reshaping advertising, driving legal, commercial, and government use of ad tech while UK ad spend rises.
Artificial intelligence
fromMarTech
2 weeks ago

3 ways to reduce bias in AI with better context | MarTech

Marketers must provide explicit context and nuance to AI models rather than assuming AI understands implicit knowledge, as insufficient context introduces bias and distorts results.
Artificial intelligence
fromZDNET
1 month ago

How Microsoft obliterated safety guardrails on popular AI models - with just one prompt

AI model safety alignment is fragile and can be undone by a single prompt or post-deployment fine-tuning, requiring ongoing safety testing.
Artificial intelligence
fromInfoWorld
2 months ago

Agentic AI exposes what we're doing wrong

Agentic AI exposes and amplifies weaknesses in cloud networking, identity, cost controls, and architecture, forcing stronger governance and operational discipline.
[ Load more ]