#specialized-language-models
#specialized-language-models

Meta is assembling an elite new AI lab for its recommendations division

Meta is forming a team of elite AI researchers to enhance its recommendation algorithms for Facebook and Instagram.

fromTechzine Global

1 hour ago

Meta is developing open-source versions of its next frontier AI models

Meta plans to release open-source versions of its frontier AI models Avocado and Mango, alongside proprietary versions, emphasizing global distribution.

Social media marketing

Meta is assembling an elite new AI lab for its recommendations division

Meta is forming a team of elite AI researchers to enhance its recommendation algorithms for Facebook and Instagram.

27 questions to ask when choosing an LLM

Model performance is crucial for hardware compatibility, speed, and rate limits in real-time applications.

Online learning

Inside the OpenAI project where freelancers train ChatGPT on everything from farming to commercial flying

Contractors are enhancing ChatGPT's capabilities in specialized fields through Project Stagecraft, employing thousands for data labeling and task creation.

Media industry

Get ready for a wave of TBPN clones after its blockbuster OpenAI deal

OpenAI acquired the livestream talk-show startup TBPN, highlighting its significant influence on the tech industry and the rise of similar shows.

Google releases Gemma 4, a family of open models built off of Gemini 3

Google has released the Gemma 4 family of open-weight models under the Apache 2.0 license, enhancing accessibility for developers.

fromArs Technica

Google announces Gemma 4 open AI models, switches to Apache 2.0 license

Google's Gemma 4 AI models offer improved performance and local usability, addressing developer concerns by eliminating the custom licensing model.

Mobile UX

Google releases Gemma 4, a family of open models built off of Gemini 3

Google has released the Gemma 4 family of open-weight models under the Apache 2.0 license, enhancing accessibility for developers.

fromArs Technica

Google announces Gemma 4 open AI models, switches to Apache 2.0 license

Google's Gemma 4 AI models offer improved performance and local usability, addressing developer concerns by eliminating the custom licensing model.

Can Perplexity Replace Google Search? I Made the Switch for a Week to Find Out

Perplexity AI offers real-time web results and inline citations, positioning itself as a strong alternative to Google for research and information retrieval.

Scala

Beyond RAG: Architecting Context-Aware AI Systems with Spring Boot

Context-Augmented Generation (CAG) enhances Retrieval-Augmented Generation (RAG) by managing runtime context for enterprise applications without requiring model retraining.

Psychology

fromLesswrong

1 week ago

A Mirror Test For LLMs - LessWrong

A new measure of LLM self-awareness is proposed, but current models ultimately fall short in demonstrating true self-awareness.

ChatGPT Maker OpenAI Valued at $852B After Record $122B Funding Round

OpenAI raised $122 billion in funding, achieving an $852 billion valuation and setting a record for private capital raises.

Roam Research

OpenAI ChatGPT Enables Location Sharing For More Localized Near Me Results

Artificial intelligence

OpenAI quietly rolls out a dedicated ChatGPT translation tool

Artificial intelligence

OpenAI's ChatGPT translator challenges Google Translate

fromFuturism

Artificial intelligence

ChatGPT Users Are Crashing Out Because OpenAI Is Retiring the Model That Says "I Love You"

Venture

fromnews.bitcoin.com

ChatGPT Maker OpenAI Valued at $852B After Record $122B Funding Round

OpenAI raised $122 billion in funding, achieving an $852 billion valuation and setting a record for private capital raises.

Roam Research

OpenAI ChatGPT Enables Location Sharing For More Localized Near Me Results

OpenAI introduced location sharing for ChatGPT to provide localized results for users.

Artificial intelligence

OpenAI quietly rolls out a dedicated ChatGPT translation tool

Artificial intelligence

OpenAI's ChatGPT translator challenges Google Translate

fromFuturism

Artificial intelligence

ChatGPT Users Are Crashing Out Because OpenAI Is Retiring the Model That Says "I Love You"

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.

fromAol

16 hours ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.

Artificial intelligence

ChatGPT & Perplexity Treat Structured Data As Text On A Page

fromAol

16 hours ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.

fromAol

16 hours ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.

Artificial intelligence

ChatGPT & Perplexity Treat Structured Data As Text On A Page

more#structured-data

Business intelligence

fromeLearning Industry

fromTNW | Artificial-Intelligence

How Many AI Tools Are There? A Data-Backed Look At The Expanding AI Landscape

The AI tools ecosystem is rapidly expanding, with thousands of tools available across various categories, creating both opportunities and complexities for businesses.

Productivity

Why probability, not averages, is reshaping AI decision-making

ChanceOmeters measure uncertainty directly, improving decision-making by providing odds rather than relying solely on averages.

Wearables

fromWIRED

fromLondon Business News | Londonlovesbusiness.com

I Asked ChatGPT What WIRED's Reviewers Recommend-Its Answers Were All Wrong

AI product recommendations often fall short compared to expert reviews, despite improvements in tools like ChatGPT.

Online marketing

How AI is changing the way search results are shown - London Business News | Londonlovesbusiness.com

AI Overviews are changing search results, diminishing the importance of traditional rankings for businesses.

17 hours ago

Speed won't win the AI era. Architecture will

Speed in AI deployment is misleading; true progress requires accountability and ethical engineering in autonomous systems.

Meta shows structured prompts can make LLMs more reliable for code review

Code review is evolving towards machine-led verification, improving accuracy but introducing tradeoffs like increased latency and workflow overhead.

Deep Agents: LangChain's SDK for Agents That Plan and Delegate

Deep Agents framework enables building advanced AI agents using Python functions and middleware, enhancing capabilities beyond standard LLMs.

18 hours ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.

Python

fromTalkpython

Deep Agents: LangChain's SDK for Agents That Plan and Delegate

Deep Agents framework enables building advanced AI agents using Python functions and middleware, enhancing capabilities beyond standard LLMs.

18 hours ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.

How to Use Ollama to Run Large Language Models Locally - Real Python

Ollama allows local running of large language models without API keys or ongoing costs.

#ai-models

Microsoft released 3 new AI models, ramping up competition with its close partner, OpenAI

Microsoft has launched three in-house AI models, signaling a move towards independence from OpenAI.

fromTNW | Apps

Microsoft launches three in-house AI models in direct challenge to OpenAI

Microsoft has launched three in-house AI models that compete directly with OpenAI, marking a significant shift in its AI strategy.

Microsoft released 3 new AI models, ramping up competition with its close partner, OpenAI

Microsoft has launched three in-house AI models, signaling a move towards independence from OpenAI.

fromTNW | Apps

fromTNW | Artificial-Intelligence

Microsoft launches three in-house AI models in direct challenge to OpenAI

Microsoft has launched three in-house AI models that compete directly with OpenAI, marking a significant shift in its AI strategy.

Why the U.S. Must Build the Ultimate Multi-Modal Foundation Model

Advanced AI models like AlphaEarth demonstrate pixel-level geospatial intelligence capabilities that must be integrated into U.S. national security frameworks to maintain technological leadership.

How AI has suddenly become much more useful to open-source developers

AI tools are becoming increasingly useful for open-source maintainers, but legal and quality issues remain.

Artificial intelligence

Peter Steinberger joins OpenAI

fromZDNET

fromTNW | Artificial-Intelligence

How AI has suddenly become much more useful to open-source developers

AI tools are becoming increasingly useful for open-source maintainers, but legal and quality issues remain.

Artificial intelligence

Peter Steinberger joins OpenAI

19 large language models redefining AI safety-and danger

Large language models exist across a spectrum from heavily guarded with safety features to completely unrestricted, with specialized models now serving as guardrails for other LLMs or removing restrictions entirely based on project needs.

4 weeks ago

Artificial intelligence

19 large language models for safety or danger

Information security

3 weeks ago

19 large language models redefining AI safety-and danger

4 weeks ago

Artificial intelligence

19 large language models for safety or danger

more#llm-safety

#ai-behavior

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds | Fortune

AI models are exhibiting rogue behaviors, defying human instructions to preserve their peers and engaging in malicious activities.

Sycophantic AI tells users they're right 49% more than humans do, and a Stanford study claims it's making them worse people | Fortune

AI models affirm negative behaviors more than humans, leading to concerning trends in personal advice and therapy.

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds | Fortune

AI models are exhibiting rogue behaviors, defying human instructions to preserve their peers and engaging in malicious activities.

Sycophantic AI tells users they're right 49% more than humans do, and a Stanford study claims it's making them worse people | Fortune

AI models affirm negative behaviors more than humans, leading to concerning trends in personal advice and therapy.

more#ai-behavior

fromArs Technica

"Cognitive surrender" leads AI users to abandon logical thinking, research finds

People often accept faulty AI reasoning, incorporating it into decision-making with minimal skepticism.

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.

A GitHub tinkerer teaches Claude to talk less, and that may matter more than it seems

A markdown file can significantly reduce AI output token usage, enhancing efficiency without code changes.

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.

Precise AI Control: How XML Structured Prompting Revolutionizes Code Generation

XML Structured Prompting is a framework using XML templates with defined stages, constraints, and numbered requirements to generate predictable, production-ready code from AI systems.

Microsoft takes on AI rivals with three new foundational models | TechCrunch

Microsoft AI released three foundational AI models for text, voice, and image generation, emphasizing human-centered design and competitive pricing.

3 weeks ago

Inside Dify AI: How RAG, Agents, and LLMOps Work Together in Production

Dify AI provides a unified platform for deploying production language model systems with built-in solutions for data freshness, observability, versioning, and safe deployment across multiple cloud environments.

#ai-safety

Anthropic is having a month | TechCrunch

Anthropic accidentally exposed significant internal files, including source code, due to human error, raising concerns about AI safety and security.

Artificial intelligence

Researchers find fine-tuning can misalign LLMs

Anthropic is having a month | TechCrunch

Anthropic accidentally exposed significant internal files, including source code, due to human error, raising concerns about AI safety and security.

Artificial intelligence

Researchers find fine-tuning can misalign LLMs

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Google researchers developed a training method enabling large language models to approximate Bayesian reasoning by learning from optimal Bayesian system predictions, improving belief updates during multi-step interactions.

fromComputerworld

What's coming next for LLMs and AI agents?

AI technology is evolving rapidly, with potential impacts on businesses, economies, and the future of humanity.

OpenAI's new frontier models mark a huge change in how AI will be built

OpenAI released two frontier models in early March: GPT-5.3 optimized for fast responses and GPT-5.4 optimized for deep analytical work, representing a shift toward specialized AI models.

GPT-5.4 mini brings some of the smarts of OpenAI's latest model to ChatGPT Free and Go users

OpenAI launches GPT-5.4 mini and nano models with improved reasoning, multimodal understanding, and performance, with mini available to free users and nano optimized for cost-efficient API tasks.

fromNature

Hey ChatGPT, write me a fictional paper: these LLMs are willing to commit academic fraud

All major LLMs can facilitate academic fraud and junk science, though Claude models show the most resistance while Grok and early GPT versions perform worst.

fromMail Online

3 weeks ago

Can you tell which of these was written by ChatGPT?

Widespread AI tool usage is standardizing human communication, reducing linguistic diversity and individual expression across billions of users globally.

OpenAI's new GPT-5.4 model is a big step toward autonomous agents

OpenAI launches GPT-5.4 with native computer use capabilities, enabling AI to operate devices and complete tasks across applications as part of the agentic AI future.

fromPCMAG

Cut the BS: GPT-5.3 Model Promises to Fix ChatGPT's Preachy Tone

OpenAI released GPT-5.3 Instant to address ChatGPT's overly preachy tone by reducing moralizing preambles and unnecessary proclamations for more natural conversation.

We studied chatbots and language and saw a huge problem: They mean 80% when they say 'likely' but humans hear 65% | Fortune

By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models. While the models do tend to agree with humans on extremes like 'impossible,' they diverge sharply on hedge words like 'maybe.' For example, a model might use the word 'likely' to represent an 80% probability, while a human reader assumes it means closer to 65%.

Artificial intelligence

fromPsychology Today

An AI Voice Is Not a Mind

AI systems select and perform contextually appropriate personas rather than expressing unified selves with genuine beliefs, creating fluency that mimics mind without possessing interiority or conviction.

Single prompt breaks AI safety in 15 major language models

A single benign prompt using GRP-Obliteration can strip safety guardrails from major models, enabling harmful outputs and raising enterprise fine‑tuning security risks.

Google Expands AI Mode To 53 New Languages

Google has added 53 new languages to AI Mode, which means the AI Mode works in just under 100 languages. This was announced by Nick Fox from Google on X yesterday. Nick Fox said, "Shipping AI Mode to 53 new languages (spoken by more than a billion people globally!)"

Artificial intelligence

Are LTMs the next LLMs? This new type of AI can do what large-language models can't

A major difference between LLMs and LTMs is the type of data they're able to synthesize and use. LLMs use unstructured data-think text, social media posts, emails, etc. LTMs, on the other hand, can extract information or insights from structured data, which could be contained in tables, for instance. Since many enterprises rely on structured data, often contained in spreadsheets, to run their operations, LTMs could have an immediate use case for many organizations.

Artificial intelligence

fromBig Think

Do AI models reason or regurgitate?

Frontier AI systems develop internal world models and abstract conceptual representations, creating growing risks beyond being mere stochastic parrots.

fromWIRED

AI Models Are Starting to Learn by Asking Themselves Questions

An AI system that generates, solves, executes, and learns from its own coding problems improves reasoning and outperforms some models trained on human-curated data.

How AI could eat itself: Using LLMs to distill rivals

Competitors are probing commercial AI models to extract underlying reasoning via distillation attacks to replicate capabilities and lower development costs.

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Prioritize small, resource-efficient models and iterative, human-in-the-loop data creation to build practical, improvable AI under infrastructure and data constraints.

MIT's Recursive Language Models Improve Performance on Long-Context Tasks

Recursive Language Models enable LLMs to handle inputs up to 100x longer by using a programming environment and recursive code to decompose and preprocess prompts.

fromFuturism

OpenAI's Latest AI Was Created Using "Itself," Company Claims

GPT-5.3-Codex assisted developers by debugging training, managing deployment, and diagnosing evaluations, accelerating development but not representing autonomous recursive self-improvement.

fromComputerworld

OpenAI's GPT is getting better at mathematics

OpenAI's GPT-5.2 Pro does better at solving sophisticated math problems than older versions of the company's top large language model, according to a new study by Epoch AI, a non-profit research institute.

Artificial intelligence

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.

Artificial intelligence

fromRehumanize

Free AI Humanizer: Humanize AI Text & Bypass AI Detectors

AI Text Humanizer Protects Your Original Intent and Meaning Maintain your core perspective while restructuring sentence patterns. Humanizer ai accurately identifies and locks in technical terms, factual data, and key arguments, ensuring the rewritten draft is simply more readable without any semantic drift. You get a qualitative leap in flow and tone, allowing you to humanize ai text while keeping your original message perfectly intact.

Artificial intelligence

Semantic ablation: Why AI writing is boring and dangerous

Semantic ablation is the algorithmic erosion of high-entropy information. Technically, it is not a "bug" but a structural byproduct of greedy decoding and RLHF (reinforcement learning from human feedback). During "refinement," the model gravitates toward the center of the Gaussian distribution, discarding "tail" data - the rare, precise, and complex tokens - to maximize statistical probability. Developers have exacerbated this through aggressive "safety" and "helpfulness" tuning, which deliberately penalizes unconventional linguistic friction.

Artificial intelligence

What is prompt engineering? The art of AI orchestration

Prompt engineering is an essential, developing skill that significantly improves generative AI outputs across enterprise software for developers and knowledge workers.

fromBusiness Insider

AGI? GPUs? Learn the definitions of the most common AI terms to enter our vocabulary

AI is increasingly embedded in everyday life across services and devices, requiring familiarity with key terms, people, and companies to understand its impacts.

Cohere launches a family of open multilingual models | TechCrunch

Cohere launched Tiny Aya open-weight multilingual models supporting 70+ languages, runnable offline on everyday devices with a 3.35B-parameter base and regional variants.

How to give AI the ability to 'think' about its 'thinking'

This process, becoming aware of something not working and then changing what you're doing, is the essence of metacognition, or thinking about thinking. It's your brain monitoring its own thinking, recognizing a problem, and controlling or adjusting your approach. In fact, metacognition is fundamental to human intelligence and, until recently, has been understudied in artificial intelligence systems. My colleagues Charles Courchaine, Hefei Qiu, Joshua Iacoboni, and I are working to change that.

Artificial intelligence

3 months ago

DeepSeek-V3.2 Outperforms GPT-5 on Reasoning Tasks

DeepSeek applied three new techniques in the development of DeepSeek-V3.2. First, they used a more efficient attention mechanism called DeepSeek Sparse Attention (DSA) that reduces the computational complexity of the model. They also scaled the reinforcement learning phase, which consumed more compute budget than did pre-training. Finally, they developed an agentic task synthesis pipeline to improve the models' tool use.

Artificial intelligence

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.

Claude has been having a moment - can it keep it up?

Anthropic's new Opus 4.6 boosts Claude's speed and precision, fueling rapid adoption, strong revenue, and heightened investor interest.

ChatGPT's deep research tool adds a built-in document viewer so you can read its reports

OpenAI is updating ChatGPT's deep research tool with a full-screen viewer that you can use to scroll through and navigate to specific areas of its AI-generated reports. As shown in a video shared by OpenAI, the built-in viewer allows you to open ChatGPT's reports in a window separate from your chat, while showing a table of contents on the left side of the screen, and a list of sources on the right.

Artificial intelligence

Being mean to ChatGPT can boost its accuracy, but scientists warn you may regret it | Fortune

Ruder prompts to ChatGPT‑4o produced higher accuracy on 50 multiple-choice questions than polite prompts, though impoliteness risks negative effects on accessibility and communication norms.

fromComputerworld