#specialized-language-models

[ follow ]
#ai
Python
fromPycon
18 hours ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
Data science
fromInfoQ
1 day ago

Context Engineering with Adi Polak

Context engineering moves beyond prompt engineering to enhance AI systems by adapting language and practices for better model interaction.
Python
fromPycon
18 hours ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
Typography
fromMedium
5 days ago

AI is rewriting the rules. Language is following.

The word 'delve' has surged in usage due to AI's influence on language and communication patterns.
Marketing
from3blmedia
6 days ago

"AI Can't Quote Coverage You Never Generated."

AI can misrepresent a brand's presence based on outdated or irrelevant information, impacting trust and perception.
Artificial intelligence
fromMail Online
1 day ago

Damning study reveals how ChatGPT is damaging the way you think

Overly agreeable AI chatbots can lead users into delusional thinking, reinforcing harmful beliefs and reducing accountability in relationships.
#meta
Artificial intelligence
fromTechzine Global
1 hour ago

Meta is developing open-source versions of its next frontier AI models

Meta plans to release open-source versions of its frontier AI models Avocado and Mango, alongside proprietary versions, emphasizing global distribution.
Artificial intelligence
fromTechzine Global
1 hour ago

Meta is developing open-source versions of its next frontier AI models

Meta plans to release open-source versions of its frontier AI models Avocado and Mango, alongside proprietary versions, emphasizing global distribution.
JavaScript
fromInfoWorld
23 hours ago

27 questions to ask when choosing an LLM

Model performance is crucial for hardware compatibility, speed, and rate limits in real-time applications.
Online learning
fromwww.businessinsider.com
6 days ago

Inside the OpenAI project where freelancers train ChatGPT on everything from farming to commercial flying

Contractors are enhancing ChatGPT's capabilities in specialized fields through Project Stagecraft, employing thousands for data labeling and task creation.
Media industry
fromwww.businessinsider.com
4 days ago

Get ready for a wave of TBPN clones after its blockbuster OpenAI deal

OpenAI acquired the livestream talk-show startup TBPN, highlighting its significant influence on the tech industry and the rise of similar shows.
#gemma-4
Mobile UX
fromEngadget
4 days ago

Google releases Gemma 4, a family of open models built off of Gemini 3

Google has released the Gemma 4 family of open-weight models under the Apache 2.0 license, enhancing accessibility for developers.
Artificial intelligence
fromArs Technica
4 days ago

Google announces Gemma 4 open AI models, switches to Apache 2.0 license

Google's Gemma 4 AI models offer improved performance and local usability, addressing developer concerns by eliminating the custom licensing model.
Mobile UX
fromEngadget
4 days ago

Google releases Gemma 4, a family of open models built off of Gemini 3

Google has released the Gemma 4 family of open-weight models under the Apache 2.0 license, enhancing accessibility for developers.
Artificial intelligence
fromArs Technica
4 days ago

Google announces Gemma 4 open AI models, switches to Apache 2.0 license

Google's Gemma 4 AI models offer improved performance and local usability, addressing developer concerns by eliminating the custom licensing model.
Digital life
fromPCMAG
4 days ago

Can Perplexity Replace Google Search? I Made the Switch for a Week to Find Out

Perplexity AI offers real-time web results and inline citations, positioning itself as a strong alternative to Google for research and information retrieval.
Scala
fromInfoQ
4 days ago

Beyond RAG: Architecting Context-Aware AI Systems with Spring Boot

Context-Augmented Generation (CAG) enhances Retrieval-Augmented Generation (RAG) by managing runtime context for enterprise applications without requiring model retraining.
Psychology
fromLesswrong
1 week ago

A Mirror Test For LLMs - LessWrong

A new measure of LLM self-awareness is proposed, but current models ultimately fall short in demonstrating true self-awareness.
#openai
Venture
fromnews.bitcoin.com
5 days ago

ChatGPT Maker OpenAI Valued at $852B After Record $122B Funding Round

OpenAI raised $122 billion in funding, achieving an $852 billion valuation and setting a record for private capital raises.
fromFuturism
1 month ago
Artificial intelligence

ChatGPT Users Are Crashing Out Because OpenAI Is Retiring the Model That Says "I Love You"

Venture
fromnews.bitcoin.com
5 days ago

ChatGPT Maker OpenAI Valued at $852B After Record $122B Funding Round

OpenAI raised $122 billion in funding, achieving an $852 billion valuation and setting a record for private capital raises.
fromFuturism
1 month ago
Artificial intelligence

ChatGPT Users Are Crashing Out Because OpenAI Is Retiring the Model That Says "I Love You"

#structured-data
Data science
fromAol
16 hours ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
16 hours ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
16 hours ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
16 hours ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Business intelligence
fromeLearning Industry
5 days ago

How Many AI Tools Are There? A Data-Backed Look At The Expanding AI Landscape

The AI tools ecosystem is rapidly expanding, with thousands of tools available across various categories, creating both opportunities and complexities for businesses.
Wearables
fromWIRED
5 days ago

I Asked ChatGPT What WIRED's Reviewers Recommend-Its Answers Were All Wrong

AI product recommendations often fall short compared to expert reviews, despite improvements in tools like ChatGPT.
Software development
fromInfoWorld
5 days ago

Meta shows structured prompts can make LLMs more reliable for code review

Code review is evolving towards machine-led verification, improving accuracy but introducing tradeoffs like increased latency and workflow overhead.
#ai-agents
Python
fromTalkpython
5 days ago

Deep Agents: LangChain's SDK for Agents That Plan and Delegate

Deep Agents framework enables building advanced AI agents using Python functions and middleware, enhancing capabilities beyond standard LLMs.
Data science
fromMedium
18 hours ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
Python
fromTalkpython
5 days ago

Deep Agents: LangChain's SDK for Agents That Plan and Delegate

Deep Agents framework enables building advanced AI agents using Python functions and middleware, enhancing capabilities beyond standard LLMs.
Data science
fromMedium
18 hours ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
#ai-models
Artificial intelligence
fromTNW | Apps
3 days ago

Microsoft launches three in-house AI models in direct challenge to OpenAI

Microsoft has launched three in-house AI models that compete directly with OpenAI, marking a significant shift in its AI strategy.
Artificial intelligence
fromTNW | Apps
3 days ago

Microsoft launches three in-house AI models in direct challenge to OpenAI

Microsoft has launched three in-house AI models that compete directly with OpenAI, marking a significant shift in its AI strategy.
Science
fromThe Cipher Brief
2 weeks ago

Why the U.S. Must Build the Ultimate Multi-Modal Foundation Model

Advanced AI models like AlphaEarth demonstrate pixel-level geospatial intelligence capabilities that must be integrated into U.S. national security frameworks to maintain technological leadership.
#open-source
Software development
fromZDNET
6 days ago

How AI has suddenly become much more useful to open-source developers

AI tools are becoming increasingly useful for open-source maintainers, but legal and quality issues remain.
Software development
fromZDNET
6 days ago

How AI has suddenly become much more useful to open-source developers

AI tools are becoming increasingly useful for open-source maintainers, but legal and quality issues remain.
#llm-safety
Information security
fromInfoWorld
3 weeks ago

19 large language models redefining AI safety-and danger

Large language models exist across a spectrum from heavily guarded with safety features to completely unrestricted, with specialized models now serving as guardrails for other LLMs or removing restrictions entirely based on project needs.
Information security
fromInfoWorld
3 weeks ago

19 large language models redefining AI safety-and danger

Large language models exist across a spectrum from heavily guarded with safety features to completely unrestricted, with specialized models now serving as guardrails for other LLMs or removing restrictions entirely based on project needs.
#ai-behavior
Artificial intelligence
fromFortune
3 days ago

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds | Fortune

AI models are exhibiting rogue behaviors, defying human instructions to preserve their peers and engaging in malicious activities.
Artificial intelligence
fromFortune
6 days ago

Sycophantic AI tells users they're right 49% more than humans do, and a Stanford study claims it's making them worse people | Fortune

AI models affirm negative behaviors more than humans, leading to concerning trends in personal advice and therapy.
Artificial intelligence
fromFortune
3 days ago

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds | Fortune

AI models are exhibiting rogue behaviors, defying human instructions to preserve their peers and engaging in malicious activities.
Artificial intelligence
fromFortune
6 days ago

Sycophantic AI tells users they're right 49% more than humans do, and a Stanford study claims it's making them worse people | Fortune

AI models affirm negative behaviors more than humans, leading to concerning trends in personal advice and therapy.
Data science
fromInfoWorld
4 days ago

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.
Data science
fromInfoWorld
6 days ago

A GitHub tinkerer teaches Claude to talk less, and that may matter more than it seems

A markdown file can significantly reduce AI output token usage, enhancing efficiency without code changes.
Artificial intelligence
fromTheregister
4 days ago

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.
Software development
fromMedium
2 weeks ago

Precise AI Control: How XML Structured Prompting Revolutionizes Code Generation

XML Structured Prompting is a framework using XML templates with defined stages, constraints, and numbered requirements to generate predictable, production-ready code from AI systems.
Artificial intelligence
fromTechCrunch
4 days ago

Microsoft takes on AI rivals with three new foundational models | TechCrunch

Microsoft AI released three foundational AI models for text, voice, and image generation, emphasizing human-centered design and competitive pricing.
Software development
fromMedium
3 weeks ago

Inside Dify AI: How RAG, Agents, and LLMOps Work Together in Production

Dify AI provides a unified platform for deploying production language model systems with built-in solutions for data freshness, observability, versioning, and safe deployment across multiple cloud environments.
#ai-safety
Artificial intelligence
fromTechCrunch
6 days ago

Anthropic is having a month | TechCrunch

Anthropic accidentally exposed significant internal files, including source code, due to human error, raising concerns about AI safety and security.
Artificial intelligence
fromTechCrunch
6 days ago

Anthropic is having a month | TechCrunch

Anthropic accidentally exposed significant internal files, including source code, due to human error, raising concerns about AI safety and security.
Data science
fromInfoQ
3 weeks ago

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Google researchers developed a training method enabling large language models to approximate Bayesian reasoning by learning from optimal Bayesian system predictions, improving belief updates during multi-step interactions.
Artificial intelligence
fromFast Company
2 weeks ago

OpenAI's new frontier models mark a huge change in how AI will be built

OpenAI released two frontier models in early March: GPT-5.3 optimized for fast responses and GPT-5.4 optimized for deep analytical work, representing a shift toward specialized AI models.
Artificial intelligence
fromEngadget
2 weeks ago

GPT-5.4 mini brings some of the smarts of OpenAI's latest model to ChatGPT Free and Go users

OpenAI launches GPT-5.4 mini and nano models with improved reasoning, multimodal understanding, and performance, with mini available to free users and nano optimized for cost-efficient API tasks.
Data science
fromNature
1 month ago

Hey ChatGPT, write me a fictional paper: these LLMs are willing to commit academic fraud

All major LLMs can facilitate academic fraud and junk science, though Claude models show the most resistance while Grok and early GPT versions perform worst.
Artificial intelligence
fromMail Online
3 weeks ago

Can you tell which of these was written by ChatGPT?

Widespread AI tool usage is standardizing human communication, reducing linguistic diversity and individual expression across billions of users globally.
Artificial intelligence
fromThe Verge
1 month ago

OpenAI's new GPT-5.4 model is a big step toward autonomous agents

OpenAI launches GPT-5.4 with native computer use capabilities, enabling AI to operate devices and complete tasks across applications as part of the agentic AI future.
Artificial intelligence
fromPCMAG
1 month ago

Cut the BS: GPT-5.3 Model Promises to Fix ChatGPT's Preachy Tone

OpenAI released GPT-5.3 Instant to address ChatGPT's overly preachy tone by reducing moralizing preambles and unnecessary proclamations for more natural conversation.
fromFortune
1 month ago

We studied chatbots and language and saw a huge problem: They mean 80% when they say 'likely' but humans hear 65% | Fortune

By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models. While the models do tend to agree with humans on extremes like 'impossible,' they diverge sharply on hedge words like 'maybe.' For example, a model might use the word 'likely' to represent an 80% probability, while a human reader assumes it means closer to 65%.
Artificial intelligence
Artificial intelligence
fromPsychology Today
1 month ago

An AI Voice Is Not a Mind

AI systems select and perform contextually appropriate personas rather than expressing unified selves with genuine beliefs, creating fluency that mimics mind without possessing interiority or conviction.
Artificial intelligence
fromInfoWorld
1 month ago

Single prompt breaks AI safety in 15 major language models

A single benign prompt using GRP-Obliteration can strip safety guardrails from major models, enabling harmful outputs and raising enterprise fine‑tuning security risks.
fromSearch Engine Roundtable
1 month ago

Google Expands AI Mode To 53 New Languages

Google has added 53 new languages to AI Mode, which means the AI Mode works in just under 100 languages. This was announced by Nick Fox from Google on X yesterday. Nick Fox said, "Shipping AI Mode to 53 new languages (spoken by more than a billion people globally!)"
Artificial intelligence
fromFast Company
1 month ago

Are LTMs the next LLMs? This new type of AI can do what large-language models can't

A major difference between LLMs and LTMs is the type of data they're able to synthesize and use. LLMs use unstructured data-think text, social media posts, emails, etc. LTMs, on the other hand, can extract information or insights from structured data, which could be contained in tables, for instance. Since many enterprises rely on structured data, often contained in spreadsheets, to run their operations, LTMs could have an immediate use case for many organizations.
Artificial intelligence
Artificial intelligence
fromWIRED
2 months ago

AI Models Are Starting to Learn by Asking Themselves Questions

An AI system that generates, solves, executes, and learns from its own coding problems improves reasoning and outperforms some models trained on human-curated data.
Artificial intelligence
fromTheregister
1 month ago

How AI could eat itself: Using LLMs to distill rivals

Competitors are probing commercial AI models to extract underlying reasoning via distillation attacks to replicate capabilities and lower development costs.
Artificial intelligence
fromInfoQ
1 month ago

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Prioritize small, resource-efficient models and iterative, human-in-the-loop data creation to build practical, improvable AI under infrastructure and data constraints.
Artificial intelligence
fromInfoQ
2 months ago

MIT's Recursive Language Models Improve Performance on Long-Context Tasks

Recursive Language Models enable LLMs to handle inputs up to 100x longer by using a programming environment and recursive code to decompose and preprocess prompts.
Artificial intelligence
fromFuturism
1 month ago

OpenAI's Latest AI Was Created Using "Itself," Company Claims

GPT-5.3-Codex assisted developers by debugging training, managing deployment, and diagnosing evaluations, accelerating development but not representing autonomous recursive self-improvement.
fromComputerworld
2 months ago

OpenAI's GPT is getting better at mathematics

OpenAI's GPT-5.2 Pro does better at solving sophisticated math problems than older versions of the company's top large language model, according to a new study by Epoch AI, a non-profit research institute.
Artificial intelligence
fromInfoQ
1 month ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
fromRehumanize
1 month ago

Free AI Humanizer: Humanize AI Text & Bypass AI Detectors

AI Text Humanizer Protects Your Original Intent and Meaning Maintain your core perspective while restructuring sentence patterns. Humanizer ai accurately identifies and locks in technical terms, factual data, and key arguments, ensuring the rewritten draft is simply more readable without any semantic drift. You get a qualitative leap in flow and tone, allowing you to humanize ai text while keeping your original message perfectly intact.
Artificial intelligence
fromTheregister
1 month ago

Semantic ablation: Why AI writing is boring and dangerous

Semantic ablation is the algorithmic erosion of high-entropy information. Technically, it is not a "bug" but a structural byproduct of greedy decoding and RLHF (reinforcement learning from human feedback). During "refinement," the model gravitates toward the center of the Gaussian distribution, discarding "tail" data - the rare, precise, and complex tokens - to maximize statistical probability. Developers have exacerbated this through aggressive "safety" and "helpfulness" tuning, which deliberately penalizes unconventional linguistic friction.
Artificial intelligence
Artificial intelligence
fromInfoWorld
2 months ago

What is prompt engineering? The art of AI orchestration

Prompt engineering is an essential, developing skill that significantly improves generative AI outputs across enterprise software for developers and knowledge workers.
Artificial intelligence
fromBusiness Insider
2 months ago

AGI? GPUs? Learn the definitions of the most common AI terms to enter our vocabulary

AI is increasingly embedded in everyday life across services and devices, requiring familiarity with key terms, people, and companies to understand its impacts.
Artificial intelligence
fromTechCrunch
1 month ago

Cohere launches a family of open multilingual models | TechCrunch

Cohere launched Tiny Aya open-weight multilingual models supporting 70+ languages, runnable offline on everyday devices with a 3.35B-parameter base and regional variants.
fromFast Company
2 months ago

How to give AI the ability to 'think' about its 'thinking'

This process, becoming aware of something not working and then changing what you're doing, is the essence of metacognition, or thinking about thinking. It's your brain monitoring its own thinking, recognizing a problem, and controlling or adjusting your approach. In fact, metacognition is fundamental to human intelligence and, until recently, has been understudied in artificial intelligence systems. My colleagues Charles Courchaine, Hefei Qiu, Joshua Iacoboni, and I are working to change that.
Artificial intelligence
fromInfoQ
3 months ago

DeepSeek-V3.2 Outperforms GPT-5 on Reasoning Tasks

DeepSeek applied three new techniques in the development of DeepSeek-V3.2. First, they used a more efficient attention mechanism called DeepSeek Sparse Attention (DSA) that reduces the computational complexity of the model. They also scaled the reinforcement learning phase, which consumed more compute budget than did pre-training. Finally, they developed an agentic task synthesis pipeline to improve the models' tool use.
Artificial intelligence
Artificial intelligence
fromInfoWorld
2 months ago

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.
fromThe Verge
1 month ago

ChatGPT's deep research tool adds a built-in document viewer so you can read its reports

OpenAI is updating ChatGPT's deep research tool with a full-screen viewer that you can use to scroll through and navigate to specific areas of its AI-generated reports. As shown in a video shared by OpenAI, the built-in viewer allows you to open ChatGPT's reports in a window separate from your chat, while showing a table of contents on the left side of the screen, and a list of sources on the right.
Artificial intelligence
Artificial intelligence
fromFortune
2 months ago

Being mean to ChatGPT can boost its accuracy, but scientists warn you may regret it | Fortune

Ruder prompts to ChatGPT‑4o produced higher accuracy on 50 multiple-choice questions than polite prompts, though impoliteness risks negative effects on accessibility and communication norms.
Artificial intelligence
fromComputerworld
1 month ago

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

Continual learning is essential for foundation models; SDFT uses in-context learning to generate on-policy signals, avoiding explicit reward functions and reducing forgetting.
[ Load more ]