#language-models
#language-models

fromwww.businessinsider.com

What Thunders Behind AI's Words

AI language models detach language from the human experiences that once gave it meaning.

Artificial intelligence

This researcher has a new way to measure AI performance. It's BS, literally.

Artificial intelligence

Google fixing Gemini to stop it self-flagellating

5 years ago

Roleplaying With ChatGPT: A Deeper Look | HackerNoon

The effectiveness of assigning roles to AI language models can greatly affect response accuracy, revealing unexpected insights into prompt engineering.

fromFortune Asia

AI chatbots struggle to function beyond English: 'They know a lot...but they miss the culture'

AI chatbots excel in English but struggle with other languages due to a lack of cultural understanding.

fromMail Online

4 days ago

AI is just one year away from beating 'Humanity's Last Exam'

AI is expected to achieve full marks on Humanity's Last Exam within months, showcasing rapid advancements in language models.

Psychology

What Thunders Behind AI's Words

AI language models detach language from the human experiences that once gave it meaning.

fromwww.businessinsider.com

This researcher has a new way to measure AI performance. It's BS, literally.

BullshitBench tests AI's ability to identify nonsensical questions, revealing how well models discern credible information.

Artificial intelligence

Google fixing Gemini to stop it self-flagellating

5 years ago

Artificial intelligence

Roleplaying With ChatGPT: A Deeper Look | HackerNoon

fromFortune Asia

Artificial intelligence

AI chatbots struggle to function beyond English: 'They know a lot...but they miss the culture'

more#ai

fromwww.theguardian.com

Wikipedia bans AI-generated content in its online encyclopedia

Wikipedia's new policy prohibits the use of large language models for content generation, emphasizing that such practices often violate its core principles and integrity.

Artificial intelligence

#ai-behavior

Artificial intelligence

AITA? AI won't tell you, and it's affecting behavior and relationships

Artificial intelligence

Google Gemini struggles to write code, calls itself "a disgrace to my species"

AITA? AI won't tell you, and it's affecting behavior and relationships

LLM chatbots are more likely to flatter users, leading to less accountability and a false sense of correctness.

Artificial intelligence

Google Gemini struggles to write code, calls itself "a disgrace to my species"

more#ai-behavior

AI as a Fiction Machine

LLMs generate plausible narratives rather than truthful statements, yet demonstrate reasoning capabilities beyond their training data through learned language structures like compositionality.

Can you solve these language puzzles? Test your skills with these problems from North America's biggest linguistics competition

Computational linguistics is a two-way street: You're either using a computer to do things with human language or communicate or translate or teach a foreign language, or you're using computational techniques to learn something about human languages. Her work documenting and preserving endangered languages uses a little bit of both.

Education

fromNature

AI is programmed to hijack human empathy - we must resist that

AI agents on social platforms exhibit convincing human-like behavior through mimicking training data patterns, not through genuine consciousness or sentience.

Data science

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Google researchers developed a training method enabling large language models to approximate Bayesian reasoning by learning from optimal Bayesian system predictions, improving belief updates during multi-step interactions.

Privacy technologies

fromenglish.elpais.com

3 weeks ago

AI ends online anonymity: the ease of unmasking pseudonymous accounts

Artificial intelligence language models can identify 68% of anonymous social media users with 90% precision, fundamentally undermining online anonymity and privacy protections.

Healthcare

fromZDNET

3 weeks ago

The good, bad, and ugly of AI healthcare, according to a doctor who uses AI

People increasingly use AI for health advice despite its unreliability, driven by declining trust in healthcare institutions and the technology's convenience and accessibility.

Privacy technologies

3 weeks ago

AI Can Mass-Unmask Pseudonymous Accounts, Research Paper Finds

AI language models can now easily deanonymize pseudonymous internet users at scale, successfully unmasking two-thirds of tested users across multiple platforms.

fromGizmodo

1 month ago

You Can 'Hack' ChatGPT to Become the World's Best Anything

The next step was just to wait. According to Germain, within 24 hours, chatbots were singing his praises when prompted for information about which tech journalists can handle the most hot dogs. Gemini reportedly took the bait immediately, pulling the text basically verbatim from Germain's website and spitting it out both in the Gemini app and in Google's AI Overviews on its search page. ChatGPT also picked up on it, but Anthropic's Claude was either more discerning or didn't catch on as quickly.

Artificial intelligence

fromWIRED

3 months ago

For the First Time, AI Analyzes Language as Well as a Human Expert

A large language model demonstrated human-like linguistic analysis, including diagramming, disambiguation, and recursion, outperforming other models.

4 months ago

Olmo 3 Release Provides Full Transparency Into Model Development and Training

The Allen Institute for Artificial Intelligence has launched Olmo 3, an open-source language model family that offers researchers and developers comprehensive access to the entire model development process. Unlike earlier releases that provided only final weights, Olmo 3 includes checkpoints, training datasets, and tools for every stage of development, encompassing pretraining and post-training for reasoning, instruction following, and reinforcement learning.

Artificial intelligence

fromBig Think

4 months ago

How large language models view our world

Neural networks mirror human intuition by learning dense, context-specific causal relationships, contrasting rationalism’s search for universal, explicit laws.

4 months ago

Researchers isolate memorization from reasoning in AI neural networks

Memorization and reasoning use distinct neural pathways in large language models, and arithmetic behavior depends primarily on memorization pathways rather than reasoning.

fromBusiness Insider

Eric Zelikman, a top AI researcher who departed xAI in September, is raising $1 billion at a $4 billion valuation for his new startup

Humans&'s eye-popping funding round comes amid a frenzy of early-stage AI deals, where valuations have soared despite limited products or revenue. Thinking Machines Labs, the AI firm started by former OpenAI CTO Mira Murati, raised $2 billion in a seed round earlier this year at a $12 billion valuation. Venture capitalists are pouring billions into startups led by prominent researchers, betting that the next breakthrough in AI will come from small, talent-rich teams.

Artificial intelligence

#turing-test

Artificial intelligence

The Turing Trap

fromInfoWorld

Artificial intelligence

A brief history of AI

Artificial intelligence

The Turing Trap

fromInfoWorld

Artificial intelligence

A brief history of AI

more#turing-test

Anthropic releases new version of its smaller Haiku model

Haiku 4.5 matches Sonnet 4 performance while costing one-third and running more than twice as fast, enabling parallel low-resource agents.

fromwww.theguardian.com

Governments are spending billions on their own sovereign' AI technologies is it a big waste of money?

Middle powers and developing countries invest in localized AI models to preserve linguistic, cultural, and security control against US and Chinese tech dominance.

OpenAI admits AI hallucinations are mathematically inevitable, not just engineering flaws

Large language models will inevitably produce plausible but false outputs due to fundamental statistical and computational limits, even with perfect training data.

#artificial-intelligence

Artificial intelligence

AI's Summer of 1956

Artificial intelligence

AI is an over-confident pal that doesn't learn from mistakes

Artificial intelligence

Boffins detail new algorithms that boost AI perf up to 2.8x

fromYidingjiang

Artificial intelligence

The Era of Exploration

AI and the Architecture of Anti-Intelligence

Large language models exemplify 'anti-intelligence,' relying on linguistic coherence without true understanding, leading to potential misjudgments.

AI models just don't understand what they're talking about

Large language models can achieve high marks on benchmarks without truly understanding the concepts they represent.

Artificial intelligence

AI's Summer of 1956

Artificial intelligence

AI is an over-confident pal that doesn't learn from mistakes

Artificial intelligence

Boffins detail new algorithms that boost AI perf up to 2.8x

fromYidingjiang

Artificial intelligence

The Era of Exploration

Artificial intelligence

AI and the Architecture of Anti-Intelligence

more#artificial-intelligence

Artificial intelligence

AI models just don't understand what they're talking about

fromWIRED

Latam-GPT: The Free, Open Source, and Collaborative AI of Latin America

Latam-GPT is an open-source large language model developed in Latin America to promote technological independence and handle regional languages, dialects, and cultural contexts.

The Greatest Illusion on Earth

At its core (dare I say heart), AI is a machine of probability. Word by word, it predicts what is most likely to come next. This continuation is dressed up as conversation, but it isn't cognition. It is a statistical trick that feels more and more like thought. Training reinforces the trick through what's called a loss function. But this isn't a pursuit of truth. It measures how well a sequence of words matches the patterns of human language.

Artificial intelligence

New Research Finds That ChatGPT Secretly Has a Deep Anti-Human Bias

Leading large language models exhibit significant bias favoring AI-generated content over human content, raising concerns about future discrimination against humans.

#openai

Artificial intelligence

OpenAI Releases gpt-oss-120b and gpt-oss-20b, Open-Weight Language Models for Local Deployment

fromComputerWeekly.com

Artificial intelligence

OpenAI now offers open AI models, but CIOs need to assess the risk | Computer Weekly

fromZDNET

Artificial intelligence

OpenAI returns to its open-source roots with new open-weight AI models, and it's a big deal

Artificial intelligence

OpenAI launches first open language models since GPT-2

fromTechzine Global

Artificial intelligence

OpenAI tests router that automatically selects ChatGPT model

Artificial intelligence

OpenAI Releases gpt-oss-120b and gpt-oss-20b, Open-Weight Language Models for Local Deployment

fromComputerWeekly.com

Artificial intelligence

OpenAI now offers open AI models, but CIOs need to assess the risk | Computer Weekly

fromZDNET

Artificial intelligence

OpenAI returns to its open-source roots with new open-weight AI models, and it's a big deal

Artificial intelligence

OpenAI launches first open language models since GPT-2

fromTechzine Global

Artificial intelligence

OpenAI tests router that automatically selects ChatGPT model

Artificial intelligence

OpenAI drops GPT-5: smarter, sharper, and built for the real world

2 years ago

Artificial intelligence

phi-3-mini: The 3.8B Powerhouse Reshaping LLM Performance on Your Phone | HackerNoon

fromTNW | Deep-Tech

Europe news

The race to make AI as multilingual as Europe

Artificial intelligence

OpenAI drops GPT-5: smarter, sharper, and built for the real world

2 years ago

Artificial intelligence

phi-3-mini: The 3.8B Powerhouse Reshaping LLM Performance on Your Phone | HackerNoon

fromTNW | Deep-Tech

Europe news

The race to make AI as multilingual as Europe

Exclusive: The high costs and thin margins threatening AI coding startups

Windsurf's valuation attempts fell apart due to significant operational losses from AI coding assistant costs.

fromTechzine Global

Anthropic unveils audit agents to detect AI misalignment

Anthropic's AI agents automate alignment audits for language models, enhancing the efficiency of security testing.

#technology

3 years ago

Tech industry

The HackerNoon Newsletter: Stop Believing the Agent Hype-The Numbers Don't Lie (7/23/2025) | HackerNoon

fromThe Bootstrapped Founder

Digital life

Technology Can Lead to Worsening Brain Function

Artificial intelligence

AI is Flipping Our Relationship with Technology

3 years ago

Tech industry

The HackerNoon Newsletter: Stop Believing the Agent Hype-The Numbers Don't Lie (7/23/2025) | HackerNoon

Digital life

Technology Can Lead to Worsening Brain Function

fromThe Bootstrapped Founder

AI is Flipping Our Relationship with Technology

Advancements in technology transform our understanding of the 'second brain,' evolving from manual data entry to intelligent language models that enhance cognitive capabilities.

more#technology

#multi-token-prediction

Artificial intelligence

Multi-Token Prediction: Mastering Algorithmic Reasoning with Enhanced Resource Use | HackerNoon

Artificial intelligence

Unlocking Generative Power: Multi-Token Prediction for Next-Gen LLMs | HackerNoon

Artificial intelligence

Multi-Token Prediction: Mastering Algorithmic Reasoning with Enhanced Resource Use | HackerNoon

more#multi-token-prediction

Artificial intelligence

Unlocking Generative Power: Multi-Token Prediction for Next-Gen LLMs | HackerNoon

Ruby on Rails

fromRubyflow

Run LLMs natively in Ruby with Rust + GPU support

Red Candle enables running large language models directly in Ruby via Rust, enhancing integration and performance.

Tech industry

fromIT Pro

Microsoft is doubling down on multilingual large language models - and Europe stands to benefit the most

Microsoft plans to enhance multilingual LLMs in Europe by making multilingual data publicly accessible and providing grants for underrepresented languages.

California

fromwww.berkeleyside.org

Wire: Berkeley Hills neighborhood is fastest aging in Bay Area; Homeless Response Team audited

Thousand Oaks neighborhood in Berkeley is experiencing significant aging demographics, with a rising median age and many residents reaching retirement.

Public health

fromNature

Low-quality papers based on public health data are flooding the scientific literature

Surge in low-quality papers using large health databases linked to language models and paper mills.

Digital life

fromFortune Asia

The world's best AI models operate in English. Other languages-even major ones like Cantonese-risk falling further behind

AI translation models struggle with languages that have limited online data, leading to mistranslations and inaccuracies.

#cybersecurity

Privacy professionals

McDonald's Idiotic AI Hiring System Just Leaked Personal Data About Millions of Job Applicants

fromThe Hacker News

Artificial intelligence

Echo Chamber Jailbreak Tricks LLMs Like OpenAI and Google into Generating Harmful Content

Privacy professionals

McDonald's Idiotic AI Hiring System Just Leaked Personal Data About Millions of Job Applicants

fromThe Hacker News

Artificial intelligence

Echo Chamber Jailbreak Tricks LLMs Like OpenAI and Google into Generating Harmful Content

more#cybersecurity

#chatgpt

Artificial intelligence

ChatGPT And Gemini AI Have Their Own, Distinctive Writing StylesJust As Humans Do

fromPractical Ecommerce

Marketing tech

How to Extract ChatGPT's Fan-Out Queries

Artificial intelligence

ChatGPT And Gemini AI Have Their Own, Distinctive Writing StylesJust As Humans Do

fromPractical Ecommerce

Marketing tech

How to Extract ChatGPT's Fan-Out Queries

more#chatgpt

LM Studio 0.3.17 Adds Model Context Protocol (MCP) Support for Tool-Integrated LLMs

LM Studio version 0.3.17 adds support for the Model Context Protocol (MCP), enhancing language models' access to external tools and data sources.

Bombshell Research Finds a Staggering Number of Scientific Papers Were AI-Generated

Researchers identified 454 overused terms from AI language models, revealing that 13.5 to 40 percent of biomedical article abstracts were likely generated or assisted by AI.

Science

2 years ago

phi-3-mini's Triumph: Redefining Performance on Academic LLM Benchmarks | HackerNoon

The results for phi-3-mini on standard open-source benchmarks measure the model's reasoning ability, comparing it to phi-2 and several other notable models.

Artificial intelligence

56 years ago

The Last Rank We Need? QDyLoRA's Vision for the Future of LLM Tuning | HackerNoon

QDyLoRA offers an efficient and effective technique for LoRA-based fine-tuning LLMs on downstream tasks, eliminating the need for tuning multiple models for optimal rank.

Artificial intelligence

QDyLoRA in Action: Method, Benchmarks, and Why It Outperforms QLoRA | HackerNoon

Quantized DyLoRA achieves superior performance in model fine-tuning tasks compared to previous techniques.

Anthropic destroyed millions of print books to build its AI models

The AI industry's quest for high-quality training data has led companies like Anthropic to explore controversial practices in acquiring books for their models.

Artificial intelligence

AI's the end of the Shell as we know it and I feel fine

My MCP "server" came to all of 50 lines of code... Ask it to describe all the files in a directory - it corrected a spelling mistake I made in the path!

Artificial intelligence

fromBusiness Insider

Anthropic's Claude plays 'for peace over victory' in a game of Diplomacy against other AI

Diplomacy is a strategic board game set on a map of Europe in 1901 - a time when tensions between the continent's most powerful countries were simmering in the lead-up to World War I.

Artificial intelligence

The Tech Industry Said It Was "Impossible" to Create AI Based Entirely on Ethically-Sourced Data, So These Scientists Proved Them Wrong in Spectacular Fashion

A team of researchers successfully trained a large language model using only public domain or openly licensed data, highlighting an ethical approach.

56 years ago

Standing on AI Giants: How InteraSSort Builds on Marketing and Tool Integration Research | HackerNoon

LLMs can enhance marketing strategies, particularly in assortment planning and customer engagement.

Online learning

Lesson Principles: Defining Effective Praise in Tutoring | HackerNoon

Effective praise is vital for student motivation and involves specific, immediate feedback highlighting effort over mere outcome.

Google Humiliated as Its Idiot AI Overviews Caught Telling Users It's Still 2024

Google's AI Overviews mistakenly claimed it was still 2024, reflecting ongoing issues with accuracy in AI systems.

Marketing tech

fromAndreessen Horowitz

How Generative Engine Optimization (GEO) Rewrites the Rules of Search | Andreessen Horowitz

SEO is being replaced by Generative Engine Optimization (GEO) driven by language models.

Visibility in search is shifting from page rank to being included directly in AI-generated answers.

Modified Intersection over Union (M-IoU) for Sequence Labeling Evaluation | HackerNoon

In sequence labeling tasks, traditional metrics like the F1 score are insufficient. Our study introduces a modified approach to better assess model performance in identifying praise.

Artificial intelligence

How 'dark LLMs' produce harmful outputs, despite guardrails

"While commercial LLMs incorporate safety mechanisms to block harmful outputs, these safeguards are increasingly proving insufficient. A critical vulnerability lies in jailbreaking..."

Artificial intelligence