#data-parallelism

[ follow ]
Software development
fromInfoQ
15 hours ago

TigerFS Mounts PostgreSQL Databases as a Filesystem for Developers and AI Agents

TigerFS is an experimental filesystem that integrates PostgreSQL, allowing file operations through a standard filesystem interface.
#ai
Data science
fromTheregister
7 hours ago

PrismML debuts 1-bit LLM in bid to free AI from the cloud

PrismML's Bonsai 8B is a 1-bit language model that outperforms larger models, enhancing AI efficiency for mobile applications.
Data science
fromTheregister
2 days ago

TurboQuant is a big deal, but it won't end the memory crunch

TurboQuant is an AI data compression technology that reduces memory usage for KV caches but may not significantly alleviate memory shortages.
Software development
fromMedium
3 days ago

The AI Revolution in Development: Why Outer Loop Agents Are the Next Big Thing

AI is set to revolutionize post-code push processes, automating tasks like security fixes, error logging, and code reviews.
Artificial intelligence
fromZDNET
4 days ago

What Google's TurboQuant can and can't do for AI's spiraling cost

Google's TurboQuant significantly reduces AI memory usage, making AI more efficient and accessible by lowering inference costs.
Artificial intelligence
fromEngadget
4 days ago

Microsoft's research assistant can now use multiple AI models simultaneously

The upgraded Researcher tool combines ChatGPT and Claude models for improved research quality in Microsoft 365 Copilot.
Data science
fromTheregister
7 hours ago

PrismML debuts 1-bit LLM in bid to free AI from the cloud

PrismML's Bonsai 8B is a 1-bit language model that outperforms larger models, enhancing AI efficiency for mobile applications.
Data science
fromTheregister
2 days ago

TurboQuant is a big deal, but it won't end the memory crunch

TurboQuant is an AI data compression technology that reduces memory usage for KV caches but may not significantly alleviate memory shortages.
Software development
fromMedium
3 days ago

The AI Revolution in Development: Why Outer Loop Agents Are the Next Big Thing

AI is set to revolutionize post-code push processes, automating tasks like security fixes, error logging, and code reviews.
Artificial intelligence
fromZDNET
4 days ago

What Google's TurboQuant can and can't do for AI's spiraling cost

Google's TurboQuant significantly reduces AI memory usage, making AI more efficient and accessible by lowering inference costs.
Artificial intelligence
fromEngadget
4 days ago

Microsoft's research assistant can now use multiple AI models simultaneously

The upgraded Researcher tool combines ChatGPT and Claude models for improved research quality in Microsoft 365 Copilot.
Silicon Valley
fromSilicon Canals
1 day ago

Frugal AI wants to break the global compute hierarchy before it becomes permanent - Silicon Canals

The Soliga tribe's speech AI system exemplifies a new, decentralized approach to AI that challenges existing global tech hierarchies.
Science
fromNature
2 days ago

Breakthrough computer chip tech could help meet 'monumental demand' driven by AI

A new light source enables the creation of 8 nm wide structures on silicon wafers, increasing transistor density for advanced computer chips.
Scala
fromInfoQ
2 days ago

Beyond RAG: Architecting Context-Aware AI Systems with Spring Boot

Context-Augmented Generation (CAG) enhances Retrieval-Augmented Generation (RAG) by managing runtime context for enterprise applications without requiring model retraining.
Tech industry
fromWIRED
1 day ago

A New Google-Funded Data Center Will Be Powered by a Massive Gas Plant

A pragmatic 'all-of-the-above' strategy is essential for energy, with gas as a critical bridge while investing in renewables.
JavaScript
fromPythonSpeed
3 days ago

Timesliced reservoir sampling: a new(?) algorithm for profilers

Random sampling from an unknown-length event stream can effectively identify relevant information without storing all data.
#ai-infrastructure
fromFortune
2 days ago
Environment

Data centers are so hot, their 'heat island' effect is raising temperatures up to 6 miles away and impacting 343 million people worldwide, study finds | Fortune

Environment
fromFortune
2 days ago

Data centers are so hot, their 'heat island' effect is raising temperatures up to 6 miles away and impacting 343 million people worldwide, study finds | Fortune

AI infrastructure is creating a 'data heat island effect' that raises local temperatures and impacts millions of people.
Tech industry
fromTechzine Global
2 weeks ago

Cisco and Nvidia lower barrier to secure, full-stack AI infrastructure

Cisco and Nvidia expanded the Cisco Secure AI Factory to deliver a complete, integrated, and secure AI stack enabling faster customer adoption of AI infrastructure.
Tech industry
fromZDNET
2 weeks ago

Nvidia wants to own your AI data center from end to end

Nvidia expanded its AI infrastructure portfolio with five rack types, including a new LPX inference rack using Groq technology, positioning itself to control all data center processing.
Artificial intelligence
fromComputerWeekly.com
2 weeks ago

HPE taps Nvidia to transform distributed AI factories into intelligent AI grid | Computer Weekly

HPE launches AI Grid infrastructure powered by Nvidia GPUs to enable distributed, low-latency AI inference at edge locations for real-time applications across retail, manufacturing, healthcare, and telecommunications.
#snowflake
Artificial intelligence
fromTheregister
1 week ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
Artificial intelligence
fromTheregister
1 week ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
Information security
fromSecurityWeek
4 days ago

Google Slashes Quantum Resource Requirements for Breaking Cryptocurrency Encryption

Google's Quantum AI warns that cryptocurrencies are more vulnerable to quantum attacks than previously believed, shortening the timeline for potential threats.
European startups
fromTheregister
5 days ago

Rebellions eyes global expansion with rack-scale AI platform

Rebellions raised $400 million to expand globally with AI accelerators and a new compute platform for enterprises and sovereign clouds.
SF politics
fromFortune
4 days ago

Hyperscalers often lack the "aptitude" on power as the political push picks up to expedite grid connections and pipelines | Fortune

Federal efforts to expedite power grid interconnections face challenges due to hyperscalers' communication issues and lack of understanding of processes.
DevOps
fromInfoQ
19 hours ago

Replacing Database Sequences at Scale Without Breaking 100+ Services

Validating requirements can simplify complex problems, and embedding sequence generation reduces network calls, enhancing performance and reliability.
Software development
fromTechzine Global
1 day ago

Cursor updates its platform with a focus on autonomous AI agents

Cursor 3 enhances software development by integrating AI agents for collaborative coding, reducing manual programming and streamlining workflows.
Science
fromFuturism
1 day ago

There's a Blinking Warning Sign for the Data Centers in Space Industry

Elon Musk's plan for space-based data centers faces significant challenges similar to those encountered in previous failed projects.
#openai
Artificial intelligence
fromFuturism
6 days ago

OpenAI's Obsession With Data Centers Is Running Into Trouble

OpenAI has significantly reduced its AI infrastructure spending plans from $1.4 trillion to $600 billion amid financial pressures and market expectations.
Artificial intelligence
fromFuturism
6 days ago

OpenAI's Obsession With Data Centers Is Running Into Trouble

OpenAI has significantly reduced its AI infrastructure spending plans from $1.4 trillion to $600 billion amid financial pressures and market expectations.
Tech industry
fromComputerWeekly.com
2 days ago

Marvell scales up networking to extend Nvidia AI ecosystem | Computer Weekly

Marvell Technology joins Nvidia AI ecosystem to enhance infrastructure development with a $2bn investment.
Environment
fromTheregister
3 days ago

AI datacenters create heat islands around them, paper finds

Datacenters significantly raise surrounding temperatures, impacting communities up to 10 km away, with average increases between 1.5°C and 2.4°C.
fromInfoWorld
4 days ago

How Apache Kafka flexed to support queues

Apache Kafka has cemented itself as the de facto platform for event streaming, often referred to as the 'universal data substrate' due to its extensive ecosystem that enables connectivity and processing capabilities.
Scala
European startups
fromFortune
4 days ago

Emerald AI and Nvidia aim to offer the fast pass for data center grid connects, partnering with power producers and raising new funds | Fortune

Emerald AI aims to enhance grid flexibility for data centers, reducing peak power consumption while maintaining high reliability.
#apache-spark
Java
fromMedium
2 weeks ago

Spark Internals: Understanding Tungsten (Part 1)

Apache Spark revolutionized big data processing but faces challenges due to JVM memory management and garbage collection issues.
Java
fromMedium
2 weeks ago

Spark Internals: Understanding Tungsten (Part 2)

Catalyst Optimizer and Tungsten work together in Apache Spark to optimize data execution and manage raw binary data.
Java
fromMedium
2 weeks ago

Spark Internals: Understanding Tungsten (Part 1)

Apache Spark revolutionized big data processing but faces challenges due to JVM memory management and garbage collection issues.
Java
fromMedium
2 weeks ago

Spark Internals: Understanding Tungsten (Part 2)

Catalyst Optimizer and Tungsten work together in Apache Spark to optimize data execution and manage raw binary data.
DevOps
fromMedium
21 hours ago

Fair Multitenancy-Beyond Simple Rate Limiting

Fair multitenancy ensures equitable infrastructure access for customers, balancing simplicity, performance, and safety in shared environments.
Node JS
fromInfoWorld
2 weeks ago

Edge.js launched to run Node.js for AI

Edge.js is a WebAssembly-based JavaScript runtime that safely executes Node.js applications with faster startup times by sandboxing workloads through WASIX.
Scala
fromMedium
5 days ago

Data Extraction and Classification Using Structural Pattern Matching in Scala

Scala pattern matching enhances code readability and extensibility in real-world data engineering use cases.
DevOps
fromTechzine Global
1 day ago

OpenStack Gazpacho simplifies operations and VMware migrations

OpenStack 2026.1 emphasizes operational simplicity, live migration for VMware workloads, and hardware flexibility, positioning itself as a sovereign alternative to major cloud providers.
Software development
fromZDNET
3 days ago

How AI has suddenly become much more useful to open-source developers

AI tools are becoming increasingly useful for open-source maintainers, but legal and quality issues remain.
Business intelligence
fromInfoWorld
2 weeks ago

Snowflake's new 'autonomous' AI layer aims to do the work, not just answer questions

Project SnowWork is Snowflake's autonomous AI layer that automates data analysis tasks like forecasting, churn analysis, and report generation without requiring data team intervention.
Science
fromNature
1 week ago

Drowning in data sets? Here's how to cut them down to size

The Square Kilometre Array Observatory will generate massive data, but storage and retention pose significant challenges for researchers.
#ibm
DevOps
fromTheregister
1 day ago

IBM wants Arm software on its mainframes for AI support

IBM and Arm are collaborating to enhance enterprise systems for AI and data-intensive workloads using Arm chips.
DevOps
fromComputerWeekly.com
2 days ago

Arm works with IBM to deliver flexibility on mainframe | Computer Weekly

IBM and Arm are collaborating to create dual-architecture hardware for enterprise AI and data-intensive workloads.
DevOps
fromTheregister
1 day ago

IBM wants Arm software on its mainframes for AI support

IBM and Arm are collaborating to enhance enterprise systems for AI and data-intensive workloads using Arm chips.
DevOps
fromComputerWeekly.com
2 days ago

Arm works with IBM to deliver flexibility on mainframe | Computer Weekly

IBM and Arm are collaborating to create dual-architecture hardware for enterprise AI and data-intensive workloads.
#ollama
Data science
fromInfoWorld
2 weeks ago

The 'toggle-away' efficiencies: Cutting AI costs inside the training loop

Simple optimizations can significantly reduce AI training costs and carbon emissions without needing the latest GPUs.
DevOps
fromTechzine Global
2 days ago

Observability warehouses, the next structural evolution for telemetry

Observability is essential for real-time insights in cloud systems, helping to reduce downtime and improve performance.
Data science
fromMedium
3 weeks ago

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Query federation enables safe, incremental lakehouse migration by allowing simultaneous queries across legacy warehouses and new lakehouse systems without risky big bang cutover approaches.
DevOps
fromTechzine Global
3 days ago

Harness adds four capabilities to close AI delivery gap

Harness is launching four new capabilities to enhance its Continuous Delivery platform, addressing the gap between code writing speed and release reliability.
#ai-efficiency
Artificial intelligence
fromInfoWorld
1 week ago

Google targets AI inference bottlenecks with TurboQuant

TurboQuant improves AI model efficiency by compressing key-value caches, reducing memory usage and runtime without accuracy loss.
Artificial intelligence
fromInfoWorld
1 week ago

Google targets AI inference bottlenecks with TurboQuant

TurboQuant improves AI model efficiency by compressing key-value caches, reducing memory usage and runtime without accuracy loss.
DevOps
fromInfoWorld
4 days ago

What front-end engineers need to know about AWS

Understanding AWS infrastructure improves front-end debugging and UI performance.
Miscellaneous
fromDevOps.com
1 month ago

I Learned Traffic Optimization Before I Learned Cloud Computing. It Turns Out the Lessons Were the Same. - DevOps.com

Cloud infrastructure requires understanding system behavior and costs to operate effectively at speed, similar to how skilled drivers anticipate conditions rather than simply driving fast.
DevOps
fromInfoWorld
4 days ago

Azure's new AI modernization tools

Microsoft's Azure Copilot aids in application migration and modernization, addressing technical debt and improving cloud infrastructure management.
DevOps
fromInfoQ
5 days ago

Optimization in Automated Driving: From Complexity to Real-Time Engineering

A production-grade AV stack is a distributed dataflow graph of components, optimized for resource management and real-time constraints.
Artificial intelligence
fromMedium
1 week ago

Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI

Model quantization and architectural optimization can outperform larger models, challenging the belief that more GPUs equal greater intelligence.
Data science
fromTechRepublic
1 month ago

Inside the Gas Engine Strategy Powering AI's Next Wave

Gas reciprocating engines are emerging as a critical power solution for AI data centers, with manufacturers like Caterpillar securing multi-gigawatt orders to meet demand that exceeds grid and turbine capacity.
Artificial intelligence
fromTechCrunch
2 weeks ago

Multiverse Computing pushes its compressed AI models into the mainstream | TechCrunch

Multiverse Computing offers on-device AI models that eliminate counterparty risk by running locally without requiring external compute infrastructure or cloud providers.
DevOps
fromInfoQ
3 weeks ago

Running Ray at Scale on AKS

Microsoft and Anyscale provide guidance for running managed Ray service on Azure Kubernetes Service, addressing GPU capacity limits, ML storage challenges, and credential expiry issues through multi-cluster, multi-region deployment strategies.
Silicon Valley
fromTheregister
1 month ago

Meta already deploying Nvidia's standalone CPUs at scale

Meta has deployed Nvidia's standalone Grace CPUs at scale and will deploy Vera CPUs and millions of Superchips to power general-purpose and agentic AI workloads.
#neoclouds
Data science
fromInfoQ
1 month ago

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Databricks Lakebase is a serverless PostgreSQL OLTP database that separates compute from storage and unifies transactional and analytical capabilities.
Artificial intelligence
fromComputerWeekly.com
4 weeks ago

Edge AI: What's working and what isn't | Computer Weekly

Edge AI deployment success depends on identifying efficient, narrow use cases with manageable risks rather than pursuing sophisticated, large-scale models across all applications.
Artificial intelligence
fromInfoWorld
1 month ago

Why AI requires rethinking the storage-compute divide

AI workloads require continuous processing of unstructured multimodal data, causing redundant data movement and transformation that wastes infrastructure costs and data scientist time.
Artificial intelligence
fromEngadget
1 month ago

AI data centers could reduce power draw on demand, study says

AI data centers can dynamically reduce energy consumption by up to 40% without disrupting critical workloads, enabling grid stability and reducing infrastructure strain.
Tech industry
fromTheregister
2 months ago

How Nvidia is using emulation to turn AI FLOPS into FP64

Nvidia achieves higher FP64 throughput through software emulation on Rubin GPUs, trading hardware FP64 for emulated matrix performance up to 200 TFLOPS.
Data science
fromMedium
2 months ago

The Complete Guide to Optimizing Apache Spark Jobs: From Basics to Production-Ready Performance

Optimize Spark jobs by using lazy evaluation awareness, early filter and column pruning, partition pruning, and appropriate join strategies to minimize shuffles and I/O.
#spark
fromMedium
2 months ago
Software development

How I Fixed a Critical Spark Production Performance Issue (and Cut Runtime by 70%)

fromMedium
2 months ago
Data science

How I Fixed a Critical Spark Production Performance Issue (and Cut Runtime by 70%)

fromMedium
2 months ago
Software development

How I Fixed a Critical Spark Production Performance Issue (and Cut Runtime by 70%)

fromMedium
2 months ago
Data science

How I Fixed a Critical Spark Production Performance Issue (and Cut Runtime by 70%)

Artificial intelligence
from24/7 Wall St.
1 month ago

NVIDIA Cements Its Role as the Backbone of AI Infrastructure

NVIDIA's networking revenue grew 162% year-over-year to $8.2 billion, nearly tripling GPU growth, signaling a shift from chip seller to integrated infrastructure provider selling complete AI data center systems.
Software development
fromInfoQ
1 month ago

Are You Missing a Data Frame? The Power of Data Frames in Java

DataFrames and data-oriented programming promote modeling immutable data separately from behavior, making Java suitable for DataFrame-style data manipulation comparable to Python.
fromComputerWeekly.com
1 month ago

Neoclouds: Meeting demand for AI acceleration | Computer Weekly

ChatGPT, launched in 2022, began making a significant impact on the market by late 2023, according to Synergy Research Group. The company's chief analyst, John Dinsdale, points out that cloud market leaders have experienced accelerated revenue growth over time. Additionally, the emergence of numerous neocloud companies ( see box: What is a neocloud?) has further strengthened the already positive momentum in the market.
Artificial intelligence
Software development
fromInfoWorld
2 months ago

Why your next microservices should be streaming SQL-driven

Streaming SQL with UDFs, materialized results, and ML/AI integrations enables continuous, stateful processing of event streams for microservices.
Artificial intelligence
fromInfoQ
2 months ago

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

A Q-learning agent autonomously learns and generalizes optimal Spark configurations by discretizing dataset features and combining with Adaptive Query Execution for superior performance.
fromCointelegraph
2 months ago

What Role Is Left for Decentralized GPU Networks in AI?

What we are beginning to see is that many open-source and other models are becoming compact enough and sufficiently optimized to run very efficiently on consumer GPUs,
Artificial intelligence
Artificial intelligence
fromInfoWorld
2 months ago

Edge AI: The future of AI inference is smarter local compute

Edge AI shifts computation from cloud to devices, enabling low-latency, cost-efficient, and privacy-preserving AI inference while facing performance and ecosystem challenges.
fromComputerworld
1 month ago

Intel sets sights on data center GPUs amid AI-driven infrastructure shifts

Intel is making a new push into GPUs, this time with a focus on data center workloads, as the chipmaker looks to reestablish itself in a market increasingly shaped by AI-driven demand and dominated by Nvidia. CEO Lip-Bu Tan said that after hiring a senior GPU architect, the company is working directly with customers to define requirements, signaling a more demand-driven approach as enterprises and cloud providers weigh their options for accelerated computing, according to a Reuters report.
Artificial intelligence
fromInfoQ
2 months ago

NVIDIA Dynamo Planner Brings SLO-Driven Automation to Multi-Node LLM Inference

The new capabilities center on two integrated components: the Dynamo Planner Profiler and the SLO-based Dynamo Planner. These tools work together to solve the "rate matching" challenge in disaggregated serving. The teams use this term when they split inference workloads. They separate prefill operations, which process the input context, from decode operations that generate output tokens. These tasks run on different GPU pools. Without the right tools, teams spend a lot of time determining the optimal GPU allocation for these phases.
Artificial intelligence
Artificial intelligence
fromTechRepublic
6 months ago

Google Launches New Server to Supercharge AI Agents

Data Commons MCP Server enables AI agents to access public datasets via the Model Context Protocol, reducing hallucinations and accelerating development of data-rich agent applications.
fromTechzine Global
2 months ago

Who will develop the OS for AI? VAST Data is going for it

In the early days, VAST Data's focus was primarily on storing enormous amounts of data. "Even before we talked about AI, data had to be stored somewhere," Pernsteiner notes. The company started out in the world of HPC (High Performance Computing). The choice of this sector was strategic: in that world, the scale and performance requirements are enormous. With this choice, VAST more or less forced itself to set the bar very high.
Artificial intelligence
fromTechzine Global
2 months ago

Neuromorphic computers prove suitable for supercomputing

Scientists are showing that neuromorphic computers, designed to mimic the human brain, are not only useful for AI, but also for complex computational problems that normally run on supercomputers. This is reported by The Register. Neuromorphic computing differs fundamentally from the classic von Neumann architecture. Instead of a strict separation between memory and processing, these functions are closely intertwined. This limits data transport, a major source of energy consumption in modern computers. The human brain illustrates how efficient such an approach can be.
Artificial intelligence
fromInfoWorld
1 month ago

Databricks adds MemAlign to MLflow to cut cost and latency of LLM evaluation

By replacing repeated fine‑tuning with a dual‑memory system, MemAlign reduces the cost and instability of training LLM judges, offering faster adaptation to new domains and changing business policies. Databricks' Mosaic AI Research team has added a new framework, MemAlign, to MLflow, its managed machine learning and generative AI lifecycle development service. MemAlign is designed to help enterprises lower the cost and latency of training LLM-based judges, in turn making AI evaluation scalable and trustworthy enough for production deployments.
Artificial intelligence
fromInfoQ
1 month ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
[ Load more ]