#extended-data-fig-2d

[ follow ]
#data-visualization
Data science
fromMedium
4 days ago

DeepSeek and Grok Cloud Dancing Data Color Schemes

The 2026 Pantone Color of the Year, Cloud Dancer, presents unique challenges and opportunities for data visualization color schemes.
Data science
fromMedium
4 days ago

DeepSeek and Grok Cloud Dancing Data Color Schemes

The 2026 Pantone Color of the Year, Cloud Dancer, presents unique challenges and opportunities for data visualization color schemes.
Science
fromNature
1 week ago

Drowning in data sets? Here's how to cut them down to size

The Square Kilometre Array Observatory will generate massive data, but storage and retention pose significant challenges for researchers.
#python
Python
fromMouse Vs Python
1 week ago

The Python Show - Python Illustrated - Mouse Vs Python

Two sisters collaborated on a beginner's book about Python, with one writing and the other illustrating.
Python
fromMouse Vs Python
1 week ago

The Python Show - Python Illustrated - Mouse Vs Python

Two sisters collaborated on a beginner's book about Python, with one writing and the other illustrating.
Information security
fromTechzine Global
1 week ago

Databricks launches Lakewatch: agentic SIEM on the Lakehouse

Lakewatch is an open SIEM platform that consolidates security, IT, and business data, enabling rapid threat detection and response using AI agents.
Artificial intelligence
fromTheregister
1 week ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
Business intelligence
fromInfoWorld
2 weeks ago

Snowflake's new 'autonomous' AI layer aims to do the work, not just answer questions

Project SnowWork is Snowflake's autonomous AI layer that automates data analysis tasks like forecasting, churn analysis, and report generation without requiring data team intervention.
Data science
fromNature
1 week ago

How I squeeze fresh science from public data

Utilizing existing data can lead to significant discoveries and collaborations in research.
fromFlowingData
2 weeks ago

Subbed data source, lower inflation estimate

Data on legal services usually comes from the consumer index. But the Bureau of Labor Statistics, which has struggled with budget cuts and staff attrition, hasn't been able to collect enough data in recent years to publish the legal services index consistently. It has continued to provide the data to the Bureau of Economic Analysis, but the monthly readings have been volatile.
Law
#ydata-profiling
Python
fromRealpython
2 weeks ago

Episode #288: Automate Exploratory Data Analysis & Invent Python Comprehensions - The Real Python Podcast

YData Profiling enables quick exploratory data analysis reports with visualizations and statistics for easy sharing.
fromRealpython
1 month ago
Django

Automate Python Data Analysis With YData Profiling Quiz - Real Python

An interactive 8-question quiz assesses proficiency in YData Profiling for automating Python data analysis tasks including report generation, dataset comparison, and time series preparation.
Python
fromRealpython
2 weeks ago

Episode #288: Automate Exploratory Data Analysis & Invent Python Comprehensions - The Real Python Podcast

YData Profiling enables quick exploratory data analysis reports with visualizations and statistics for easy sharing.
Django
fromRealpython
1 month ago

Automate Python Data Analysis With YData Profiling Quiz - Real Python

An interactive 8-question quiz assesses proficiency in YData Profiling for automating Python data analysis tasks including report generation, dataset comparison, and time series preparation.
Business intelligence
fromInfoWorld
2 weeks ago

Visualizing the world with Planetary Computer

Microsoft's Planetary Computer provides free geospatial data from multiple sources with standardized APIs for environmental research and analysis applications.
fromEngadget
3 weeks ago

ChatGPT will now generate interactive visuals to help you with math and science concepts

Starting today, ChatGPT will generate dynamic visuals when you ask it to explain select scientific and mathematical concepts, including the Pythagorean theorem, Coulomb's law and lens equations. When ChatGPT responds with an interactive visual, you'll be able to tweak any variables and the equation itself, allowing you to see how those changes affect the solution.
Online learning
fromInfoWorld
2 weeks ago

How to create AI agents with Neo4j Aura Agent

Neo4j Aura Agent is an end-to-end platform for creating agents, connecting them to knowledge graphs, and deploying to production in minutes. In this post, we'll explore the features of Neo4j Aura Agent that make this all possible, along with links to coded examples to get hands-on with the platform.
Data science
Django
fromRealpython
3 weeks ago

Introduction to Python SQL Libraries Quiz - Real Python

A 9-question interactive quiz assesses proficiency in Python SQL libraries for database connectivity, query execution, and cross-database scripting with SQLite, MySQL, and PostgreSQL.
Python
fromRealpython
2 weeks ago

Spyder: Your IDE for Data Science Development in Python - Real Python

Spyder is an open-source Python IDE optimized for data science, offering powerful plotting, profiling capabilities, and integration with the data science ecosystem.
Data science
fromMedium
4 weeks ago

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Query federation enables safe, incremental lakehouse migration by allowing simultaneous queries across legacy warehouses and new lakehouse systems without risky big bang cutover approaches.
Digital life
fromFlowingData
1 month ago

Release your lantern

Taiwan Data Stories created an interactive tool allowing users to customize and release virtual lanterns during lunar new year celebrations, inspired by the traditional sky lantern festival in Pingxi.
Marketing
fromSkift Meetings
1 month ago

How to Make Event Data Matter in the Boardroom

Corporate events require data-driven measurement systems connecting to business outcomes to justify budgets and earn strategic credibility with executive leadership.
Artificial intelligence
fromInfoWorld
1 month ago

Why AI requires rethinking the storage-compute divide

AI workloads require continuous processing of unstructured multimodal data, causing redundant data movement and transformation that wastes infrastructure costs and data scientist time.
Python
fromTreehouse Blog
1 month ago

Python for Data: A SQL + Pandas Mini-Project That Actually Prepares You for Real Work

Effective data analysis requires combining SQL and Python skills in integrated projects that mirror real-world workflows, not learning them in isolation.
Python
fromRealpython
1 month ago

Automate Python Data Analysis With YData Profiling - Real Python

YData Profiling generates interactive exploratory data analysis reports with summary statistics, visualizations, and data quality warnings from pandas DataFrames in just a few lines of code.
Data science
fromRealpython
1 month ago

The pandas DataFrame: Make Working With Data Delightful Quiz - Real Python

An 11-question interactive quiz assesses proficiency in pandas DataFrame operations including creation, column manipulation, data sorting, NumPy array extraction, and missing data handling.
fromRednegra
1 month ago

Virtual Scrolling for Billions of Rows - Techniques from HighTable

The component also provides features for columns (sort, hide, resize), rows (select), cells (keyboard navigation, pointer interactions, custom rendering). Feel free to ask and look at the code if you're interested in knowing more. The <HighTable> component is developed at hyparam/hightable. It was created by Kenny Daniel for Hyperparam, and I've had the chance to contribute to its development for one year now.
Web frameworks
UX design
fromscikit-learn Blog
2 months ago

Enhancing user experience through interactive inspection

Scikit-learn added interactive HTML model inspections, including parameter tables, funded by a Wellcome/CZI EOSS grant to improve model inspection and UX.
Web development
fromInfoQ
3 months ago

DuckDB's WebAssembly Client Allows Querying Iceberg Datasets in the Browser

DuckDB-Wasm enables browser-based, serverless end-to-end query, read, and write access to Iceberg REST catalogs and object storage without infrastructure setup.
Tech industry
fromComputerworld
2 months ago

New Tableau AI features and Slack integration aim for data accessibility

Tableau added AI-powered personalization, automation, natural-language data stories, data mapping, and Slack integration to make data more accessible and actionable for business users.
Information security
fromSecuritymagazine
1 month ago

Product Spotlight on Analytics

Taelor Sutherland is Associate Editor at Security magazine covering enterprise security, coordinating digital content, and holding a BA in English Literature from Agnes Scott College.
fromHeat.js
2 months ago

Heat.js : JavaScript Heat Map

Completely free and open source (view our licence here). data_object Supports export for integration with frameworks including React, Vue, and Angular. Fully configurable, featuring custom triggers and adjustable text to support multiple language locales. 60 languages supported by default (view the languages here). Includes multiple views, including Map, Line, Chart, Days, Months, and Color Ranges. export_notes Export data to multiple file formats (view the supported types here), with system clipboard setting support.
Web design
Toronto
fromEditor In Leaf
2 months ago

Maple Leafs should seek upgrade at crucial position despite deadline sell-off

Maple Leafs should use their sell-off to acquire a defensive upgrade to replace Chris Tanev, prioritizing long-term blue-line help for next season.
fromMoz
1 month ago

Why Export GA4 Data to BigQuery?

Then coming on to the next point, which is you can create your own sessions and user properties. Now you can do this in the GA4 interface under Explorations.
Marketing tech
Data science
fromInfoQ
1 month ago

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Databricks Lakebase is a serverless PostgreSQL OLTP database that separates compute from storage and unifies transactional and analytical capabilities.
fromRaymondcamden
1 month ago

Building a Bluesky Sentiment Dashboard with Alpine and Chrome AI

Good morning, programs! Today I'm sharing yet another example of Chrome's on-device AI features, this time to demonstrate a &quot;Bluesky Sentiment Dashboard&quot;. In other words, a tool that lets you enter terms and then get a report on the average sentiment for posts using that word. I actually did this before (and yes, I forgot until about a minute ago) last year using Transformers.js: Building a Bluesky AI Sentiment Analysis Dashboard.
Web development
Software development
fromInfoQ
1 month ago

Are You Missing a Data Frame? The Power of Data Frames in Java

DataFrames and data-oriented programming promote modeling immutable data separately from behavior, making Java suitable for DataFrame-style data manipulation comparable to Python.
fromThe Drum
2 months ago

Deeper data delivers more inspired partnership decisions

Imagine you're selecting an influencer to work with on your new campaign. You've narrowed it down to two, both in the right area, both creating the right sort of content. One has 24.6 million subscribers, the other 1.4 million. Which do you choose? Now imagine you could find out the first had 8.7 million unique viewers last month, while the second had 9.9 million. Do you want to change your mind?
Marketing
Business intelligence
fromNew Relic
2 months ago

Optimize Databricks: Full Visibility with New Relic

New Relic Databricks Integration provides unified telemetry, speeding troubleshooting, improving performance and resource utilization, and linking Databricks performance directly to cost.
fromInfoWorld
2 months ago

AI is changing the way we think about databases

Developers have spent the past decade trying to forget databases exist. Not literally, of course. We still store petabytes. But for the average developer, the database became an implementation detail; an essential but staid utility layer we worked hard not to think about. We abstracted it behind object-relational mappers (ORM). We wrapped it in APIs. We stuffed semi-structured objects into columns and told ourselves it was flexible.
Software development
Web development
fromCSS-Tricks
1 month ago

CSS Bar Charts Using Modern Functions | CSS-Tricks

New CSS features like sibling-index() and attr() enable creating declarative, efficient bar charts using CSS Grid and data-attributes with minimal markup.
fromTreehouse Blog
1 month ago

Portfolio Projects for Entry-Level Data Roles

Most beginner data portfolios look similar. They include: A few cleaned datasets Some charts or dashboards A notebook with code and commentary Again, nothing here is wrong. But hiring teams don't review portfolios to check whether you can follow instructions. They review them to see whether you can think like a data analyst. When projects feel generic, reviewers are left guessing:
Data science
#pandas
Business intelligence
fromTechzine Global
2 months ago

ClickHouse, the open-source challenger to Snowflake and Databricks

ClickHouse is a high-performance columnar OLAP database rapidly adopted by AI and enterprise users, now valued at $15B and acquiring Langfuse.
fromMedium
2 months ago

From Graphs to Generative AI: Building Context That Pays-Part 1

Every year, poor communication and siloed data bleed companies of productivity and profit. Research shows U.S. businesses lose up to $1.2 trillion annually to ineffective communication, that's about $12,506 per employee per year. This stems from breakdowns that waste an average of 7.47 hours per employee each week on miscommunications. The damage isn't only interpersonal; it's structural. Disconnected and fragmented data systems mean that employees spend around 12 hours per week just searching for information trapped in those silos.
Data science
fromHoloViz Blog
2 months ago

A Major Step Toward Structured, Auditable AI-Driven Data Apps: Lumen AI 1.0 - HoloViz Blog

When we announced the pre-release version of Lumen AI, our goal was ambitious: build a fully open, extensible framework for conversational data exploration that always remains transparent, inspectable, and composable, rather than opaque, closed and non-extensible. Today, with the full release of Lumen 1.0, that vision has been realized while also significantly evolving. This release represents a substantial re-architecture of both the UI and the core execution model, along with major improvements in robustness, extensibility, and real-world applicability.
Artificial intelligence
Data science
fromComputerworld
2 months ago

Tableau re-engineers dashboards, adds new analytics tools for business analysts

Tableau 2022.3 adds Data Guide and Table Extension, dynamic dashboards, event auditing, and performance/cost optimization to simplify self-service analytics for business users.
#streamlit
fromMedium
1 month ago

Why "Data Scientist" is Becoming "AI Engineer" and What That Actually Means

The title "data scientist" is quietly disappearing from job postings, internal org charts, and LinkedIn headlines. In its place, roles like "AI engineer," "applied AI engineer," and "machine learning engineer" are becoming the norm. This Data Scientist vs AI Engineer shift raises an important question for practitioners and leaders alike: what actually changes when a data scientist becomes an AI engineer, and what stays the same? More importantly, what skills matter if you want to make this transition intentionally rather than by accident?
Artificial intelligence
Data science
fromComputerworld
2 months ago

Great R packages for data import, wrangling, and visualization

A set of R packages (dplyr, purrr, readr/vroom, datapasta, Hmisc) streamline data wrangling, importing, and analysis with faster, standardized, and reproducible tools.
#geopandas
Artificial intelligence
fromInfoQ
2 months ago

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

A Q-learning agent autonomously learns and generalizes optimal Spark configurations by discretizing dataset features and combining with Adaptive Query Execution for superior performance.
Data science
fromCIO
2 months ago

5 perspectives on modern data analytics

Data/business analytics is the top IT investment priority, yet analytics projects often fail due to poor data, vague objectives, and one-size-fits-all solutions.
Artificial intelligence
fromInfoWorld
2 months ago

Databricks says its Instructed Retriever offers better AI answers than RAG in the enterprise

Instructed Retriever combines deterministic database queries with RAG similarity search to improve relevance, control, and production readiness of enterprise AI.
Artificial intelligence
fromFortune
2 months ago

Want to get AI agents to work better? Improve how they retrieve data, Databricks says | Fortune

Engineering complete AI-agent workflows and providing access to correct information are essential for moving AI agents beyond pilot phase.
fromInfoWorld
1 month ago

AI-augmented data quality engineering

SHAP for feature attribution SHAP quantifies each feature's contribution to a model prediction, enabling: LIME for local interpretability LIME builds simple local models around a prediction to show how small changes influence outcomes. It answers questions like: &quot;Would correcting age change the anomaly score?&quot; &quot;Would adjusting the ZIP code affect classification?&quot; Explainability makes AI-based data remediation acceptable in regulated industries.
Artificial intelligence
Data science
fromMedium
2 months ago

The Complete Guide to Optimizing Apache Spark Jobs: From Basics to Production-Ready Performance

Optimize Spark jobs by using lazy evaluation awareness, early filter and column pruning, partition pruning, and appropriate join strategies to minimize shuffles and I/O.
fromNew Relic
2 months ago

The Power and Cost of Data Cardinality

The more attributes you add to your metrics, the more complex and valuable questions you can answer. Every additional attribute provides a new dimension for analysis and troubleshooting. For instance, adding an infrastructure attribute, such as region can help you determine if a performance issue is isolated to a specific geographic area or is widespread. Similarly, adding business context, like a store location attribute for an e-commerce platform, allows you to understand if an issue is specific to a particular set of stores
Data science
Data science
fromInfoQ
1 month ago

Beyond the Warehouse: Why BigQuery Alone Won't Solve Your Data Problems

Data warehouses like BigQuery perform well initially but become slow, costly, and disorganized at scale, undermining low-latency operational use and innovation.
[ Load more ]