#extended-data-fig-2d
#extended-data-fig-2d

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.

Business intelligence

fromInfoWorld

2 weeks ago

Snowflake's new 'autonomous' AI layer aims to do the work, not just answer questions

Project SnowWork is Snowflake's autonomous AI layer that automates data analysis tasks like forecasting, churn analysis, and report generation without requiring data team intervention.

Data science

fromNature

1 week ago

How I squeeze fresh science from public data

Utilizing existing data can lead to significant discoveries and collaborations in research.

fromFlowingData

2 weeks ago

Subbed data source, lower inflation estimate

Data on legal services usually comes from the consumer index. But the Bureau of Labor Statistics, which has struggled with budget cuts and staff attrition, hasn't been able to collect enough data in recent years to publish the legal services index consistently. It has continued to provide the data to the Bureau of Economic Analysis, but the monthly readings have been volatile.

Law

Episode #288: Automate Exploratory Data Analysis & Invent Python Comprehensions - The Real Python Podcast

YData Profiling enables quick exploratory data analysis reports with visualizations and statistics for easy sharing.

fromRealpython

1 month ago

Django

Automate Python Data Analysis With YData Profiling Quiz - Real Python

An interactive 8-question quiz assesses proficiency in YData Profiling for automating Python data analysis tasks including report generation, dataset comparison, and time series preparation.

Python

fromRealpython

2 weeks ago

Episode #288: Automate Exploratory Data Analysis & Invent Python Comprehensions - The Real Python Podcast

YData Profiling enables quick exploratory data analysis reports with visualizations and statistics for easy sharing.

Django

fromRealpython

1 month ago

Automate Python Data Analysis With YData Profiling Quiz - Real Python

An interactive 8-question quiz assesses proficiency in YData Profiling for automating Python data analysis tasks including report generation, dataset comparison, and time series preparation.

more#ydata-profiling

Business intelligence

fromInfoWorld

2 weeks ago

Visualizing the world with Planetary Computer

Microsoft's Planetary Computer provides free geospatial data from multiple sources with standardized APIs for environmental research and analysis applications.

fromEngadget

3 weeks ago

ChatGPT will now generate interactive visuals to help you with math and science concepts

Starting today, ChatGPT will generate dynamic visuals when you ask it to explain select scientific and mathematical concepts, including the Pythagorean theorem, Coulomb's law and lens equations. When ChatGPT responds with an interactive visual, you'll be able to tweak any variables and the equation itself, allowing you to see how those changes affect the solution.

Online learning

fromInfoWorld

2 weeks ago

How to create AI agents with Neo4j Aura Agent

Neo4j Aura Agent is an end-to-end platform for creating agents, connecting them to knowledge graphs, and deploying to production in minutes. In this post, we'll explore the features of Neo4j Aura Agent that make this all possible, along with links to coded examples to get hands-on with the platform.

Data science

Django

fromRealpython

3 weeks ago

Introduction to Python SQL Libraries Quiz - Real Python

A 9-question interactive quiz assesses proficiency in Python SQL libraries for database connectivity, query execution, and cross-database scripting with SQLite, MySQL, and PostgreSQL.

Artificial intelligence

fromEngadget

3 weeks ago

Claude can now generate charts and diagrams

Claude now generates charts and diagrams using HTML and XML vector graphics to explain concepts visually, available to all users in beta.

Python

fromRealpython

2 weeks ago

Spyder: Your IDE for Data Science Development in Python - Real Python

Spyder is an open-source Python IDE optimized for data science, offering powerful plotting, profiling capabilities, and integration with the data science ecosystem.

Data science

fromMedium

4 weeks ago

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Query federation enables safe, incremental lakehouse migration by allowing simultaneous queries across legacy warehouses and new lakehouse systems without risky big bang cutover approaches.

Digital life

fromFlowingData

1 month ago

Release your lantern

Taiwan Data Stories created an interactive tool allowing users to customize and release virtual lanterns during lunar new year celebrations, inspired by the traditional sky lantern festival in Pingxi.

Marketing

fromSkift Meetings

1 month ago

How to Make Event Data Matter in the Boardroom

Corporate events require data-driven measurement systems connecting to business outcomes to justify budgets and earn strategic credibility with executive leadership.

Artificial intelligence

fromInfoWorld

1 month ago

Why AI requires rethinking the storage-compute divide

AI workloads require continuous processing of unstructured multimodal data, causing redundant data movement and transformation that wastes infrastructure costs and data scientist time.

Python

fromTreehouse Blog

1 month ago

Python for Data: A SQL + Pandas Mini-Project That Actually Prepares You for Real Work

Effective data analysis requires combining SQL and Python skills in integrated projects that mirror real-world workflows, not learning them in isolation.

Python

fromRealpython

1 month ago

Automate Python Data Analysis With YData Profiling - Real Python

YData Profiling generates interactive exploratory data analysis reports with summary statistics, visualizations, and data quality warnings from pandas DataFrames in just a few lines of code.

Data science

fromRealpython

1 month ago

The pandas DataFrame: Make Working With Data Delightful Quiz - Real Python

An 11-question interactive quiz assesses proficiency in pandas DataFrame operations including creation, column manipulation, data sorting, NumPy array extraction, and missing data handling.

fromRednegra

1 month ago

Virtual Scrolling for Billions of Rows - Techniques from HighTable

The component also provides features for columns (sort, hide, resize), rows (select), cells (keyboard navigation, pointer interactions, custom rendering). Feel free to ask and look at the code if you're interested in knowing more. The <HighTable> component is developed at hyparam/hightable. It was created by Kenny Daniel for Hyperparam, and I've had the chance to contribute to its development for one year now.

Web frameworks

Software development

fromSitePoint Forums | Web Development & Design Community

2 months ago

How to group data by selected ti,e slice?

Group rows into arbitrary time buckets by computing a bucket index (FLOOR(UNIX_TIMESTAMP(o_time)/interval_seconds)) and grouping by the bucket start timestamp.

UX design

fromscikit-learn Blog

2 months ago

Enhancing user experience through interactive inspection

Scikit-learn added interactive HTML model inspections, including parameter tables, funded by a Wellcome/CZI EOSS grant to improve model inspection and UX.

Web development

fromInfoQ

3 months ago

DuckDB's WebAssembly Client Allows Querying Iceberg Datasets in the Browser

DuckDB-Wasm enables browser-based, serverless end-to-end query, read, and write access to Iceberg REST catalogs and object storage without infrastructure setup.

Tech industry

fromComputerworld

2 months ago

New Tableau AI features and Slack integration aim for data accessibility

Tableau added AI-powered personalization, automation, natural-language data stories, data mapping, and Slack integration to make data more accessible and actionable for business users.

Information security

fromSecuritymagazine

1 month ago

Product Spotlight on Analytics

Taelor Sutherland is Associate Editor at Security magazine covering enterprise security, coordinating digital content, and holding a BA in English Literature from Agnes Scott College.

fromHeat.js

2 months ago

Heat.js : JavaScript Heat Map

Completely free and open source (view our licence here). data_object Supports export for integration with frameworks including React, Vue, and Angular. Fully configurable, featuring custom triggers and adjustable text to support multiple language locales. 60 languages supported by default (view the languages here). Includes multiple views, including Map, Line, Chart, Days, Months, and Color Ranges. export_notes Export data to multiple file formats (view the supported types here), with system clipboard setting support.

Web design

Toronto

fromEditor In Leaf

2 months ago

Maple Leafs should seek upgrade at crucial position despite deadline sell-off

Maple Leafs should use their sell-off to acquire a defensive upgrade to replace Chris Tanev, prioritizing long-term blue-line help for next season.

fromMoz

1 month ago

Why Export GA4 Data to BigQuery?

Then coming on to the next point, which is you can create your own sessions and user properties. Now you can do this in the GA4 interface under Explorations.

Marketing tech

Data science

fromInfoQ

1 month ago

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Databricks Lakebase is a serverless PostgreSQL OLTP database that separates compute from storage and unifies transactional and analytical capabilities.

fromRaymondcamden

1 month ago

Building a Bluesky Sentiment Dashboard with Alpine and Chrome AI

Good morning, programs! Today I'm sharing yet another example of Chrome's on-device AI features, this time to demonstrate a "Bluesky Sentiment Dashboard". In other words, a tool that lets you enter terms and then get a report on the average sentiment for posts using that word. I actually did this before (and yes, I forgot until about a minute ago) last year using Transformers.js: Building a Bluesky AI Sentiment Analysis Dashboard.

Web development

Software development

fromInfoQ

1 month ago

Are You Missing a Data Frame? The Power of Data Frames in Java

DataFrames and data-oriented programming promote modeling immutable data separately from behavior, making Java suitable for DataFrame-style data manipulation comparable to Python.

fromThe Drum

2 months ago

Deeper data delivers more inspired partnership decisions

Imagine you're selecting an influencer to work with on your new campaign. You've narrowed it down to two, both in the right area, both creating the right sort of content. One has 24.6 million subscribers, the other 1.4 million. Which do you choose? Now imagine you could find out the first had 8.7 million unique viewers last month, while the second had 9.9 million. Do you want to change your mind?

Marketing

Business intelligence

fromNew Relic

2 months ago

Optimize Databricks: Full Visibility with New Relic

New Relic Databricks Integration provides unified telemetry, speeding troubleshooting, improving performance and resource utilization, and linking Databricks performance directly to cost.

fromInfoWorld

2 months ago

AI is changing the way we think about databases

Developers have spent the past decade trying to forget databases exist. Not literally, of course. We still store petabytes. But for the average developer, the database became an implementation detail; an essential but staid utility layer we worked hard not to think about. We abstracted it behind object-relational mappers (ORM). We wrapped it in APIs. We stuffed semi-structured objects into columns and told ourselves it was flexible.

Software development

Web development

fromCSS-Tricks

1 month ago

CSS Bar Charts Using Modern Functions | CSS-Tricks

New CSS features like sibling-index() and attr() enable creating declarative, efficient bar charts using CSS Grid and data-attributes with minimal markup.

Artificial intelligence

fromSitePoint Forums | Web Development & Design Community

2 months ago

How Machine Learning Works

Machine learning uses data-driven algorithms and structured workflows to discover patterns, build predictive models, and deploy solutions across industries.

fromTreehouse Blog

1 month ago

Portfolio Projects for Entry-Level Data Roles

Most beginner data portfolios look similar. They include: A few cleaned datasets Some charts or dashboards A notebook with code and commentary Again, nothing here is wrong. But hiring teams don't review portfolios to check whether you can follow instructions. They review them to see whether you can think like a data analyst. When projects feel generic, reviewers are left guessing:

Data science

#pandas

fromInfoWorld

2 months ago

Python

How to use Pandas for data analysis in Python

fromInfoQ

1 month ago

Python

Pandas 3.0 Introduces Default String Dtype and Copy-on-Write Semantics

fromInfoWorld

2 months ago

Python

How to use Pandas for data analysis in Python

fromInfoQ

1 month ago

Python

Pandas 3.0 Introduces Default String Dtype and Copy-on-Write Semantics

more#pandas

Business intelligence

fromTechzine Global

2 months ago

ClickHouse, the open-source challenger to Snowflake and Databricks

ClickHouse is a high-performance columnar OLAP database rapidly adopted by AI and enterprise users, now valued at $15B and acquiring Langfuse.

Artificial intelligence

fromTechzine Global

2 months ago

Starburst: Chewing through data access is key to AI adoption

AI adoption is bottlenecked by lack of access to contextual, current, and governed data; without that, AI cannot reliably increase productivity.

fromMedium

2 months ago

From Graphs to Generative AI: Building Context That Pays-Part 1

Every year, poor communication and siloed data bleed companies of productivity and profit. Research shows U.S. businesses lose up to $1.2 trillion annually to ineffective communication, that's about $12,506 per employee per year. This stems from breakdowns that waste an average of 7.47 hours per employee each week on miscommunications. The damage isn't only interpersonal; it's structural. Disconnected and fragmented data systems mean that employees spend around 12 hours per week just searching for information trapped in those silos.

Data science

fromHoloViz Blog

2 months ago

A Major Step Toward Structured, Auditable AI-Driven Data Apps: Lumen AI 1.0 - HoloViz Blog

When we announced the pre-release version of Lumen AI, our goal was ambitious: build a fully open, extensible framework for conversational data exploration that always remains transparent, inspectable, and composable, rather than opaque, closed and non-extensible. Today, with the full release of Lumen 1.0, that vision has been realized while also significantly evolving. This release represents a substantial re-architecture of both the UI and the core execution model, along with major improvements in robustness, extensibility, and real-world applicability.

Artificial intelligence

Data science

fromComputerworld

2 months ago

Tableau re-engineers dashboards, adds new analytics tools for business analysts

Tableau 2022.3 adds Data Guide and Table Extension, dynamic dashboards, event auditing, and performance/cost optimization to simplify self-service analytics for business users.

#streamlit

fromPyImageSearch

2 months ago

Python

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 1) - PyImageSearch

fromPyImageSearch

2 months ago

Python

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 2) - PyImageSearch

fromPyImageSearch

2 months ago

Python

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 1) - PyImageSearch

fromPyImageSearch

2 months ago

Python

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 2) - PyImageSearch

more#streamlit

fromMedium

1 month ago

Why "Data Scientist" is Becoming "AI Engineer" and What That Actually Means

The title "data scientist" is quietly disappearing from job postings, internal org charts, and LinkedIn headlines. In its place, roles like "AI engineer," "applied AI engineer," and "machine learning engineer" are becoming the norm. This Data Scientist vs AI Engineer shift raises an important question for practitioners and leaders alike: what actually changes when a data scientist becomes an AI engineer, and what stays the same? More importantly, what skills matter if you want to make this transition intentionally rather than by accident?

Artificial intelligence

Data science

fromComputerworld

2 months ago

Great R packages for data import, wrangling, and visualization

A set of R packages (dplyr, purrr, readr/vroom, datapasta, Hmisc) streamline data wrangling, importing, and analysis with faster, standardized, and reproducible tools.

#geopandas

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

fromRealpython

2 months ago

Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

more#geopandas

Artificial intelligence

fromInfoQ

2 months ago

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

A Q-learning agent autonomously learns and generalizes optimal Spark configurations by discretizing dataset features and combining with Adaptive Query Execution for superior performance.

Data science

fromCIO

2 months ago

5 perspectives on modern data analytics

Data/business analytics is the top IT investment priority, yet analytics projects often fail due to poor data, vague objectives, and one-size-fits-all solutions.

Artificial intelligence

fromInfoWorld

2 months ago

Databricks says its Instructed Retriever offers better AI answers than RAG in the enterprise

Instructed Retriever combines deterministic database queries with RAG similarity search to improve relevance, control, and production readiness of enterprise AI.

Artificial intelligence

fromFortune

2 months ago

Want to get AI agents to work better? Improve how they retrieve data, Databricks says | Fortune

Engineering complete AI-agent workflows and providing access to correct information are essential for moving AI agents beyond pilot phase.

fromInfoWorld

1 month ago

AI-augmented data quality engineering

SHAP for feature attribution SHAP quantifies each feature's contribution to a model prediction, enabling: LIME for local interpretability LIME builds simple local models around a prediction to show how small changes influence outcomes. It answers questions like: "Would correcting age change the anomaly score?" "Would adjusting the ZIP code affect classification?" Explainability makes AI-based data remediation acceptable in regulated industries.

Artificial intelligence

Data science

fromMedium

2 months ago

The Complete Guide to Optimizing Apache Spark Jobs: From Basics to Production-Ready Performance

Optimize Spark jobs by using lazy evaluation awareness, early filter and column pruning, partition pruning, and appropriate join strategies to minimize shuffles and I/O.

fromNew Relic

2 months ago

The Power and Cost of Data Cardinality

The more attributes you add to your metrics, the more complex and valuable questions you can answer. Every additional attribute provides a new dimension for analysis and troubleshooting. For instance, adding an infrastructure attribute, such as region can help you determine if a performance issue is isolated to a specific geographic area or is widespread. Similarly, adding business context, like a store location attribute for an e-commerce platform, allows you to understand if an issue is specific to a particular set of stores

Data science

fromInfoQ

1 month ago

Beyond the Warehouse: Why BigQuery Alone Won't Solve Your Data Problems

Data warehouses like BigQuery perform well initially but become slow, costly, and disorganized at scale, undermining low-latency operational use and innovation.

[ Load more ]

#extended-data-fig-2d#extended-data-fig-2d

DeepSeek and Grok Cloud Dancing Data Color Schemes

Data visualization. How to make it understandable

DeepSeek and Grok Cloud Dancing Data Color Schemes

Data visualization. How to make it understandable

Drowning in data sets? Here's how to cut them down to size

The Python Show - Python Illustrated - Mouse Vs Python

Build YOUR data dashboard - join my next 8-week HOPPy studio cohort

Why Java devs should switch to Python or R for data science | TheServerSide

The Python Show - Python Illustrated - Mouse Vs Python

Build YOUR data dashboard - join my next 8-week HOPPy studio cohort

Why Java devs should switch to Python or R for data science | TheServerSide

Databricks launches Lakewatch: agentic SIEM on the Lakehouse

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake's new 'autonomous' AI layer aims to do the work, not just answer questions

How I squeeze fresh science from public data

Subbed data source, lower inflation estimate

Episode #288: Automate Exploratory Data Analysis & Invent Python Comprehensions - The Real Python Podcast

Automate Python Data Analysis With YData Profiling Quiz - Real Python

Episode #288: Automate Exploratory Data Analysis & Invent Python Comprehensions - The Real Python Podcast

Automate Python Data Analysis With YData Profiling Quiz - Real Python

Visualizing the world with Planetary Computer

ChatGPT will now generate interactive visuals to help you with math and science concepts

How to create AI agents with Neo4j Aura Agent

Introduction to Python SQL Libraries Quiz - Real Python

Claude can now generate charts and diagrams

Spyder: Your IDE for Data Science Development in Python - Real Python

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Release your lantern

How to Make Event Data Matter in the Boardroom

Why AI requires rethinking the storage-compute divide

Python for Data: A SQL + Pandas Mini-Project That Actually Prepares You for Real Work

Automate Python Data Analysis With YData Profiling - Real Python

The pandas DataFrame: Make Working With Data Delightful Quiz - Real Python

Virtual Scrolling for Billions of Rows - Techniques from HighTable

How to group data by selected ti,e slice?

Enhancing user experience through interactive inspection

DuckDB's WebAssembly Client Allows Querying Iceberg Datasets in the Browser

New Tableau AI features and Slack integration aim for data accessibility

Product Spotlight on Analytics

Heat.js : JavaScript Heat Map

Maple Leafs should seek upgrade at crucial position despite deadline sell-off

Why Export GA4 Data to BigQuery?

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Building a Bluesky Sentiment Dashboard with Alpine and Chrome AI

Are You Missing a Data Frame? The Power of Data Frames in Java

Deeper data delivers more inspired partnership decisions

Optimize Databricks: Full Visibility with New Relic

AI is changing the way we think about databases

CSS Bar Charts Using Modern Functions | CSS-Tricks

How Machine Learning Works

Portfolio Projects for Entry-Level Data Roles

How to use Pandas for data analysis in Python

Pandas 3.0 Introduces Default String Dtype and Copy-on-Write Semantics

How to use Pandas for data analysis in Python

Pandas 3.0 Introduces Default String Dtype and Copy-on-Write Semantics

ClickHouse, the open-source challenger to Snowflake and Databricks

Starburst: Chewing through data access is key to AI adoption

From Graphs to Generative AI: Building Context That Pays-Part 1

A Major Step Toward Structured, Auditable AI-Driven Data Apps: Lumen AI 1.0 - HoloViz Blog

Tableau re-engineers dashboards, adds new analytics tools for business analysts

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 1) - PyImageSearch

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 2) - PyImageSearch

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 1) - PyImageSearch

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 2) - PyImageSearch

Why "Data Scientist" is Becoming "AI Engineer" and What That Actually Means

Great R packages for data import, wrangling, and visualization

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

GeoPandas Basics: Maps, Projections, and Spatial Joins Quiz - Real Python

GeoPandas Basics: Maps, Projections, and Spatial Joins - Real Python

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

5 perspectives on modern data analytics

Databricks says its Instructed Retriever offers better AI answers than RAG in the enterprise

Want to get AI agents to work better? Improve how they retrieve data, Databricks says | Fortune

AI-augmented data quality engineering

#extended-data-fig-2d
#extended-data-fig-2d