#big-data--spark

[ follow ]
#grafana
Software development
fromTheregister
6 hours ago

Grafana: Free AI for all - please don't bankrupt us

Grafana offers its AI assistant for free to open source and on-prem users, while introducing Grafana 13 and expanding into business analytics.
DevOps
fromDevOps.com
1 day ago

Grafana Labs Extends Observability Reach Deeper Into AI - DevOps.com

Grafana Labs has enhanced its observability platform with AI capabilities and introduced new tools for AI application monitoring and data collection.
Software development
fromTheregister
6 hours ago

Grafana: Free AI for all - please don't bankrupt us

Grafana offers its AI assistant for free to open source and on-prem users, while introducing Grafana 13 and expanding into business analytics.
DevOps
fromDevOps.com
1 day ago

Grafana Labs Extends Observability Reach Deeper Into AI - DevOps.com

Grafana Labs has enhanced its observability platform with AI capabilities and introduced new tools for AI application monitoring and data collection.
#snowflake
Artificial intelligence
fromInfoWorld
1 day ago

Snowflake offers help to users and builders of AI agents

Snowflake enhances its Intelligence and Cortex Code for better automation and data source access, aiming for a unified enterprise AI experience.
Artificial intelligence
fromInfoWorld
1 day ago

Snowflake offers help to users and builders of AI agents

Snowflake enhances its Intelligence and Cortex Code for better automation and data source access, aiming for a unified enterprise AI experience.
Artificial intelligence
fromTheregister
4 weeks ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
#ai-adoption
Business intelligence
fromZDNET
1 day ago

Scaling agentic AI demands a strong data foundation - 4 steps to take first

Trusted quality data is essential for scaling agentic AI adoption in organizations.
fromTechzine Global
2 months ago
Artificial intelligence

Starburst: Chewing through data access is key to AI adoption

AI adoption is bottlenecked by lack of access to contextual, current, and governed data; without that, AI cannot reliably increase productivity.
Business intelligence
fromZDNET
1 day ago

Scaling agentic AI demands a strong data foundation - 4 steps to take first

Trusted quality data is essential for scaling agentic AI adoption in organizations.
DevOps
fromMedium
1 day ago

Practical AgentOps: Getting Started with MLflow 3

MLflow 3.0 enhances generative AI support while ensuring compatibility with traditional ML workflows.
#python
Python
fromPycoders
1 day ago

PyCoder's Weekly | Issue #731

Wallaby enhances Python development with integrated testing, AI context, and tools for machine learning visualization and vector databases.
fromTheServerSide.com
2 months ago
Data science

Why Java devs should switch to Python or R for data science | TheServerSide

Python and R dominate data science front-end work, offering richer ecosystems and easier data analysis than Java for many statistical and machine learning tasks.
Python
fromPycoders
1 day ago

PyCoder's Weekly | Issue #731

Wallaby enhances Python development with integrated testing, AI context, and tools for machine learning visualization and vector databases.
Environment
fromwww.dw.com
1 day ago

Why cloud computing still runs on coal and gas

Data centers' energy demands are straining U.S. power grids, leading to reliance on fossil fuels and delaying renewable energy goals.
Business
from24/7 Wall St.
1 day ago

Forget Nvidia: Why HPE Could Be the Overlooked AI Infrastructure Play of 2026

Hewlett Packard Enterprise is an overlooked investment opportunity in AI infrastructure with strong financial growth and expanding margins.
Angular
fromMedium
2 days ago

Senior Scala Developer Resume Samples for Lead Positions

Senior Scala roles require resumes that demonstrate ownership, system-level decisions, and measurable outcomes.
Data science
fromMedium
3 days ago

What is a Datathon? And Why You Should Join One

Datathons are collaborative events where participants analyze real-world datasets to generate insights and solve practical problems.
Scala
fromInfoQ
5 days ago

Lakehouse Tower of Babel: Handling Identifier Resolution Rules Across Database Engines

Open table formats standardize data semantics but lack SQL dialect interoperability, complicating identifier resolution across different engines.
Software development
fromInfoWorld
10 hours ago

Google's Gemma 4 shines on local systems - both big and small

Gemma 4's mixture of experts design enhances performance by allowing CPU weight allocation, improving token generation speed significantly.
Data science
fromMedium
6 days ago

Is the Data Scientist Role Dead? No, it's Transforming

The data scientist role is evolving, not disappearing, as organizations demand broader skills and system-oriented thinking.
#scala
Scala
fromMedium
3 weeks ago

Data Extraction and Classification Using Structural Pattern Matching in Scala

Scala pattern matching enhances code readability and extensibility in real-world data engineering use cases.
Angular
fromMedium
6 days ago

Junior Scala Developer Resume Samples Optimized for Employers

Present existing work effectively to demonstrate capability for junior Scala roles.
Scala
fromMedium
3 weeks ago

Data Extraction and Classification Using Structural Pattern Matching in Scala

Scala pattern matching enhances code readability and extensibility in real-world data engineering use cases.
Artificial intelligence
fromMedium
1 day ago

Enterprise AI in Practice: 6 Must-Watch Sessions on Scaling Agentic Systems

Enterprise AI is transitioning from experimentation to execution, presenting challenges in governance, scaling, and measurable business impact.
DevOps
from24/7 Wall St.
6 days ago

Oracle's New AWS Partnership Just Put It Ahead of Azure and Google Cloud

Multicloud setups are essential for enterprise AI, enabling seamless data movement and integration across different cloud providers.
Software development
fromTechzine Global
6 days ago

Scale sets edge platform's software ever more free from hardware constraints

Scale Computing is reducing hardware requirements for its software, allowing more flexibility for partners and customers in choosing hardware platforms.
#ai
fromMedium
3 weeks ago
Software development

The AI Revolution in Development: Why Outer Loop Agents Are the Next Big Thing

Software development
fromMedium
3 weeks ago

The AI Revolution in Development: Why Outer Loop Agents Are the Next Big Thing

AI is set to revolutionize post-code push processes, automating tasks like security fixes, error logging, and code reviews.
Scala
fromInfoQ
2 weeks ago

Beyond RAG: Architecting Context-Aware AI Systems with Spring Boot

Context-Augmented Generation (CAG) enhances Retrieval-Augmented Generation (RAG) by managing runtime context for enterprise applications without requiring model retraining.
Data science
fromMedium
2 weeks ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
#databricks
Information security
fromInfoWorld
3 weeks ago

Databricks pitches Lakewatch as a cheaper SIEM - but is it really?

Translating benefits into buy-in from CIOs and CISOs may be challenging for Databricks despite its intent and acquisitions.
fromTechCrunch
2 months ago
Artificial intelligence

Databricks CEO says SaaS isn't dead, but AI will soon make it irrelevant | TechCrunch

Information security
fromInfoWorld
3 weeks ago

Databricks pitches Lakewatch as a cheaper SIEM - but is it really?

Translating benefits into buy-in from CIOs and CISOs may be challenging for Databricks despite its intent and acquisitions.
fromTechCrunch
2 months ago
Artificial intelligence

Databricks CEO says SaaS isn't dead, but AI will soon make it irrelevant | TechCrunch

DevOps
fromInfoQ
1 week ago

Uber's Hive Federation Decentralizes 16K Datasets and 10+ PB for Zero-Downtime Analytics at Scale

Uber redesigned its Hive data warehouse to decentralize datasets, enhancing scalability, security, and operational autonomy for teams.
Artificial intelligence
fromAxios
1 day ago

Anthropic bites back in the compute wars with Amazon partnership

Anthropic is investing heavily in compute capacity to enhance its Claude models, competing directly with OpenAI's infrastructure advantage.
#apache-spark
Java
fromMedium
1 month ago

Spark Internals: Understanding Tungsten (Part 1)

Apache Spark revolutionized big data processing but faces challenges due to JVM memory management and garbage collection issues.
Java
fromMedium
1 month ago

Spark Internals: Understanding Tungsten (Part 2)

Catalyst Optimizer and Tungsten work together in Apache Spark to optimize data execution and manage raw binary data.
Java
fromMedium
1 month ago

Spark Internals: Understanding Tungsten (Part 1)

Apache Spark revolutionized big data processing but faces challenges due to JVM memory management and garbage collection issues.
Java
fromMedium
1 month ago

Spark Internals: Understanding Tungsten (Part 2)

Catalyst Optimizer and Tungsten work together in Apache Spark to optimize data execution and manage raw binary data.
Data science
fromInfoWorld
2 weeks ago

Databricks launches AiChemy multi-agent AI for drug discovery

AiChemy integrates various data sources to enhance research efficiency in pharmaceutical companies.
fromInfoWorld
3 weeks ago

How Apache Kafka flexed to support queues

Apache Kafka has cemented itself as the de facto platform for event streaming, often referred to as the 'universal data substrate' due to its extensive ecosystem that enables connectivity and processing capabilities.
Scala
Node JS
fromInfoQ
1 month ago

Inside Netflix's Graph Abstraction: Handling 650TB of Graph Data in Milliseconds Globally

Netflix engineers developed Graph Abstraction to manage large-scale graph data in real time, enabling fast queries and supporting various internal services.
Science
fromNature
1 month ago

Drowning in data sets? Here's how to cut them down to size

The Square Kilometre Array Observatory will generate massive data, but storage and retention pose significant challenges for researchers.
Artificial intelligence
fromnews.bitcoin.com
2 days ago

Nvidia Releases Nemotron 3 Super, a 120B Open AI Model Built for Agentic Workloads

Nvidia launched Nemotron 3 Super, a 120 billion parameter model that significantly reduces AI compute costs and increases throughput.
Information security
fromTechzine Global
4 weeks ago

Databricks launches Lakewatch: agentic SIEM on the Lakehouse

Lakewatch is an open SIEM platform that consolidates security, IT, and business data, enabling rapid threat detection and response using AI agents.
Tech industry
fromTechzine Global
1 month ago

HPE offers AI at every scale for Nvidia's Vera Rubin portfolio

HPE offers plug-and-play Nvidia-powered AI solutions designed to integrate seamlessly with existing infrastructure across three customer groups: AI model builders, service providers, and regulated organizations.
Business intelligence
fromInfoWorld
1 month ago

Snowflake's new 'autonomous' AI layer aims to do the work, not just answer questions

Project SnowWork is Snowflake's autonomous AI layer that automates data analysis tasks like forecasting, churn analysis, and report generation without requiring data team intervention.
Python
fromRealpython
1 month ago

Spyder: Your IDE for Data Science Development in Python - Real Python

Spyder is an open-source Python IDE optimized for data science, offering powerful plotting, profiling capabilities, and integration with the data science ecosystem.
Data science
fromMedium
1 month ago

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Query federation enables safe, incremental lakehouse migration by allowing simultaneous queries across legacy warehouses and new lakehouse systems without risky big bang cutover approaches.
Software development
fromMedium
1 month ago

Unified Databricks Repository for Scala and Python Data Pipelines

Databricks repositories require structured setup with Gradle for multi-language support, dependency management, and version control to scale beyond manual notebook maintenance.
Business intelligence
fromTechzine Global
1 month ago

Dataiku introduces platform for scalable enterprise AI

Dataiku launches Platform for AI Success with three new products designed to move AI initiatives from pilots to measurable business outcomes through unified orchestration across cloud providers.
DevOps
fromInfoQ
1 month ago

Running Ray at Scale on AKS

Microsoft and Anyscale provide guidance for running managed Ray service on Azure Kubernetes Service, addressing GPU capacity limits, ML storage challenges, and credential expiry issues through multi-cluster, multi-region deployment strategies.
Miscellaneous
fromTechzine Global
2 months ago

Klarrio uses open source expertise to build foundational data platforms

Klarrio builds compliant, scalable open-source data platforms and platform-engineering foundations, integrating and securing underlying infrastructure so customers can focus on analytics and data science.
#scala-interview-preparation
Tech industry
fromTheregister
2 months ago

Snowflake plugs PostgreSQL into its AI Data Cloud

Snowflake now offers a native PostgreSQL DBaaS in its AI Data Cloud to run transactional workloads alongside analytics and AI under unified governance.
#ai-automation
Artificial intelligence
fromTechzine Global
1 month ago

Snowflake's Project SnowWork targets autonomous enterprise AI

Snowflake launches Project SnowWork, an autonomous AI interface that performs enterprise tasks like forecasts and reports without data team involvement, expanding from backend infrastructure to front-office productivity tool.
fromInfoWorld
1 month ago
Artificial intelligence

Databricks launches Genie Code to automate data science and engineering tasks

Artificial intelligence
fromTechzine Global
1 month ago

Snowflake's Project SnowWork targets autonomous enterprise AI

Snowflake launches Project SnowWork, an autonomous AI interface that performs enterprise tasks like forecasts and reports without data team involvement, expanding from backend infrastructure to front-office productivity tool.
fromInfoWorld
1 month ago
Artificial intelligence

Databricks launches Genie Code to automate data science and engineering tasks

Software development
fromInfoQ
2 months ago

Are You Missing a Data Frame? The Power of Data Frames in Java

DataFrames and data-oriented programming promote modeling immutable data separately from behavior, making Java suitable for DataFrame-style data manipulation comparable to Python.
fromMedium
3 months ago

How I Fixed a Critical Spark Production Performance Issue (and Cut Runtime by 70%)

"The job didn't fail. It just... never finished." That was the worst part. No errors.No stack traces.Just a Spark job running forever in production - blocking downstream pipelines, delaying reports, and waking up-on-call engineers at 2 AM. This is the story of how I diagnosed a real Spark performance issue in production and fixed it drastically, not by adding more machines - but by understanding Spark properly.
Data science
fromDevOps.com
2 months ago

Why Data Contracts Need Apache Kafka and Apache Flink - DevOps.com

Data contracts formalize schemas, types, and quality constraints through early producer-consumer collaboration to prevent pipeline failures and reduce operational downtime.
Data science
fromInfoQ
1 month ago

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Databricks Lakebase is a serverless PostgreSQL OLTP database that separates compute from storage and unifies transactional and analytical capabilities.
fromTechzine Global
2 months ago

Databricks makes serverless Postgress service Lakebase available

Databricks today announced the general availability of Lakebase on AWS, a new database architecture that separates compute and storage. The managed serverless Postgres service is designed to help organizations build faster without worrying about infrastructure management. When databases link compute and storage, every query must use the same CPU and memory resources. This can cause a single heavy query to affect all other operations. By separating compute and storage, resources automatically scale with the actual load.
Software development
Artificial intelligence
fromFast Company
1 month ago

Exclusive: Databricks launches 'Genie Code' to own the next frontier of vibe-coding

AI coding agents are evolving beyond software development to operate complex data systems in production, representing the next frontier for enterprise AI automation.
Software development
fromInfoWorld
2 months ago

Why your next microservices should be streaming SQL-driven

Streaming SQL with UDFs, materialized results, and ML/AI integrations enables continuous, stateful processing of event streams for microservices.
fromInfoWorld
2 months ago

AI is changing the way we think about databases

Developers have spent the past decade trying to forget databases exist. Not literally, of course. We still store petabytes. But for the average developer, the database became an implementation detail; an essential but staid utility layer we worked hard not to think about. We abstracted it behind object-relational mappers (ORM). We wrapped it in APIs. We stuffed semi-structured objects into columns and told ourselves it was flexible.
Software development
Data science
fromInfoWorld
2 months ago

Snowflake debuts Cortex Code, an AI agent that understands enterprise data context

Cortex Code enables developers to use natural language to build, optimize, and deploy governed, production-ready data pipelines, analytics, ML workloads, and AI agents.
Artificial intelligence
fromInfoQ
2 months ago

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

A Q-learning agent autonomously learns and generalizes optimal Spark configurations by discretizing dataset features and combining with Adaptive Query Execution for superior performance.
Artificial intelligence
fromInfoWorld
1 month ago

Why AI requires rethinking the storage-compute divide

AI workloads require continuous processing of unstructured multimodal data, causing redundant data movement and transformation that wastes infrastructure costs and data scientist time.
fromTechzine Global
2 months ago

Databricks shows how AI strengthens the SaaS model

The rise of generative AI is often seen as an existential threat to the SaaS model. Interfaces would disappear, software would fade away, and existing players would become irrelevant. However, new figures from Databricks paint a different picture. Rather than undermining SaaS, AI appears to be increasing its use. This week, Databricks reported a revenue run rate of $5.4 billion, a 65 percent year-on-year increase. More than a quarter of that now comes from AI-related products.
Artificial intelligence
Artificial intelligence
fromTechzine Global
2 months ago

Snowflake launches Cortex Code agent for understanding data context

Cortex Code is an AI agent that converts complex data engineering, ML, and analytics tasks into natural-language workflows integrated into Snowflake and developer tools.
fromInfoWorld
2 months ago

Databricks adds MemAlign to MLflow to cut cost and latency of LLM evaluation

By replacing repeated fine‑tuning with a dual‑memory system, MemAlign reduces the cost and instability of training LLM judges, offering faster adaptation to new domains and changing business policies. Databricks' Mosaic AI Research team has added a new framework, MemAlign, to MLflow, its managed machine learning and generative AI lifecycle development service. MemAlign is designed to help enterprises lower the cost and latency of training LLM-based judges, in turn making AI evaluation scalable and trustworthy enough for production deployments.
Artificial intelligence
Artificial intelligence
fromInfoQ
2 months ago

Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models

BigQuery lets data teams deploy and run Hugging Face or Vertex AI open models with plain SQL, auto-provisioning compute and managing endpoints.
Artificial intelligence
fromTechRepublic
6 months ago

Google Launches New Server to Supercharge AI Agents

Data Commons MCP Server enables AI agents to access public datasets via the Model Context Protocol, reducing hallucinations and accelerating development of data-rich agent applications.
[ Load more ]