Data science

[ follow ]
Data science
fromMedium
1 day ago

Translating Chinese with Data Science: StanfordNLP and Beyond

Translating Chinese texts requires strong programming skills and a broad understanding of IT tools like Stanford NLP.
Data science
fromTechzine Global
22 hours ago

Edge AI and private 5G are made for each other

Edge AI and private 5G are crucial for manufacturers to achieve real-time insights and operational efficiency on factory floors.
fromComputerWeekly.com
1 day ago

Glasgow researchers use machine learning to build network digital twin | Computer Weekly

Our results show that testing computer networks with automatically generated digital twins can achieve high accuracy and significantly faster speeds than traditional simulator-based testing.
Data science
#sap
Data science
fromTheregister
23 hours ago

SAP dives deeper into Iceberg with Dremio acquisition

SAP acquired Dremio to enhance data integration and analytics capabilities, focusing on unifying SAP and non-SAP data sources.
Data science
fromTechzine Global
1 day ago

SAP makes a double play in data and AI with acquisitions of Prior Labs and Dremio

SAP is acquiring Prior Labs and Dremio to enhance its AI capabilities for structured business data with over one billion euros investment.
Data science
fromTheregister
23 hours ago

SAP dives deeper into Iceberg with Dremio acquisition

SAP acquired Dremio to enhance data integration and analytics capabilities, focusing on unifying SAP and non-SAP data sources.
Data science
fromTechzine Global
1 day ago

SAP makes a double play in data and AI with acquisitions of Prior Labs and Dremio

SAP is acquiring Prior Labs and Dremio to enhance its AI capabilities for structured business data with over one billion euros investment.
#artificial-intelligence
fromNature
1 day ago
Data science

AI agents in research: when productivity comes at the cost of apprenticeship

Data science
fromFortune
2 days ago

AI models are choking on junk data | Fortune

Quality data is crucial for advancing physical AI and world models, as junk data hampers development and potential.
fromFortune
1 week ago
Data science

Goldman tackles AI's missing link: the 'world model' that every AI godfather is racing to figure out | Fortune

Data science
fromMarTech
3 weeks ago

How to make AI work with context instead of prompts | MarTech

AI struggles to operate reliably in enterprises due to its context-blind nature, leading to failures in scaling despite initial successes.
fromNature
1 day ago
Data science

AI agents in research: when productivity comes at the cost of apprenticeship

Data science
fromFortune
2 days ago

AI models are choking on junk data | Fortune

Quality data is crucial for advancing physical AI and world models, as junk data hampers development and potential.
Data science
fromFortune
1 week ago

Goldman tackles AI's missing link: the 'world model' that every AI godfather is racing to figure out | Fortune

The next leap in AI requires solving the 'world model' problem, which is essential for machines to achieve a fundamental understanding of reality.
Data science
fromTNW | Finance
2 weeks ago

How AI and human judgment combine in modern financial market analysis

Intelligent Investing AI enhances financial forecasting by processing large datasets while human interpretation remains crucial for meaningful market insights.
Data science
fromMarTech
3 weeks ago

How to make AI work with context instead of prompts | MarTech

AI struggles to operate reliably in enterprises due to its context-blind nature, leading to failures in scaling despite initial successes.
Data science
fromInfoWorld
2 days ago

Small language models: Rethinking enterprise AI architecture

Specialized small language models (SLMs) are emerging as efficient alternatives to large language models (LLMs) for specific workflows in autonomous enterprises.
Data science
fromJeff Gothelf
2 weeks ago

Strong opinions, loosely held - and what that means in the age of AI

Strong opinions should be based on informed hypotheses and adjusted with new evidence, emphasizing humility in product management.
Data science
from24/7 Wall St.
1 day ago

KLA Is Gaining Share as AI Chip Complexity Drives Up Yield Costs

KLA's metrology and inspection equipment is crucial for yield enhancement in semiconductor manufacturing, especially as AI drives process complexity.
Data science
fromInfoQ
4 days ago

DuckLake 1.0: Data Lake Format with SQL Catalog Metadata

DuckLake 1.0 introduces a data lake format that stores metadata in a SQL database, enhancing performance and simplifying operations compared to file-based systems.
Data science
fromFast Company
4 days ago

Stop letting ChatGPT and other AI chatbots train on your data. Here's why-and how

Chatbot interactions often expose personal data used for AI training, risking privacy, but users can opt out of data usage.
#ai
Data science
fromFast Company
4 days ago

Traditional forecasting still beats AI for the most extreme weather

AI weather models struggle to predict extreme weather events compared to traditional physics-based models.
Data science
fromTechRepublic
5 days ago

Cisco Introduces Model Provenance Kit to Strengthen AI Supply Chain Security

Cisco's Model Provenance Kit helps organizations verify AI model origins and trace lineage to enhance trust and security in the AI supply chain.
Data science
fromTheregister
1 week ago

Vintage chatbot lives in the past like an elderly relative

A new vintage language model, Talkie, has been created, trained on pre-1931 texts, offering a unique perspective on historical topics.
Data science
fromFast Company
1 week ago

AI traders are already testing prediction markets-and losing money

AI models struggle to profit in prediction markets, losing between 16% and 30.8% during a 57-day trading period.
Data science
fromFast Company
2 weeks ago

Your AI can't read an invoice. That should worry you more than whether it can pass a math exam

Advanced AI excels in structured reasoning tasks but struggles with messy, real-world data extraction like invoices.
Data science
fromTECHBOOK
2 weeks ago

Google Search Spreads Millions of Misinformation Pieces Every Hour

Google's AI search summaries have a 91% accuracy rate, but this still results in significant misinformation.
Data science
fromFast Company
4 days ago

Traditional forecasting still beats AI for the most extreme weather

AI weather models struggle to predict extreme weather events compared to traditional physics-based models.
Data science
fromTechRepublic
5 days ago

Cisco Introduces Model Provenance Kit to Strengthen AI Supply Chain Security

Cisco's Model Provenance Kit helps organizations verify AI model origins and trace lineage to enhance trust and security in the AI supply chain.
Data science
fromTheregister
1 week ago

Vintage chatbot lives in the past like an elderly relative

A new vintage language model, Talkie, has been created, trained on pre-1931 texts, offering a unique perspective on historical topics.
Data science
fromFast Company
1 week ago

AI traders are already testing prediction markets-and losing money

AI models struggle to profit in prediction markets, losing between 16% and 30.8% during a 57-day trading period.
Data science
fromFast Company
2 weeks ago

Your AI can't read an invoice. That should worry you more than whether it can pass a math exam

Advanced AI excels in structured reasoning tasks but struggles with messy, real-world data extraction like invoices.
Data science
fromTECHBOOK
2 weeks ago

Google Search Spreads Millions of Misinformation Pieces Every Hour

Google's AI search summaries have a 91% accuracy rate, but this still results in significant misinformation.
Data science
fromTechCrunch
6 days ago

Exclusive: Earth AI is vertically integrating the search for critical minerals | TechCrunch

Earth AI is establishing its own labs to reduce mineral sample processing delays from five months to five days.
#quantum-computing
Data science
fromComputerWeekly.com
1 week ago

HSBC collaborates on noisy qubit real-world application | Computer Weekly

HSBC collaborates on quantum computing to enhance financial modeling and secure transactions using advanced probability distributions and post-quantum cryptography.
Data science
fromTheregister
3 weeks ago

Nvidia slaps forehead: AI, that's what quantum needs!

Nvidia's AI models aim to reduce quantum processor error rates significantly, enhancing the reliability of quantum computing applications.
Data science
fromComputerWeekly.com
1 week ago

HSBC collaborates on noisy qubit real-world application | Computer Weekly

HSBC collaborates on quantum computing to enhance financial modeling and secure transactions using advanced probability distributions and post-quantum cryptography.
Data science
fromTheregister
3 weeks ago

Nvidia slaps forehead: AI, that's what quantum needs!

Nvidia's AI models aim to reduce quantum processor error rates significantly, enhancing the reliability of quantum computing applications.
#large-language-models
Data science
fromInfoQ
1 week ago

Legare Kerrison and Cedric Clyburn on LLM Performance and Evaluations

Measuring LLM performance is essential for AI adoption, focusing on metrics like RPS, TTFT, and ITL while navigating trade-offs between quality, responsiveness, and cost.
Data science
fromInfoQ
1 week ago

Legare Kerrison and Cedric Clyburn on LLM Performance and Evaluations

Measuring LLM performance is essential for AI adoption, focusing on metrics like RPS, TTFT, and ITL while navigating trade-offs between quality, responsiveness, and cost.
Data science
fromMedium
3 weeks ago

The Top 10 LLM Training Datasets for 2026

Large language models require extensive training data, and practitioners can utilize ten leading public datasets for effective training and fine-tuning.
Data science
fromMedium
1 month ago

Context matters... A lot

Large language models excel at tasks but struggle with context, leading to potentially misleading answers despite their capabilities.
Data science
fromFast Company
1 week ago

Otter wants AI agents to mine your meetings for institutional knowledge

Otter enhances its transcription tool with AI features to integrate meeting knowledge with other software for better accessibility and insight retrieval.
Data science
fromTelecompetitor
1 week ago

FBA white paper addresses common engineering and construction questions

Accurate field data and base maps are crucial for efficient and resilient fiber broadband network deployment.
fromMarTech
1 week ago

Warehouse-native CDPs vs standalone platforms explained | MarTech

The case for a warehouse-native CDP starts with control and data centralization. In this model, the data warehouse becomes the single source of truth, with tools layered on top for identity resolution, segmentation and activation.
Data science
Data science
fromInfoQ
1 week ago

A Java Performance Quest: Taming Unsafe Code, Embracing Idiomatic Style & Debugging the Linux Kernel

QuestDB is a time-series database designed for high ingestion rates and efficient querying of time-based data.
Data science
fromNature
1 week ago

Telomere-to-Telomere Assembly Using HERRO-Corrected Simplex Nanopore Reads - Nature

Error-corrected ONT Simplex reads can lower sequencing costs and enhance genomic analysis quality.
fromnews.bitcoin.com
1 week ago

Polymarket Study Finds 3.14% Drive Accuracy

The study found that only 3.14% of Polymarket accounts qualified as skilled winners, who consistently earned profits that held up out of sample, trading across an average of 79 markets each.
Data science
Data science
fromTechzine Global
1 week ago

Pinecone On-Demand is thirsty for bursty workloads

Pinecone offers solutions for variable and sustained query workloads in AI, focusing on cost-effective and predictable performance.
Data science
fromInfoWorld
1 week ago

Why world models are AI's next frontier

World models learn the physical world, providing the common sense AI needs to achieve artificial general intelligence (AGI).
Data science
fromNextgov.com
1 week ago

NIST is giving fingerprint examiners better tools for a messy job

NIST aims to enhance forensic fingerprint examination accuracy and training through new resources, including a database and open-source software.
Data science
fromTheregister
1 week ago

DeepSeek's new models offer big inference cost savings

DeepSeek V4 introduces a new large language model that rivals top American models while reducing inference costs and supporting Huawei's AI accelerators.
Data science
fromNature
1 week ago

Wikipedia-based AI model reveals the 100 technologies to watch

Machine learning, blockchain, and 3D printing are predicted to be the fastest-growing technologies in 2026 according to the Momentum 100 list.
Data science
fromTheregister
2 weeks ago

LLMs fuel new generation of natural language query systems

Text-to-SQL tools may simplify data queries but can misinterpret business users' intentions, raising caution for organizations.
Data science
fromMedium
1 week ago

Entity Resolved Knowledge Graphs: The Foundation for Effective GraphRAG

GraphRAG enhances LLMs by using knowledge graphs for relationship-based queries, addressing limitations of vector-based retrieval methods.
Data science
fromRealpython
1 week ago

Altair: Declarative Charts With Python - Real Python

Altair simplifies data visualization in Python by allowing users to describe data meaning rather than scripting every detail.
Data science
fromInfoQ
2 weeks ago

Redesigning Banking PDF Table Extraction: A Layered Approach with Java

PDF table extraction in enterprise systems is an architectural problem requiring hybrid parsing and machine learning for effective handling.
Data science
fromInfoWorld
2 weeks ago

Addressing the challenges of unstructured data governance for AI

Enterprises must enhance data governance for unstructured data as AI transforms data management practices.
Data science
fromeLearning Industry
2 weeks ago

Multimodal AI For Instructional Designers: What It Is, How It Works, And Why It Changes Learning Design

Multimodal AI processes and generates multiple data types, enhancing understanding and output accuracy by mimicking human information processing.
Data science
fromNature
2 weeks ago

Got bugs? Here's how to catch the errors in your scientific software

Scientific coding is error-prone, often due to lack of training, making debugging an essential but under-taught skill for researchers.
Data science
fromMedium
2 weeks ago

What Can You Do With a Free ODSC AI East Expo Pass?

ODSC AI East 2026 offers free Expo Passes for attendees to experience the latest in AI and data science.
#data-centers
Data science
fromTechzine Global
2 weeks ago

Eaton: AI data centers need aerospace-grade engineering

AI demands require a complete overhaul of data center infrastructure, moving from traditional cooling methods to advanced systems-level designs.
fromTechCrunch
1 month ago
Data science

People would rather have an Amazon warehouse in their backyard than a data center | TechCrunch

Data science
fromThe Walrus
1 month ago

Data Centres Are on Track to Wreck the Planet. Can We Stop Them? | The Walrus

Hyperscaled data centers consume massive power and water, raising concerns about their environmental impact.
Data science
fromTechzine Global
2 weeks ago

Eaton: AI data centers need aerospace-grade engineering

AI demands require a complete overhaul of data center infrastructure, moving from traditional cooling methods to advanced systems-level designs.
fromTechCrunch
1 month ago
Data science

People would rather have an Amazon warehouse in their backyard than a data center | TechCrunch

Data science
fromThe Walrus
1 month ago

Data Centres Are on Track to Wreck the Planet. Can We Stop Them? | The Walrus

Hyperscaled data centers consume massive power and water, raising concerns about their environmental impact.
Data science
fromMedium
2 weeks ago

What is a Datathon? And Why You Should Join One

Datathons are collaborative events where participants analyze real-world datasets to generate insights and solve practical problems.
Data science
fromMarTech
2 weeks ago

Synthetic research is a promise with a catch | MarTech

Economic pressure for quick research results conflicts with the scientific demand for rigor, leading to potential biases in synthetic data outputs.
#ai-bias
Data science
fromNature
2 weeks ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
3 weeks ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Data science
fromNature
2 weeks ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
3 weeks ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Data science
fromRealpython
2 weeks ago

Episode #291: Reassessing the LLM Landscape & Summoning Ghosts - The Real Python Podcast

Current techniques for LLMs focus on context engineering and multi-agent orchestration, moving away from traditional post-training methods.
Data science
fromMedium
2 weeks ago

Is the Data Scientist Role Dead? No, it's Transforming

The data scientist role is evolving, not disappearing, as organizations demand broader skills and system-oriented thinking.
fromTheregister
2 weeks ago

DuckDB uses RDBMS to tackle lakehouse 'small changes' issue

You make a small change to your table, adding a single row, and it affects data lake performance because, due to the way they work, a new file has to be written that contains one row, and then a bunch of metadata has to be written. This is very inefficient, because formats like Parquet really don't want to store a single row, they want to store a million rows.
Data science
Data science
fromInfoQ
3 weeks ago

Google's TurboQuant Compression May Support Faster Inference, Same Accuracy on Less Capable Hardware

TurboQuant compresses language models' Key-Value caches by up to 6x with near-zero accuracy loss, enabling efficient use of modest hardware.
#ai-development
Data science
fromTheregister
2 weeks ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Data science
fromTheregister
4 weeks ago

UK National Data Library plan needs work, study finds

The UK's National Data Library needs improved dataset accessibility to support AI development and meaningful analysis.
Data science
fromTheregister
2 weeks ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Data science
fromTheregister
4 weeks ago

UK National Data Library plan needs work, study finds

The UK's National Data Library needs improved dataset accessibility to support AI development and meaningful analysis.
Data science
fromNature
3 weeks ago

Dozens of AI disease-prediction models were trained on dubious data

Dubious data sets used in AI models for stroke and diabetes risk may lead to flawed clinical decisions.
Data science
fromComputerWeekly.com
2 weeks ago

Ordnance Survey works with Snowflake to tackle flood risk | Computer Weekly

Intelligent Flood Readiness model identifies 1.2 million people in England at risk of flooding, focusing on vulnerable areas for policymaking.
fromNature
3 weeks ago

Ancient DNA reveals pervasive directional selection across West Eurasia - Nature

Ancient DNA has transformed our understanding of population history, but its potential to reveal insights about human evolutionary biology has not been fully realized due to limited sample sizes and challenges in distinguishing between different types of selection.
Data science
Data science
fromMedium
3 weeks ago

How To Use Andrej Karapthy's Knowledge Wiki Approach for Product Research

Andrej Karpathy's approach to creating knowledge bases enhances AI's ability to remember and focus on data.
Data science
fromNature
3 weeks ago

AI needs solid botanical data more than ever

The disappearance of specialized botany programs threatens biodiversity research and the effectiveness of AI in biotechnology.
Data science
fromApp Developer Magazine
3 weeks ago

AccuWeather Launches ChatGPT Integration for Live Weather Updates

AccuWeather integrates localized weather forecasts into ChatGPT, allowing users to ask natural language questions for personalized weather insights.
Data science
fromInfoWorld
3 weeks ago

Google Cloud introduces QueryData to help AI agents create reliable database queries

QueryData enhances AI agents' accuracy in querying databases by translating natural language into precise database queries.
Data science
fromInfoQ
3 weeks ago

Lyft Scales Global Localization Using AI and Human-in-the-Loop Review

Lyft's AI-driven localization system enhances translation efficiency and quality for international expansion, processing 99% of user content with a 30-minute SLA.
Data science
fromFast Company
3 weeks ago

Data activation and Newton's first law

Data activation is essential for business success in the AI era, requiring accessible, governed, and contextualized information to overcome inertia.
Data science
fromFast Company
3 weeks ago

Your AI initiative may be failing because you're measuring it like a legacy business

Leadership often misjudges AI initiatives by applying mature-business metrics too early, leading to premature project cancellations.
Data science
fromMedium
3 weeks ago

Reasons Why an AI Conference is the Right Idea for Your Career

Good AI conferences focus on practical implementation and real-world applications rather than theoretical concepts or hype.
Data science
fromPsychology Today
3 weeks ago

Is Algorithmic Asymmetry Reshaping How We Think?

Algorithmic asymmetry creates unequal access to information and decision-making, impacting individuals across various aspects of life.
Data science
fromESPN.com
3 weeks ago

The data scientists trying to lift the USMNT to the World Cup

U.S. Soccer's analysts play a crucial role in decision-making and performance evaluation, significantly impacting the team's success.
fromArchDaily
4 weeks ago

From Data to Digital Twins: Japan's PLATEAU Project Offers Open-Access Models of More Than 250 Cities

Project PLATEAU, led by Japan's Ministry of Land, Infrastructure, Transport and Tourism, aims to develop and expand access to 3D models representing the diversity of cities across the country, enhancing urban resilience and addressing local challenges.
Data science
Data science
fromFast Company
4 weeks ago

Data, not infrastructure, must drive your AI strategy

Data centricity is essential for effective AI strategies, enabling collaboration and problem-solving across business units by making data accessible.
Data science
fromMedium
4 weeks ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
Data science
fromSilicon Canals
4 weeks ago

The one dataset that could predict AI job displacement barely exists - and nobody is collecting it - Silicon Canals

Price elasticity is crucial for predicting AI's impact on jobs, yet reliable data on this variable is largely unavailable.
#structured-data
Data science
fromAol
4 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
4 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
4 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
4 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromInfoWorld
4 weeks ago

Databricks launches AiChemy multi-agent AI for drug discovery

AiChemy integrates various data sources to enhance research efficiency in pharmaceutical companies.
fromMedium
1 month ago

IVF vs HNSW Indexing in Milvus

Brute-force exact search guarantees perfect recall but scales at O(n · d) per query, making it totally impractical at the scale modern applications demand. This is where Approximate Nearest Neighbor (ANN) indexes come into play: they trade a small amount of recall for dramatic speedups, often achieving over 95% recall at up to 100× higher throughput.
Data science
Data science
fromInfoWorld
1 month ago

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.
Data science
fromMedium
1 month ago

In-Silico Perturbation Meets Single-Cell Foundation Models: From Zero-Shot Potential to Fine-Tuned...

In-silico perturbation simulates cellular state changes, but biological trustworthiness remains a challenge despite advancements in single-cell foundation models.
Data science
fromTechzine Global
1 month ago

Datadog launches Experiments for A/B testing in observability

Datadog Experiments integrates A/B testing and product analytics into a single platform, addressing fragmentation in product development tools.
[ Load more ]