Data science

[ follow ]
#database-management
fromTechzine Global
1 day ago

Ataccama underlines AI data lineage for business users

Ataccama closes that gap by turning complex data logic into plain language. Business users can now trace a data point's origin and understand how it was profiled or flagged without relying on technical experts.
Data science
fromBarchart.com
4 days ago

Amazon.com Earnings Preview: What to Expect

Switch the Market flag for targeted data from your country of choice. This feature allows users to tailor their chart data to specific geographic markets.
Data science
#machine-learning
Data science
fromInfoWorld
1 month ago

Building an analytics architecture for unstructured data and multimodal AI

Organizations must adapt data pipelines for scalability and consistency to leverage AI effectively.
Flexible data preparation processes are essential for managing unstructured data and evolving with business needs.
fromMedium
1 week ago
Data science

The Data Science Playbook: Exploring Sports Analytics Through Real Datasets

Data science
fromInfoWorld
1 month ago

Building an analytics architecture for unstructured data and multimodal AI

Organizations must adapt data pipelines for scalability and consistency to leverage AI effectively.
Flexible data preparation processes are essential for managing unstructured data and evolving with business needs.
fromMedium
1 week ago
Data science

The Data Science Playbook: Exploring Sports Analytics Through Real Datasets

fromHackernoon
4 months ago

Redefining Data Operations With Data Flow Programming in CocoIndex | HackerNoon

In traditional systems, side effects lead to increased complexity, debugging challenges, and unpredictable behavior. CocoIndex adopts a pure data flow programming approach, ensuring reliability.
Data science
fromHackernoon
1 week ago

Effective Data Chunking and Querying with Pinecone and GPT-4o | HackerNoon

To improve data quality for ingestion into Pinecone, markdown was preprocessed to remove images, dividers, and excess whitespace, enhancing readability and relevance.
Data science
fromInfoWorld
1 year ago

Snowflake updates developer tools, adds observability features

Snowflake Trail enhances observability by allowing developers to monitor data quality, pipelines, and applications, ultimately improving workflow optimization and troubleshooting capabilities.
Data science
#data-management
#data-integration
fromHackernoon
1 year ago
Data science

A Developer's Guide to SeaTunnel and Hive Integration with Real-World Configs | HackerNoon

fromHackernoon
1 year ago
Data science

A Developer's Guide to SeaTunnel and Hive Integration with Real-World Configs | HackerNoon

fromHackernoon
2 years ago

Why No Single Algorithm Solves Deduplication - and What to Do Instead | HackerNoon

Effective blocking dramatically cuts comparisons while still grouping true duplicates together. Several blocking strategies can be applied in multi-pass to improve recall.
Data science
fromInfoWorld
1 year ago

What's new in MySQL 9.0

MySQL 9.0.0 introduces a new Vector datatype, JavaScript Stored Programs, updated library versions, and enhancements to the Event Scheduler, while deprecating old SHA-1 security.
Data science
#data-quality
fromTechCrunch
2 weeks ago
Data science

AI is forcing the data industry to consolidate - but that's not the whole story | TechCrunch

fromTechCrunch
2 weeks ago
Data science

AI is forcing the data industry to consolidate - but that's not the whole story | TechCrunch

#open-source
#decision-making
fromHackernoon
4 years ago

What If Your 'Messy' Data Is Actually Perfect? | HackerNoon

The Success Metrics layer transforms a vision from aspiration to action by defining what success looks like and how we'll know when we've achieved it.
Data science
#model-evaluation
#data-processing
#data-analysis
fromESPN.com
3 weeks ago
Data science

NHL draft grades: From the excellent (Islanders, Hurricanes) to the confusing (Maple Leafs)

fromESPN.com
3 weeks ago
Data science

NHL draft grades: From the excellent (Islanders, Hurricanes) to the confusing (Maple Leafs)

fromMedium
1 month ago

Frequent Spark Interview QuestionsPart 2

Both cache() and persist() store an RDD/DataFrame/Dataset in memory (or disk) to avoid recomputation. cache() is shorthand for persist(StorageLevel.MEMORY_ONLY), while persist() offers more control.
Data science
fromTheregister
3 weeks ago

A trip through vintage datacenter networking

Mainframe manufacturers defined their own proprietary network protocol stacks, e.g., IBM System Network Architecture, Digital's DECNet. These generally ran over leased lines between datacenters.
Data science
fromMedium
1 month ago

RDD vs DataFrame vs Dataset in Apache Spark: Which One Should You Use and Why

Spark offers three main APIs—RDD, DataFrame, and Dataset—each with unique advantages: RDD provides low-level control, DataFrames optimize performance, and Datasets bring type safety.
Data science
#research
fromwww.theguardian.com
3 weeks ago

Antarctic ice has grown again but this does not buck overall melt trend

Antarctic ice gained mass from 2021 to 2023, showing climate change follows a jagged path with temporary gains amid long-term losses.
Data science
fromInfoWorld
1 year ago

Snowflake's Data Clean Room promises to ease analysis of PII data

Samooha's acquisition has played a pivotal role in reducing complexity in the ability to granularly join data across multiple parties while protecting data privacy and making it easier for both business and technical users to navigate, utilize the technology effectively.
Data science
#software-engineering
#artificial-intelligence
#data-analytics
fromNature
4 weeks ago

Will Gates and other funders save massive public health database at risk from Trump cuts?

Ending the DHS would be catastrophic," says Peter Macharia, a spatial epidemiologist from Kenya, now at the Institute of Tropical Medicine in Antwerp, Belgium. Macharia says his PhD on child health interventions was based entirely on DHS data from Kenya. "Where would we get our new statistics from? We would not know what is happening in terms of health in the communities and the needs in each area," he says.
Data science
from24/7 Wall St.
4 weeks ago

Snowflake (NYSE: SNOW) Price Prediction and Forecast 2025-2030 (June 2025)

Shares of Snowflake Inc. surged 6.56% in the past month, achieving a year-to-date gain of 70.82%, with Q1 revenue exceeding $1 billion for the first time.
Data science
fromWIRED
4 weeks ago

India Is Using AI and Satellites to Map Urban Heat Vulnerability Down to the Building Level

Remote-sensing data and AI are being utilized to identify heat-vulnerable buildings in cities like Delhi, targeting efforts to provide relief during extreme temperatures.
Data science
fromHackernoon
1 year ago

Are Judeo-Christian Values the Foundation of American Democracy? | HackerNoon

There are some that claim the US Constitution is a product of a Judeo-Christian culture, asserting that democracy matured due to a Christian influence.
Data science
fromwww.npr.org
1 month ago

Greetings from Shenyang, China, where workers sort AI data in 'Severance'-like ways

Cities like Shenyang, once reliant on declining industries, are reinventing themselves by focusing on new tech initiatives, particularly in AI data processing to create new jobs.
Data science
fromTalkpython
1 month ago

10 Polars Tools and Techniques To Level Up Your Data Science

There are many benefits to Polars directly of course.
Data science
fromLos Angeles Times
1 month ago

'We are still here, yet invisible.' Study finds that U.S. government has overestimated Native American life expectancy

The findings of this study reveal that systemic misclassification is further exacerbating existing health disparities, leading to a tragic underrepresentation of the true mortality rates for American Indian and Alaska Native individuals.
Data science
fromBusiness Insider
1 month ago

Data centers' environmental impact is hard to quantify. Here's how we did it.

Tech companies are investing hundreds of billions into data centers for AI, but the environmental and economic costs are largely unaccounted for, raising critical questions.
Data science
fromeLearning Industry
1 month ago

Data-Driven L&D: Building Real-Time Learning Analytics Dashboards With No-Code

In today's hyper-digital workplace, the shift from traditional training methods to real-time insights through no-code analytics dashboards is revolutionizing Learning and Development.
Data science
fromHackernoon
1 month ago

The Data Science Behind r/antiwork's Upvotes | HackerNoon

The dataset for our analysis was shaped by filtering out potentially biased comments, ensuring that the final set was representative and valid for our study.
Data science
Data science
fromThe Verge
1 month ago

Google has a new AI model and website for forecasting tropical storms

Google's new AI model forecasts tropical cyclones more accurately than traditional models, promising improved storm tracking and preparation.
fromHackernoon
1 month ago

Why Data Lies (and Your Model Might Too): The Curious Case of Simpson's Paradox | HackerNoon

The conditional probability P(Admit∣ Female, Dept) is higher than P(Admit∣ Male, Dept) in Department A, but that advantage gets wiped out when we aggregate everything.
Data science
fromBusiness Matters
1 month ago

Mostly AI launches $100k global challenge to spotlight privacy-safe synthetic data for AI development

"Open data access is key to unlocking AI's full potential - but achieving that will require wider adoption of synthetic data tools."
Data science
fromYanko Design - Modern Industrial Design News
1 month ago

Economic and environmental data become tangible objects in creative art installation - Yanko Design

Fragapane transforms cold data into engaging, emotional forms, using beauty to connect viewers with living narratives behind stark statistics.
Data science
fromwww.npr.org
1 month ago
Data science

How a dog aging project can help pets and humans live healthier lives

The Dog Aging Project aims to uncover health trends in dogs to improve their longevity and gain insights applicable to human health.
Data science
fromwww.theguardian.com
1 month ago

Alzheimer's blood test can spot people with early symptoms, study suggests

A new blood test can accurately diagnose Alzheimer's with high sensitivity and specificity, suggesting a major advance in early detection.
Data science
fromZDNET
1 month ago

The hidden data crisis threatening your AI transformation plans

Siloed data limits holistic understanding, especially for AI applications.
Data science
fromFlowingData
1 month ago

Professor who studied honesty loses tenure over faked data

Francesca Gino lost Harvard tenure due to allegations of data falsification, despite her extensive research on honesty.
[ Load more ]