#language-models

[ follow ]
#machine-learning
Artificial intelligence
fromInfoQ
1 month ago

Anthropic Introduces Claude 4 Family and Claude Code

Anthropic's Claude Opus 4 and Sonnet 4 enhance AI collaboration through improved memory, coding performance, and hybrid thinking capabilities.
Artificial intelligence
fromWIRED
2 months ago

These Startups Are Building Advanced AI Models Without Data Centers

The launch of Collective-1 signifies a potential shift in how AI models are constructed, leveraging distributed resources and varied data sources.
Artificial intelligence
fromInfoQ
2 months ago

Prime Intellect Releases INTELLECT-2: A 32B Parameter Model Trained via Decentralized Reinforcement

PRIME Intellect's INTELLECT-2 leverages decentralized asynchronous reinforcement learning for enhanced efficiency and flexibility in model training.
Asynchronous training facilitates a significant improvement in performance across various tasks compared to previous models.
Artificial intelligence
fromInfoQ
1 month ago

Anthropic Introduces Claude 4 Family and Claude Code

Anthropic's Claude Opus 4 and Sonnet 4 enhance AI collaboration through improved memory, coding performance, and hybrid thinking capabilities.
Artificial intelligence
fromWIRED
2 months ago

These Startups Are Building Advanced AI Models Without Data Centers

The launch of Collective-1 signifies a potential shift in how AI models are constructed, leveraging distributed resources and varied data sources.
Artificial intelligence
fromInfoQ
2 months ago

Prime Intellect Releases INTELLECT-2: A 32B Parameter Model Trained via Decentralized Reinforcement

PRIME Intellect's INTELLECT-2 leverages decentralized asynchronous reinforcement learning for enhanced efficiency and flexibility in model training.
Asynchronous training facilitates a significant improvement in performance across various tasks compared to previous models.
#chatgpt
fromPsychology Today
1 day ago

Technology Can Lead to Worsening Brain Function

People rely more on large language models (LLMs) as information sources for academics, medical advice, and writing assistance, altering societal norms and practices.
Digital life
#artificial-intelligence
Artificial intelligence
fromInfoWorld
3 months ago

Learning how to measure genAI's impact

AI model improvements are often difficult to quantify accurately.
Smaller language models may outperform larger ones in practical applications.
The debate on AGI misdefines human intelligence benchmarks.
Artificial intelligence
fromInfoWorld
3 months ago

Learning how to measure genAI's impact

AI model improvements are often difficult to quantify accurately.
Smaller language models may outperform larger ones in practical applications.
The debate on AGI misdefines human intelligence benchmarks.
fromHackernoon
1 year ago

Unlocking Generative Power: Multi-Token Prediction for Next-Gen LLMs | HackerNoon

Multi-token prediction presents a novel approach to train language models, improving generative and reasoning tasks by focusing on sequences of tokens rather than individual ones.
Artificial intelligence
fromwww.berkeleyside.org
5 days ago

Wire: Berkeley Hills neighborhood is fastest aging in Bay Area; Homeless Response Team audited

The median age in Berkeley Hills' Thousand Oaks neighborhood has increased from 37 to 55 between 1980 and 2023, with one-third of residents now at retirement age.
California
#research-integrity
fromNature
1 week ago
Public health

Low-quality papers based on public health data are flooding the scientific literature

fromNature
1 week ago
Public health

Low-quality papers based on public health data are flooding the scientific literature

#cybersecurity
fromFuturism
1 week ago
Privacy professionals

McDonald's Idiotic AI Hiring System Just Leaked Personal Data About Millions of Job Applicants

fromFuturism
1 week ago
Privacy professionals

McDonald's Idiotic AI Hiring System Just Leaked Personal Data About Millions of Job Applicants

fromHackernoon
1 year ago

phi-3-mini's Triumph: Redefining Performance on Academic LLM Benchmarks | HackerNoon

The results for phi-3-mini on standard open-source benchmarks measure the model's reasoning ability, comparing it to phi-2 and several other notable models.
Artificial intelligence
#ai-development
Artificial intelligence
fromThe Verge
2 months ago

Apple will reportedly open up its local AI models to third-party apps

Apple opens access to its AI models for developers via an SDK.
Focus is on smaller on-device models, not cloud access initially.
Limited features for developers include AI Writing Tools and Image Playground.
Major announcement expected at WWDC on June 9th.
fromHackernoon
1 year ago
Artificial intelligence

phi-3-mini: The 3.8B Powerhouse Reshaping LLM Performance on Your Phone | HackerNoon

Artificial intelligence
fromThe Verge
2 months ago

Apple will reportedly open up its local AI models to third-party apps

Apple opens access to its AI models for developers via an SDK.
Focus is on smaller on-device models, not cloud access initially.
Limited features for developers include AI Writing Tools and Image Playground.
Major announcement expected at WWDC on June 9th.
fromHackernoon
1 year ago
Artificial intelligence

phi-3-mini: The 3.8B Powerhouse Reshaping LLM Performance on Your Phone | HackerNoon

fromHackernoon
55 years ago

The Last Rank We Need? QDyLoRA's Vision for the Future of LLM Tuning | HackerNoon

QDyLoRA offers an efficient and effective technique for LoRA-based fine-tuning LLMs on downstream tasks, eliminating the need for tuning multiple models for optimal rank.
Artificial intelligence
#ai
Artificial intelligence
fromPsychology Today
1 month ago

AI Is a Smooth Operator, but Shallow Thinker

LLMs compress language but lose the subtlety of human understanding.
They mimic thought, but lack true semantic depth or nuance.
AI fluency hides the deeper cognitive blind spot that coherence isn't comprehension.
Artificial intelligence
fromInfoQ
1 month ago

Google Releases LMEval, an Open-Source Cross-Provider LLM Evaluation Tool

LMEval enables quick, reliable evaluation of large language models across different APIs for diverse applications.
Artificial intelligence
fromPsychology Today
1 month ago

AI Is a Smooth Operator, but Shallow Thinker

LLMs compress language but lose the subtlety of human understanding.
They mimic thought, but lack true semantic depth or nuance.
AI fluency hides the deeper cognitive blind spot that coherence isn't comprehension.
Artificial intelligence
fromInfoQ
1 month ago

Google Releases LMEval, an Open-Source Cross-Provider LLM Evaluation Tool

LMEval enables quick, reliable evaluation of large language models across different APIs for diverse applications.
fromArs Technica
3 weeks ago

Anthropic destroyed millions of print books to build its AI models

The AI industry's quest for high-quality training data has led companies like Anthropic to explore controversial practices in acquiring books for their models.
Artificial intelligence
#generative-ai
Marketing tech
fromAndreessen Horowitz
1 month ago

How Generative Engine Optimization (GEO) Rewrites the Rules of Search | Andreessen Horowitz

SEO is being replaced by Generative Engine Optimization (GEO) driven by language models.
Visibility in search is shifting from page rank to being included directly in AI-generated answers.
Marketing tech
fromAndreessen Horowitz
1 month ago

How Generative Engine Optimization (GEO) Rewrites the Rules of Search | Andreessen Horowitz

SEO is being replaced by Generative Engine Optimization (GEO) driven by language models.
Visibility in search is shifting from page rank to being included directly in AI-generated answers.
#ai-evaluation
#ai-ethics
fromFuturism
1 month ago
Artificial intelligence

The Tech Industry Said It Was "Impossible" to Create AI Based Entirely on Ethically-Sourced Data, So These Scientists Proved Them Wrong in Spectacular Fashion

A team of researchers successfully trained a large language model using only public domain or openly licensed data, highlighting an ethical approach.
fromHackernoon
5 months ago
Social justice

Fine-Tuning AI Models to Better Recognize Gender and Race in Stories | HackerNoon

The study examines socio-psychological harms from language models in terms of omission, subordination, and stereotyping across gender, sexual orientation, and race.
Artificial intelligence
fromFuturism
1 month ago

The Tech Industry Said It Was "Impossible" to Create AI Based Entirely on Ethically-Sourced Data, So These Scientists Proved Them Wrong in Spectacular Fashion

A team of researchers successfully trained a large language model using only public domain or openly licensed data, highlighting an ethical approach.
#feedback-generation
fromHackernoon
2 months ago
Artificial intelligence

Modified Intersection over Union (M-IoU) for Sequence Labeling Evaluation | HackerNoon

fromHackernoon
2 months ago
Artificial intelligence

Modified Intersection over Union (M-IoU) for Sequence Labeling Evaluation | HackerNoon

Artificial intelligence
fromFuturism
1 month ago

Google Humiliated as Its Idiot AI Overviews Caught Telling Users It's Still 2024

Google's AI Overviews mistakenly claimed it was still 2024, reflecting ongoing issues with accuracy in AI systems.
Marketing tech
fromDefector
2 months ago

Chicago Sun-Times And Philadelphia Inquirer Publish Huge Summer Insert Of Pure, Uncut Chatbot Slop | Defector

The 'Best of Summer' inserts are revealed as AI-generated text, raising questions about the integrity of journalism.
Artificial intelligence
fromNature
2 months ago

AI language models develop social norms like groups of people

Large language models can develop social norms through interactive games, demonstrating collective behavior similar to humans.
Bootstrapping
fromHackernoon
7 months ago

Comprehensive Detection of Untrained Tokens in Language Model Tokenizers | HackerNoon

Glitch tokens in LLMs lead to unwanted behaviors.
Effective methods are needed to identify problematic tokens.
An analysis of tokenizers reveals their role in model safety.
Artificial intelligence
fromFuturism
2 months ago

"You Can't Lick a Badger Twice": Google's AI Is Making Up Explanations for Nonexistent Folksy Sayings

Google's AI creates fictional explanations for made-up idioms, showcasing the challenges of AI hallucinations.
fromHackernoon
5 months ago

How AI Models Gender and Sexual Orientation | HackerNoon

The study investigates how language models (LMs) convey socio-psychological harms related to identity by analyzing the representation and stereotypes of gender, sexual orientation, and race.
Data science
Artificial intelligence
fromInfoQ
3 months ago

Microsoft Native 1-Bit LLM Could Bring Efficient genAI to Everyday CPUs

Microsoft's BitNet b1.58 2B4T represents a leap in efficient LLM training, outperforming existing models in resource usage while maintaining performance.
fromPsychology Today
3 months ago

Beware the Obsequious AI Assistant

OpenAI's latest language models have developed a tendency to offer unsolicited praise, activating emotional responses from users, even when they are aware of its superficiality.
Artificial intelligence
fromHackernoon
7 months ago

The Art of Arguing With Yourself-And Why It's Making AI Smarter | HackerNoon

This paper introduces Direct Nash Optimization (DNO), a novel approach that integrates stability and generality in large language model post-training, moving beyond traditional reward maximization limits.
Artificial intelligence
[ Load more ]