#model-performance

[ follow ]
Artificial intelligence
fromTechCrunch
4 days ago

DeepSeek may have used Google's Gemini to train its latest model | TechCrunch

DeepSeek's R1 model may have been trained on outputs from Google's Gemini, raising ethical concerns regarding data sourcing.
#quantization
Scala
fromHackernoon
3 months ago

The Future of AI Compression: Smarter Quantization Strategies | HackerNoon

Impact-based parameter selection outperforms magnitude-based criteria in improving quantization for language models.
Scala
fromHackernoon
3 months ago

The Future of AI Compression: Smarter Quantization Strategies | HackerNoon

Impact-based parameter selection outperforms magnitude-based criteria in improving quantization for language models.
fromHackernoon
1 year ago

Fine-tuned GPT-3.5 Performance for Explanatory Feedback | HackerNoon

The fine-tuned GPT-3.5 model's performance was evaluated using M-IoU scores across multiple random seeds, demonstrating its efficacy in identifying praise in tutor responses with limited training data.
Online learning
fromHackernoon
1 month ago

How LightCap Sees and Speaks: Mobile Magic in Just 188ms Per Image | HackerNoon

In our experiments, we found that the LightCap model achieved efficient inference on mobile devices, processing images in about 188ms on the Kirin 990 CPU.
Artificial intelligence
Software development
fromInfoQ
2 weeks ago

Windsurf Launches SWE-1 Family of Models for Software Engineering

Windsurf's SWE-1 models support diverse software engineering tasks while improving performance and user experience.
#machine-learning
Artificial intelligence
fromTechCrunch
2 months ago

Researchers say they've discovered a new method of 'scaling up' AI, but there's reason to be skeptical | TechCrunch

Experts are skeptical of the newly proposed AI scaling law called 'inference-time search' despite its potential for improving model performance.
fromHackernoon
1 month ago
Artificial intelligence

Reconstruction Evaluations Across Varying Amounts of Training Data: Mindeye2 | HackerNoon

Artificial intelligence
fromTechCrunch
2 months ago

Researchers say they've discovered a new method of 'scaling up' AI, but there's reason to be skeptical | TechCrunch

Experts are skeptical of the newly proposed AI scaling law called 'inference-time search' despite its potential for improving model performance.
fromHackernoon
1 month ago
Artificial intelligence

Reconstruction Evaluations Across Varying Amounts of Training Data: Mindeye2 | HackerNoon

#openai
Artificial intelligence
fromFuturism
3 months ago

OpenAI May Have Really Screwed Up With GPT-4.5

OpenAI's GPT-4.5 is perceived as underwhelming despite claims of being the 'largest and most knowledgeable model'.
High costs and slow performance contribute to skepticism regarding GPT-4.5's value.
Artificial intelligence
fromInfoWorld
1 month ago

Vector Institute aims to clear up confusion about AI model performance

DeepSeek and OpenAI's o1 models excel in performance, yet AI models still face significant challenges across various tasks.
Artificial intelligence
fromFuturism
3 months ago

OpenAI May Have Really Screwed Up With GPT-4.5

OpenAI's GPT-4.5 is perceived as underwhelming despite claims of being the 'largest and most knowledgeable model'.
High costs and slow performance contribute to skepticism regarding GPT-4.5's value.
Artificial intelligence
fromInfoWorld
1 month ago

Vector Institute aims to clear up confusion about AI model performance

DeepSeek and OpenAI's o1 models excel in performance, yet AI models still face significant challenges across various tasks.
[ Load more ]