#model-optimization

[ follow ]
fromHackernoon
2 years ago

Using LLVM To Supercharge AI Model Execution On Edge Devices | HackerNoon

LLVM has quietly emerged as the secret sauce that makes AI workloads not just tolerable but genuinely exciting to optimize, turning legacy model execution pipelines into blazing-fast, hardware-friendly deployment flows.
Artificial intelligence
fromHackernoon
55 years ago

Keep the Channel, Change the Filter: A Smarter Way to Fine-Tune AI Models | HackerNoon

Efficient fine-tuning methods are critical to address the high computational and parameter complexity while adapting large pre-trained models to downstream tasks.
Artificial intelligence
#open-source
fromHackernoon
6 months ago
Artificial intelligence

Chinese AI Model Promises Gemini 2.5 Pro-level Performance at One-fourth of the Cost | HackerNoon

fromHackernoon
6 months ago
Artificial intelligence

Chinese AI Model Promises Gemini 2.5 Pro-level Performance at One-fourth of the Cost | HackerNoon

#machine-learning
Artificial intelligence
fromHackernoon
4 months ago

Rethinking AI Quantization: The Missing Piece in Model Efficiency | HackerNoon

Quantum strategies optimize LLM precision while balancing accuracy and effectiveness through methods like post-training quantization and quantization-aware training.
Artificial intelligence
fromHackernoon
4 months ago

Rethinking AI Quantization: The Missing Piece in Model Efficiency | HackerNoon

Quantum strategies optimize LLM precision while balancing accuracy and effectiveness through methods like post-training quantization and quantization-aware training.
Growth hacking
fromInfoQ
1 month ago

Scaling Large Language Model Serving Infrastructure at Meta

LLM serving is evolving into a foundational technology similar to an operating system.
fromInfoWorld
1 year ago

All the brilliance of AI on minimalist platforms

Fast forward to 2024, our reliance on massive data infrastructures is evaporating, with AI systems running on palm-sized devices. Apple & Qualcomm chips integrate AI for tasks like language translation and photo processing.
Digital life
Scala
fromHackernoon
4 months ago

The Hidden Power of "Cherry" Parameters in Large Language Models | HackerNoon

Parameter heterogeneity in LLMs shows that a small number of parameters greatly influence performance, leading to the development of the CherryQ quantization method.
[ Load more ]