#long-context-efficiency

[ follow ]
Artificial intelligence
fromInfoWorld
1 month ago

IBM launches Granite 4.0 to cut AI infra costs with hybrid Mamba-transformer models

Granite 4.0 combines Mamba state-space layers with transformers to reduce memory use, speed inference, and enable lower-cost long-context and edge enterprise deployments.
[ Load more ]