#model-compression tag

Running local models on Macs gets faster with Ollama's MLX support

Ollama enhances local language model performance on Apple Silicon with MLX support and improved caching, catering to growing interest in local models.

Artificial intelligence

fromTechCrunch

2 weeks ago

Multiverse Computing pushes its compressed AI models into the mainstream | TechCrunch

Multiverse Computing offers on-device AI models that eliminate counterparty risk by running locally without requiring external compute infrastructure or cloud providers.

fromComputerworld

2 months ago

Apple's Siri to see two major AI improvements this year

The quest for edge AI seems central to Apple's future approach, and to support it the company will consider the acquisition of smaller AI firms who can deliver optimized, compressed AI models. The company also intends to work with third-party models to shrink and adapt them to work more fully on Apple's hardware. Doing so is important, as the more intelligence Apple can put at the edge, the more it can reduce demand on hosted cloud-based AI, which will reduce infrastructure costs.

Apple

Artificial intelligence

fromFuturism

4 months ago

Researchers Hack DeepSeek to Speak Freely About Tiananmen Square

Researchers compressed DeepSeek R1 by 55% and removed its censorship using quantum-inspired tensor-network compression while maintaining performance and reducing parameters.

fromTechCrunch

4 months ago

Laude Institute announces first batch of 'Slingshots' AI grants | TechCrunch

On Thursday, the Laude Institute announced its first batch of Slingshots grants, aimed at "advancing the science and practice of artificial intelligence." Designed as an accelerator for researchers, the Slingshots program is meant to provide resources that would be unavailable in most academic settings, whether it's funding, compute power, or product and engineering support. In exchange, the recipients pledge to produce some final work product, whether it's a startup, an open-source codebase, or another type of artifact.

Artificial intelligence

fromFortune

5 months ago

DeepSeek's new model sees text differently, opening new possibilities for enterprise AI | Fortune

DeepSeek's OCR approach converts text into images to compress information, enabling up to tenfold input efficiency and much larger LLM context windows.

Artificial intelligence

fromWIRED

6 months ago

Distillation Can Make AI Models Smaller and Cheaper

Knowledge distillation enables smaller models to mimic larger ones efficiently and can explain DeepSeek R1's claims and the resulting industry reaction.

fromInfoQ

6 months ago

GenAI at Scale: What It Enables, What It Costs, and How To Reduce the Pain

My name is Mark Kurtz. I was the CTO at a startup called Neural Magic. We were acquired by Red Hat end of last year, and now working under the CTO arm at Red Hat. I'm going to be talking about GenAI at scale. Essentially, what it enables, a quick overview on that, costs, and generally how to reduce the pain. Running through a little bit more of the structure, we'll go through the state of LLMs and real-world deployment trends.

Artificial intelligence

fromTechCrunch

7 months ago

Buzzy AI startup Multiverse creates two of the smallest high-performing models ever | TechCrunch

Multiverse Computing has released the world's smallest AI models designed for high performance on personal devices and IoT applications.

#model-compression#model-compression

Running local models on Macs gets faster with Ollama's MLX support

Multiverse Computing pushes its compressed AI models into the mainstream | TechCrunch

Apple's Siri to see two major AI improvements this year

Researchers Hack DeepSeek to Speak Freely About Tiananmen Square

Laude Institute announces first batch of 'Slingshots' AI grants | TechCrunch

DeepSeek's new model sees text differently, opening new possibilities for enterprise AI | Fortune

Distillation Can Make AI Models Smaller and Cheaper

GenAI at Scale: What It Enables, What It Costs, and How To Reduce the Pain

Buzzy AI startup Multiverse creates two of the smallest high-performing models ever | TechCrunch

#model-compression
#model-compression