Hacker News 中文摘要

文章摘要

文章介绍了在苹果芯片Mac mini上配置Ollama和Gemma 4 26B模型的简化方法，包括自动启动、预加载和保持运行状态的技巧。

文章总结

2026年4月Mac mini（Apple Silicon）上Ollama + Gemma 4 26B的简明配置指南：自动启动、预加载与持久化

前提条件

配备Apple Silicon（M1/M2/M3/M4/M5）的Mac mini
至少24GB统一内存以运行Gemma 4 26B模型
已安装Homebrew的macOS系统

安装与配置步骤

安装Ollama
通过Homebrew安装Ollama应用（含自动更新和MLX后端支持）： shell brew install --cask ollama-app
启动Ollama
启动应用并验证服务运行： shell open -a Ollama ollama list
下载Gemma 4 26B模型
下载约17GB的模型文件： shell ollama pull gemma4:26b
测试模型
运行测试命令并检查GPU加速状态： shell ollama run gemma4:26b "Hello, what model are you?" ollama ps # 查看CPU/GPU使用比例
配置自动启动与持久化
- 登录时启动Ollama：通过菜单栏选项或系统设置添加。
- 预加载模型：创建LaunchAgent定时运行空提示以保持模型活跃。
- 禁用自动卸载：设置环境变量OLLAMA_KEEP_ALIVE="-1"。
验证配置
检查服务、模型状态及LaunchAgent注册情况： shell ollama list ollama ps launchctl list | grep ollama

其他功能

API访问：通过http://localhost:11434提供本地API服务。
常用命令：包括模型管理、交互式对话等操作。
卸载方法：移除LaunchAgent并卸载Ollama应用。

版本新特性（Ollama v0.19+）

MLX后端优化：Apple Silicon芯片自动加速。
NVFP4支持：提升NVIDIA设备性能。
缓存改进：降低内存占用，优化编码和代理任务效率。

注意事项

内存需求：Gemma 4 26B运行时占用约20GB内存，24GB内存设备需关闭其他高内存应用。

参考资源

Ollama官方博客及版本发布说明
Google DeepMind关于Gemma 4的公告

（注：原文中的GitHub界面操作及无关导航内容已省略，聚焦于核心配置流程和技术细节。）

评论总结

总结评论内容如下：

对Ollama的负面评价
- 认为Ollama性能较差且缺乏创新，只是llama.cpp的复制品
  "Ollama is slower and they started out as a shameless llama.cpp ripoff" (评论1)
  "Why are you using Ollama? Just use llama.cpp" (评论3)
- 认为其功能过于简化，不如其他替代品
  "feels overly stripped down/dumbed down vs nearly everything else" (评论2)
  "Lately I've been playing with Unsloth Studio...much better 'give it to a beginner' default" (评论2)
对Ollama的正面使用体验
- 在M4 Mac Mini上成功运行Gemma模型，小模型表现优异
  "The ~10 GB model is super speedy...giving responses almost instantly" (评论5)
  "could fit comfortably on a Mac Mini 24GB with plenty of RAM left" (评论5)
- 文章提供了有用的Ollama使用建议
  "The article has a few good tips for using Ollama" (评论8)
技术问题与兼容性质疑
- 存在版本兼容性问题
  "I had to install the VO.20 pre-release of ollama to use this model" (评论4)
- 不同硬件配置下的运行效果参差不齐
  "Run in LM Studio...failed over and over on tool calls" (评论7)
  "Anyone got some luck with mlx builds on oMLX so far?" (评论9)
模型适用性讨论
- Gemma 4模型更适合特定任务而非编程
  "Gemma 4 models are not really trained for strong performance with coding agents" (评论8)
- 专家建议根据任务选择不同模型
  "use strong models to build effective applications using small efficient models" (评论8)

关键引用保留： - 负面评价：
"they're just vibe code translating llama.cpp, bugs included" (评论1)
"feels overly stripped down/dumbed down" (评论2)

正面体验：
"loading in a few seconds and giving responses almost instantly" (评论5)
"a few good tips for using Ollama" (评论8)
技术问题：
"failed over and over on tool calls" (评论7)
"Anyone got some luck with mlx builds" (评论9)

2026年4月：Mac mini上Ollama与Gemma 4 26B的简明配置指南 -- April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini