Hacker News 中文摘要

文章摘要

通义千问推出新一代图像生成模型Qwen-Image-2.0，具备专业排版能力可生成PPT/海报等图文内容，支持2K高清图像生成，在文本渲染和语义理解方面显著提升，同时采用更轻量架构实现快速推理。测试显示该模型在文生图和图生图任务中均表现优异。

通义千问推出新一代图像生成模型Qwen-Image-2.0，该模型具备五大核心优势：

该模型已应用于实际场景，如自动生成杭州旅游攻略海报、古诗词水墨画、电影级宣传海报等。研究团队同时发布了技术报告供学术引用。

（注：原文中大量示例图片及详细提示词未在摘要中呈现，主要保留了技术特性和核心案例的说明）

评论总结：

图像质量评价正面观点认为文本渲染和复杂提示跟随能力令人印象深刻，接近OpenAI和谷歌水平： "The text rendering is quite impressive...complex prompt following ability and editing is seriously impressive" (cubefox) 负面观点指出生成图像存在"诡异感"，且信息图表质量普遍较差： "all these generated 'realistic' images have a distinctly uncanny feel...the infographics are 99% terrible" (Deukhoofd, wiether)
开放性问题部分用户对未公开模型权重表示失望： "unfortunately no open weights it seems...Another closed model dressed up as 'coming soon' open source" (dsrtslnd23, singularfutur) 也有用户询问Linux平台的图像生成工具： "What Linux tools are you guys using for image generation models" (inanothertime)
争议性示例 "骑马人"图片引发强烈反应，有用户认为其令人不安： "I found the horse revenge-porn image quite disturbing" (fguerraz) "What the actual fuck" (skerit) 另有用户解释这是源自中国网络的文化梗： "The 'horse riding man' is a Chinese internet meme...though the image per se is undeniably bizarre" (tianqi)
文化细节有用户指出中文竖排文本的标点问题： "The Chinese vertical typography is sadly a bit off...should be the characters specifically designed for vertical text" (sandbach) 也有用户质疑图片的种族代表性： "Why is the only image featuring non-Asian men the one under the horse?" (goga-piven)