Hacker News 中文摘要

文章摘要

文章强调在软件开发中应该优先进行测试而不仅仅是形式验证，指出虽然形式验证有其价值，但测试更能有效发现实际问题。作者认为AI正在推动形式验证的发展，但测试仍然是更实用的方法。

标题：测试优先，而非（仅）验证

作者：Alperen Keles

随着AI技术的崛起，形式化验证正逐渐成为主流。AI辅助的机械证明公司获得巨额融资，越来越多人尝试使用证明助手（尤其是Lean），AI模型在数学竞赛和开放性问题中取得突破性成果。从Terry Tao到Martin Kleppman，全球顶尖研究者都对AI辅助证明充满期待。

验证引导开发(VGD)模式通过构建两个版本系统： - 简单可验证的参考实现 - 高效的生产实现通过随机测试确保两者行为一致，兼顾正确性与性能。

作者强调： - 认可形式化验证的价值 - 但反对过度承诺 - 随机测试与形式化验证同等重要未来软件工程需要验证与测试的结合，才能实现"正确性成为常态"的理想。

（注：原文中的社交媒体链接、图片描述等非核心内容已省略，保留了主要技术论点和案例）

这篇评论围绕AI辅助编程与形式化验证展开讨论，主要呈现以下观点：

认为形式化验证尚未成为主流，实施难度大："No it isn't...I'm sure the people selling it wish it was"(badgersnake)
指出当前更现实的验证方式："Most people can't even get agents to robustly E2E QA code"(CuriouslyC)

强调验证概念的价值："learning the concept of invariants...makes reasoning about code easier"(zipy124)
建议从基础验证开始："start with something 'dumber' like Rust or any typed program"(anon-3988)

认可AI改变验证瓶颈："pushes the bottleneck to verification"(andrewmutz)
但指出局限性："adversarial agents...still get tripped up"(tgtweak)
担忧数据问题："AI need tons of training material...large scale synthetic generation"(ecocentrik)

区分验证级别："distinguish between a full specification...and a specification of certain properties"(vzaliva)
质疑AI验证效率："many possible ways...be terrible in terms of performance"(xp84)