文章摘要

文章介绍了2026年有效的电子邮件混淆技术，包括纯文本、HTML实体、HTML注释、SVG、CSS隐藏、JS拼接、Rot18、JS转换和AES加密等方法，并提供了每种技术被垃圾邮件发送者破解的统计概率，从0%到100%不等。

文章总结

本文探讨了多种保护电子邮件地址免受垃圾邮件收集者侵扰的技术方法，并提供了每种技术的实际防护效果数据。

这些技术用于保护以纯文本形式显示的电子邮件地址（如"email@example.com"）。最佳实践是组合使用多种技术，将电子邮件地址分段并用不同方法保护。

破坏可用性的方法： - 符号替换（如用AT代替@） - 添加操作说明 - 使用图片显示地址 - CSS内容生成 - 文字方向反转

这些技术保护mailto链接，需注意若链接文本也包含地址，还需叠加纯文本保护技术。

本文本身就是一个"蜜罐"实验，每个技术保护的真实地址收到垃圾邮件时，就能知道哪个技术被破解。作者通过自有邮件服务器收集数据（避开了主流邮件服务的垃圾过滤），并尽可能去重以确保统计准确性。

注：统计数据会随着文章传播量的增加而变得更加精确。文章作者为Spencer Mortensen，所有示例代码均为公共领域。

总结评论内容：

支持方认为简单的HTML实体编码就能有效阻止大多数爬虫： "Anecdotal, but I’ve used HTML entities... and yet I’ve not seen any spam" (newscracker) "I'm surprised that html entity substitution performs so well" (siruwastaken)
反对方认为这些方法对高级爬虫无效： "HTML entities are often decoded automatically... this technique should be worthless" (newscracker) "AFAICT it could reveal anything that wasn't relying on CSS tricks or JavaScript" (badsectoracula)

主要论点认为数据泄露才是垃圾邮件的主要来源： "The data-source are the enormous data breach... more intensive to collect more information on someone you already know" (ache) "Your E-mail address is not and can't be a secret. It will get into spammer databases eventually" (jwr)

使用高级过滤技术： "what works very well is spam filtering using LLMs... I see >97% accuracy" (jwr)
使用特殊地址标记： "I filter everything that does NOT include '+asdf' in the to:" (gfody)
创意混淆方法： "I use SVG... converted it to curves so the SVG doesn't have text any more" (fmajid) "obfuscate emails on my websites with [brainf*ck] language" (binaryturtle)

认为不必过度防护： "I stopped being concerned about email harvesting years ago... Spam handling is okay enough" (ciroduran) "personally i haven't bothered by email harvesting for years now since spam filters seem to do a decent job" (badsectoracula)
警告过度防护的风险： "make really sure you are in control... your account will be deleted" (jwr)

诱捕技术： "having an tarpit email address... block that IP for 24h" (Croak)
安全漏洞警告： "a 302 into a 'mailto:'... opens up my e-mail client without clicking" (xiconfjs)