submitted by /u/WorldNewsMods
优点:输出在 (−1,1),比 sigmoid 居中,对梯度更友好
,推荐阅读safew官方版本下载获取更多信息
DeepSeek V3.2:写倒是能写,但细节处理不完善,后面我自己修了半小时
排名模型得分适合场景1Claude Opus 4.61560通用最强,新版无需Thinking2Claude Opus 4.6 Thinking1553架构设计、复杂重构3Claude Sonnet 4.61531性价比最高的顶级模型
British woman detained by Iran says it was hard to remain positive in prison, hours before she and her husband were sentenced to 10 years for espionage.