A Journal Through My Activities, Thoughts, and Notes
韦斯特奇尔提到,早在30年前(1990),捷克民主之父哈维尔的夫人就应台湾邀请到访,“这让我们男人知道,女人很多时候比男人更快认识到正确的事情,也比男人更勇敢”。
“英国和法国为了避免第二次世界大战爆发,同意牺牲捷克斯洛伐克,结果如英国首相邱吉尔所言,‘在战争与屈辱之间,他们选择了屈辱,但屈辱后还是要面临战争’。” #网摘
“英国和法国为了避免第二次世界大战爆发,同意牺牲捷克斯洛伐克,结果如英国首相邱吉尔所言,‘在战争与屈辱之间,他们选择了屈辱,但屈辱后还是要面临战争’。” #网摘
## Manually testing code is essential
Just because code looks good and runs without errors doesn’t mean it’s actually doing the right thing. No amount of meticulous code review—or even comprehensive automated tests—will demonstrably prove that code actually does the right thing. You have to run it yourself!
Proving to yourself that the code works is your job. This is one of the many reasons I don’t think LLMs are going to put software professionals out of work.
LLM code will usually look fantastic: good variable names, convincing comments, clear type annotations and a logical structure. This can lull you into a false sense of security, in the same way that a gramatically correct and confident answer from ChatGPT might tempt you to skip fact checking or applying a skeptical eye.
The way to avoid those problems is the same as how you avoid problems in code by other humans that you are reviewing, or code that you’ve written yourself: you need to actively exercise that code. You need to have great manual QA skills.
A general rule for programming is that you should never trust any piece of code until you’ve seen it work with your own eye—or, even better, seen it fail and then fixed it.
Across my entire career, almost every time I’ve assumed some code works without actively executing it—some branch condition that rarely gets hit, or an error message that I don’t expect to occur—I’ve later come to regret that assumption.
摘自 <https://simonwillison.net/2025/Mar/2/hallucinations-in-code/?ref=webtoart.com>
Just because code looks good and runs without errors doesn’t mean it’s actually doing the right thing. No amount of meticulous code review—or even comprehensive automated tests—will demonstrably prove that code actually does the right thing. You have to run it yourself!
Proving to yourself that the code works is your job. This is one of the many reasons I don’t think LLMs are going to put software professionals out of work.
LLM code will usually look fantastic: good variable names, convincing comments, clear type annotations and a logical structure. This can lull you into a false sense of security, in the same way that a gramatically correct and confident answer from ChatGPT might tempt you to skip fact checking or applying a skeptical eye.
The way to avoid those problems is the same as how you avoid problems in code by other humans that you are reviewing, or code that you’ve written yourself: you need to actively exercise that code. You need to have great manual QA skills.
A general rule for programming is that you should never trust any piece of code until you’ve seen it work with your own eye—or, even better, seen it fail and then fixed it.
Across my entire career, almost every time I’ve assumed some code works without actively executing it—some branch condition that rarely gets hit, or an error message that I don’t expect to occur—I’ve later come to regret that assumption.
摘自 <https://simonwillison.net/2025/Mar/2/hallucinations-in-code/?ref=webtoart.com>
#观点
“Fake it until you make it.” 直译是“假装成功,直到真正成功。”
这句话的核心意思是:在还没有掌握某项技能或达到某个目标之前,先以自信的姿态去行动,好像自己已经具备了所需的能力或身份。通过不断实践和适应,你最终会真正具备这些能力,实现目标。
“Fake it until you make it.” 直译是“假装成功,直到真正成功。”
这句话的核心意思是:在还没有掌握某项技能或达到某个目标之前,先以自信的姿态去行动,好像自己已经具备了所需的能力或身份。通过不断实践和适应,你最终会真正具备这些能力,实现目标。
#网摘
webto【道理】 The More You Own, The More You Maintain
If every new feature just meant one more thing to maintain, things might not be that bad. But ten design components don't create ten relationships, they create forty-five potential interaction points to consider. Each new addition multiplies a system's complexity, not just adds to it.
标题是一个简单的道理,不只在软件设计和开发领域适用,家里的很多东西也是这样。
webto【道理】 The More You Own, The More You Maintain
If every new feature just meant one more thing to maintain, things might not be that bad. But ten design components don't create ten relationships, they create forty-five potential interaction points to consider. Each new addition multiplies a system's complexity, not just adds to it.
标题是一个简单的道理,不只在软件设计和开发领域适用,家里的很多东西也是这样。
#书摘 当无人质疑管理者的决定时,他们就失去控制了。如果团队不再质疑权威的决定,管理者就会慢慢相信自己的决定永远正确。虽然永远正确的感觉很爽,但事实上人不可能永远正确,不论他是谁。
所以,我们要鼓励团队成员说不。这样才能迫使管理者停下来思考。《软件人才管理的艺术》
所以,我们要鼓励团队成员说不。这样才能迫使管理者停下来思考。《软件人才管理的艺术》
阿章人不笨,且常卖弄一下小聪明。
年将近,阿章算定贩一批桔子来卖。果聪明人也,桔子销量很好,生意火爆。一批桔子卖罢,盘算了几次,除了本钱费用,蝇利未获。章百思不得其解。夜间诉之于妻:“秤砣吾已偷加了铅块,本应会……”,言未毕,其妻暴怒。#laugh
年将近,阿章算定贩一批桔子来卖。果聪明人也,桔子销量很好,生意火爆。一批桔子卖罢,盘算了几次,除了本钱费用,蝇利未获。章百思不得其解。夜间诉之于妻:“秤砣吾已偷加了铅块,本应会……”,言未毕,其妻暴怒。#laugh
Rates的账单埋在gmail邮件里没注意到overdue了,下一期要被罚款10%。$100啊!我就恨自己为啥头脑一热取消了纸质账单。
#网友语录 米粒
看杨照的历史书,意外的被秀了一脸考古学,居然还发现考古学挺有意思。考古通过几个小探坑去猜测大片遗迹。知识也是如此。没学习某学科之前,怎么知道自己想学这个学科?得挖几个“探坑”去管窥一下。我上大学以后,看到建筑系学生写生,后悔没有报建筑系。如果报考前管窥一下建筑专业,我肯定就报考了
看杨照的历史书,意外的被秀了一脸考古学,居然还发现考古学挺有意思。考古通过几个小探坑去猜测大片遗迹。知识也是如此。没学习某学科之前,怎么知道自己想学这个学科?得挖几个“探坑”去管窥一下。我上大学以后,看到建筑系学生写生,后悔没有报建筑系。如果报考前管窥一下建筑专业,我肯定就报考了
娃离家一周多了,目测适应良好。(不过已经吃了一周面条)
## 苹果研究人员质疑 AI 的推理能力
> 例如,当研究人员给出一个简单的数学问题:“奥利弗星期五摘了 44 个奇异果,星期六摘了 58 个奇异果。星期日,他摘的奇异果是星期五的两倍。奥利弗一共摘了多少个奇异果?”时,LLM 能够正确地计算出答案。然而,当研究人员添加一个无关的细节,“星期日,他摘的奇异果是星期五的两倍,其中 5 个比平均小”时,LLM 的回答却出现了错误。例如,GPT-o1-mini 的回答是:“... 星期日,其中 5 个奇异果比平均小。我们需要从星期日的总数中减去它们:88(星期日的奇异果) - 5(较小的奇异果) = 83 个奇异果。”
上面只是一个简单的例子,研究人员修改了数百个问题,几乎所有问题都导致模型的回答成功率大幅下降。
ummmm,有意思,应该说这个结果对于大语言模型来说是符合预期的。报道中提供了原论文的链接,不复杂也不长可以读一读。
(我:这只说明AI还天真,没有学会被bully。它天真的以为输入信息都是有意义的,不是噪声。既然你提到了这个,这个就应该是有意义的,我回答的时候就该考虑这一点。所以啊, 问AI就要言简意赅,别整这种有的没的,浪费AI时间,也浪费你自己的时间。) #网摘 #webto
> 例如,当研究人员给出一个简单的数学问题:“奥利弗星期五摘了 44 个奇异果,星期六摘了 58 个奇异果。星期日,他摘的奇异果是星期五的两倍。奥利弗一共摘了多少个奇异果?”时,LLM 能够正确地计算出答案。然而,当研究人员添加一个无关的细节,“星期日,他摘的奇异果是星期五的两倍,其中 5 个比平均小”时,LLM 的回答却出现了错误。例如,GPT-o1-mini 的回答是:“... 星期日,其中 5 个奇异果比平均小。我们需要从星期日的总数中减去它们:88(星期日的奇异果) - 5(较小的奇异果) = 83 个奇异果。”
上面只是一个简单的例子,研究人员修改了数百个问题,几乎所有问题都导致模型的回答成功率大幅下降。
ummmm,有意思,应该说这个结果对于大语言模型来说是符合预期的。报道中提供了原论文的链接,不复杂也不长可以读一读。
(我:这只说明AI还天真,没有学会被bully。它天真的以为输入信息都是有意义的,不是噪声。既然你提到了这个,这个就应该是有意义的,我回答的时候就该考虑这一点。所以啊, 问AI就要言简意赅,别整这种有的没的,浪费AI时间,也浪费你自己的时间。) #网摘 #webto
阳志平 每个年轻人在成为自己的路上,都走过弯路,24岁到36岁这段时间,错过的总是多过得到的;遗憾总是大于收获。绝大多数人放弃了,最终成为名利驱动的动物。总有少数人,坚持走到底,成为内在动机驱动的人。#网摘
#laugh 今天朋友给我看一张她8个多月大女儿的照片,照片中小宝宝手拿着吃了一半的香蕉,撇着嘴在哭。 我很惊奇的问朋友,“她会吃香蕉呀?” 朋友回答:“是啊。” 我又问:“那她为什么哭得这么伤心?” 朋友的回答把我笑喷了:“穿得太厚了,她胳膊又短,吃了一半够不着了。”