展示HN:软件如何失败——一本关于复杂系统失败的书(样章)

2作者: enginyoyen23 天前原帖
根据理查德·库克关于复杂系统故障的研究,我写了一本书,探讨了当每个组件都正常运作时,为什么像奈特资本(Knight Capital)损失4.4亿美元这样的灾难仍然会发生。 免费章节样本: [https://leanpub.com/how-software-fails](https://leanpub.com/how-software-fails) 这本书探讨了复杂系统故障背后的模式——从宇宙射线在投票机中翻转比特,到Therac-25辐射过量事件。关键见解是:传统的“根本原因分析”根本误解了这些系统是如何实际发生故障的——故障是由各个组件之间的相互作用引发的,而这些组件本身都是正常工作的。 欢迎下载样本章节试读。
查看原文
Based on Richard Cook&#x27;s research on complex system failures, I wrote a book exploring why disasters like Knight Capital&#x27;s $440M loss happen when every component functions correctly.<p>Free sample chapters: <a href="https:&#x2F;&#x2F;leanpub.com&#x2F;how-software-fails" rel="nofollow">https:&#x2F;&#x2F;leanpub.com&#x2F;how-software-fails</a><p>The book explores patterns behind complex system failures—from cosmic rays flipping bits in voting machines to the Therac-25 radiation overdoses. Key insight: traditional &quot;root cause analysis&quot; fundamentally misunderstands how these systems actually fail—they emerge from interactions between components that were individually functioning correctly.<p>Feel free to grab the sample chapter and give it a try.