GPU 每小时 $2.25 或每小时 $12.29:决定价格的基础设施层
H100的价格差异达到9倍,这一现象确实存在,但进行比较时需要谨慎。每小时1.38美元的价格通常是预留或承诺的计算能力,而每小时12.29美元的价格则是在主要云服务提供商处按需使用,并包含了完全灵活性的溢价。
更有意义的比较是对于一个持续使用的团队来说,三年的总拥有成本(TCO)。在1000个GPU上以85%的利用率运行时,专用的共置基础设施在二级市场的成本通常为相应云服务成本的40-60%,这已经考虑了所有非计算成本。这个范围取决于你内部运营的开销和融资成本。
在规模化运营中,硅芯片本身占总成本的20-25%。其余的成本包括基础设施、电力、网络、运营和其他开销。这就是为什么设施位置比人们预期的更为重要。
查看原文
The 9x price spread on H100 is real but the comparison requires some care. The $1.38/hr end is typically reserved or committed capacity. The $12.29/hr end is on demand at major cloud providers with full flexibility premium built in.<p>The more meaningful comparison is 3-year TCO for a team running consistent utilization. At 85% utilization on 1,000 GPUs, dedicated colocated infrastructure in a secondary market typically runs 40-60% of equivalent cloud cost after accounting for all non-compute costs. That range depends on your internal ops overhead and financing cost.<p>The silicon itself is 20-25% of total cost at scale. The rest is infrastructure, power, networking, ops, and overhead. That's why facility location matters more than people expect.