Caffeine takes up nearly 100 million gigabytes of storage in one database
(100,000,000 * GB) / TB == 97,656.
So Caffeine’s current index could be stored non-redundantly on ~98 thousand 1TB drives. Newegg’s cheapest 1TB drive is currently $70, so that’s very-naively 6.8 million USD in HDD drive costs alone.
and adds new information at a rate of hundreds of thousands of gigabytes per day.
And it sounds like you’re on the hook for buying “hundreds” of 1TB drives each day, with a burn rate between $14,000/day (200 TB drives) to $63,000/day (900 TB drives).
My math is wildly naive as economies of scale push prices down and machine costs + redundancy + networking + labor + energy drive costs up, but it’s fun to think about.