Cloud storage for AI: Options, pros and cons

IT architects tasked with the design of storage systems for artificial intelligence (AI) need to balance capacity, performance and cost.

AI systems, especially those based on large language models (LLMs), consume vast amounts of data. In fact, LLMs or generative AI (GenAI) models often work better the more data they have. The training phase of AI in particular is very data hungry.

The inference phase of AI, however, needs high performance to avoid AI systems that feel unresponsive or fail to work at all. They need throughput and low latency.

So, a key question is, to what extent can we use a mix of on-premise and cloud storage? On-premise storage brings higher performance and greater security. Cloud storage offers the ability to scale, lower costs and potentially, better integration with cloud-based AI models and cloud data sources.

In this article, we look at the pros and cons of each and how best to optimise them for storage for AI.