CLIIntroduction

Welcome to CloudQuery

Open-source core
Blazing fast
Deploy anywhere
Pre-built queries
Eliminate data silos
Unlimited scale

CloudQuery is a high-performance, flexible data movement framework that runs entirely on your infrastructure. Extract data from hundreds of cloud and SaaS sources and load it into any data warehouse, lake, or database with speed, precision, and full control.

Built for developers and designed for the demands of AI-native applications, CloudQuery powers use cases from AI workflows to cloud security to asset inventories.

  • Integrate easily with your stack with a code-first interface (Go, Python, Java, and JavaScript)
  • Unmatched performance, powered by an open-source framework powered by Apache Arrow
  • Complete control, so your data is never exposed

Installation

Check out the quickstart guide for step-by-step instructions on completing your first sync with CloudQuery.

Why CloudQuery?

  • Composable and flexible - Use the languages, destinations, and orchestrators you want. CloudQuery is built to fit into your stack, not the other way around.
  • Runs on your infrastructure - Your cloud data never touches CloudQuery’s servers. Full privacy, built for regulated, secure, and performance-critical environments.
  • Built for developers - Code-first, extensible plugins, multi-language, open plugin system, no lock-in. Write it, extend it, ship it. No black boxes, no unexplained failures.
  • Fast, powerful data movement - Move large volumes of data with high performance and fine-grained control, powered by Apache Arrow. Perfect for feeding AI models, LLM pipelines, or large-scale data stores.
  • Specialized plugin coverage - Support for complex, unique data sources such as cloud infrastructure, security, and FinOps data.

Next steps