Archives

Rafay Systems Launches Token Factory to Help GPU Providers Monetize AI at Scale

Rafay Systems

Rafay Systems has introduced Token Factory, a new capability within its platform designed to help GPU infrastructure providers evolve into full-fledged AI service operators. The offering enables organizations to deliver and monetize AI models through token-based consumption, a model rapidly becoming central to how enterprises and developers access AI services.

With the general availability of Token Factory, Rafay equips neocloud providers and AI infrastructure operators with built-in metering, pricing, and access controls. This allows them to offer AI models as services without building complex monetization frameworks from scratch. By converting raw GPU capacity into tokenized consumption, providers can unlock new revenue streams while simplifying service delivery for end users.

The launch comes amid a shift in AI usage patterns, where enterprises increasingly rely on agentic frameworks such as OpenClaw and NVIDIA’s NemoClaw. These systems execute multi-step workflows and continuously interact with external tools, significantly increasing token consumption compared to traditional AI queries. As a result, demand for scalable, token-based access to AI models is accelerating, with most spending currently concentrated among hyperscalers.

Also Read: DoControl Introduces First-of-Its-Kind Security Controls for Google Gemini Gems

Token Factory aims to redistribute that opportunity by enabling regional infrastructure providers to serve local demand. It allows operators to expose AI models through API endpoints, where usage is tracked and billed based on tokens. Built-in governance, quota management, and real-time monitoring ensure that both enterprises and individual users can manage consumption effectively.

“Token Factories are the new cellphone companies,” said Haseeb Budhani, CEO and co-founder of Rafay Systems. “Similar to how cellphone companies used to sell pre- and post-paid minute plans, AI factories are beginning to sell pre- and post-paid token plans. Team Rafay is looking forward to supporting the success of a thousand AI factories across the world with our Token Factory offering.”

The platform integrates seamlessly with leading agentic frameworks, allowing developers to connect to tokenized endpoints through self-service workflows while abstracting underlying infrastructure complexity. This approach positions Token Factory as a key enabler in the growing market for AI services, where consumption-based models are expected to dominate.

As global investment in sovereign AI and GPU-as-a-Service continues to rise, Rafay’s latest release addresses a critical gap—helping infrastructure providers move beyond capacity provisioning to sustainable monetization strategies.