Archives

FriendliAI Opens San Francisco Hub to Power Next-Gen AI Inference at Scale

FriendliAI

FriendliAI has expanded its global footprint with a new 7,000-square-foot office in San Francisco’s SoMa district, positioning itself closer to the core of the AI ecosystem as demand for production-grade inference infrastructure accelerates. The move comes amid a surge in AI agent adoption, which significantly increases token consumption due to multi-step reasoning and tool usage, while increasingly powerful open-weight models are challenging closed systems on performance and cost efficiency. This shift is making inference optimization a critical bottleneck and competitive advantage for AI companies.

Also Read: Anthropic, Blackstone, Hellman & Friedman, and Goldman Sachs Launch Enterprise AI Services Firm

“San Francisco is the epicenter of AI innovation, and a deeper presence here lets us partner with the customers and developers shaping what comes next,” said Byung-Gon Chun, CEO of FriendliAI. “The industry is no longer asking whether to build with AI — it’s asking how to run AI in production, profitably, at scale. FriendliAI, The Frontier AI Inference Cloud, was built for exactly that.” The company, famous for its continuous batching and fast inference systems, handles production workloads for clients like Twelve Labs and LG. It reports strong growth as enterprises scale AI-native applications. The new hub will aid hiring and serve as a collaboration space. Developers and enterprise partners can work on large-scale AI deployment challenges there.

Read More: FriendliAI Expands to San Francisco to Scale Frontier AI Inference for Open-Weight and Custom Models