OpenAI plants its first overseas lab. Anthropic’s $65B Series H. NVIDIA’s Vera Rubin cuts inference costs tenfold. What does this mean for AI founders?

Three Moves, One Signal: How OpenAI, Anthropic, and NVIDIA are Rewriting the AI Value Map

OpenAI plants its first overseas lab. Anthropic’s $65B Series H. NVIDIA’s Vera Rubin cuts inference costs tenfold. What does this mean for AI founders?

If you are building an AI native company in Southeast Asia, recent developments across major players in frontier AI shifted three variables that matter to your business: your cost structure, your investor universe, and the competitive distance between where you are and where the frontier labs are operating.

On May 20, OpenAI committed S$300 million to Singapore, announced its first Applied AI Lab outside the United States, and signed a memorandum of understanding with the Singapore government covering public service, finance, healthcare, and digital infrastructure [7]. On May 22, it filed its S-1 confidentially, targeting a valuation of up to $1 trillion on nearly $6 billion in Q1 2026 revenue, while remaining deeply unprofitable [1][2]. On May 28, Anthropic closed a $65 billion Series H at a $965 billion post-money valuation, with GIC as a co-lead investor and Temasek as a significant investor, Singapore’s two sovereign wealth funds anchoring the round together [6][8]. Two months prior, at NVIDIA GTC, Jensen Huang had announced the Vera Rubin platform: inference costs falling to one-tenth of the previous generation, available from every major hyperscaler in H2 2026 [3].

None of these events happened in isolation. Read together, they describe a specific set of conditions that SEA founders should understand. The implications differ depending on where you sit in the stack.

Your Cost of Intelligence Just Dropped by an Order of Magnitude

The most actionable development for applied AI founders is NVIDIA’s Vera Rubin, and most product teams have not worked through what it means for their business.

The NVL72 delivers up to 10x higher inference throughput per watt at one-tenth the cost per token compared to Blackwell [3]. Gartner projects inference costs falling 90% by 2030 even without further hardware improvements beyond what is already announced [4]. Vera Rubin ships H2 2026 from AWS, Google Cloud, Microsoft Azure, Oracle Cloud, and CoreWeave. 

For any company whose product runs on per-transaction AI processing, this is a direct COGS event. If your product processes documents, routes conversations, classifies transactions, or handles any high-volume inference workload, your input costs are about to fall materially. The question is not whether this changes your economics (it does), but whether you are positioned to capture that improvement as margin, use it to lower price and expand your addressable market, or let it pass through to customers as a competitive necessity.

The bifurcation that matters here is between commodity inference and frontier reasoning. Token prices for commodity tasks (document processing, classification, summarization, structured extraction) are falling and will continue to fall with Vera Rubin. Token prices for frontier reasoning, the kind required for complex multi-step agentic workflows, are stickier, because they require the most capable models and the most compute-intensive generation methods.

Founders building products on commodity inference are entering a structural tailwind. Founders building products that depend on frontier reasoning at every step need to think carefully about whether the workflow can be redesigned to use frontier models only where necessary and commodity models everywhere else.

Gartner analyst Will Sommer put the risk plainly: “Chief Product Officers should not confuse the deflation of commodity tokens with the democratization of frontier reasoning.” [4] The cost collapse is real, but it is not uniform. The founders who understand this distinction will make better product and pricing decisions than those who assume cheaper tokens means cheaper AI across the board.

Case Study: WIZ.AI operates on a per-conversation pricing model for enterprise voice AI across more than 300 clients in 17 countries [9]. The economics are direct: the company pays per token consumed and charges per conversation delivered. But the unit of sale is not a conversation — it is an organizational transformation.

WIZ.AI sells to enterprises, deploying multi-agent solutions that automate entire customer-facing workflows across sales, collections, and service functions. That architecture distributes the per-token cost across a much larger base of automated interactions, which changes the risk profile of the pricing model: the exposure is not a single call gone long but a portfolio of agent-handled conversations priced against measurable business outcomes.

As per-token costs fall with Vera Rubin, the spread between input cost and conversation value widens across that entire portfolio without any change to the product or the contracts. The constraint on WIZ.AI’s market expansion has not been model capability. It has been the price point at which enterprise voice AI becomes viable for mid-market customers across ASEAN. The H2 2026 inference cost shift moves that threshold.

Singapore’s Evolving Relationship with Frontier Labs

GIC and Temasek investing in Anthropic is significant not because of the capital ($65 billion is a large round with many investors) but because of what it signals about how Singapore’s institutional infrastructure will relate to frontier AI over the next decade [8].

Singapore’s sovereign funds do not make passive bets. When GIC takes a co-lead position in a frontier AI lab, it is making a statement about which AI infrastructure it expects to be commercially relevant to Singapore’s economy. The downstream effects of that bet are not just financial returns. They include which AI systems get deployed across government-linked companies, which models get integrated into Singapore’s financial and healthcare infrastructure, and which AI vendors receive favorable policy treatment as Singapore builds out its AI regulatory framework.

For SEA founders, this matters because Singapore is the region’s primary hub for enterprise sales, regional headquarters, and institutional partnerships. An AI company aligned with the infrastructure Singapore’s sovereign capital has backed is better positioned for enterprise deals in the region than one that is not. This is not about choosing sides between models. It is about understanding that the institutional environment in Singapore is now organized around frontier AI adoption, and that creates a more favorable enterprise sales environment than existed eighteen months ago.

OpenAI’s Singapore lab reinforces this. The Applied AI Lab’s mandate covers public service, finance, healthcare, and digital infrastructure, the same sectors where SEA founders are building [7]. The 200-plus technical roles OpenAI is hiring in Singapore will build local AI talent density. Some of that talent will eventually leave OpenAI and join or found applied AI companies. The government MOU creates procurement relationships and pilot opportunities that will make Singapore enterprises more accustomed to deploying AI systems, which lowers the sales cycle friction that has historically slowed enterprise AI adoption in the region.

The near-term effect for SEA founders is a more AI-ready enterprise customer base. The medium-term effect is a deeper local talent pool. Both are inputs that compounding AI companies need.

The Frontier Subsidy and Why Founders Should Understand It

OpenAI’s S-1 filing, what will become the first public window into the company’s financials, reveals something that every founder consuming frontier model APIs should understand: the product you are building on is being subsidized at scale.

The company remains unprofitable [1]. It has committed to hundreds of billions in future compute contracts. NVIDIA VP Bryan Catanzaro described the cost dynamic precisely: “For my team, the cost of compute is far beyond the costs of the employees.” [4] OpenAI’s capital structure reflects this. Microsoft’s approximately 27% stake has functioned as a financing mechanism for compute obligations the company cannot cover from revenue [1]. The IPO is a search for capital to fund those obligations.

Anthropic’s financials tell a structurally similar story. At $47 billion in annualized run-rate revenue, more than double OpenAI’s quarterly pace, Anthropic is growing faster, but it is signing the same category of commitments: five gigawatts with Amazon, five gigawatts of TPU capacity with Google and Broadcom, GPU access through SpaceX’s Colossus facilities [6]. Both companies are committing to infrastructure investments that their current revenue does not justify without the assumption that the cost curve will improve and token volumes will grow enough to reach profitability.

What this means for SEA founders is that the gap between what it costs to train and maintain a frontier model and what you pay to call its API is not just a pricing decision. It is a structural subsidy. The $130 billion raised across both companies in recent months is being deployed, in part, to keep frontier inference available and affordable long enough for applications built on top of it to generate the revenue that justifies the infrastructure. You are benefiting from that subsidy whether or not you have thought about it in those terms.

This creates a time-sensitive opportunity. The inference cost curve is falling, the frontier models are available at prices that do not fully reflect their development cost, and the hardware generation that will make commodity inference dramatically cheaper is arriving in H2 2026. Founders who build products that compound during this window, acquiring customers, embedding workflows, and deepening data advantages, will be harder to displace when the cost structure normalizes.

The Enterprise Deployment Gap Is Real, and It Is an Advantage

The past quarter produced evidence that should encourage SEA founders competing against global AI vendors for enterprise customers. Microsoft, the world’s largest enterprise AI distributor, has been canceling Claude Code licenses at scale because per-seat costs at enterprise deployment are too high [4]. Uber burned through its entire 2026 AI budget in four months [4]. These are not failures of AI capability. They are failures of cost structure fit.

The enterprises struggling with AI deployment economics are predominantly large Western organizations deploying AI for knowledge worker productivity, use cases where the value per interaction is hard to measure and the cost per interaction adds up quickly. The enterprises that SEA founders are typically selling to have different economic profiles. Manufacturing, logistics, financial services, and healthcare operations in Southeast Asia are deploying AI for workflows where the alternative is manual labor at a cost that AI undercuts even at today’s prices, and where the value per transaction is measurable.

A document processing company pricing per document processed in an ASEAN financial institution is not competing on the same cost-value equation as a coding assistant being deployed per seat at a Western tech company. The cost pushback that is hitting Microsoft’s enterprise AI business reflects Western enterprise procurement realities, not the procurement realities of the markets SEA founders are serving. This is a structural edge, not a temporary one.

Case Study: Surfin, which has disbursed over $5.3 billion in credit across 90 million users in 10 markets on three continents, and is on track to double revenue to USD 500 million this year, deploys AI for credit decisioning on populations without traditional financial history, using social behavior data and alternative data signals to price risk where bank records do not exist [10]. Each credit decision has a measurable outcome: a loan disbursed or declined, priced against the risk of default. The value per AI inference is not an abstraction. It is a recoverable margin on a loan.

Beyond its own lending operations, Surfin extends that infrastructure through B2B partnerships, offering its AI credit-scoring and underwriting capability as a service to other financial institutions that lack the data infrastructure to serve the same populations. That model means the cost of AI deployment is distributed across a partner network rather than absorbed by a single balance sheet, and each partner deployment adds to the pool of behavioral and repayment data that makes the models more accurate over time.

The dynamics creating budget problems at Microsoft and Uber — high per-seat cost, diffuse value, hard-to-measure ROI — are structurally absent here. Goldman Sachs estimates that agentic AI could drive a 24x increase in token consumption by 2030 as AI moves from single-shot queries to multi-step workflows [4]. For applied AI companies that charge per outcome rather than per token, increasing token consumption per workflow is a COGS increase, not a revenue constraint. The founders who design their pricing models to capture outcome value rather than token volume will be better positioned as agentic consumption grows.

What Founders Should Actually Do With This

The three moves from this week are not just news. They are inputs to product, pricing, and go-to-market decisions that compound over the next twelve to eighteen months.

On cost structure: model the impact of Vera Rubin pricing on your COGS before it ships. Identify which parts of your inference workload are commodity tasks that will benefit from the cost collapse, and which require frontier reasoning that will remain expensive. Design your product to use the right model for each task rather than defaulting to the most capable model for everything.

On market expansion: the customers you could not serve profitably at current inference costs may become viable in H2 2026. Map that population now. The founders who have a go-to-market motion ready for the newly addressable segment when the cost curve shifts will capture more of that expansion than those who react after the fact.

On enterprise relationships: Singapore’s institutional AI environment has become meaningfully more favorable in a single week. OpenAI’s government MOU and Anthropic’s GIC backing together create a policy and procurement environment that will accelerate enterprise AI adoption among Singapore-based and Singapore-linked companies. If your enterprise sales motion is not already working through Singapore as a regional entry point, the window to establish those relationships is now, while the institutional momentum is building rather than after it has crystallized around existing vendors.

On the frontier subsidy: use it. The gap between what frontier models cost to build and what you pay to access them will not stay this wide indefinitely. The companies that build the strongest customer relationships and the deepest workflow integrations during this window will retain those advantages when pricing normalizes.

The map is clear. Three of the world’s most consequential AI companies made moves in one week that, together, describe a specific set of conditions for applied AI in Southeast Asia. The conditions are favorable. They are also temporary. The founders who understand both things at once are the ones who will make the most of them.

References

[1] Beatrice Nolan, “The big questions looming over OpenAI’s trillion-dollar IPO,” Fortune, May 22, 2026. https://fortune.com/2026/05/22/openai-ipo-big-questions/

[2] The Information, “OpenAI Q1 2026 revenue nearly $6 billion,” cited in Fortune, May 22, 2026.

[3] NVIDIA Newsroom, “NVIDIA Vera Rubin Opens Agentic AI Frontier,” March 16, 2026. https://nvidianews.nvidia.com/news/nvidia-vera-rubin-opens-agentic-ai-frontier

[4] Jake Angelo, “Microsoft reports are exposing AI’s real cost problem,” Fortune, May 22, 2026. https://fortune.com/2026/05/22/microsoft-ai-cost-problem/

[5] WIZ.AI company profile, Insignia Ventures Partners portfolio, 2026.

[6] Anthropic, “Anthropic raises $65B in Series H funding at $965B post-money valuation,” May 28, 2026. https://www.anthropic.com/news/series-h

[7] OpenAI, “Introducing OpenAI for Singapore,” May 20, 2026. https://openai.com/index/introducing-openai-for-singapore/

[8] Duc Dao, “GIC, Temasek anchor Singapore’s investment in Anthropic’s $65B Series H round,” TechNode Global, May 29, 2026. https://technode.global/2026/05/29/gic-temasek-anchor-singapores-investment-in-anthropics-65b-series-h-round/

[9] “WIZ.AI: The Voice of AI-Powered Enterprise,” Insignia Business Review, February 19, 2026. https://review.insignia.vc/2026/02/19/wiz-ai/

[10] “Surfin: Financial Inclusion at Scale,” Insignia Business Review, February 12, 2026. https://review.insignia.vc/2026/02/12/surfin-nyse/

+ posts

Paulo Joquiño is a writer and content producer for tech companies, and co-author of the book Navigating ASEANnovation. He is currently Editor of Insignia Business Review, the official publication of Insignia Ventures Partners, and senior content strategist for the venture capital firm, where he started right after graduation. As a university student, he took up multiple work opportunities in content and marketing for startups in Asia. These included interning as an associate at G3 Partners, a Seoul-based marketing agency for tech startups, running tech community engagements at coworking space and business community, ASPACE Philippines, and interning at workspace marketplace FlySpaces. He graduated with a BS Management Engineering at Ateneo de Manila University in 2019.

***