tencent cloud

Service Registry and Governance

AI Gateway Overview

PDF
Mode fokus
Ukuran font
Terakhir diperbarui: 2026-05-07 17:26:54
AI gateway is a new-generation gateway product launched by Tencent Cloud Intelligent Gateway for large models and intelligent scenarios. It focuses on solving core issues faced by enterprises when enterprises access, schedule, and manage multiple AI models, such as complex protocols, governance difficulties, uncontrollable costs, and high barriers to transforming existing businesses.
AI gateway serves as the traffic entry and governance hub for enterprise intelligent architectures, enabling enterprises to efficiently, securely, and economically integrate and utilize AI capabilities through unified protocol adaptation, intelligent routing scheduling, and comprehensive observability capabilities, thereby accelerating business innovation and intelligent transformation.


Product Features

Intelligent Model Governance: Unified access and intelligent scheduling of Tencent Cloud Hunyuan, open-source models, and third-party commercial models. Through load balancing (CLB), circuit breaking and degradation, and cost optimization policies, it achieves the optimal balance of performance, stability, and cost.
Rapid AI Transformation for Legacy Business Systems: It features a powerful built-in protocol conversion engine that supports bidirectional conversion between AI ecosystem protocols such as MCP and OpenAI and traditional business protocols like HTTP/gRPC. This enables legacy business systems to quickly acquire AI capabilities, effectively protecting enterprises' existing IT investments.
Comprehensive End-to-End Security and Compliance: Build a multi-layered security protection system spanning from access authentication and parameter filtering to Data Masking (DMask). Integrated with capabilities such as WAF and DDoS protection (Anti-DDoS), it ensures AI applications operate compliantly, securely, and reliably.
Enterprise-Grade High Availability Assurance: Adopts a multi-AZ high-availability deployment architecture, supports automatic failover and elastic scaling of instances, ensuring service availability.

Business Scenarios

Scenario 1: Unified Governance and Intelligent Scheduling of Multiple Models

Core Issues: Enterprises need to use multiple AI models simultaneously but lack a unified platform, leading to chaotic model usage, inefficient resource allocation, and uncontrollable costs.
Solution: AI gateway serves as a unified entry point, enabling visual access and management of multiple models. Through intelligent routing policies, it automatically selects the optimal model based on factors such as request content, model performance, and cost, and incorporates rate limiting and circuit breaking mechanisms to ensure service stability, achieving cost reduction and efficiency improvement.


Scenario 2: AI Transformation of Legacy Business Systems

Core Issues: Traditional enterprises' legacy systems have outdated technology stacks; direct transformation requires significant investment and high risks, and they cannot adapt to modern AI application invocation protocols.
Solution: The AI gateway automatically packages standard APIs provided by legacy business systems into standardized tools (MCP Tools) that AI applications can call, through its protocol conversion engine. This enables the "zero-code transformation" of business capabilities into AI capabilities. Developers can quickly build intelligent applications without needing to focus on underlying integration details.


Features

Feature
Note
Unified Protocol Access
100% compatible with the open-source gateway ecosystem, fully adapts to standard AI protocols such as MCP, OpenAI, and SSE, provides seamless conversion for traditional protocols like RESTful and gRPC, and enables a single gateway to handle all traffic.
Model Service Management
Provides full lifecycle management for model services, supports configuring keys and API endpoints for multiple model vendors, and offers model-level traffic control, Fallback disaster recovery, and fine-grained monitoring.
Intelligent Routing & Orchestration
Supports intelligent routing based on policies such as content semantics, cost, and performance. Can orchestrate and chain multiple model invocations or business APIs to complete complex tasks.
Fine-grained Traffic Governance
Provides capabilities such as rate limiting, circuit breaking, and degradation across multiple dimensions from consumers and APIs to models, ensuring the stability of backend services and model APIs.
Out-of-the-box Security Protection
Integrates security capabilities such as authentication and authorization, access control, sensitive information desensitization, and replay attack prevention, provides enterprise-grade security protection, and meets compliance requirements.
End-to-end Observability
Provides end-to-end tracing from user requests to model responses, monitors multi-dimensional metrics such as API call latency, Token consumption, and model costs, supports intelligent diagnosis and alarms, and facilitates Ops and cost optimization.
Fine-grained Permission Management
Implements secure isolation and convenient sharing of AI capabilities across different teams and projects through a multi-level permission model based on consumers and consumer groups, supporting platform-based operations.


Bantuan dan Dukungan

Apakah halaman ini membantu?

masukan