An icon of an eye to tell to indicate you can view the content by clicking
Signal
Original article date: Feb 17, 2026

SoftBank and AMD Partner to Revolutionize AI Infrastructure with Intelligent GPU Orchestration

February 17, 2026
5 min read

SoftBank Corp and AMD have announced a groundbreaking collaboration to test AMD Instinct GPUs for next-generation AI infrastructure. The partnership focuses on optimizing GPU resource allocation for generative AI applications and Large Language Models (LLMs) through SoftBank's innovative Orchestrator system, promising to eliminate both resource bottlenecks and waste in enterprise AI deployments.

Revolutionary GPU Resource Management

The core innovation lies in SoftBank's Orchestrator system, which leverages AMD Instinct GPU hardware partitioning to transform a single physical GPU into multiple logical devices. This breakthrough technology enables dynamic resource allocation based on specific application needs, dramatically improving efficiency for organizations running multiple AI workloads simultaneously.

Key Technical Advantages

  • Dynamic Workload Management: The Orchestrator adapts to different model sizes and user request volumes in real-time
  • Multi-Application Support: Multiple AI applications can operate on one GPU while maintaining resource availability for all tasks
  • Intelligent Resource Distribution: Power allocation adjusts automatically based on specific requirements rather than fixed allocations

Addressing Critical Enterprise AI Challenges

Large Language Models require vastly different computing resources depending on their parameter count and complexity. Traditional fixed resource allocation systems create inefficiencies that slow adoption and increase costs. This collaboration directly addresses these pain points by introducing intelligent orchestration that maximizes hardware utilization while maintaining performance standards.

Industry Impact: The solution promises to make AI infrastructure more accessible to small and medium enterprises by improving cost-effectiveness and reducing the complexity of managing AI workloads at scale.

What's Next: MWC Barcelona 2026 Demonstration

The companies will showcase their joint verification work at Mobile World Congress Barcelona 2026, where they'll demonstrate real-world performance improvements. The SoftBank Research Institute of Advanced Technology has released detailed technical architecture specifications, indicating the partnership is moving beyond theoretical concepts to practical implementation.

According to Ryuji Wakikawa, SoftBank's Vice President, the orchestration logic implementation for AMD Instinct GPUs enables significantly better performance across multiple AI applications. AMD's Corporate Vice President Kumaran Siva emphasized that proper GPU resource allocation remains critical for successful AI inference deployment in high-performance enterprise environments.

This collaboration represents a significant step toward making enterprise AI more efficient, cost-effective, and scalable for organizations of all sizes.

Read the full article on TechNetBooks