为何会提出光互连解决当前HBM封装的极限?
最本质的原因是:我们正在一步步接近HBM封装的极限。
一、物理规律的极限被逐渐触及:
极限1️⃣:垂直堆叠层数——已触顶
HBM的堆叠演进:HBM2(8层)→ HBM2E(8层)→ HBM3(12层)→ HBM3E(12/16层)→ HBM4(计划16–20层)。
每多叠一层,TSV(硅穿孔)的深宽比就要提高。目前16层的TSV深宽比约为20:1,这是电镀铜填充工艺的极限边缘——再往上,铜填充时气泡无法逸出,良率断崖式下跌。
更深的问题是热阻叠加。每层DRAM die的导热路径要穿越所有下层芯片才能到达散热器。12层堆叠时底层die的结温(junction temperature)比顶层高约15°C,20层时这个差距会超过25°C,已经超过DRAM工作安全边界。
这都导向一个事实——垂直方向的物理扩展空间已所剩无几。
极限②:硅中介层岸线——已触顶
GPU的「岸线」(shoreline)是GPU die四周边缘的物理周长。HBM堆栈通过硅中介层与GPU并排,HBM的数量上限就是:GPU周长能容纳多少个HBM接口宽度。
假设GPU die最终做到一个reticle极限,周长约为130mm,单个HBM4堆栈接口约需4mm,理论上限约为32个堆栈——但实际上电源/信号走线、角落利用率等因素会把这个数字压到16–20个以内。
极限③:带宽密度(单位面积IO数)——接近极限
Microbump的物理极限大约在25–30μm pitch,低于这个数字,焊锡球的表面张力和对准精度无法维持量产良率。
极限④:功耗密度——这是最被低估的极限
HBM3E每栈满载功耗约15W,8栈就是120W,加上GPU本身的600–700W,整个封装的功耗密度已经超过100 W/cm²,相当于火箭发动机喷口附近的热流密度。
散热才是最硬的物理墙。
HBM越叠越高,散热路径越长,这是垂直堆叠无法回避的热阻叠加问题,与材料科学的边界直接碰撞。
—————————————————————————
二、光连接的解法
电信号传输本质上是在用「极短距离」这个物理条件换取优势。
光的优势恰好在电的弱点处显现——光信号的带宽与距离几乎无关。当GPU与HBM的物理间距被迫增加,光互连的相对优势就从「理论上可行」变成「工程上合理」。
—————————————————————————
三、目前技术架构的可能形态
方案A:光学Bridge芯片
在硅中介层中嵌入硅光子波导层,GPU与HBM之间的信号不再走铜线,而是走片内波导。距离仍在毫米级,但打破了「必须紧邻」的约束,允许HBM在中介层上远离GPU边缘排列。
这是Ayar Labs(与Intel合作)的TeraPHY路线,已在2024年实现单chiplet 2Tbps光I/O,且可3D堆叠于逻辑芯片下方。
方案B:CXL over Optics(推理情况不合适)
将多个HBM堆栈聚合为独立的记忆体池模块,通过CXL协议与GPU通信。距离可达几厘米至数十厘米,直接接入背板。这不是HBM专属光互连,而是把HBM变成CXL记忆体节点。
延迟代价:每次E→O→O→E转换约增加5–10ns,相对于DRAM本身的~150ns访问延迟,约增加3–7%,在大模型训练的流式访问模式下可接受,但推理场景(延迟敏感)会更在意。
方案C:3D光学垂直互连(最激进)
将HBM置于GPU正下方,利用垂直光学通孔(Optical Through-Silicon Vias,OTSV) 实现Z轴方向的光互连。这在理论上消灭了岸线限制(HBM直接在GPU下方大面积铺展),同时保持极短传输距离。
方案D:光子织网
把光子互连fabric做成一个独立的2D芯片层,像三明治一样插在GPU和HBM(或其他加速器)之间,所有芯片通过这层光子层通信。它并不限定是HBM,而是一个通用的光互连基板。
—————————————————————————
四、这个方案的难点:激光源
这是目前工程界最头疼的问题。
硅不能发光。硅光子可以导光、调制光、探测光,但无法产生光。今天所有硅光子方案的激光源都是III-V族化合物半导体(InP、GaAs基材料),需要外置激光器,通过fiber coupling耦合进硅波导。
这带来:
可靠性问题:激光器是光互连系统中寿命最短的组件
良率问题:光纤与波导的耦合对准精度要求亚微米级,大规模封装良率极低
成本问题:III-V激光器目前仍比硅便宜不了多少,难以摊薄
功耗问题:激光源本身的电光转换效率约30–40%,是额外的能耗来源
————————————————————————
五、解决路径:
1️⃣直接在硅上外延生长III-V材料
Intel、MIT林肯实验室、UCSB都在研究,但良率与可靠性尚未达到量产标准。
2️⃣另一条路是量子点激光器直接长在硅上,理论上可行但仍是实验室阶段。
AXT(AXTI)的潜在价值就在这里——其磷化铟(InP)基片是高性能光互连激光器的关键衬底材料,这条需求链条目前仍处于早期。
————————————————————————
六、时间线
2026–2027:板级CPO(GPU与光引擎共封装)进入量产,主要用于scale-out网络,不直接涉及HBM
2027–2029:光学Bridge方案开始进入高端AI加速器概念验证,HBM可能扩展至更远位置但仍在同一封装内
2029–2032:CXL over Optics的记忆体池化开始规模部署,HBM功能性分离
2032 :真正意义上GPU-HBM光互连作为标准封装方案,仍存在不确定性
Breaking the "Memory Wall": Optical Interconnects Emerge in GPU–HBM Packaging
As a solution to the "memory wall," one of the chronic challenges in AI semiconductors, the memory and packaging industries at home and abroad are weighing an approach that decouples the GPU and high-bandwidth memory (HBM) and packages them separately. The core idea is to move the HBM—until now mounted right next to the GPU—a certain distance away, and bridge the gap with light (optics), allowing several times more HBM to be installed than is possible today.
On the 22nd, a researcher at a major domestic memory maker said, "We're currently struggling to expand HBM bandwidth and capacity, so we're discussing with customers a plan to overcome the GPU's shoreline limit through optical interconnects and mount more HBM." Shoreline refers to the length of the chip's perimeter.
In today's AI computing environment, the key factor dragging down compute efficiency is the data transfer speed of memory chips. While GPU performance has grown by leaps and bounds with each generation, the speed at which memory stores and supplies data has failed to keep pace—creating a structural performance barrier, the memory wall. The arrival of HBM, with its wide data pathways, put out the immediate fire, but critics continue to point out that bandwidth and transfer speeds still fall short of handling the explosive growth in AI compute.
Until now, the industry has focused on stacking HBM ever higher to increase memory capacity and bandwidth within a confined footprint. But as stack counts climbed past 12 and 16 layers toward 20 and beyond, process difficulty rose exponentially. The technology hit physical limits, including the growing difficulty of meeting fixed height specifications. Vertical stacking has reached an inflection point—so much so that the JEDEC standards body has relaxed its HBM height specifications.
The bigger problem is that if stack counts can't be raised, the alternative is to add more HBM horizontally around the GPU—but that, too, is impossible. In the current 2.5D packaging structure, the GPU and HBM are mounted tightly together on a single substrate. Within this structure, the number of HBM units that can be placed is strictly limited by the finite length of the GPU chip's perimeter—its shoreline. Even when more HBM is desired, there is physically no room to place it, leaving the industry in a structural deadlock.
The alternative now emerging across the semiconductor industry is to separate the GPU and HBM and package them independently. It overturns the conventional chip-design principle that components must sit close together to minimize data transfer time. Instead of keeping the two chips adjacent, the approach spaces them apart and links them with overwhelmingly fast optical signals to overcome the added physical distance.
Placing the HBM slightly away from the GPU within the board frees the design from the GPU's shoreline constraint. With the spatial limitation gone, far more HBM can be spread out laterally and packed into the board—several times more than today—without having to push stack heights to extremes. This means the total memory capacity and data bandwidth of the AI accelerator system would expand dramatically, on a scale incomparable to current systems.
"Discussing Placing HBM Beneath the GPU"… Form Factor Could Change
The industry is now producing a range of architectural design proposals over where exactly to place the HBM within the GPU board.
The same memory researcher said, "Options under discussion range from broadly utilizing the space immediately around the GPU to isolating the HBM beneath the GPU board." He added, "In the latter case—isolating it beneath the GPU board—the motherboard would have to be extended lengthwise, so we're discussing even an overall form-factor change with the GPU maker." Specifically, the HBM might surround the GPU from several centimeters away, or a separate HBM zone might be created in the center of the board.
"We're keeping every possibility open as we discuss the optimal layout," he said. "Nothing has been confirmed as an official roadmap yet, but as part of preliminary research toward next-generation AI accelerators, we're in talks with our partners."
The outsourced semiconductor assembly and test (OSAT) industry is also watching this trend closely. An executive at a global OSAT firm said, "Optical interconnects are a clear trajectory. The only question is timing," predicting that "rack-to-rack and server-to-server links will go optical first, and then chip-to-chip connections within the board will follow." He added, "The larger units will be connected by light first, but optical research is moving so fast that it may not be that far off."
Technically, the optical-interconnect technology linking GPU and HBM shares the same underlying principle as the technology connecting server to server inside a data center. The difference is the high technical barrier of shrinking optical-conversion technology—once used for communication between large pieces of equipment—down to the microscopic scale of a single board and chipset.
An executive at a domestic developer of co-packaged optics (CPO) components explained, "As HBM stack heights approach their limit, the industry is discussing spreading the memory out laterally to maximize how much can physically be mounted." He added, "The principle is the same as conventional data-center optical interconnects, but HBM optical links that have to operate within a confined board space require optical components to be miniaturized to far smaller sizes and far higher integration density—so the technical difficulty is greater."