Optoelectronic Module with Integrated Transceiver and Mux-Demux for Optical Interconnect Applications

Nga Thi Hang Nguyen¹, Ikechi Augustine Ukaegbu²,* , Jamshid Sangirov¹, Hyo-Hoon Park¹

¹Electrical Engineering Department, Korea Advanced Institute of Science and Technology (KAIST), Deajeon, South Korea
²Electrical and Computer Engineering Department, Nazarbayev University (NU), Nur-Sultan, Kazakhstan

Email address: nganth@smtron.com (Nga T. H. N.), ikechi.ukaegbu@nu.edu.kz (Ikechi A. U.), james.sangirov@q5x.com (Jamshid S.), parkhh@kaist.ac.kr (Hyo-Hoon P.)

*Corresponding author

To cite this article:
doi: 10.11648/j.ajop.20190702.11

Received: June 21, 2019; Accepted: July 12, 2019; Published: July 24, 2019

Abstract: The design, development and improvent in electronic devices and components have led to the further miniaturization of the system devices and their interconnecting interfaces. Hence, reducing the size of the transmitter (Tx) chips, receiver (Rx) chips and associated components play an important role in designing a reduced/small sized optoelectronic modules for optical interconnect applications. Some of the associated components include multiplexer, demultiplexer, clock and data recovery circuits (CDR), etc. Therefore, in this paper we present an optoelectronic module with integrated transceiver (Tx-Rx) and multiplexer-demultiplexer (mux-demux) with the aim of reducing the total area occupied by the chips. The topology is based on sharing common blocks between Tx and Rx as well as Mux and Demux for saving chip area, power consumption and board area. Based on this topology, a design of 2-channel Tx/Mux and Rx/Demux is realized in a 0.18 µm CMOS technology. The measurement results show clear output eye diagrams at 2.5 Gbps with the voltage swing of 375 mVpp from the Rx/Demux. The combined solution would save the chip area and power consumption of 10% and 23%, respectively, compared to discrete circuits implementations. Using this topology, a high efficient bidirectional optical link could be implemented in modern computer and other green IT applications.

Keywords: Multiplexer, Demultiplexer, Bidirectional, Transceiver, Integrated Optoelectronics

1. Introduction

System clock speed is rapidly increasing and the copper-based links on printed circuit boards (PCBs) constitute the bottle-neck of the high speed signal transmission between CPU and memory in modern computer systems. This is due to issues such as signal integrity, crosstalk, power consumption and electromagnetic interference. Optical links have been considered as one of the most promising candidates for replacing the electrical links [1-2]. Traditionally, optical links consist of a transmitter at the transmitting end and a receiver at the receiving end. A transmitter (Tx) module consists of a Tx chip and a Vertical Cavity Surface Emitting Laser (VCSEL) for transmitting and a receiver module (Rx) consists of an Rx chip and a photodiode (PD) for receiving.

Several implementations of data transfer (sending and receiving data) between the CPU and memory use time multiplexing. In traditional topologies, both the Tx and Rx are implemented on separate chips without sharing a common block for transmission. This situation may lead to increase in total power consumption and increase in the total chip area of the Tx and Rx combined. This issue has been resolved in some of our previous works for point-to-point, multi parallel point-to-point, chip-to-chip or board-to-board optical links applications [3-7], where the Tx and Rx share a common chip space while also sharing a common limiting amplifier stage as in the works of Kang et al and Ukaegbu et al [4-5] and switching between the Tx and Rx modes as in the works of
Sangirov et al and Ngo et al [6-7]. Operations of up to 2.5 Gbps was achieved in Kang’s work [4] and 7 Gbps in Ukaegbu’s work [5], respectively.

In conventional Tx-Rx topology, which in many cases have several data lines for exchanging data between CPU and memory, the cost of power, chip area and board area for sending and receiving data when every electrical line is replaced by each optical line is large. However, several solutions have been demonstrated and implemented. In the works by Nguyen et al and Sangirov et al [8-10], a single fiber was used to send and receive data simultaneously (in full duplex mode), where the Tx and Rx chips have been implemented on separate chips. For communication between CPU and memory, separate I/O pins would be required when using implementations of the works by Nguyen et al and Sangirov et al [8-10] and hence requiring a single pin for transmit and a single pin for receive. Half duplex operation was achieved in Kang’s work [4] while both full and half duplex operation were achieved in Ukaegbu’s work [4]. In Kang’s and Ukaegbu’s works [4-5], both the Tx and Rx are implemented on a single chip with a shared gain stage (limiting amplifier stage). However, in Ukaegbu’s work [5] a single pin is used for both transmit and receive. In order to increase throughput while transmitting data of the same type, multiple transmit pins and multiple receive pins need to be used over a single transmission line. To solve this problem, a multiplexer (Mux) and a demultiplexer (Demux) can be used to convert the slow speed parallel data to a serialized high speed data in the transmitting side and vice versa in the receiving side to replace many electrical data line using a single optical link. Several works have been reported on Mux and Demux designs by Chien et al, Luo et al, Sekiguchi et al, Rubiya et al, and Meghelli et al [11-15]. However, most of them are usually discrete implementations of Mux and Demux. Also, designs with the Mux, Tx, Rx, Demux are usually implemented on discrete chips thus, resulting to large chip area, more power consumption, while occupying larger board area. On the other hand, the wire bonding between these discrete chips leads to increased noise and this is undesirable for an optical link system. Moreover, discrete chips result to complicated DC/AC matching between the chips.

In this paper, an optoelectronic module with integrated Tx-Rx and Mux-Demux topology is proposed to multiplex/demultiplex multi electrical lines to/from a single optical signal line. With this topology, an optical link comprises of Mux, Tx, and VCSEL for the transmitting mode and PD, Rx, and Demux for the receiving mode, where the combined Mux-Demux and Tx-Rx are integrated on a single chip. Additionally, the combined Mux-Demux as well as Tx-Rx can share common blocks to further save chip area and power consumption. In our proposed topology, Mux-Tx and Rx-Demux chips are realized in a 0.18 μm TSMC CMOS technology. The chip design as well as the measurement results and chip photographs for integrating 2:1 Mux-Tx and 1:2 Demux-Rx chips are presented.

2. The Proposed Optoelectronic Module

In our proposed topology, the TRx (Tx-Rx) can share a common space on a single chip. The share of the common block of two circuits makes it possible to save 55% or 20% of power dissipation, depending on the operating mode. The chip is fabricated in a 0.18 μm CMOS technology and occupies an area of 0.82 × 0.82 mm², which is about 70% of the total area of a typical unshared Tx-Rx chip. This chip can be applied for point-to-point and parallel multi point optical links. In order to have a large bandwidth, where a single optical link can replace multiple electrical links, an optoelectronic module with integrated Tx-Rx and Mux-Demux is proposed. The proposed topology is shown in Figure 1.

![Figure 1](image-url)

The topology is special for replacing multi parallel time interleaving electrical links using a single optical link. The integration and combination have merits such as chip size and power consumption of chips for optical link system. On the other hand, the integration allows for a more reliable system as there is no need to interface between discrete chips using wire bonds. Additionally, many parallel electrical lines will be replaced by one optical line. The number of optical devices (VCSEL and PD), the space occupied on the board, as well as the complexity of the system will be reduced.

The Mux/Tx and Rx/Demux functions of this chip are controlled by switches (SWs) according to the status of control signal (Ctr). A high Ctr signal allows the chip to work in the Tx/Mux mode. The Mux, which works as a serializer, converts the parallel input data (Dn) to a high-speed serial data (Dn) which is then amplified by gain stage and modulated by VCSEL for transmission. A low Ctr signal will make the chip to operate in the Rx/Demux mode. The optical signal is converted by PD to a current signal. This small and noisy current signal then is amplified and reshaped by trans-impedance amplifier (TIA) and limiting amplifier (LA). Finally, the Demux converts this serial electrical data to parallel output data (Qn).

3. Circuit Design and Implementation

Based on the proposed topology, the design and implementation of Mux/Tx and Rx/Demux chips designs are...
introduced in this section.

3.1. Mux/Tx Design

3.1.1. 2:1 Mux Cell Design

Figure 2(a) is the schematic of a 2:1 Mux cell, consisting of five latches $L_1$-$L_5$ to avoid the simultaneous transition in both inputs of the selector (SEL) and a SEL for serializing input data. The inductorless CML circuit is used in designing the latches and selector for high speed operation. The latches and selector work based on the clock signal (Clk). When the Clk is high, the $L_1$, $L_3$ and $L_4$ are obscure and the SEL selects the output of $M_3$ while $L_2$ senses the output of $L_1$ and $L_5$ sense the output of $L_4$. When the Clk is low, the output of $L_2$ will be sensed whereas $L_5$ becomes obscure providing a stable level for the SEL. The 2 inputs are turn by turn selected to the output (serialized) based on the Clk.

3.1.2. The Tx Design

Figure 2(b) shows the block diagram and schematics of Tx. The Tx is made up of a gain stage and a driver stage. In the gain stage, three differential gain pairs ($M_1, M_2, M_5, M_6$; $M_7, M_8$) are used for high gain and large bandwidth. The $M_3, M_4$ pair forms an active feedback; $M_{11}, R_1$ and $M_{12}, R_2$ pairs work as active inductors. The $M_9, M_{10}$ pair with $C$ forms an active capacitor to enhance the bandwidth of the gain stage. The driver consists of two differential pairs for driving large current to the output and a source follower buffer for impedance matching to VCSEL. Figure 3 shows the frequency response of the Tx. A 3-dB bandwidth of 4.2 GHz with a gain of 39 dB were achieved.

![Figure 2](image1.png)  
**Figure 2.** (a) 2:1 Mux cell block diagram and schematics and (b) Tx block diagram and schematics.

![Figure 3](image2.png)  
**Figure 3.** Frequency response of the Tx.

![Figure 4](image3.png)  
**Figure 4.** (a) 1:2 Demux cell block diagram and schematics and (b) Rx block diagram and schematics.
3.2. Rx/Demux Design

3.2.1. 1:2 Demux Cell Design

Figure 4 (a) shows the 1:2 Demux cell schematic. The cell uses both rising and falling Clk edges and operates at half the clock speed of the input data rate $D_{in}$. During high interval of the Clk, the $L_3$ trace the first bit of $D_{in}$. When the Clk transitions from high to low, the stored bit at the input of $L_3$ is locked and $L_4$ will capture the signal from $L_3$ at the falling edge of the Clk. During the low interval of Clk, the first bit of $D_{in}$ is transferred to output $Q_2$ while $L_1$ traces the second bit of $D_{in}$. When the Clk transitions from low to high, the second stored bit at the input of $L_1$ is locked and $L_2$ captures the signal from $L_1$ at the rising edge of the clock. During the high Clk interval, the second bit of $D_{in}$ will be transferred to the output $Q_1$. This process will deserialze the $D_{in}$ to two parallel outputs.

3.2.2. The Rx Design

The Rx consists of a TIA, a gain stage limiting amplifier (LA) and an output buffer. Figure 4(b) presents the block diagram and schematics of the proposed optical receiver where two identical TIsAs are used for differential signal, one of them is a dummy with the same input capacitance as the photodiode capacitance.

The TIA is implemented using advanced CG topology for isolating the large input parasitic capacitance from the bandwidth degradation, according to Kang et al [4]. On the other hand, an NMOS active inductor and a resistor are used to strengthen the bandwidth.

The number of gain stages of LA have been considered for optimising the trade-off between gain, input-referred noise, power consumption and bandwidth. In Figure 4(b), a gain stage cell of LA is presented. A negative active feedback through differential transistor pair ($M_3$, $M_4$) and an active inductor load for the second stage of the gain cell are used to enhance the bandwidth. Additionally, a negative capacitance is used to reduce the high impedance between the cascading amplifiers as well as a slew-rate boosting to remove the low-setting of the output waveform through $M_8$, $M_{10}$, and the capacitance, $C$. Figure 5 shows the frequency response of the Rx. A 3-dB bandwidth of 4.3 GHz with a gain of 81dB were achieved.

4. Experimental Results

The optoelectronic module with integrated transceiver (Tx-Rx) and a Mux-Demux. The module was designed with the Mux/Tx on one side and the Rx/Demux on another side. They were designed and fabricated in a 0.18μm TSMC CMOS technology. For demonstration purposes, the chips were fabricated and measured separately (i.e. the Tx/Mux as a separate chip and the Rx/Demux as a separate chip). The chip photographs of Tx/Mux and Rx/Demux are presented in Figures 6(a) and (b) respectively. The layouts of digital part Mux-Demux and of analog part Tx-Rx are carefully isolated to minimize the noise from digital part interfering to analog signal. The chip area of the Tx/Mux and Rx/Demux are 540x1320 μm$^2$ and 594x1089 μm$^2$ (including pads), respectively. For measurement purposes, Teflon based evaluation boards were used. The chips were packaged on the evaluation board by die-bonding the chips on the boards and wire-bonding them where necessary. Figure 7 shows a photograph of the packaged chips. A power consumptions of 112 mA and 122 mA were achieved for the Tx/Mux and Rx/Demux, respectively, under a 1.8 V power supply. The total chip area and power consumption reduction of 10% and 23%, respectively, were achieved when compared to discrete chips.

![Figure 5. Frequency response of the Rx.](image-url)

![Figure 6. The chip photographs of the (a) Tx/Mux and (b) Rx/Demux.](image-url)

Figures 8(a), 8(c) and Figures 8(b), 8(d) are the measured results of Rx/Demux for the two output channels at the data rate of 3 Gbps and 5 Gbps, respectively. The results show that the chip works well up to 5 Gbps input speed and gives 2.5 Gbps open and clear eye diagram output signal with the voltage swing on 50 Ω impedance of 375 mV, large enough for the following chips.
5. Conclusion

In this paper, an optoelectronic module with integrated transceiver (Tx-Rx) and Mux-Demux for optical interconnect applications have been proposed and implemented. A combined 2:1 Mux/Tx for transmitting mode and combined Rx/1:2 Demux for receiving mode have been presented. Designed in a 0.18 µm CMOS technology, the chips work well up to 5 Gbps. Open and clear eye diagram output signal with the voltage swing on 50 Ω impedance of 375 mV were achieved. The combined solution would save the chip area and power consumption of 10% and 23%, respectively, compared to discrete circuits implementations. The optoelectronic module has applications for serializing and deserializing the multiple point-to-point links between CPU and memory using a single optical link. The module could be applied for optical interconnects as well as other green IT applications.

References


