The Future of Image Sensors is Chip Stacking

CMOS image sensors (CIS) have often been heralded as the first 3D devices in volume manufacturing. However, this is not really the case. Shellcase MVP, the first generation of CIS that used through silicon vias (TSVs) to form interconnects was still a 2D device. (Remember, TSV is not always synonymous with 3D). Chip stacking only came about with the advent of backside illuminated (BSI) CIS and involved reversing the order of the lens, circuitry, and photodiode from traditional front-side illumination so that the light coming through the lens first hits the photodiode and then the circuitry. This was accomplished by flipping the photodiode wafer, thinning the backside, and then bonding it on top of the readout layer using a direct oxide bond, with TSVs as interconnects around the perimeter. Even then, some don’t consider this a true 3D stack. According to Paul Enquist, CTO, Ziptronix, implementation of true 3D requires using wafer-to-wafer (W2W) hybrid bonding to create “interconnects across the wafer at will without hurting the function of the part.” He said what this means for image sensors is making the electrical connections with the bond without using TSVs, which reduces the pixel exclusion area and allows for increased density.

I caught Enquist’s presentation, and that of Piet De Moor, imec, at last week’s Image Sensors America conference, September 9-11, 2014, at the Hyatt Fisherman’s Wharf, San Francisco, CA. The two speakers comprised the session titled “Stacked Chip Sensors, Is there Any Going Back from Here”? I also caught a few other presentations and quickly realized that not everyone in the image sensor industry is developing 3D stacking technologies, but looking for other 2D solutions to improve pixel resolution.

According to De Moor, this is because many of the companies that were presenting and exhibiting target industrial, scientific and professional markets, and so produce low product volumes. Stacked chip image sensors require high volume manufacturing (HVM) to be cost-effective, explained De Moor, because of the cost of the manufacturing equipment lines. Because of this, to date stacked BSI CIS are only manufactured by Sony and TSMC (for OmniVision) targeting consumer products, where the volume requirements are higher. Enquist also pointed out that Sony is a vertically integrated IDMs and manufactures BSI CIS for use in its own products, and therefore can absorb the cost.

Advancements in 2D Architectures

image sensors — Figure 1: Taking the multiplexing approach achieved the design goal to minimize area and power, with the tradeoff of buffer
amp noise. (Image courtesy of Forza Silicon)

In his presentation, Daniel Van Blerkom, CTO, and founder of image sensor design house, Forza Silicon, noted that the consumer market requires a shift from what’s needed for professional sensors. His company’s goal is to bridge the gap between the two markets through the development of a new architecture for high-resolution image sensors that are scalable and can be applied to smaller pixel sizes.

He explained the pitch limitations of large format sensors due to the pixel array and electronics in a single column architecture. Forza’s solution is to break the sensor into multiple blocks of columns (he called it “multiplexing”) to differentiate the signal chain and analog to digital converters, as well as create independent readout blocks (Figure 1). The result is an architecture that can be applied to next-gen professional sensors and consumer sensors. But it’s still a 2D design. When asked about other ways of achieving this, Van Blerkom said, “There is a forward looking way where you could actually stack” but didn’t indicate whether Forza is developing a stacking approach in its design architecture.

The Advantages Stack Up
In his presentation, “Stacking Chip Technology for Imagers,” de Moor outlined an imager integration roadmap in which 2 layer BSI CIS are in production and we “dream of three layers or more.”

BSI Image Sensor — Figure 2: The top layer of Sony’s stacked images sensor is a BSI sense layer stacked on the readout/image processing. Large TSVs are used to create vertical interconnects to the peripheral electronics. (Image courtesy of imec and Sony)

The advantages of two-layer stacks —such as one developed and manufactured by Sony (Figure 2), in which the top layer is the illuminated imager and the bottom layer contains the readout electronics and image processing — is the small form factor, and the ability to stack different CMOS technology nodes optimized for functionality, which is a cost advantage. Another advantage is that wafers can be sourced from different vendors for more options and competition, which will bring the cost down.

Imec and the University of Ghent have collaborated to fabricate three-layer stacks having two layers of CMOS stacked with the imager on top. Moor said this configuration is helpful when you want to not only have the readout on its own layer, but also the memory for on-chip signal processing. This puts image processing like facial recognition, image compression, and depth right on the chip.

De Moor described, in detail, two significant technology “enablers” for two-layer BSI: backside thinning and direct low-temperature oxide bonding from Ziptronix.

He also described W2W hybrid bonding, which combines W2W oxide bonding with Cu-Cu permanent bonding to create high density interconnects, thereby eliminating the need for TSVs and creating the “True 3D stack” defined above by Enquist. Ziptronix has patented this process, called DBI Hybrid Bonding.

Taking it further, Enquist demonstrated how Ziptronix’ DBI Hybrid bonding is a low-cost solution for stacking without TSVs. He showed the table below as a cost comparison.

Reducing Image sensor cost — Of all the stacking technologies, DBI® Hybrid Bonding demonstrates the best pitch scaling, 3D interconnectivity, and cost. (Source: Ziptronix)

The company just announced an industry-first in demonstrating a 3 layer stacked image sensor device in collaboration with Fermilab National Laboratory, to improve the performance of high-end 3D sensor arrays, which are used for particle detection in large-scale particle physics and x-ray imaging experiments.

3D stacked Image sensor — Figure 3: Schematic drawing of Fermilab’s vertically integrated x-ray photon imaging chip (VIPIC) detector, using W2W and D2W DBI Hybrid Bonding.

The demonstrator, a vertically integrated x-ray photon imaging chip (VIPIC) detector, was manufactured using W2W DBI hybrid bonding to bond two ASIC wafers containing TSVs. The bonded wafer pair is thinned to expose the TSVs on one side, then singulated. The singulated die stacks are then bonded to an x-ray sensor wafer using die-to-wafer (D2W) DBI hybrid bonding. Subsequent thinning of the other side of the bonded wafer pair allows backside connections to the 3-layer assembly.

“The significance of the Fermilab announcement is that it shows what can be done with this technology, not what’s necessarily next for volume adoption,” said Enquist.

During his talk, Enquist pointed out that stacking technologies for image sensors have been around longer than we realize; since 1977 and commercialized since 1882. He said Ziptronix’ DBI Hybrid bonding was first licensed by Raytheon for focal plane arrays in 2009, and 2012 marked the implementation of Ziptronix’ direct oxide bonding (called Zibond) for consumer stacked BSI image sensors by Sony. (Sony has yet to license the DBI Hybrid Bonding to replace TSV interconnects.)

So If the technology has been available, why the delay in adoption of chip stacking by HVM foundries? “Timing is everything,” said Enquist.“If you consider the history of image sensors, people have been doing 3D in image sensors for decades, but its a matter of going into volume.” He added Sony was successful with its first-generation BSI because the design and the technology were ready. He believes that is just a stepping stone for them. The next step is to build 3D stacks without TSVs, says Enquist, which is one way to accelerate 3D adoption.

“There’s a difference between a technology being manufactured in high volume and being available for manufacture in volume,” noted Enquist. “It’s available but nobody wants to be the first to use it.” All it takes is that first order. And surely for image sensors, chip stacking is clearly a technology whose time has come. ~ F.v.T.

7 thoughts on “The Future of Image Sensors is Chip Stacking”

Dr. Dev Gupta says:
September 17, 2014 at 8:51 am
In contrast to the title of this Blog ” the future of image sensors is chip stacking ” the reality, as dictated by Physics ( another ” inconvenient truth ? ” ), is actually the other way around : “the ( near – ) future of chip stacking is image sensors “.
Thermal & stress ( CPI ) issues within the stack will continue to be concerns for Wide I/O type stacking even after TSV process issues / cost / availability is resolved or the world follows Tezzaron in dumping Cu filled vias for CVD W.
It has been nearly 2 years since SONY took intelligent technical decisions about using DWB and only peripheral TSVs to enable their BSI sensor products now used in many smartphones. I discussed this back in Feb in my talk to IMAPS Phx.
Thanks to Govt. grants not tied to performance, Labs in even small European countries with no sizable semiconductor mfg. of their own were able to grab attention in 3d TSV research. But they did not publicize / even ignored aforementioned critical technical issues of implementing 3-d TSV stacks, thereby launching a hype cycle.
Do we really now need these Labs who are in part responsible for the over-selling of 3-d TSVs and thus distorting R&D roadmaps to now sprinkle Holy Water on the intelligent choices made by SONY independently or the pioneering work done by its innovative suppliers like Ziptronix in the US ?
Gretchen Patti says:
September 25, 2014 at 1:45 pm
Dev – Thanks for the shout-out! Tezzaron does indeed use tungsten for most of its TSVs, including the ones in the Fermilab sensor chip shown in this post. However, we still support copper TSVs for special cases where the TSVs must be deeper than 20 microns. One example is TSVs that pierce interposers; these are too deep for tungsten. Another case is when the dies will be handled stand-alone and therefore cannot be aggressively thinned. Keeping both options open lets us use whichever material is best for the job.
Gretchen Patti says:
September 25, 2014 at 1:55 pm
Francoise – I am puzzled by the statement that hybrid bonding eliminates the need for TSV. You don’t need TSV in a face-to-face two-layer device where signals emerge from the chip edges; but isn’t that true with any bonding strategy? And if there are three layers, how will the signals get through without TSV? Another huge advantage of 3D is the ultra-wide interface created by signals that emerge from the face of the chip. Certainly that requires TSV, no matter the bonding method. Or am I missing something here?
Kathy Cook says:
September 25, 2014 at 4:04 pm
Gretchen — With face-to-face hybrid bonding, one of the wafers can be thinned so that the connection to the outside world can be made through a pad cut rather than a higher aspect ratio TSV. For three or more layers, you would still need TSV’s as deep as the thickness of the wafer.
1. Francoise von Trapp says:
  September 25, 2014 at 4:27 pm
  Thanks Kathy! Also, with regard to other bonding strategies being able to form interconnects without TSVs, its my understanding that you can interconnect out the side with other technologies, but thermo-compression may not withstand the thinning to enable the pad cuts as hybrid bonding can, since the entire wafer or die surface is bonded rather than a small fraction of the surface that is bonded with thermo-compression.
  1. Gretchen Patti says:
    September 29, 2014 at 2:53 pm
    Kathy, Francoise – Thank for the clarification. However, Cu-Cu thermo-compression does support the same kind of connection; we’ve done it when required, but it can require extra post-processing steps for assembly.
Ronald Lipton says:
September 30, 2014 at 9:41 am
Parts of the Fermilab wafer lot were bonded with Cu thermocompression and parts by DBI. Both worked and could be backthinned to reveal the TSVs. The DBI wafers typically had better alignment than the thermocompression wafers. The wafers that were further processed for the 3-tier sensor integration were DBI. All wafers relied on Tezzaron TSVs for connections between the three tiers. As Kathy said, with two tiers it is possible to thin the top wafer and contact what would have been the bottom metal on the down-facing tier. We have just received some wafers from Novati using this process. It is limited to larger features and takes more space than TSV-based connections, but could be an alternative when only backside IO connections are needed.

Comments are closed.

The Future of Image Sensors is Chip Stacking

7 thoughts on “The Future of Image Sensors is Chip Stacking”

Francoise von Trapp

Related Category Posts

Below the Surface: From Substrate to System—Why Integration Is the Real RF Breakthrough

Advanced Optical Metrology for Wire Bond Inspection Using 3D Area Multi-Focus (AMF) Imaging

Expanding the Chiplet Market: Processing Any Wafer from Any Foundry

Related Tag Posts

IFTLE 530: Do Chiplets Increase Security Risks? Sony and Samsung Battle for Leadership in CIS

Xperi Sets Out to Enable Immersive Experiences

Hugo Pristauz Drops the F-bomb at 3D ASIP 2016, You Won’t Believe What Happens Next!

Most Active Authors