VLSI Design课程笔记 (上)

water_yellow

已于 2024-09-07 18:37:56 修改

阅读量1k

点赞数 20

文章标签：笔记硬件工程 pcb工艺

于 2024-02-16 03:03:58 首次发布

本文链接：https://blog.csdn.net/water_yellow/article/details/135566072

版权

Overview of VLSI Design Flow

Chip manufacturing process：
在这里插入图片描述
$design\,\, style \begin{cases} full\,\,costume &\text{： } design\,\, everything \\ standard\,\, cell\,\, base \,\,design\,(ASIC) &\text{： } use\,\, pre-design\,\, gate \end{cases}$
full costume: minimized area, power, maximized speed, expensive and long-period
ASIC: just cheap and short-period

Cost of an Integrated Circuit

cost per IC = variable cost per IC + (fixed cost / volume)
variable cost per IC=(cost of die +cost of test +cost of packaging)/final test yield
$final\;test\;Yield\,\, Y=\frac {Number\,\, of\,\, good \,\,chips\,\, per\,\, wafer}{Total\,\, number\,\, of\,\,chip\,\,per\,\,wafer} *100\%\\ Die \,\,cost=\frac {Wafer \,\,cost}{Dies\,\, per \,\,wafer \times Die \,\,yield}\\ Dies\,\,per\,\,wafer=\frac{\pi \times (\frac {wafer\,\,diameter} 2)^2}{die\,\, area} - \frac {\pi \times wafer\,\,diameter}{\sqrt{2 \times die \,\,area}}\\ \\ Die \,\,yield=(1+\frac {defects\,\,per\,\,unit\,\,area\times die\,\,area}{\alpha})^{-\alpha}\\ \alpha\approx3$

Example：
we have a 4-inch wafer fab and an 8-inch wafer fab.
Calculate the cost per IC of each design at each fabrication plant (4 calculations, 2 per 4-inch and 2 per 8-inch). Identify the minimum cost fabrication facility for each design.
The microprocessor die size is 3.2 $cm^2$ and the projected volume is 50,000,000 units over the lifetime of the product. The cost of packaging the microprocessor is $25.00 per part, cost of testing per part is $2.78, and the non-recurring engineering design cost is $200,000,000.00.
The ASIC design is $0.25 cm^2$ and the projected volume is only 1,000,000 units. The cost of packaging the ASIC is $0.67, cost of testing is $0.69, and the design cost is $2,000,000.00.
For both designs, the functional test yield rate is 95% and α=3. Information on the fabrication options is listed in the table below.
在这里插入图片描述
Ans:
Microprocessor with 4-inch wafer:
cost per IC = variable cost per IC + (fixed cost / volume)=variable cost per IC + ( $2*10^8/5*10^7$ )
variable cost per IC= (cost of die +cost of test +cost of packaging)/final test yield
cost of test = 25
cost of packaging= 2.78
cost of die $=\frac {Wafer \,\,cost}{Dies\,\, per \,\,wafer \times Die \,\,yield}=\frac {150}{Dies\,\, per \,\,wafer \times Die \,\,yield}$
final test yield = 95%
$Dies\,\,per\,\,wafer=\frac{\pi \times (\frac {10.16} 2) ^2}{3.2} - \frac {\pi \times 10.16}{\sqrt{2 \times 3.2}}$
Die yield= $(1+\frac {defects\,\,per\,\,unit\,\,area\times die\,\,area}{\alpha})^{-\alpha}=(1+\frac {0.5\times3.2}{3})^{-3}$

Die yield=0.28
Dies per wafer=12
final test yield = 95%
cost of die=44.64
cost of packaging=2.78
cost of test = 25
variable cost per IC = 76.23
cost per IC = 76.23+ $2*10^8/5*10^7\approx$ $80.23

Microprocessor with 8-inch wafer:
Die yield= $(1+\frac {0.56\times3.2}{3})^{-3}$ = 0.245
Dies per wafer= $\frac{\pi \times (\frac {20.32} 2) ^2}{3.2} - \frac {\pi \times 20.32}{\sqrt{2 \times 3.2}}$ = 76
final test yield = 95%
cost of die= $\frac {800}{76 \times 0.25}$ =42.9
cost of packaging=2.78
cost of test = 25
variable cost per IC = (42.9+2.78+25) / 0.95 = 73.45
cost per IC = 74.45+ $2*10^8/5*10^7\approx$ $78.45

ASIC with 4-inch wafer:
cost per IC = variable cost per IC + (fixed cost / volume)=variable cost per IC + ( $2*10^6/10^6$ )
variable cost per IC= (cost of die +cost of test +cost of packaging)/final test yield
cost of test = 0.69
cost of packaging= 0.67
cost of die $=\frac {Wafer \,\,cost}{Dies\,\, per \,\,wafer \times Die \,\,yield}=\frac {150}{Dies\,\, per \,\,wafer \times Die \,\,yield}$
final test yield = 95%
$Dies\,\,per\,\,wafer=\frac{\pi \times (\frac {10.16} 2) ^2}{0.25} - \frac {\pi \times 10.16}{\sqrt{2 \times 0.25}}$
Die yield= $(1+\frac {defects\,\,per\,\,unit\,\,area\times die\,\,area}{\alpha})^{-\alpha}=(1+\frac {0.5\times0.25}{3})^{-3}$

Die yield=0.88
Dies per wafer= 279
final test yield = 95%
cost of die= 0.61
cost of packaging=0.69
cost of test = 0.67
variable cost per IC = 2.07
cost per IC = 2.07+ $2*10^6/10^6\approx$ $4.07

ASIC with 8-inch wafer:
Die yield= $(1+\frac {0.56\times0.25}{3})^{-3}$ = 0.87
Dies per wafer= $\frac{\pi \times (\frac {20.32} 2) ^2}{0.25} - \frac {\pi \times 20.32}{\sqrt{2 \times 0.25}}$ = 1206
final test yield = 95%
cost of die= $\frac {800}{1206 \times 0.87}$ = 0.76
cost of packaging= 0.69
cost of test = 0.67
variable cost per IC = (0.76+0.69+0.67) / 0.95 = 2.23
cost per IC = 2.23+ $2*10^8/10^6\approx$ $4.23

A new 12-inch fabrication facility for production of the microprocessor part. The 12-inch fabrication facility will cost an additional $1,000,000,000.00 to construct for this project. Develop a constraint on the defect density of this new facility that will ensure cost savings over the previous minimum cost solution. Volume cost for 12-inch wafers is estimated to be
$1500/wafer.

Microprocessor with 12-inch wafer(new):
find defect density $x$
Die yield= $(1+\frac {x\times3.2}{3})^{-3}$
Dies per wafer= $\frac{\pi \times (\frac {30.48} 2) ^2}{3.2} - \frac {\pi \times 30.48}{\sqrt{2 \times 3.2}}$ = 190
final test yield = 95%
cost of die= $\frac {1500}{190 \times Die \;yield}$
cost of packaging=2.78
cost of test = 25
variable cost per IC = (cost of die+2.78+25) / 0.95
cost per IC = variable cost per IC+ $2*10^8+10^9)/5*10^7<$ $78.45

$defect\;density \;x \le 0.42$

Fan-out： the number of load gates $N$ that are connected to the output of the
driving gate
Fan-in: the number of inputs to the gate

propagation delay $t_p$ :how quickly it responds to a change at its input(s)
measured between the 50% transition points of the input and output waveforms
在这里插入图片描述
$t_p=\frac {t_{pHL}+t_{pLH}} 2$

Period of ring oscillator $T$
在这里插入图片描述
odd number( $N$ ) of inverter
$T=2\times t_p \times N$

noise margin
在这里插入图片描述
$V_{OH}$ : Maximum output voltage when the output level is logic “1”
$V_{OL}$ : Maximum output voltage when the output level is logic “0”
$V_{IL}$ : Maximum input voltage which can be interpreted as logic “0”
$V_{IH}$ : Maximum input voltage which can be interpreted as logic “1”

Propagation delay of First-order RC network
在这里插入图片描述
The time to reach the 50% point: $t=ln(2)\tau=0.69\tau$
Time to the 90% point: $t=ln(9)\tau=2.2\tau$
$\tau=RC, time\;constant$
logic change from 0 to 1, this process needs power: $E_{0\rightarrow1}=C_L\cdot V_{dd}^2$
logic change from 1 to 0 do not need energy
power dissipation of capacitor: $E_{cap}=\frac 1 2C_L\cdot V_{dd}^2$

CMOS Fabrication Process Technology

Photolithography： The technique to accomplish selective masking so that a desired processing step can be selectively applied to the remainingregions.
Photolithography invovle:

Oxidation layering
Photoresist coating
Stepper exposure
Photoresist development and bake
Acid Etching
Spin, rinse, and dry
Various process steps
Photoresist removal (or ashing)

Some important techniques used：Diffusion and Ion Implantation, Deposition, Etching, Planarization

Simplified process:
在这里插入图片描述

MOS Transistor Theory

static behaviour

Built-in potential: under zero bias, there exists a voltage $\phi_0$ across the junction. $\phi_0=\phi_Tln[\frac {N_AN_D}{n_i^2}]$ $n_i$ : intrinsic carrier concentration in a pure sample of the semiconductor and equals approximately $1.5*10^{10} cm^{-3}$ at 300 K for silicon.

Ideal diode equation: $I_D=I_S(e^{V_D/ \phi_T}-1)$ $\phi_T$ : termal voltage, equal to 26 mV at room temperature.
$I_S$ : saturation current, constant value.

Junction Capacitance: the space-charge region contains few mobile carriers, it acts as an insulator with a dielectric constant $\varepsilon_{si}$ of the semiconductor material. The n- and p-regions act as the capacitor plates. $C_j=\frac {C_{j0}} {(1-V_D/\phi_0)^m}$ $m = 0.5$ :abrupt junction
$m = 0.33$ :linear junction
$C_{j0}$ : the capacitance under zero-bias conditions, $C_{j0}=A_D\sqrt{(\frac{\varepsilon_{si}q} 2\frac{N_AN_D}{N_A+N_D})\phi_0^{-1}}$
$\phi_0$ : Built-in potential (defined previously)

MOS Transistor under Static Conditions

Threshold Voltage: $V_T=V_{T0}+\gamma*(\sqrt{|-2\phi_F+V_{SB}|}-\sqrt{|-2\phi_F|})$
$V_{T0}$ : Threshold voltage for substrate bias voltage $V_{SB}=0$
$\phi_F$ : Fermi Potential, $\phi_F=-\phi_Tln(\frac {N_A}{n_i} )$ , sometimes 0.55V
$N_A$ : substrate doping
$\gamma$ : body-effect coefficient, $\gamma=\sqrt{2qN_A\varepsilon_{Si}}/C_{ox}$

the voltage-current relation of the transistor in linear region/resistive region ( $V_{DS}<V_{GS}-V_T$ ) $I_D =k'_n\frac W L[(V_{GS}-V_T)V_{DS}-\frac {V_{DS}^2} 2]\\ =k_n[(V_{GS}-V_T)V_{DS}-\frac {V_{DS}^2} 2]$
the voltage-current relation of the transistor in saturation region( $V_{DS}>V_{GS}-V_T$ ) $I_D=\frac {k'_n} 2 \frac W L(V_{GS}-V_T)^2$
$k'_n$ : process transconductance parameter, $k'_n=\mu_nC_{ox}$
$\mu_n$ : mobility parameter, expressed in $m^2/V\cdot s$
$C_{ox}$ : capacitance per unit area presented by the gate oxide. $C_{ox}=\frac {\varepsilon_{ox}} {t_{ox}}$
$\varepsilon_{ox}$ : oxide permittivity, $3.5\times 10^{-11}$
$t_{ox}$ : thickness of the oxide
$k_n$ : gain factor
$W$ : channel width
$L$ : channel length
$V_{GS}$ : voltage apply between gate and source
$V_{DS}$ : voltage apply between drain and source
$V_T$ :threshold voltage (define previously)

channel length modulation
increasing $V_{DS}$ causes the depletion region at the drain junction to grow,reducing the length of the effective channel, the current increases when the length factor $L$ decrease
$I_D=I_D'(1+\lambda V_{DS})$ $I_D'$ : the current expressions derived earlier
$\lambda$ : empirical parameter

short channel effect

velocity saturation
subthreshold current
hot electron
DIBL: brain induced barrier lowering

velocity saturation:

when the electrical field along the channel reaches a critical value $\xi_c$ , the velocity of the carriers tends to saturate due to scattering effects.
Velocity-saturation effects are more pronounced in NMOS short-channel transistors.
drain current in the resistive/linear region with velocity saturation effect
$I_{DSAT}=\upsilon_{sat}C_{ox}W((V_{GS}-V_T)-V_{DSAT})\\ =\kappa(V_{DSAT})\mu_nC_{ox}\frac WL[(V_{GS}-V_T)V_{DSAT}-\frac {V_{DSAT}^2}2]\\ V_{DSAT}=\kappa(V_{GS}-V_T)(V_{GS}-V_T)=L\xi_c$

$\upsilon_{sat}$ : saturation velocity, $=\mu_n\xi_c$ , approximately equals $10^5$ m/s.
$\kappa(V_{DSAT})$ : $=\frac 1 {1+(V_{DSAT}/\xi_cL)}$ , measure of the degree of velocity saturation

subthreshold conduction
the current does not drop abruptly to 0 at $V_GS=V_T$ . MOS transistor is already partially conductiong for voltages below the threshold voltage. This effect is called subthreshold or weak-inversion conduction.
current in subthreshold exponential region: $I_D=I_Se^{\frac {V_{GS}}{nkT/q}(1-e^{\frac {V_{DS}}{kT/q}})}$
$I_S, n$ : empirical parameters
slope factor $S$ : inverse rate of decline of the current with respect to $V_GS$ below $V_T$ , measures by how much $V_GS$ has to be reduced for the drain current to drop by a factor of 10: $S=n(\frac {kT} q)ln10$
unit: $mV / d ec a d e$

hot electron/ hot carrier
Caused by high electric fields. Electrons and holes gaining high kinetic energies in the electric field (hot carriers) may be injected into the gate oxide, and cause permanent changes in the oxide-interface charge distribution, degrading the current-voltage characteristics of the MOSFET.

DIBL: drain-induced barrier lowering
If the drain voltage is increased, the potential barrier in the channel decreases, leading to drain-induced barrier lowering. The reduction of the potential barrier eventually allows electron flow between the source and the drain, even if the gate-to-source voltage is lower than the threshold voltage. The channel current that flows under these conditions ( $V_{GS}<V_{T0}$ ) is called the sub-threshold current.

dynamic behaviour

channel capacitance
total: $C=C_{ox}\cdot W\cdot L_D$
$C_{ox}$ :gate oxide capacitance per unit area
$W$ : channel width
$L_D$ : lateral diffusion

operation region	$C_{gs}$ - between gate and source	$C_{gd}$ - between gate and drain	$C_{gb}$ - between gate and body
Cutoff	0	0	$C_{ox}W(L-2L_D)$
Resistive/linear	$\frac1 2 C_{ox}W(L-2L_D)$	$\frac1 2 C_{ox}W(L-2L_D)$	0
Saturation	$\frac 2 3 C_{ox}W(L-2L_D)$	0	0

$L-2L_D$ : effective channel length
在这里插入图片描述
p-mos pull up; n-mos pull down

Source-drain Resistance
$R_{S,D}=R_{\square}\frac {L_{S,D}} W+R_C$
$L_{S,D}$ : length of the source or drain region
$W$ : width of the transistor
$R_{\square}$ : sheet resistance
$R_C$ : contact resistance

Circuit Characteristic and Performance Estimation

latch up

must be prevented by reducing substrate noise, it will ruin all devices

Why latch up will cause problem: the creation of a low-impedance possitive-feedback path between power supply rails as a result of triggering a parasitic device. The noise in this circuit will be amplifid and produce excessisve current.

在这里插入图片描述

wire capacitancce

capacitance of wire interconnect:
wire parallel plate capacitance: $C=\frac {\varepsilon_{ox}} {t_{ox}}(W*L)$
L: wire length
W: wire width

$C_{total}=C_{parallel}+C_{fringe}=\frac {w\varepsilon_{ox}} {t_{ox}}+\frac {2\pi\varepsilon_{ox}}{lg(t_{ox}/H)}$
H: interconnect thickness
$w$ : $w = W - H /2$

在这里插入图片描述
$\frac {\varepsilon_{ox}} {t_{ox}}$ . area capacitance(parallel-plate) are expressed in $aF/\mu m^2$ , while the fringe capacitance(given in the shaded rows) are in $aF/\mu m$

wire resistance
$R=\frac {\rho L}{HW}=R_\square \frac LW$

resistance (per unit length) at high frequencies(with skin effect, consider skin depth) $r(f)=\frac {\sqrt{\pi f \mu \rho}}{2(H+W)}$
$\mu$ : permeability of the surrounding dielectric, $4\pi\times 10^{-7} H/m$

lumped model
在这里插入图片描述

how to calculate RC delay of a tree-structured network

在这里插入图片描述 $\tau=C_1R_1+C_2(R_1+R_2)+C_3(R_1+R_3)+C_4(R_1+R_3+R_4)+C_i(R_1+R_3+R_i)$

在这里插入图片描述

$\tau_{SD}=C_1R_1+C_2(R_1+R_2)+C_3(R_1+R_2+R_3)+C_4(R_1+R_2+R_3)+C_5R_1+C_6R_1+C_7(R_1+R_2)+C_8(R_1+R_2+R_3)$

Time-Constant of Resistive-Capacitive Wire
$\tau_{DN}=RC\frac{N+1}{2N}=rcL^2\frac{N+1}{2N}$
wire delay: $\frac {rc}2l^2$
$r$ : resistance per unit length
$c$ : capacitance per unit length

9 stages or 11 stage RC
simplified model: $\pi$ model or T model

in a chip:
top layer is the most thick layer
global signal travel same distance
If gate delay>wire delay, we can ignore wire delay

*diffusion equation of voltage node at node i of distributed RC line * $\frac {\partial^2V}{\partial x^2}=rc\frac{\partial V}{\partial t}$

impedence of transmission line is not related to the length
impedence: $Z=\frac{Z_L-Z_0}{Z_L+Z_0}$
$Z_L$ : load
$Z_0$ : characteristic impedence
telegraph equation:
$V_r=(\frac{Z_L-Z_0}{Z_L+Z_0})V_in$
when impedence matching: $V_r=0$

lattice diagram

在这里插入图片描述
Reflection Coefficient:
$T_{Z_L}=\frac{Z_L-Z_0}{Z_L+Z_0}=\rho_L\\ T_{Z_S}=\frac{Z_S-Z_0}{Z_S+Z_0}=\rho_S$
incoming signal:
$V_{in}=\frac{Z_0}{Z_S+Z_0}V_s=V_{initial}$
$R_L=Z_L ; R_S=Z_L)$

$a=V_{initial}\\ b=a\rho_L\\ c=b\rho_s\\ d=c\rho_L\\ e=d\rho_s\\ f=e\rho_L$

example:
$R_S=5Z_0,\;R_L=\infty,V_S=5V$
$a=V_{initial}=\frac{Z_0}{Z_S+Z_0}V_s=\frac 16*5V=0.8333V\\ \rho_L=\frac{Z_L-Z_0}{Z_L+Z_0}=1\\ b=a\rho_L=\frac 56*1=0.8333V\\ \rho_S=\frac{Z_S-Z_0}{Z_S+Z_0}=\frac23\\ c=b\rho_s=\frac56*\frac23=0.5556\\ d=c\rho_L=\frac59*1=0.5556\\ e=d\rho_s=\frac59*\frac23=0.3704\\ f=e\rho_L=0.3704 g=f\rho_s=0.2469\\ h=g\rho_L=0.2469$
$A=a=0.8333\\ B=a+b+c=2.2216\\ C=a+b+c+d+e=3.1476$
$A'=a+b=1.6666\\ B'=a+b+c+d=2.7772\\ C'=a+b+c+d+e+f=3.5180$
在这里插入图片描述

example:
$R_S=75\Omega,\;Z_0=50\Omega,\;\;R_L=\infty,V_S=2V$
$a=V_{initial}=\frac{Z_0}{Z_S+Z_0}V_s=\frac 25*2V=0.8V\\ \rho_L=\frac{Z_L-Z_0}{Z_L+Z_0}=1\\ b=a\rho_L=\frac 45*1=0.8V\\ \rho_S=\frac{Z_S-Z_0}{Z_S+Z_0}=\frac15\\ c=b\rho_s=\frac45*\frac15=0.16\\ d=c\rho_L=0.16*1=0.16\\ e=d\rho_s=0.16*\frac15=0.032\\ f=e\rho_L=0.032$
$A=a=0.8\\ B=a+b+c=1.96\\ C=a+b+c+d+e=2.152$
$A'=a+b=1.6\\ B'=a+b+c+d=1.92\\ C'=a+b+c+d+e+f=1.984$
在这里插入图片描述

Circuit Simulation and Combinational Circuit Design

CMOS saturation diagram of NOT gate

在这里插入图片描述

在这里插入图片描述
$V_{IL}:$ PMOS in linear NMOS in saturation
$V_{IH}:$ PMOS in saturation NMOS in linear
$V_{OH}:$ PMOS in linear NMOS off
$V_{OL}:$ PMOS off NMOS linear

for fast raise time and middle logic threshold point: $\frac {w_p}{L_n}=2.5*\frac{w_n}{L_n}$

multi-stage fanout

capacitance between driver and receiver: $C_L=_{gsp}+c_{gdp}+c_{gdn}+c_{gsn}+c_{w}$
$C_{gp}=2.5C_{gn}$
total delay: $\tau_p=0.69c_L(\frac{R_{eqN}+R_{eqP}}2)$
width: $w_p=\beta*w_n$
$t_p=0.345[(1+\beta)(c_{gdN}+c_{gsN})+c_w]R_{eqN}(1+\beta)$
$\tau=\tau_{int}+S\cdot C_L$
$t_p=0.69R_{eq} c_{int} (1+C_{ext}/C_{int} )$
$C_{ext}$ : external load capacitance

energy store in cap
$\frac 12C_LV_{DD}^2$

if we need big driver, we should gradually increase size of TR, optimize total delay gate 1 gate 2 gate 3 gate 4 … gate n
$\tau=N\cdot R_1SC_1$
$R_1$ : the resistor of the first transistor
$C_1$ : capacitor of the first transistor

maximum number of fan-out(also the minimum delay): 4

sizing factor $F$ and effective fanout $f$
$f=\sqrt[N]{C_L/C_{g,1}}=\sqrt[N]F$
$N$ : number of fan-out
$C_{g,1}$ :input capacitance of the first inverter (minimally-sized device)
minimum delay
$t_p=Nt_{p0}(1+\sqrt[N]F/\gamma)$
$t_{p0}$ : intrinsic delay of inverter (independent of sizing)
$\gamma$ : $\frac {c_{int}}{c_g}$ , proportionality factor, which is only a function of technology and is close to 1

在这里插入图片描述

fanout for best power consumption

$C_{tot}=C_{g1}[(1+\gamma)(1+f)+F]\\ \text{energy dissipation:}\\ E=V_{dd}^2C_{g1}((1+\gamma)(1+f)+F)$

CMOS circuit power consumption

dynamic power/switching power
short circuit power
leakage power/static power
- subthreshold current
- BTBT current (diode leakage current)
- Gate tunneling

energy consumed per switching period:
$E_{dp}=t_{sc}V_{DD}I_{peak}$
$t_sc$ : time both devices are conducting, $\frac {V_{DD}-2V_T}{V_{DD}}t_{rising}$
average power consumption:
$P_{dp}=C_{sc}V_{DD}f$

power delay product PDP
$C_LV_{DD}^2/2$
energy delay product EDP
$PDP*t_p=\frac {\alpha*C_L^2V_{DD}^3}{2(V_{DD}-V_T)}$
$\alpha$ : technology parameter

techniques to reduce power

MTVT: multi-threshold voltage technique
clock gating
- pipelining
- parallel processing
coding technique
- thermometer code

MTVT: threshold voltage higher, harder to turn on transistor, react slower, but less leakage (high threshold can reduce leakage).

technology scaling

Parameter	Relation	Full Scaling	General scaling	Fixed-Voltage scaling
Area/Device	$W L$	$1/S^2$	$1/S^2$	$1/S^2$
Intrinsic Delay	$R_{on}C_{gate}$	$1/ S$	$1/ S$	$1/ S$
Intrinsic Energy	$C_{gate}V^2$	$1/S^3$	$1/SU^2$	$1/ S$
Intrinsic Power	Energy/Delay	$1/S^2$	$1/U^2$	1
Power Density	P/Area	1	$S^2/U^2$	$S^2$