Engineer's Planet

Design of VLSI Circuits and Systems

Design of VLSI (Very Large Scale Integration) Circuits and Systems refers to the process of designing and developing integrated circuits (ICs) that consist of a large number of transistors and other electronic components. VLSI circuits and systems are used in a wide range of electronic devices, from computers and smartphones to medical equipment and automotive systems.

MTech VLSI Projects | MS, IEEE, ME, BE Projects

Course Overview

This course delves into the advanced principles of VLSI (Very Large Scale Integration) circuit and system design using the latest CMOS (Complementary Metal-Oxide-Semiconductor) technologies. Students will learn:

The course will explore how these concepts apply to a range of applications, including microprocessors, signal and multimedia processors, portable devices, memory, and periphery. Throughout the course, there will be a special emphasis on circuit optimization and designing for ultra-low power, which will be integrated into both the lectures and project work.

Project

“Energy-Efficient FPGA Configurable Logic Block (CLB) Design”

The goal of this project is to build a configurable logic block (CLB) for FPGAs with the minimum energy and worst-case delay of 2ns. The architecture of the CLB is shown in Fig. 1.

Energy-Efficient FPGA Configurable Logic Block (CLB) Design

The CLB can function as one 8-bit adder, two 4-bit adders, or four 4-input look-up tables (LUTs). Each LUT requires 16 configuration bits of storage elements. Assume these bits to be available as inputs, so the LUT functions as a 16:1 multiplexer. You may use gate sizing and supply voltage scaling as variables. No extra pipelining registers are allowed.

[Constraints]

Project Report

Title: Energy-Efficient FPGA Configurable Logic Block (CLB) Design

Abstract:

This work presents an energy-efficient FPGA configurable logic block with energy of 286.1 fJ in VDD of 460mV. The indicate size is 58.2 um by 87.7 um with 1.506 of the aspect ratio. The static CMOS type of lower power (LP) full-adder is implemented in this design, and VDD and size optimization have been performed through the critical path.

Background Knowledge

Full Adder Summary

Design Summary

The overall summary is shown in below:

Critical-Path Analysis

As shown in Fig. 1, the critical path is obviously through the 8-bit adders when the all carries are rippled. From Cin to OutD, there is no internal buffer since MUXs are implemented using a transmission gate and the LP adder does not have one inside. Thus, we add up some buffers at each intermediate nodes to mitigate delay penalty from Elmore delay.

This buffer could be replaced with level-conversion block if we need to implement multi-VDD scheme to perform the optimization. Also, each region will have separated VDD-domain in that case. By the way, the critical path delay is the sum of carry-ripple delay of full-adder blocks and MUXs.

Design Optimization

The gate-level schematic is shown

Before doing the optimization following the method in reference (2) : D. markovic et al, “Methods for True Energy-Performance Optimization” IEEE JSSC, Vol. 39, No. 8, Aug. 2004, all devices are sized arbitrarily along with the critical path. Including parasitic and branch efforts, the optimization can be done in Matlab.

In this optimization, the minimum delay is around 430 psec with maximum energy. The overall results from the optimization are :

Because the target delay is 2 nsec and the minimum delay from the analysis is around 430 psec, we add up the delay around three times of the minimum delay. The figure shows the energy reduction in percentage when we increase the delay.The X-axis is the additional delay with respect to the minimum delay. The Y-axis is the energy reduction at the critical path, not overall energy reduction. Interestingly, our given target delay is too long to make a good optimization because before touching the target delay, all the physical constraints are ended up. First of all, minimum size constraint limits around 0.9 time of additional delay. Then, VDD constraint limits around 1.7 times of additional delay.

Therefore, except the last buffer for the output load, all sizes are the minimum, and the VDD is around 0.5 V, which barely make the transition for an inverter. Also, using multi-VDD scheme, almost same VDD, which is the minimum, are obtained. Thus, we only exploit a single-VDD with regular buffers.

Note that using the optimization, we can reduce the energy at the critical path almost 90 percent.

Above figures show the delay and energy characteristics at each stages. When we increase the delay and optimize the energy in a same time, the stage number 2 and 4 reveal the maximum energy and delay contributors. The stage number 2 and 4 are the adder blocks, which consist of 4 transmission gates in series without any buffers. Thus, a couple of buffers are added up at the full-adder blocks.

Physical Layout

1. Full Adder 1-bit

2. Entire Layout

The overall layout floorplan can be done in center-oriented manner. The inputs, AdrA~AdrD[3:0], and DA~DD[15:0], are fed from left and right sides, then the internal signal transmissions are done through toward the center. The final results are captured at the center, and the last buffers are placed at the bottom. In fact, we can place the last buffer at either the top or bottom. There is no description regarding to floorplan and the input/output ports. The overall layout have done manually, and the total area is 5104 um^2. The critical path is highlighted using blue line.

Overall blocks are very compact, and they shared same VDD and GND lines.

 

Exit mobile version