A VLSI DESIGN CONCEPT FOR PARALLEL ITERATIVE ALGORITHMS
#1

Presented by
M.V.SUBBA LAKSHMI
B.MOUNIKA

[attachment=10321]
A VLSI DESIGN CONCEPT FOR PARALLEL ITERATIVE ALGORITHMS
Abstract:

Modern VLSI manufacturing technology has kept shrinking down to the nanoscale level with a very fast trend.
Integration with the advanced nano-technology now makes it possible to realize advanced parallel iterative algorithms directly which was almost impossible 10 years ago. In this paper,
we want to discuss the influences of evolving VLSI technologies for iterative algorithms and present design strategies
from an algorithmic and architectural point of view. Implementing an iterative algorithm on a multiprocessor array,there is a trade-off between the performance/complexity of processors and the load/throughput of interconnects. This is due to the behavior of iterative algorithms. For example, we could simplify the parallel implementation of the iterative algorithm(i.e., processor elements of the multiprocessor array) in any way as long as the convergence is guaranteed. However, the modification of the algorithm (processors) usually increases the number of required iterations which also means that the switch activity of interconnects is increasing. As an example we show that a 25×25 full Jacobi EVD array could
be realized into one single FPGA device with the simplified μ-rotation CORDIC architecture.
1 Introduction
Modern VLSI manufacturing technology has kept shrinking down to Deep Sub-Micron (DSM) with a very fast trend
and Moore’s law is expected to hold for the next 10 years (Gelsinger, 2008). Now, since the DSM nano-technology allows the integration of an ever-increasing number of IP macro-cells on a single silicon die, parallel multiprocessor platforms have received great attention and have been realized into several state-of-the-art applications (e.g., Dual-Core CPU, MPSoC and Parallel Computing) (Vangal et al., 2007; Wolf, 2004; Vitullo et al., 2008). 10 years ago, for 0.35μm technology, design engineers were focusing on reducing the area size. Later, when it came Correspondence to: C. C. Sun to 0.13μm technology they paid huge efforts to improve the signal delay and reduce the power consumption. As the VLSI
manufacturing technology keeps shrinking down into 65 nm,the design methodology for nano-circuits poses new challenges: area requirements of the wire interconnections are increasing explosively in relation to the area of processor elements, bus transmission bottleneck in the million transistors SoC designs, and leakage current is now dominating the
power consumption (Sainarayanan et al., 2007; Stine et al.,2007). These changes bring us to analyze the impacts on parallel iterative algorithms as VLSI technology keeps evolving. As long as the convergence properties of the iterative algorithms are guaranteed, it is possible to modify/simplify the architecture during the iteration steps and reduce the computational complexity significantly with regard to the implementation.
However, this simplification will usually cause an increased number of iterations for convergence. Therefore, it actually becomes a trade-off problem between the performance/ complexity of the hardware, the load/throughput of interconnects and the overall energy/power consumption due
to the behavior of parallel iterative algorithms.Computing the Eigenvalue Decomposition (EVD) with the parallel Jacobi method is used as an example since the convergence of this methodology is very robust to modification of the processor elements. Finally, a VLSI design concept for parallel iterative algorithms is presented which takes into account the influence of the modifications on area, timing delay and power consumption.The paper is organized as follows: in Sect. 2 we will first describe the design concepts for parallel iterative algorithms. After that, we will further clarify the definition of the serial and parallel Jacobi method, respectively, in Sect. 3. Then,in Sect. 4 the design issues of the Jacobi EVD array and
their suitability for different hardware implementations are discussed, which lead to the simplified μ-rotation CORDIC
processor. Section 5 shows the experimental and syntheses results. Section 6 concludes this paper.Published by Copernicus Publications on behalf of the URSI Landesausschuss in der Bundesrepublik Deutschland e.V. 96 C. C. Sun and J. G¨otze: A VLSI design concept for parallel iterative algorithms
2 Design concept and implementation issues
A design concept for parallel iterative algorithms, is presented taking into consideration the influences of different
VLSI technologies in terms of area, power and timing delay.Implementing an iterative algorithm on a multiprocessor array, there is a trade-off between the complexity of an iteration step (assuming that the convergence of the algorithm is retained) and the number of required iteration steps. For example,suppose we have a hardware platform, which requires an iteration step of the iterative algorithm to be executed K times in order to obtain the convergence.The iteration step is executed in parallel on the platform. If we simplify the processors in order to improve the logical utilization of the platform, the number of required iterations usually increase from K to K+L. It also means that the switch activity of interconnects between these processor elements is increasing due to the behavior of iterative algorithm. How to find a superior solution to balance the design criteria is the major
issue of this paper, especially for low-power or limited-area devices. In this paper, we selected the Jacobi EVD method as a typical iterative algorithm since the convergence of this methodology is very robust to modification of the processor elements (Brent and Luk, 1985; Gotze et al., 1993; Goetze and Hekstra, 1995; Klauke and Goetze, 2001). We have investigated the influences in DSM design with different sizes of multiprocessor
arrays (i.e., 4×4, 16×16 and 25×25). After that, several modifications of the algorithm/processor were studied
and their impacts on different FPGA devices were investigated (e.g., Xilinx Virtex series in 0.22μm, 0.15μm and
65 nm). According to these analyses, we present an efficient strategy to comply with the design criteria, especially in balancing the number of iterations and the computational complexity.
3 Eigenvalue decomposition
An Eigenvalue decomposition of a real symmertric n×n matrix A is obtained by factorizing A into three matrices
A=Q ^ QT , where Q is an orthogonal matrix (QQT=I ) and ^ is a diagonal matrix which contains the eigenvalues of
A.
Reply

Important Note..!

If you are not satisfied with above reply ,..Please

ASK HERE

So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page
Popular Searches: matlab flowchart for gauss seidel iterative method, confluent eigenvalues, the timeboxing process model for iterative software development, lossy compression and iterative reconstruction for encrypted image ppt, iterative regularization matlab code, dsm ycmou, iterative deepining search technique,

[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  Exploring the design space of social network-based Sybil defenses 1 919 15-02-2017, 02:55 PM
Last Post: jaseela123d
  DYNAMIC SEARCH ALGORITHM IN UNSTRUCTURED PEER-TO-PEER NETWORKS--PARALLEL AND DISTRIBU electronics seminars 9 7,362 14-07-2015, 02:25 PM
Last Post: seminar report asees
  Design of Intranet Mail System nit_cal 14 11,436 19-05-2015, 11:17 AM
Last Post: seminar report asees
  Design and Implementation of TARF: A Trust-Aware Routing Framework for WSNs Projects9 6 3,572 10-01-2015, 11:13 PM
Last Post: Guest
  Persuasive Cued Click-Points: Design, Implementation, and Evaluation of a Knowledge-B Projects9 3 3,024 15-04-2013, 11:14 AM
Last Post: computer topic
  Cooperative Caching in Wireless P2P Networks: Design, Implementation, and Evaluation seminar class 2 3,315 02-02-2013, 02:08 PM
Last Post: seminar details
  Design of Shopping Mall Management System seminar class 1 4,219 12-01-2013, 10:26 AM
Last Post: seminar details
  Brain Computer Interface Design Using Band Powers Extracted During Mental Tasks seminar class 1 2,026 01-01-2013, 10:48 AM
Last Post: seminar details
  Internet Security and Firewall Design smart paper boy 1 1,856 10-12-2012, 01:23 PM
Last Post: seminar details
  DESIGN OF AN INTRANET MAIL CLIENT SYSTEM Electrical Fan 1 3,073 06-12-2012, 01:35 PM
Last Post: seminar details

Forum Jump: