Andrew B. Kahng

ORCID: 0000-0002-4490-5018
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • VLSI and FPGA Design Techniques
  • VLSI and Analog Circuit Testing
  • Low-power high-performance VLSI design
  • Advancements in Photolithography Techniques
  • 3D IC and TSV technologies
  • Interconnection Networks and Systems
  • Embedded Systems Design Techniques
  • Advancements in Semiconductor Devices and Circuit Design
  • Integrated Circuits and Semiconductor Failure Analysis
  • Semiconductor materials and devices
  • Manufacturing Process and Optimization
  • Parallel Computing and Optimization Techniques
  • Advanced Surface Polishing Techniques
  • Copper Interconnects and Reliability
  • Industrial Vision Systems and Defect Detection
  • Electromagnetic Compatibility and Noise Suppression
  • Computational Geometry and Mesh Generation
  • Optimization and Packing Problems
  • Advanced Memory and Neural Computing
  • Advanced Multi-Objective Optimization Algorithms
  • Modular Robots and Swarm Intelligence
  • Electronic Packaging and Soldering Technologies
  • Radiation Effects in Electronics
  • Physical Unclonable Functions (PUFs) and Hardware Security
  • Metaheuristic Optimization Algorithms Research

University of California, San Diego
2016-2025

UC San Diego Health System
2002-2025

Association for Computing Machinery
2012-2015

Penn Center for AIDS Research
2012-2015

Cedars-Sinai Medical Center
2015

University of Utah
2012

University of Michigan
2006-2012

University of California, Los Angeles
1998-2008

Silicon Labs (United States)
1998-2008

University of California, Berkeley
2005-2008

10.1023/a:1008855018923 article EN Autonomous Robots 1997-01-01

Partitioning of circuit netlists in VLSI design is considered. It shown that the second smallest eigenvalue a matrix derived from netlist gives provably good approximation optimal ratio cut partition cost. also demonstrated fast Lanczos-type methods for sparse symmetric problem are robust basis computing heuristic cuts based on eigenvector this eigenvalue. Effective clustering an immediate by-product computation and very successful difficult input classes proposed CAD literature. The...

10.1109/43.159993 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 1992-01-01

As industry moves towards many-core chips, networks-on-chip (NoCs) are emerging as the scalable fabric for interconnecting cores. With power now first-order design constraint, early-stage estimation of NoC has become crucially important. ORION [29] was amongst first models released, and since been fairly widely used NoCs. However, when validated against recent prototypes -- Intel 80-core Teraflops chip Scalable Communications Core (SCC) we saw significant deviation that can lead to erroneous...

10.5555/1874620.1874721 article EN Design, Automation, and Test in Europe 2009-04-20

As industry moves towards many-core chips, networks-on-chip (NoCs) are emerging as the scalable fabric for interconnecting cores. With power now first-order design constraint, early-stage estimation of NoC has become crucially important. ORION was amongst first models released, and since been fairly widely used NoCs. However, when validated against recent prototypes - Intel 80-core Teraflops chip communications core (SCC) we saw significant deviation that can lead to erroneous choices. This...

10.1109/date.2009.5090700 article EN 2009-04-01

Approximation can increase performance or reduce power consumption with a simplified inaccurate circuit in application contexts where strict requirements are relaxed. For applications related to human senses, approximate arithmetic be used generate sufficient results rather than absolutely accurate results. Approximate design exploits tradeoff of accuracy computation versus and power. However, required varies according applications, 100% still some situations. In this paper, we propose an...

10.1145/2228360.2228509 article EN 2012-05-31

There has been increased research interest in systems composed of multiple autonomous mobile robots exhibiting collective behavior. Groups are constructed, with an aim to studying such issues as group architecture, resource conflict, origin cooperation, learning, and geometric problems. As yet, few applications robotics have reported, supporting theory is still its formative stages. In this paper, the authors give a critical survey existing works discuss open problems field, emphasizing...

10.1109/iros.1995.525801 article EN 2002-11-19

Historically, server designers have opted for simple memory systems by picking one of a few commoditized DDR products. We are already witnessing major upheaval in the off-chip hierarchy, with introduction many new products—buffer-on-board, LRDIMM, HMC, HBM, and NVMs, to name few. Given plethora choices, it is expected that different vendors will adopt strategies their high-capacity systems, often deviating from standards and/or integrating functionality within systems. These likely differ...

10.1145/3085572 article EN ACM Transactions on Architecture and Code Optimization 2017-06-28

As industry moves towards multicore chips, networks-on-chip (NoCs) are emerging as the scalable fabric for interconnecting cores. With power now first-order design constraint, early-stage estimation of NoC has become crucially important. In this work, we present ORION 2.0, an enhanced and area simulator, which offers significant accuracy improvement relative to its predecessor, 1.0.

10.1109/tvlsi.2010.2091686 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2011-03-15

This work focuses on congestion-driven placement of standard cells into rows in the fixed-die context. We summarize state-of-the-art after two decades research recursive bisection and implement a new placer, called Capo, to empirically study achievable limits approach. From among recently proposed improvements bisection, Capo incorporates leading-edge multilevel min-cut partitioner [7], techniques for partitioning with small tolerance [8], optimal partitioners end-case min-wirelength placers...

10.1145/337292.337549 article EN Proceedings of the 40th conference on Design automation - DAC '03 2000-01-01

The deferred-merge embedding (DME) algorithm, which embeds any given connection topology to create a clock tree with zero skew while minimizing total wirelength, is presented. algorithm always yields exact trees respect the appropriate delay model. Experimental results show an 8% 15% wire length reduction over some previous constructions. DME may be applied either Elmore or linear model, and optimal wirelength for delay. very fast running in time number of synchronizing elements. A unified...

10.1109/82.204128 article EN IEEE Transactions on Circuits and Systems II Analog and Digital Signal Processing 1992-01-01

Elmore delay has been widely used to estimate interconnect delays in the performance-driven synthesis and layout of very-large-scale-integration (VLSI) routing topologies. For typical RLC interconnections, however, can deviate significantly from SPICE-computed delay, since it is independent inductance rise time input signal. Here, we develop an analytical model based on first second moments incorporate effects into for interconnection lines under step input. Delay estimates using our are...

10.1109/43.664231 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 1997-01-01

The International Technology Roadmap for Semiconductors (ITRS) is a collaborative effort within the semiconductor industry to confront challenges implicit in Moore's law. roadmap's goal present an industry-wide consensus on "best current estimate" of its R&D needs out 15-year horizon. As such, ITRS provides guide efforts companies, research organizations, and governments improve quality investment decisions made at all levels. 2001 notable because it was developed with truly international...

10.1109/2.976918 article EN Computer 2002-01-01

Digital system designs are the product of valuable effort and know-how. Their embodiments, from software HDL program down to device-level netlist mask data, represent carefully guarded intellectual property (IP). Hence, design methodologies based on IP reuse require new mechanisms protect rights producers owners. This paper establishes principles watermarking-based protection, where a watermark is mechanism for identification that (i) nearly invisible human machine inspection, (ii) difficult...

10.1145/277044.277240 article EN 1998-01-01

The deferred-merge embedding (DME) algorithm is presented. In linear time, it embeds any given connection topology into the Manhattan plane to create a clock tree with zero skew while minimizing total wirelength. Experimental results show that yields exact trees 9% 16% wirelength reduction over previous constructions. DME may be applied either Elmore or delay model and optimal for delay.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">&gt;</ETX>

10.1109/asic.1992.270316 article EN 2003-01-02

Digital system designs are the product of valuable effort and know-how. Their embodiments, from software hardware description language program down to device-level netlist mask data, represent carefully guarded intellectual property (IP). Hence, design methodologies based on IP reuse require new mechanisms protect rights producers owners. This paper establishes principles watermarking-based protection, where a watermark is mechanism for identification that is: (1) nearly invisible human...

10.1109/43.952740 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2001-01-01

A fast approach to the minimum rectilinear Steiner tree (MRST) problem is presented. The method yields results that reduce wire length by up 2% 3% over previous methods, and first heuristic which has been shown have a performance ratio less than 3/2; in fact, or equal 4/3 on entire class of instances where c(MST)/c(MRST) exactly 3/2. algorithm practical asymptotic complexity owing an elegant implementation uses methods from computation geometry parallelizes readily. randomized variation...

10.1109/43.144853 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 1992-07-01

The authors propose a provably good performance-driven global routing algorithm for both cell-based and building-block design. approach is based on new bounded-radius minimum tree formulation. first present several heuristics with performance, an analog of Prim's spanning construction. Next, they give which simultaneously minimizes cost the longest interconnection path, so that are bounded by small constant factors away from optimal. They also show geometry helps in routing: Manhattan plane,...

10.1109/43.137519 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 1992-06-01

Recent work has illustrated the promise ofmultilevel approaches for partitioning large circuits. Multilevel partitioningrecursively clusters instance until its size is smallerthan a given threshold, then unclusters while applyinga refinement algorithm. Our multilevel partitioner usesa new technique to control number of levels in matching-basedclustering phase and also exploits recent innovations classiciterative partitioning. heuristic outperforms numerousexisting bipartitioning heuristics,...

10.1145/266021.266275 article EN 1997-01-01

Automated cell placement is a critical problem in very large scale integration (VLSI) physical design. New analytical methods that simultaneously spread cells and optimize wirelength have recently received much attention from both academia industry. A novel simple objective function for spreading over the area described patent of Naylor et al. (U.S. Pat. 6301693). When combined with function, this allows efficient simultaneous optimization using nonlinear techniques. In paper, we implement...

10.1109/tcad.2005.846366 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2005-04-25

The Nesterov's method approach to analytic placement has recently demonstrated strong solution quality and scalability. We dissect the previous implementation strategy show that can be significantly improved using two levers: 1) constraint-oriented local smoothing 2) dynamic step size adaptation. propose a new density function comprehends overflow of area resources; this enables at per-bin granularity. Our adaptation automatically determines effectively allocates optimization effort improve...

10.1109/tcad.2018.2859220 article EN publisher-specific-oa IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2018-07-24

Modern digital IC designs have a critical operating point, or “wall of slack”, that limits voltage scaling. Even with an error-tolerance mechanism, scaling below - so-called overscaling results in more timing errors than can be effectively detected corrected. This the effectiveness trading off system reliability and power. We propose design-level approach to (power) in, e.g., microprocessor designs. increase range values at which (timing) error rate is acceptable; we achieve this through...

10.5555/1899721.1899911 article EN Asia and South Pacific Design Automation Conference 2010-01-18
Coming Soon ...