*** This old style webpage is no longer maintained (as of 9/2008). Please go to my new wiki pages for up-to-date information.)
Research Interests
I am interested in many aspects of computer architecture and digital
hardware design. My current research focuses on computer architecture, processor
microarchitecture, simulation and FPGA emulation of computer systems, and tools for high-level hardware design and synthesis.
My current and past program committee service include CAD and architecture conferences such as DAC, FPGA, HPCA, ICCAD, ICCD, ICS, ISCA, MEMOCODE, and SC. My current and past research sponsors include (alphabetically) C2S2, DARPA, IBM, Intel, ITRI@CMU, NSF (including a CAREER Award), SRC, and SUN (OpenSPARC Center of Excellence).
How to contact me.
(** I apologize I cannot reply to email queries about graduate school admission given the traffic volume. I do look at them, especially those with something specific to comment. If you are interested in my projects, please submit an application to ECE and reference me in your statement of purpose. If you want to know the truth about graduate school, read this.)
Reliable Processors and Systems:
This research investigates the impact of soft-error tolerance in
future deep-submicron microprocessor designs. The study
investigates different options to achieve the desired level of
protection against soft errors.
This
research effort is in part supported by NSF through a CAREER Award.
The TRUSS project
(Total Reliability Using Scalable
Servers) develops a reliable,
available, and serviceable (RAS) hardware platform based on a
distributed cluster of commodity blade servers. The goal of the
project is to leverage the cost-effectiveness of commodity processor
and memory modules in a reliable server design that achieves both
performance and cost scalability. This research effort is in part
supported by NSF through an ITR Award and by Intel Corp. (TRUSS Project Page)
- OpenSPARC: An Open Platform for Hardware Reliability Experimentation.
Ishwar Parulkar, Alan Wood, James C. Hoe, Babak Falsafi, Sarita V. Adve and Josep Torrellas. Fourth Workshop on Silicon Errors in Logic-System Effects (SELSE), April 2008.
[pdf]
- Multi-bit Error Tolerant Caches Using Two-Dimensional Error Coding.
Jangwoo Kim, Nikos Hardavellas, Ken Mai, Babak Falsafi and James C. Hoe.
ACM/IEEE International Symposium on Microarchitecture (MICRO), December 2007.
[pdf]
- PAI: A Lightweight Mechanism for Single-Node
Memory Recovery in DSM Servers.
Jangwoo Kim, Jared C. Smolens, Babak Falsafi and James C. Hoe.
IEEE Pacific Rim International Symposium on Dependable Computing (PRDC), December 2007.
[pdf]
- Detecting Emerging Wearout Faults.
Jared C. Smolens, Brian T. Gold, James C. Hoe, Babak Falsafi, and Ken Mai. The Third Workshop on Silicon Errors in Logic - System Effects (SELSE), April 2007.
[pdf]
- Reunion: Complexity-Effective Multicore Redundancy.
Jared C. Smolens, Brian T. Gold, Babak Falsafi, and James C. Hoe.
International Symposium on Microarchitecture (MICRO), December 2006.
[pdf]
-
TRUSS: Reliable, Scalable Server Architecture.
Brian T. Gold, Jared C. Smolens, Jangwoo Kim,
Eric S. Chung, Vasileios Liaskovitis, Eriko Nurvitadhi,
Babak Falsafi, James C. Hoe, and Andreas G. Nowatzyk.
IEEE Micro, Volume 25,
Number 6, November/December 2005.
[pdf]
-
Understanding the Performance of Concurrent Error Detecting Superscalar
Microarchitectures.
Jared C. Smolens, Jangwoo Kim, James C. Hoe, and Babak Falsafi.
Invited paper at IEEE Symposium on Signal Processing and Information Technology, December 2005.
[pdf]
-
Fingerprinting: Bounding Soft-Error-Detection Latency and Bandwidth.
Jared C. Smolens, Brian T. Gold, Jangwoo Kim, Babak Falsafi, James C. Hoe, and Andreas G. Nowatzyk.
IEEE Micro, Volume 24, Number 6, November/December 2004.
[pdf]
-
Efficient Resource Sharing in Concurrent Error Detecting Superscalar Microarchitectures.
Jared C. Smolens, Jangwoo Kim, James C. Hoe, and Babak Falsafi. International Symposium on Microarchitecture (MICRO), November 2004.
[pdf]
-
Fingerprinting: Bounding Soft-Error Detection Latency and Bandwidth.
Jared C. Smolens, Brian T. Gold, Jangwoo Kim, Babak Falsafi, James C. Hoe, and Andreas G. Nowatzyk.
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS),
October 2004.
[pdf]
-
Dual Use of Superscalar Datapath for Transient-Fault Detection
and Recovery.
Joydeep Ray, James C. Hoe and Babak Falsafi. International Symposium on Microarchitecture (MICRO), December 2001.
[pdf]
Mathematical Approach to High-Level Synthesis and Optimization of DSP Hardware:
This research develops a domain-specific hardware synthesis framework
for digital signal processing (DSP) computations. By incorporating
domain-specific knowledge of mathematics and algebra into a synthesis
tool, the proposed framework can manipulate a math-level transform
description to optimize a DSP transform implementation at the
algorithmic and architectural design level. This research is a part of the
SPIRAL project. This
research is in part supported by NSF through an ITR Award and by DARPA through the DESA program.
- Formal Datapath Representation and Manipulation for Implementing DSP Transforms.
Peter A. Milder, Franz Franchetti, James C. Hoe, Markus Pueschel.
Proc. ACM/IEEE Design Automation Conference (DAC), June 2008.
[pdf]
- Multiplexed Multiple Constant Multiplication.
Peter Tummeltshammer, James C. Hoe, and Markus Pueschel.
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Volume 26, Number 9, September 2007. [pdf]
- Generating FPGA-Accelerated DFT Libraries.
Paolo D'Alberto, Franz Franchetti, Peter A. Milder, Aliaksei Sandryhaila, James C. Hoe, Jeremy Johnson, Jose M. F. Moura and Markus Pueschel.
IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), April 2007.
[pdf]
- Discrete Fourier Transform Compiler: From Mathematical Representation to Efficient Hardware.
Peter A. Milder, Franz Franchetti, James C. Hoe, and Markus Pueschel. CSSI Technical Report, No. CSSI 07-01, January 2007.
[pdf]
- Spiral: Joint Runtime and Energy Optimization of Linear Transforms.
Marek Telgarsky, James C. Hoe, and Jose Moura. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2006.
[pdf]
- Fast and Accurate Resource Estimation of Automatically
Generated Custom DFT IP Cores.
Peter Milder, Mohammad Ahmad, James C. Hoe,
and Markus Pueschel. International Symposium on Field Programmable Gate Arrays (FPGA), February 2006.
[pdf]
[try out the tool dftgen]
-
Automatic Generation of Customized Discrete Fourier Transform IPs.
Grace Nordin, Peter Milder, James C. Hoe, and Markus Pueschel.
Design Automation Conference (DAC), June 2005.
[pdf]
[try out the tool dftgen]
-
Custom Optimized Multiplierless Implementations of DSP Algorithms.
Markus Pueschel, Adam Zelinski, and James C. Hoe.
International Conference on Computer Aided Design (ICCAD), November 2004.
[pdf]
- Multiple Constant Multiplication by Time Multiplexed Mapping of Addition Chains.
Peter Tummeltshammer, James C. Hoe, and Markus Pueschel. Design Automation Conference (DAC), June 2004.
[pdf]
- Automatic Cost Minimization for Multiplierless Implementations of Discrete Signal Transforms.
Adam Zelinski, Markus Pueschel, Smarahara Misra, and James C. Hoe.
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2004.
[pdf]
FPGA Prototyping and Emulation of Computer Systems
This research explores the application of Field Programmable Gate Arrays (FPGA) and High-level Hardware Synthesis technologies in computer systems prototyping and emulation. The goal of ProtoFlex FPGA-accelerated hybrid simulation is to extend the capability of
SimFlex multiprocessor simulation to hundreds to thousands of nodes. My students and I are members of the multi-university RAMP project.
-
A Complexity-Effective Architecture for Accelerating
Full-System Multiprocessor Simulations Using FPGAs
.
Eric S. Chung, Eriko Nurvitadhi, James C. Hoe, Babak Falsafi and Ken Mai. International Symposium on Field Programmable Gate Arrays (FPGA), February 2008.
[pdf]
-
RAMP: A Research Accelerator for Multiple Processors.
John Wawrzynek, David A. Patterson, Mark Oskin, Shih-Lien Lu, Christoforos Kozyrakis, James C. Hoe, Derek Chiou, Krste Asanovic. IEEE Micro, Volume 27, Number 2 , March/April 2007.
[pdf]
-
RAMP: Research Accelerator for Multiple Processors - A Community Vision for a Shared Experimental Parallel HW/SW Platform.
Arvind, Krste Asanovic, Derek Chiou, James C. Hoe, Christoforos Kozyrakis, Shih-Lien Lu, Mark Oskin, David Patterson, Jan Rabaey, John Wawrzynek. September 2005.
[pdf]
(Tech report version of original 2005 NSF proposal.)
-
In-System FPGA Prototyping of an Itanium Microarchitecture.
Roland Wunderlich and James C. Hoe. International Conference on Computer Design (ICCD), October 2004.
[pdf]
-
High-Level Modeling and FPGA Prototyping of Microprocessors. Joydeep Ray and James C. Hoe. International Symposium on Field Programmable Gate Arrays (FPGA), February 2003.
[pdf]
SMARTS
Simulation Sampling:
SMARTS is a framework that applies statistical sampling to
accelerate microarchitecture simulation. It employs systematic
sampling to measure only a very small portion of the entire
application being simulated.
The accelerated simulation rate allows complete
benchmarks to be used in performance studies. SMARTS is a part of
SimFlex.
- SimFlex: Statistical Sampling of Computer System Simulation.
Thomas F. Wenisch,
Roland E. Wunderlich,
Michael Ferdman,
Anastassia Ailamaki,
Babak Falsafi, and
James C. Hoe.
IEEE Micro, Volume 26,
Number 4, July/August 2006.
[pdf]
- SMARTS: Accelerating Microarchitecture Simulation
via Rigorous Statistical Sampling.
Roland E. Wunderlich, Thomas F. Wenisch, Babak Falsafi, and James C. Hoe.
ACM Transactions on Modeling and Computer Simulation, Volume 16, Number 3, June 2006.
[pdf]
- Simulation Sampling with Livepoints.
Thomas F. Wenisch, Roland E. Wunderlich, Babak Falsafi and James C. Hoe. International Symposium on Performance Analysis of Systems and Software (ISPASS), March 2006.
[pdf]
- SMARTS: Accelerating Microarchitecture Simulation
via Rigorous Statistical Sampling.
Roland E. Wunderlich, Thomas F. Wenisch, Babak Falsafi, and James C. Hoe. International Symposium
on Computer Architecture (ISCA), June 2003.
[pdf]
-
Applying SMARTS to SPEC CPU2000.
Thomas F. Wenisch, Roland E. Wunderlich, Babak Falsafi, and James C. Hoe. CALCM Technical Report 2003-1, June 2003.
[pdf]
(Tech report version of 2003 ISCA paper with complete data set.)
-
Download the new TurboSMARTS (includes SMARTS).
Operation-Centric Hardware Description and Synthesis:
This research investigates a high-level hardware design framework that
supports a new hardware design abstraction that is fundamentally
different from RTL. This framework is based on an
operation-centric abstraction for hardware description. In an
operation-centric description, the behavior of a system is decomposed
and described as a collection of atomic operations.
Check out Bluespec.com if you want to try out a commercial implementation.
-
Operation-Centric Hardware Description and Synthesis.
James C. Hoe and Arvind. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Volume 23, Issue 9, September 2004.
[pdf]
-
Synchronous Extensions to Operation-Centric Hardware Description Languages.
Grace Nordin and James C. Hoe. International Conference on Formal Methods and Models for Codesign (MEMOCODE), June 2004.
[pdf]
-
Synthesis of Operation-Centric Hardware Descriptions.
James C. Hoe and Arvind. International Conference on Computer Aided Design (ICCAD), November 2000.
[pdf]
- Hardware Synthesis from Term Rewriting Systems.
James C. Hoe and Arvind. IFIP International Conference on VLSI, December 1999.
[pdf]
- An Exercise in High-Level Architectural Description using a Synthesizable Subset of Term Rewriting Systems.
James C. Hoe, Martin Rinard and Arvind. MIT-CSGMEMO-403, February 1998.
[pdf]
User-Level Network Interface for Cluster Computing:
Once upon a time, I developed the
StarT-X
and
StarT-Jr
parallel systems. In 1998, a StarT-X cluster with 32 400MHz Pentium-II
Xeon processors (Hyades) was delivered to
MIT's Earth Atmosphere and Planetary Science Department. For several years, the cluster
was their primary facility for running MITgcmUV (a general circulation model for
climate simulation) in research and courses.
- A Personal Supercomputer for Climate Research.
James C. Hoe, Chris Hill and Alistair Adcroft. Supercomputing Conference (SC), November 1999.
[pdf]
- MPI-StarT: Delivering Network Performance to Numerical Applications. Parry Husbands and James C. Hoe. Supercomputing Conference (SC), November 1998. SC'98 Best Architecture Paper.
[html]
- StarT-X: A One-Man-Year Exercise in Network Interface Engineering.
James C. Hoe. Hot Interconnects VI, August 1998.
[pdf]
- StarT-JR: A Parallel System from Commodity
Technology. James C. Hoe and Mike Ehrlich.
Transputer/Occam International Conference,
November 1996. [pdf]
- StarT-NG: Delivering Seamless Parallel
Computing. Derek Chiou, Boon S. Ang, Arvind, Michael J.
Beckerle, Andy Boughton, Robert Greiner, James E. Hicks and James C.
Hoe. Euro-Par, August 1995.
[pdf]
- Network Interface for Message Passing Parallel Computation
on a Workstation Cluster. James C. Hoe.
Hot Interconnects II, August 1994. [pdf]
How to contact me.