Continuous Online Self-Monitoring Introspection Circuitry for Timing Repair by Incremental Partial-Reconfiguration (COSMIC TRIP)
Hans Giesen,
Benjamin Gojman,
Raphael Rubin,
Ji Kim,
and André DeHon
ACM Transactions on Reconfigurable Technology and Systems (TRETS) ,
Volume 11, Number 3, Article No. 3, DOI: 10.1145/3158229, January, 2018.
|
We show that continuously monitoring on-chip delays at the LUT-to-LUT link
level during operation allows a field-programmable gate array to
detect and self-adapt to aging and environmental timing
effects. Using a lightweight (>4% added area) mechanism for
monitoring transition timing, a Difference Detector with First-Fail
Latch, we can estimate the timing margin on circuits and identify
the individual links that have degraded and whose delay is
determining the worst-case circuit delay. Combined with
Choose-Your-own-Adventure precomputed, fine-grained repair
alternatives, we introduce a strategy for rapid, in-system
incremental repair of links with degraded timing. We show that
these techniques allow us to respond to a single aging event in
less than 190ms for the toronto20 benchmarks. The result is a step
toward systems where adaptive reconfiguration on the time-scale of
seconds is viable and beneficial.
Copyright
Geisen, Gojman, Rubin, Ji, DeHon 2018. Publication rights licensed to ACM.
This is the author's version of the work. It is posted here for your
personal use.
Not for redistribution. The definitive version was published in
ACM Transactions on Reconfigurable Technology and Systems (TRETS) ,
http://dx.doi.org/10.1145/3158229 .
|