Metastability in Latches, Arbiters and Data-convertors

Article (PDF Available) · August 1999 with 53 Reads

Anthony C. Davies
23.73
King's College London

Real-time digital systems and asynchronous computer systems have an inherent risk of occasional failures due to metastability in various components used to control data-transfers. This possibility has been known for a long time, although it is sometimes forgotten or overlooked. There is renewed interest in asynchronous computer systems because of the prospects they offer for lower power operation and improved electromagnetic compatibility compared to conventional fully synchronous systems. As a result, design approaches to minimise metastability effects need to be adopted. Recent increased understanding of non-linear dynamics and the availability of software for the accurate simulation and visualisation of dynamic behaviour enables metastability to be investigated and demonstrated much more readily. This paper provides a mainly-tutorial review of how metastability arises in various commonly-used components, illustrated with the results of simulations. I. INTRODUCTION Capturing extern...

Content uploaded by Anthony C. Davies

Metastability in Latches, Arbiters and

Data-convertors

Anthony C. Davies

Dept. of Electronic Engineering, King’s College, University of London, Strand, London, WC2R 2LS, England,

ph: +44-171-848-2441, fax: +44-171-836-4781, e-mail: tonydavies @ kcl.ac.uk

Abstract - Real-time digital systems and asynchronous

computer systems have an inherent risk of occasional

failures due to metastability in various components used to

control data-transfers. This possibility has been known for

a long time, although it is sometimes forgotten or

overlooked. There is renewed interest in asynchronous

computer systems because of the prospects they offer for

lower power operation and improved electromagnetic

compatibility compared to conventional fully synchronous

systems. As a result, design approaches to minimise

metastability effects need to be adopted. Recent increased

understanding of non-linear dynamics and the availability

of software for the accurate simulation and visualisation of

dynamic behaviour enables metastability to be investigated

and demonstrated much more readily. This paper provides

a mainly-tutorial review of how metastability arises in

various commonly-used components, illustrated with the

results of simulations.

I. INTRODUCTION

Capturing external data by a real-time digital system

typically involves clocking the data into some form of

latch, such as a D-type flip-flop. To guarantee the proper

operation of the flip-flop, data set up and hold times have

to be complied with, but since the data and clock sources

are independent, inevitably these timing requirements are

occasionally violated: such violations can lead to

metastability [1,2]. The same problem occurs in data

transfers between subsystems within asynchronous

computers.

Metastability arises from unstable equilibrium states in

digital system components, which can result in the

assumption that signals are two-level becoming invalid

for brief but possibly significant periods of time. A

simple example is the conventional set-reset latch (flip-

flop) - it has two state-variables and two stable equilibria

(from which its ability to store a ‘one’ or a ‘zero’ arises).

Each stable state is surrounded by its basin of attraction

but between these there is always an unstable equilibrium

point. If the state-variables closely approach this point,

the transient leading to either stable states may be

substantially slowed down compared to normal, and this

departure from expected timing may result in system

failure.

Fig. 1 shows superimposed simulated trajectories from a

simple latch when data-pulse timing is such as to take

the latch very close to its unstable equilibrium state.

Fig. 2 illustrates the flows in the state-space, from which

the three equilibria can be seen, two stable, one unstable

(arrow length and direction represent the gradient at each

state).

To handle two or more concurrent requests to a shared

resource in a computer system, it is usual to use some

form of arbiter. Arbiters are also liable to metastability,

and occasional operational problems can arise when the

requests are near-to-simultaneous. The first explicit

report of these effects seems to have been by Catt [3].

Analogue-to-digital convertors are relied upon to carry

out accurate conversions of analogue signal values to

digital form, but are subject to occasional data-

conversion failures for similar reasons.

Observing metastability experimentally has always been

difficult, which has encouraged a tendency to ignore it.

Some occasional ‘failures’ in real-time computing

systems may be attributed to metastability, but because of

the unrepeatability of the circumstances, verification is

seldom possible, so other kinds of timing errors or

software errors may be blamed.

Nowadays, accurate computer models of realistic digital

circuits are available, together with powerful simulators

which enable dynamic behaviour to be easily

investigated. Most design engineers have access to

powerful desktop computing resources. This makes

possible the modelling and accurate dynamic simulation

of complex configurations of gates and their timing

behaviour using a variety of advanced software packages.

Consequently, metastability may now be easily

investigated for both realistic gate models and simplified

idealised gate models.

Numerical solution of the non-linear differential

equations of simple ‘idealised gates’ was used for the

results illustrated in this paper.

A data-base of references to metastability accessible by

WWW has been assembled [5].

II. ASYNCHRONOUS DIGITAL SYSTEMS

It has long been recognised that asynchronous computer

systems should have some advantages over the

conventional synchronous digital computers. In a

synchronous system, the maximum clock frequency is

limited by the need for every gate’s worst-case time delay

to be complied with. In an asynchronous system, there is

no need to ‘wait for the clock’. This offers a potential

speed-increase, because behaviour is dependant on the

average and not the worst-case processing delay of each

subsystem, but also increases the risk of metastability.

Recently, the needs for low-power systems (especially for

hand-held battery operated mobile communications

systems components capable of high performance digital

data handling and processing, within the popular concept

of ‘multi-media’), has led to renewed interest in adopting

asynchronous designs [6,7]. The European Commission

funds work in this area through ESPRIT (the ACiD

Working Group [8]), and the UK Asynchronous Forum

now meets bi-annually.

Clock distribution to all parts of a synchronous system

requires a high power signal liable to radiate troublesome

interference at the clock frequency and its multiples, and

assuring adequate electromagnetic compatibility is not

easy. Also, most parts of such a system consume power

continuously, whether or not they are occupied in useful

processing operations.

An asynchronous system often incorporates locally-

synchronous subsystems, which need consume significant

power only when they are doing needed work - for the

rest of the time they can be put into a power-down

(‘sleep’) mode. Because each such subsystem is self

timed, sometimes with its own local clock, the radiation

from a shared clock is avoided, and since the clocks of

the various subsystems are not synchronised, any radiated

interference tends to be at a much lower level and

broadband, so avoiding the spectral peaks of the noise

emission from synchronous systems.

A commercial example is the asynchronous re-design by

Philips Semiconductor of the 80C51 microcontroller.

This uses one quarter of the power and has dramatically

less clock-radiation compared with the synchronous

version fabricated by the same technology [9].

Shrinking of integrated-circuit component-dimensions, a

major factor in the steady increase in microprocessor

performance, reduces gate delays but relatively increases

interconnect delays. This makes it increasingly difficult

to distribute high-speed clocks across the whole of a

complex chip, leading to synchronisation problems,

clock-skew, etc. in synchronous systems which may be

circumvented by asynchronous designs.

III. ARBITER CIRCUITS

An arbiter is used to select between two or more

concurrent requests for service or for access to a shared

resource. This is an inherent requirement in many kinds

of data-transfers and interrupt-handing systems. It has

been known for a long time that all arbiters are subject to

the risk of metastability [10, 11]

The simplest example of an arbiter is a set-reset flop-flop

(normally followed by a ‘filter’ which is a digital circuit

to reduce the probability of a metastable transient

propagating to the output). The metastable level

(between the high and low logic levels) can persist at the

output of a flip-flop for a significant (and theoretically

unbounded) time, and the effect of the ‘filter’ is to hide

this metastable level from the following circuits -

however, it can achieve this only by delaying the time at

which the final output can be ‘trusted’ - so that at best

an ‘uncertain amplitude’ is exchanged for an ‘uncertain

time-delay’.

A simple ‘filtering’ idea often advocated is simply to

follow the metastable output by a gate with a very low (or

high) threshold. The intention is to keep the output

constant until the input has departed sufficiently from the

intermediate metastable level for a ‘clean’ digital level to

be reached by a fast transition at the final output.

However, this can reduce but does not eliminate the risk.

It is possible to make ‘1 out of n’ arbiters from

combinations of basic gates, but unless care is taken,

these can exhibit various additional forms of

metastability, because of their increased number of

unstable equilibrium states.

The ‘1 out of 3’ arbiter of Fig. 3 [12, 13] suffers from

several problems (such as modes where fairness in

responding to input requests is not guaranteed). Fig. 4a

shows output transients from a large number of initial

states for the condition that all inputs are held high (for

which there are three stable states). As well as normal

transients, some delayed (metastable) responses can be

seen. Fig. 4b shows trajectories initiated from a number

of closely adjacent states in the vicinity of the state-space

origin, leading near to a metastable point and

terminating on one of the three stable states [14].

IV. DIGITAL-TO-ANALOGUE CONVERTORS

The standard successive-approximation analogue to

digital convertor is liable to occasional conversion errors,

which may not be small. It is well-known that such

errors can arise from not keeping the input analogue

signal constant over the duration of the conversion

process. Less often realised is the possibility that errors

can occur even if the input is held absolutely constant

[15]. The convertor uses a comparator to compare the

input signal level with an internally-generated level. The

comparator is, in effect, a very high gain amplifier which

is supposed to be always driven into saturation in one

direction or the other (so generating a digital output).

However, if the two input signal levels are very close the

output may be at some intermediate value within the

‘linear’ range of the comparator. As well as possibly

affecting any serial output from the convertor, this signal

will be ‘clocked’ into the register-system which drives

the internal digital-to-analogue convertor and can result

in uncertainty about the state latched into the register.

Of course, with a properly-designed convertor, using a

high-gain comparator, the error probability is extremely

small, but is not zero.

V. SYNCHRONISERS

A synchroniser is required to accept input transitions at

any time, and to generate corresponding output

transitions synchonised with the timing of a local clock.

Compared to a standard ‘latch’ which may be regarded

as ‘level-sensitive’ (e.g. capturing an input level), the

synchroniser is ‘transition-sensitive’. Failure modes

include a slow output transition or intermediate output-

signal levels. Metastability, oscillations and suggestions

of chaos have been reported [3,4,10]

VI. ATTEMPTS TO AVOID METASTABILITY

Various unsuccessful schemes have been proposed to

eliminate metastablity.

Claims that making flip-flops from Schmitt-trigger gates

(e.g. gates with hysteresis) can overcome metastabilty

have been shown to be false [4].

An ‘inertial delay’ is a filter which is supposed to ‘clean’

metastable transients (also called ‘glitches’ or ‘runt

pulses’). However problems of designing a perfect

inertial delay are essentially the same as problems of

making a perfect synchroniser [16], and so are doomed to

failure.

The Muller ‘C-element’ [17] is a popular replacement for

latches as a memory element in asynchronous circuits. It

detects and follows when two inputs are both high or

both low, and stores the corresponding level while the

inputs differ. Denoting the inputs by x, y and the output

by z, it may be represented by the assignment:

z := (x and y) or (z and (x or y))

Metastability is possible if the inputs change level in

opposite directions or do not remain at one level long

enough for the element to respond properly.

Fig. 5 shows metastable transients at the output z from a

simulation of a C-element made from ‘and’ and ‘or’

gates. These occur because the duration of the

simultaneous high level on x and y is too short for the C-

element to respond adequately.

The ‘asymmetric-C element’ [7] is a variation in which

one input is dominant: the output z goes high if x or y are

high, while if x is low, the output z goes low. This can

also exhibit metastablity.

VI. DISCUSSION AND CONCLUSIONS

Asynchronous designs offer scope for overcoming

limitations of synchronous designs which arise in the

context of faster and smaller systems and in low power

and hand held systems, but a reappraisal is required of

the risks of metastablity. The recent availablity of

excellent tools for non-linear dynamic systems analysis

and for visualisation offer the prospect of improved

insights and a more detailed evaluation.

The key problem lies in the mapping between discrete

and continuous domains. In the time domain, the

sequencing of events (which represents the behaviour at

the software level) has to be mapped to the continuous

time domain and in this process occasional unpredictable

delays from metastability may occur. In the amplitude

domain, latching a level by a clock and conversion of a

continuous-amplitude to one of a discrete number of

levels (by an analogue-to-digital convertor) leads to the

possiblity of occasional conversion errors. Uncertainty in

level may be exchanged with uncertainty in time but

complete elimination of these problems cannot be

achieved. Good design can reduce the probability and

consequences of these infrequent events to an acceptable

level but this requires system designers to be aware of

them.

Acknowledgements: Colleagues in the Department of

Computing Science, University of Newcastle-upon-Tyne,

and at Matra BAe Dynamics are thanked for discussions

about the material in this paper. The U.K. EPSRC is

thanked for financial support (Grant No. GR/L92471).

REFERENCES

[1] L.R. Marino “General Theory of Metastable

operation”, IEEE Trans., C-30, pp. 107-115, 1981

[2] A. C. Davies “Analysis of Metastable Dynamics of

Bistable Flip-flops”, Proc. 6

Int. Symp. on Networks,

Systems and Signal Processing, Zagreb, Yugoslavia,

pp. 379-382, June 1989

[3] I. Catt “Time loss through gating of asynchronous

logic signal pulses” IEEE Trans., EC-15, pp. 108-

111, 1966

[4] T.J. Chaney “Comments on “A note on synchroniser

or interlock maloperation” IEEE Trans., C-28, pp.

802-804, 1979

[5] www.eee.kcl.ac.uk/~comfort

[6] S. Nowick, M.B. Josephs, C.H. van Berkel (editors)

Proc. IEEE, Special Issue on ‘Asynchronous Circuits

and Systems’, Feb. 1999

[7] J. Kessels and P. Marston “Designing Asynchronous

Standby Circuits for a Low-Power Pager” Proc.

IEEE, pp. 257-267, 1999

[8] www.scism.sbu.ac.uk/ccsv/ACiD-WG

[9] H. van Gagaldonk et al “An asynchronous low-power

80c51 microcontroller” Proc. Async’98, San Diego,

pp96-107, 30 March - 2 April 1998

[10] T.J. Chaney and C.E. Molnar “Anomalous Behavior

of Synchronizer and Arbiter Circuits”, IEEE Trans.,

C-22, pp. 421-422, 1973

[11] W. Fleischhammer and O. Dortok “The anomalous

behavior of flip-flops in synchronizer circuits”, IEEE

Trans., C-28, pp. 273-276 1979

[12] C.H. van Berkel and C.E. Molnar “Beware the

three-way arbiter’ accepted for IEEE Trans. Solid-

state Circuits, June 1999

[13] A. Bystrov and A. Yakovlev “Revisiting the problem

of Fair Arbitration”, 5th U.K. Asynch. Forum,

Cambridge, England, 17-18th December 1988

[14] A.C. Davies “Multi-flops - a view of the dynamic

behaviour” accepted for NDES’99, Denmark, July

1999

[15] D.J Kinniment et al “Towards Asynchronous A-D

Conversion”, Proc. Async’98, pp206-215, 30 March

- 2 April 1998

[16] J.C. Barros and B.W. Johnson “Equivalence of

arbiter, synchronisers, latches, and inertial delays”

IEEE Trans., C-32, pp603-614, 1983

[17] C.L. Seitz “System Timing”, Chapter 7 of C. Mead

and L. Conway ‘Introduction to VLSI Systems’

Addison Wesley, pp. 218-262, 1980

Figure 3. ‘1 out of 3’ arbiter

Figure 4a. Transients from ‘1 out of 3’ arbiter

Figure 2. Flip-flop: flows in state-space

Figure 1. Metastable transients of flip-flop

Figure 4b. Trajectories of ‘1 out of 3’arbiter

Figure 5. Metastability of ‘C-element’

Download full-text PDF

... Significant insights were also gained into the design and operation of these algorithms, which allowed the production of design guidance hints for ACMs [16]. Metastable operating conditions were also investigated at the level of the dynamic behaviour of the circuits for standard latches with one feedback loop [35] and for latches involving more than one feedback loop, known as Tri-flops for a three-way arbiter, and, in general Multi-flops [18,32]. Other low level components investigated under metastable operation were arbiters [35], fair arbiters [15], synchronisers [19], and A-D converters [35] with significant findings. ...
... Metastable operating conditions were also investigated at the level of the dynamic behaviour of the circuits for standard latches with one feedback loop [35] and for latches involving more than one feedback loop, known as Tri-flops for a three-way arbiter, and, in general Multi-flops [18,32]. Other low level components investigated under metastable operation were arbiters [35], fair arbiters [15], synchronisers [19], and A-D converters [35] with significant findings. Complete self-timed ACM algorithms were also simulated at the low level in the analogue domain under metastable conditions. ...
... Metastable operating conditions were also investigated at the level of the dynamic behaviour of the circuits for standard latches with one feedback loop [35] and for latches involving more than one feedback loop, known as Tri-flops for a three-way arbiter, and, in general Multi-flops [18,32]. Other low level components investigated under metastable operation were arbiters [35], fair arbiters [15], synchronisers [19], and A-D converters [35] with significant findings. Complete self-timed ACM algorithms were also simulated at the low level in the analogue domain under metastable conditions. ...
Task I: Methods and tools for formal modelling and rigorous analysis of ACMs
Article
Full-text available
May 2005

A. Fraser

Sergio Velastin

Anthony C. Davies

Ian G. Clark
View
... To guarantee proper operation, data set up and hold times have to be complied with, but since clock and data are from independently-timed domains, it is inevitable that occasionally these timing requirements will be violated. Such violations can lead to metastability [2,3], a non-linear dynamical phenomenon which can result in occasional excessively-long settling times at gate and flip-flop outputs, during which the output level may remain at a mid-value (neither logic 1 or logic 0). Arbiters are used to control access by multiple processes to shared resources, and have been shown to be inherently subject to metastability, and Analogue-to- Digital conversion within a fixed conversion time also involves a risk of metastability [4,5]. ...
Communications protocols and mechanisms for distributed digital systems
Article
Full-text available

Anthony C. Davies
Synchronous clocking has continued to be the dominant digital design method despite the problems of clock distribution in integrated circuit chips of increasing complexity and speed. The continuing increases will soon force a change to asynchronous design methods, and the communication between large numbers of high-performance processors on a single chip will become a critical issue. An introduction to a classification scheme of mechanisms for interprocess communications via shared memory is described in the paper.
View

Towards Asynchronous A-D Conversion
Article
Jan 1998

Kinniment Gao
Analogue to digital (A-D) converters with a fixed conversion time are subject to errors due to metastability. These errors will occur in all converter designs with a bounded time for decisions, and are potentially severe. We estimate the frequency of these errors in a successive approximation converter, and compare the results with asynchronous designs using both a fully speed-independent, and a bundled data approach. It is shown that an asynchronous converter is more reliable than its synchronous counterpart, and that the bundled data design is also faster, on average, than the synchronous design. We also demonstrate tradeoffs involved in asynchronous converter designs, such as speed, robustness to delay variations, circuit size and design scalability. Keywords: analogue to digital conversion, arbitration, asynchronous circuits, metastability, signal transition graphs, synchronisers. 1 Introduction N-bit analogue to digital (A-D) converters are usually specified to have a fixed conv...
View
Introduction to VLSI systems
Article
Full-text available
Jan 1980

Carver Mead

Lynn Conway
The book is written to introduce all Electrical Engineering and Computer Science students to integrated system architecture and design. Combined with individual study in related research areas and participation in large system design projects, this text provides the basis for a graduate course-sequence in integrated systems. MOS devices and circuits are considered along with integrated system fabrication, data and control flow in systematic structures, the implementation of integrated system designs, the overview of an LSI computer system, the design of the OM2 data path chip, architecture and design of system controllers, the design of the OM2 controller chip, system timing, highly concurrent systems, and the physics of computational systems. Attention is given to alternative control structures, the stored-program machine, microprogrammed control, algorithms for VLSI processor arrays, and hierarchically organized machines.
View
Designing Asynchronous Standby Circuits for a Low-Power Pager.
Conference Paper
Jan 1997
P IEEE
ASYNC International Symposium on Advanced Research in Asynchronous Circuits and Systems

Paul Marston

Joep L. W. Kessels
We have designed asynchronous standby circuits for a pager decoder which dissipate four times less power and are 40% larger in size than synchronous designs. For the total pager unit this means a 37% reduction in power dissipation for nearly no additional area. The decoded chip, which apart from the standby circuits is completely synchronous, has been fabricated and was first-time-right. Two problems had to be solved to incorporate asynchronous subcircuits in a synchronous environment: synchronization and testing. A synchronization scheme is described that allows a free intermixing of asynchronous and synchronous modules and a test strategy is proposed in which the scan test facilities in the synchronous environment are used to test the asynchronous modules. One function is prevalent in the standby circuits, namely counting. In an appendix we present the asynchronous design of a so-called loadable counter whose power consumption does not depend on its size
View
General Theory of Metastable Operation.
Article
Feb 1981
IEEE T COMPUT

Leonard R. Marino
Metastable operation is a fundamental phenomenon of sequential networks that process asynchronous inputs. Nevertheless, because of its subtle nature and the relatively low probability of its occurrence in conventional systems, this phenomenon is neither well understood nor widely appreciated. With continuing advances in digital technology however, there is a growing interest in large-scale highly parallel systems. Such systems are likely to involve numerous high-frequency asynchronous interactions, which may result in frequent failures due to metastable operation unless the designers take specific measures to prevent such failures. In recent years, a number of researchers have been working with some success to develop techniques for dealing with this failure mode. The purpose of this paper is to present a comprehensive theory of metastable operation that may lead to a better understanding of this phenomenon and provide theoretical support for further work in this area.
View
An asynchronous low-power 80C51 microcontroller /
Article
Jan 1998

Johan Sebastiaan Henri van. Gageldonk
Proefschrift (doctoral)--Technische Universiteit Eindhoven, 1998. Includes bibliographical references (p. 163-167) and index.
View
Time Loss Through Gating of Asynchronous Logic Signal Pulses
Article
Mar 1966

Ivor Catt
The gating of asynchronous signals causes logical errors. It is possible to reduce the frequency of these errors, but the price paid is a severe loss of time and extra cost in hardware.
View
Equivalence of the Arbiter, the Synchronizer, the Latch, and the Inertial Delay
Article
Aug 1983
IEEE T COMPUT

José C. Barros

Brian W. Johnson
An axiomatic method for proving correctness properties about digital circuit implementations under the influence of asynchronous inputs is presented. This method, termed hardware correctness, is used to prove properties about a target digital circuit that is implemented in terms of constituent digital circuits. The proof consists of deducing theorems about properties of the target circuit from known properties of the constituent circuits. Three types of properties are considered, and they are expressed as axioms in first order predicate calculus. The axioms describe ideal behavior of the four most commonly studied asynchronous circuits, the inertial delay, the synchronizer, the time-bounded arbiter, and the latch. These axioms are derived from the less precise behavioral descriptions used by other investigators.
View
The Anomalous Behavior of Flip-Flops in Synchronizer Circuits
Article
Apr 1979
IEEE T COMPUT

Werner Fleischhammer

O. Dortok
Quantitative results of the observations of oscillatory and metastable behavior of common flip-flops in response to logically undefined input conditions, such as those that occur in synchronizers and arbiters, are presented. The results are obtained with the help of a circuit developed for this purpose which measures the failure rate for a certain flip-flop and frequency. It is found that the obtained results are in good correlation with observed failure rates of a synchronizer with both short and long flip-flop resolution times allowed.
View
Comments on A Note on Synchronizer or Interlock Maloperation
Article
Nov 1979
IEEE T COMPUT

Thomas J. Chaney
E. G. Wormald's note<sup>1</sup>proposes a way to prevent metastable action in synchronizers. Experimental results from testing his suggested circuits show that his solution does not work. A reference to a general proof that synchronizers must have a region of metastable action is given.
View
Anomalous Behavior of Synchronizer and Arbiter Circuits
Article
May 1973
IEEE T COMPUT

Thomas J. Chaney

Charles E. Molnar
Observations are shown of oscillatory and metastable behavior of flip-flops in response to logically undefined input conditions such as those that occur in synchronizers and arbiters. Significant systems failures have resulted from this fundamentally inescapable problem that is generally not appreciated by system designers and users.
View

Metastability in Latches, Arbiters and Data-convertors

Recommended publications

Dynamic properties of a multiway arbiter

Communications protocols and mechanisms for distributed digital systems

Anthony C. Davies

Measuring deep metastability

Metastability in Latches, Arbiters and Data-convertors

Discover the world's research