Abstract—This paper presents work on ultra-low-power circuits for brain–machine interfaces with applications for paralysis prosthetics, stroke, Parkinson’s disease, epilepsy, prosthetics for the blind, and experimental neuroscience systems. The circuits include a micropower neural amplifier with adaptive power biasing for use in multi-electrode arrays; an analog linear decoding and learning architecture for data compression; low-power radio-frequency (RF) impedance-modulation circuits for data telemetry that minimize power consumption of implanted systems in the body; a wireless link for efficient power transfer; mixed-signal system integration for efficiency, robustness, and programmability; and circuits for wireless stimulation of neurons with power-conserving sleep modes and awake modes. Experimental results from chips that have stimulated and recorded from neurons in the zebra finch brain and results from RF power-link, RF data-link, electrode-recording and electrode-stimulating systems are presented. Simulations of analog learning circuits that have successfully decoded prerecorded neural signals from a monkey brain are also presented.

Index Terms—Brain–machine interfaces, low-power, prosthetics, wireless neuroscience.

I. INTRODUCTION

ARGE-SCALE chronic multi-electrode neural recording and stimulating systems have emerged as an important experimental paradigm for investigating brain function. Experiments using such brain–machine interfaces (BMIs) have shown that it is possible to predict intended limb movements by analyzing simultaneous recordings from many neurons (see [1] for a report of the first human trials of such devices, and see [2], [3] for recent reviews of this field). These findings have suggested a potential approach for treating paralysis and other disorders and disabilities in humans. Other BMIs, such as deep brain stimulators for Parkinson’s disease and visual prostheses for the blind, function primarily via neural-stimulation circuitry rather than via neural-recording circuitry. BMIs for epilepsy will need both recording and stimulating circuitry, and progress toward systems with such dual capabilities has recently been demonstrated in the context of general-purpose multi-electrode arrays for experimental neuroscience [4], [5]. Chronic use of BMIs with large numbers of electrodes requires ultra-low-power operation so that the systems are miniature and implantable, heat dissipated in the brain is minimized, and frequent battery replacement and repeat surgeries associated with implanted systems are unnecessary. In this paper we describe low-power circuits that can be applied to many BMIs, focusing first on those for recording applications and then on those for stimulating applications.

Low-power neural amplifiers are extremely important in recording BMIs since one such amplifier is needed for each electrode. In this work, we first describe and present data from a micropower neural amplifier that is the most power-efficient and lowest-power differential neural amplifier reported to date, achieving an energy efficiency near the limits set by theory [6]. Although single-ended amplifiers have been shown to be capable of even better efficiency, they are significantly less effective at rejecting power-supply and common-mode noise and are thus considerably less practically suited for neural recording [7]. We then describe a novel scheme for adapting the noise floor of a neural amplifier to the noise-floor requirements at each recording site, which potentially enables multi-electrode arrays to reduce recording power by an order of magnitude.

RF data telemetry is necessary to communicate information wirelessly to and from neurons in the brain through the skull and skin. Due to the relatively high power costs of transcutaneous data communication (for example, transmitting 12-bit neural signals sampled at 20 kHz from 100 electrodes yields a data rate of 24 Mbits/s and power consumption on the order of 10 mW), some form of data compression is needed to reduce the bandwidth of information transmitted from the brain. Adaptive, learnable, multi-input-multi-output neural decoding techniques that project firing-rate neuronal data onto several motor output parameters have successfully been used to decode movement intentions from neural signals (a variety of approaches are reviewed in [2], [3]). Simple linear decoding filters have proven useful for interpreting population codes of neurons in various brain regions, and these decoders perform comparably to adaptive Kalman filters and other probabilistic decoding techniques; the Appendix to [8] demonstrates that probabilistic decoders...
with linear priors are equivalent to linear-filter-based decoders. In this work, we discuss how we may use a novel analog decoding and learning architecture to compute such a filter in a power-efficient analog fashion, thereby allowing high data compression (an output rate of $3 \times 100$ Hz $\times 8$ bits $= 24$ kbs$^{-1}$ is more than sufficient for achieving neural control over three motor degrees of freedom). We present circuit simulations of the architecture that successfully learn and decode a monkey’s intention to move from its prerecorded neural spiking signals.

Then we describe and present experimental results from a novel low-power impedance-modulation technique for BMIs that can be used to transmit RF data from the brain flexibly and efficiently so that the power costs of communication are almost solely borne by external RF circuitry outside the skin and skull, rather than by the implanted RF circuitry within the cranial cavity as in prior designs [9]–[11]. Thus, heat and power dissipation in implanted hardware within the brain can be minimized. The link achieves 0.65 nJ bit$^{-1}$ communication efficiency for forward and reverse telemetry at 2 cm distances, among the most efficient reported [12]. The forward uplink bandwidth from the brain enables 5.8 Mbs$^{-1}$ data rates and the reverse downlink bandwidth to the brain enables 300 kbs$^{-1}$ data rates. The external unit consumes at most 2.5 mW of power, while the internal unit consumes only 140 $\mu$W of power in the worst case.

Efficient wireless links that transmit RF power through the skin are necessary to power implanted chips in the interface directly via rectification and possibly also to recharge implanted batteries capable of a finite number of recharges. In this work we present data from an RF power link that achieves efficiencies near that set by theory for links in the 1–10 mm range (74% and 54% efficiencies) and summarize the tradeoffs needed to optimize such links [13].

In addition to being small and extremely power efficient, practical brain–machine interfaces also need to be programmable and capable of reporting high-bandwidth analog or lower-bandwidth spiking information from a few neurons if needed, sorting spikes from multiple neurons, and operating robustly in RF and mixed-signal environments. In this work we propose a novel low-power mixed-signal architecture for such interfaces that combines the power efficiency of an implanted programmable analog system with the flexibility of an external digital processor such that efficiency and flexibility are simultaneously achieved by combining the best of the analog and digital worlds.

Certain BMIs, such as visual prostheses for the blind, systems for treatment of epilepsy or Parkinson’s disease, or experimental systems for neurophysiology, may require neural stimulation rather than (or in addition to) neural recording. We present experimental results from a chip used for wireless stimulation in a zebra finch brain and discuss how simple wake-up circuitry can be used to reduce power consumption in such systems.

This paper is organized as follows: In Section II we discuss the adaptive micropower neural amplifier and its use in multi-electrode systems. In Section III we discuss the analog linear decoding and learning architecture. In Sections IV and V we discuss RF data and power links, respectively. In Section VI we discuss a mixed-signal architecture for BMIs that can enable efficiency and flexibility. In Section VII we discuss wireless neural-stimulation circuits. In Section VIII we conclude by summarizing our contributions.

Brief descriptions of preliminary results of this work were presented at a conference from which some papers in this special-issue journal (including the present article) were selected [14]. Since that conference, details on the RF power link and the micropower neural amplifier have appeared in more specialized journal articles [13], [6]. Therefore, the description of these portions of the work will be brief in this broader paper. However, the brief descriptions will enable this paper to be self-contained and will provide context for the other work in the paper.

II. MICROPWR NEURAL AMPLIFIER AND ADAPTIVE POWER BIASING SCHEME

A. Micropower Neural Amplifier

Fig. 1(a) shows the architecture of our adaptive micropower amplifier. The first gain stage is similar to that reported in [15] except that it is implemented with the use of an all-subthreshold and folded-cascode architecture shown in Fig. 1(b), allowing 2.8-V operation rather than 5-V operation. In [15], a 5-V power supply is necessary for maintaining large overdrive voltages in some above-threshold transistors to minimize their noise contributions. We add a bandwidth-limiting stage to keep the overall bandwidth constant as we vary the bias current of the gain stage to adapt its noise per unit bandwidth. The additional power of the bandwidth-limiting stage is negligible because the 100-fold gain provided by the gain stage alleviates its noise floor requirements. Fig. 1(c) shows an input-referred signal recorded in vivo from the RA region of the brain of an anesthetized zebra finch bird using a Carbostar 800 k$\Omega$ impedance electrode and our amplifier. The trace exhibits no discernible difference when compared with a recording made using a commercial neural amplifier.

We measured an input-referred noise of 3.06 $\mu$V rms over a $-3 \mathrm{~dB}$ bandwidth of 45.0 Hz–5.32 kHz with a power consumption of 7.56 $\mu$W for 40.8 dB of mid-band gain. The noise contributions of our amplifier are minimized to be almost those of only its two input transistors, due to the use of cascoded resistive loading rather than current-source loads. Thus, the measured Noise Efficiency Factor (NEF) is 2.67, very near the theoretical limit of 2.02 for a differential amplifier, representing the most power efficient and lowest-power differential design to date. Further details of the amplifier are described in [6] including use of this amplifier for Local Field Potential (LFP) recording.

B. Adaptive Power Biasing of Neural Amplifiers in Multi-Electrode Arrays

The power required to build an amplifier with constant bandwidth, constant power-supply voltage, and an input-referred noise $v_n$ scales as $1/v_n^2$ if the amplifier’s minimum detectable signal is limited by thermal noise. This relation clearly shows the steep power cost of achieving low-noise performance in thermal-noise-limited amplifiers. Most neural amplifiers are carefully designed to be thermal-noise limited rather than...
Neural amplifiers have been designed to handle the worst-case range of signal strengths that may be expected in any recording situation. In practice, there is considerable variance in the noise and action potential strengths of typical recordings. The steep cost of achieving low-noise performance in an amplifier suggests that rather than designing amplifier arrays with the lowest noise at all locations, significant power savings can be achieved if each amplifier can adapt its input-referred noise to the local noise floor. This adaptability enables the overall power in a multi-electrode system to be determined by a typical electrode rather than by the worst-case electrode. The control loop for setting each amplifier’s bias current in a multi-electrode array may easily be implemented with little power overhead per recording site: One very-low-noise neural amplifier is used infrequently to evaluate the noise floor at each recording site in a sequential and multiplexed fashion, so that its power overhead is shared amongst all recording sites and it is only active during calibration. This scheme is shown in Fig. 2(a). Fig. 2(b) shows a typical probability distribution that we obtained from neural data recorded using a chronically-implanted 64-electrode array in a rhesus monkey. For this probability distribution, adaptive power biasing yields a 12-fold reduction in neural recording power for an entire system of 100 electrodes. Note that the technique of adaptive power biasing may be applied to any neural amplifier, not just our amplifier. For maximum efficacy, techniques must be used to keep the bandwidth of the amplifier invariant as its bias current is changed. In our amplifier, since the first stage determines the noise per unit bandwidth and the second stage determines the bandwidth, the total output noise is controlled simply by varying the bias current of the first stage while that of the second stage is kept constant. In other amplifiers, the output capacitance and bias current can be increased in proportion with one another to reduce the total output noise while keeping the bandwidth invariant, as shown in Fig. 2(a). In our amplifier, the value of resistances in Fig. 1(b) can be scaled with a scaling resistance $R_s$ such that $I_B R_s$ is constant as $I_B$ is changed. Consequently, the NEF is constant with bias current level.

Fig. 3(a) illustrates the feedback loop of a circuit used to implement one instantiation of adaptive power biasing in a neural amplifier described in [14]. This neural amplifier’s noise performance is slightly inferior to the one shown in Fig. 1(b). A ‘command current’ that is proportional to the desired noise amplitude is determined by a prior stored measurement with a low-noise neural amplifier and a wide-dynamic-range $V$-to-$I$ envelope detector described in [16]. This current is input to the adaptive
The key idea behind adaptive power biasing is not the exact control algorithm or circuit that is used to bias the amplifier but the fact that biasing can be determined by knowledge of the statistics of the array rather than by worst-case assumptions about its statistics.

III. ANALOG DECODING AND LEARNING CIRCUITS FOR DATA COMPRESSION

In some of our prior work on a bionic-ear (cochlear implant) processor, we experimentally demonstrated that analog pre-processing and delayed digitization enable order-of-magnitude power reductions over traditional A-to-D-then-DSP implementations [19], [20]. Such implementations can also preserve programmability, as well as robustness to offset, power-supply-noise, and temperature variations. In this work, we investigated whether it may be possible to achieve similar power reductions in implementing digital linear decoding and learning algorithms with analog architectures operating on analog neuronal firing rates.

Our novel analog decoding architecture uses a continuous-time, adaptive linear filtering algorithm to map neural signal inputs onto motor command outputs. The system is an $m$ to $n$ convolutional decoder that accepts $m$ preprocessed neural signals on a capacitor between calibrations. In other instantiations, current-DACs and digital bits could store the bias-current value as in Fig. 2(a).

Analysis of the digitized output of a neural amplifier followed by digitally controlled setting of the amplifier’s bias current can implement more sophisticated adaptive biasing: For example, such schemes could set the amplifier’s power at a low value if there are large action potentials on a particular electrode even though its noise floor is low. In such a case, needlessly low-noise amplifiers are not required and we can exploit this knowledge to save power. Thus, the key idea behind adaptive power biasing is not the exact control algorithm or circuit that is used to bias the amplifier but the fact that biasing can be determined by knowledge of the statistics of the array rather than by worst-case assumptions about its statistics. In arrays with large numbers of electrodes, the savings in power with adaptive biasing exceed an order of magnitude because outlier or low-probability points are prevented from scaling power quadratically according to worst-case needs in all amplifiers.

Authorized licensed use limited to: MIT Libraries. Downloaded on October 28, 2008 at 22:16 from IEEE Xplore. Restrictions apply.
exhibited accuracy comparable to that of the Bayesian (probabilistic) decoder described in [21] operating on the same signals (real neural recordings), but was far less computationally intensive.

Using simple continuous-time transconductor-capacitor (\(G_mC\)) filters along with multiplier, adder, and subtractor circuits in subthreshold technology as shown in Fig. 5(b), we were able to show via SPICE circuit simulations in a 0.18 \(\mu\)m process that our decoding architecture is capable of learning and then decoding a monkey’s intention to move its arm in particular directions on the basis of data recorded from ten neurons in its posterior parietal cortex. Spike-time data from the monkey cortex recorded during short intervals just preceding reach movements were converted into analog firing rates using fourth-order wavelet-like analog matched filters on 1-ms-wide spiking inputs, followed by thresholding, followed by third-order analog interpolation filters. The architecture is able first to learn how to decode the intended direction of motion of a monkey’s arm from input spike-time data and reduce the mean-squared error between the intended output (the executed reach) and the prediction made by the circuit. This is accomplished by gradient-descent-based tuning of parameters defining the convolution kernels of the adaptive decoding filter in real time. After the end of the learning period, the parameters of the decoding filter are fixed and the decoder is able to predict the intended direction of motion of the monkey’s arm on the basis of new neuronal control signals. This process is demonstrated in Fig. 5(c). The analog filters were trained during the interval labeled “Learning” using the neural data observed just before the monkey reached in direction D1. Once training was completed, all the learned filter parameters were stored. After training, therefore, the filters were least-squares optimized and configured for mapping the neural signal that the monkey used for moving in the D1 direction to an output voltage of 0.5 V.

We demonstrate a test of the accuracy of the learned mapping during the interval labeled “Decoding” in which the neural signals for moving in directions D1 and D2 were applied to the analog decoder. The plot shows that when the neural signal for moving in the D1 direction is applied the output voltage settles at 0.5 V, whereas it settles elsewhere when the neural signal for D2 is applied. This result indicates that the analog decoder has learned the appropriate mapping from neural signals to the intended reach direction.

The power consumption of a single channel is near 54 nW and a complete 100-channel system with 3 motor outputs is implementable with 17 \(\mu\)W using a 1-V power supply on a modest-sized chip. Thus, our analog architecture is extremely power efficient. It can enable a dramatic reduction in communication power due to its data-compression properties: 24 Mbs
\(^{-1}\) of uncompressed data in (100 channels sampled at 20 kbs
\(^{-1}\) with 12-bit resolution) to 2.4 kbs
\(^{-1}\) of compressed data out (3 motor parameters at 100 bs
\(^{-1}\) with 8-bit resolution). An actual chip implementation would need overhead for offset and mismatch compensation via DAC calibration, and temperature- and power-supply-immune biasing, as our prior work has shown [19], [20]. Much of the power savings results from the fact that the computational tasks can be mapped very efficiently to a slow-and-parallel analog architecture. The use of analog circuits

\[N(t)\] as inputs and produces \(m\) motor control signals (reach directions), \(M(t)\) as outputs. Fig. 5(a) shows the architecture for a single motor output signal. The \(m \times n\) array of adaptive filters \(W_{ij}(t)\) forms a set of convolution kernels between the inputs and outputs that is analogous to the matrices of ‘synaptic weights’ used in artificial neural networks (\(W_{ij}(t)\) is convolved with the \(j\)th neural input to form the \(j\)th component of the sum that yields the \(i\)th motor output). The parameters \(p_{ij}\) of these filters are set during a supervised learning phase by using a continuous-time analog gradient descent algorithm. Further mathematical detail lies beyond the scope of this paper, but we note here that the algorithm results in a modified version of the ‘delta’ learning rule well known in machine learning. Our algorithm

![Block diagram illustrating the learning process used in real time to optimize the parameters of the adaptive filters that decode neural signals.](image)

![Example performance of the analog learning architecture in one learning and two decoding trials.](image)

![Performance of the continuous-time analog learning architecture](image)
to perform compression saves power in data telemetry circuits due to the reduced bandwidth needs for communication. It also saves power in analog-to-digital conversion circuits that can be operated at significantly reduced bandwidths.

Our architecture represents the ultimate form of compression in the sense that the decoding computation is performed on site and only the results are transmitted. It is only viable because analog implementations of decoding computations needed in motor prosthetics can be done so efficiently. Other forms of data compression that have been proposed before, such as adaptive thresholding followed by transmission only of spike-time information [9], [22], will need to be added to an architecture like ours to preserve general-purpose flexibility in a brain–machine interface. Our architecture reveals that, for the special-purpose needs of a paralysis prosthesis, for example, extremely power-efficient analog implementations are possible. As we discuss in Section VI, the combination of general-purpose flexible architectures that are relatively inefficient with special-purpose architectures that are efficient can be used to create a system that is both flexible and efficient.

IV. RF DATA LINK

Fig. 6 shows a low-power half-duplex RF data-telemetry link implemented with inductively coupled external (primary) and implanted (secondary) resonators for bidirectional communication through the body. Communication from inside the body to outside the body is termed the uplink and communication from outside the body to inside the body is termed the downlink. We designed our RF link for the case of a recording BMI in which the uplink reports information from the brain and can have a bandwidth of several Mbs\(^{-1}\), while the downlink sends information to the brain and can have a bandwidth of several hundred kbs\(^{-1}\) in the worst case. The downlink in a recording BMI typically transmits relatively low-bandwidth control, programming, and supervisory learning information.

We shall first discuss the design of the high-bandwidth uplink. The primary external unit broadcasts a 25 MHz continuous-wave carrier using an LC oscillator with positive feedback. For the high-bandwidth uplink, the secondary implanted unit modulates its impedance by open-circuiting or short-circuiting a parallel resonator, thus modulating the reflected impedance seen at the primary. The impedance seen in the primary is modulated by a factor that we call the modulation depth \(m\). This quantity is given by [13], [23]

\[
m \approx k^2 Q_1 Q_2
\]  

(1)

where \(k\) is the coupling factor between the coils, set by their geometry and separation; \(Q_1\) and \(Q_2\) are the quality factors of the coils; and the approximation is valid if \(m \ll 1\), which is usually the case. The strong dependence of \(m\) on \(k\) makes this configuration unsuitable for long-range links since \(k\) varies with the cube of the distance between the coils. For short-range links (such as those used in neural prosthetic devices, which rarely require coil separations greater than 2 cm), however, impedance modulation has the great advantage of requiring almost no power dissipation on the secondary (internal) side, since shorting or opening a switch costs far less energy than operating a transmitter. The load modulation results in amplitude modulation of the oscillator voltage in the primary. This amplitude modulation in the primary is detected by an envelope detector built with a rectifier and lowpass filter as shown in Fig. 6. The output of the lowpass filter is thresholded by a hysteretic comparator and fed to a phase-locked loop for clock and data recovery.

For the lower-bandwidth downlink, on-off keying of the primary oscillator results in pulse-width amplitude modulation in the secondary, which is then detected by rectification, thresholding, and pulse-width demodulation circuits.
Fig. 7. Primary (external, left) and secondary (implanted, right) RF data telemetry test circuit boards.

TABLE I

<table>
<thead>
<tr>
<th>Performance Parameter</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Link Distance</td>
<td>tested up to ≤ 5 cm</td>
</tr>
<tr>
<td>Center Frequency</td>
<td>25 MHz</td>
</tr>
<tr>
<td>Uplink Data Rate</td>
<td>1 Mbs⁻¹–5.8 Mbs⁻¹ (at 2 cm)</td>
</tr>
<tr>
<td>Uplink Encoding</td>
<td>Non-Return (NR)</td>
</tr>
<tr>
<td>Downlink Data Rate</td>
<td>15 kbs⁻¹–300 kbs⁻¹ (at 2 cm)</td>
</tr>
<tr>
<td>Downlink Encoding</td>
<td>Pulse-Width Modulation (PWM)</td>
</tr>
<tr>
<td>Power supply voltages</td>
<td>V_{DDH} = 2.8 V / V_{DDL} = 1.4 V</td>
</tr>
<tr>
<td>External Power Consumption</td>
<td>2.5 mW (Uplink) / 1.5 mW (Downlink)</td>
</tr>
<tr>
<td>Internal Power Consumption</td>
<td>100 µW (Uplink) / 140 µW (Downlink)</td>
</tr>
<tr>
<td>Fabrication Process</td>
<td>AMI 0.5 µm CMOS</td>
</tr>
<tr>
<td>Chip Size</td>
<td>1.5 mm × 1.5 mm (Each Transceiver)</td>
</tr>
</tbody>
</table>

Fig. 8 shows primary and secondary test boards that were used for making link measurements. Identical transmission and receiving coils were printed on the boards. Each coil was square, 3.5 cm on a side, and had two turns. The designed inductance was 500 nH with a simulated quality factor of 30 at 25 MHz. Packaged chips were surface mounted on the boards and aligned parallel to each other at various separations for testing. Uplink and downlink data are shown in Fig. 6 for a link separation of 2 cm. The bit error rate was < 10⁻⁵ for the uplink at data rates < 4 Mbs⁻¹, and < 10⁻⁶ for the downlink at all tested data rates [23]. We also tested the link in the presence of a 2 cm thick layer of 0.9% saline solution between the coils. As in prior reports such as [10], the goal was to verify that the wireless link would operate normally in the presence of body tissue. No significant differences in performance were noted.

Table I summarizes the performance of the overall telemetry link including bandwidth and power consumption obtained from measurements on the link. We note that the implanted power is minimal for both the uplink and downlink (100 µW and 140 µW, respectively), while the external unit consumes 2.5 mW and 1.5 mW in the uplink and downlink modes, respectively. The uplink data rates can be as high as 5.8 Mbs⁻¹ and the downlink data rates can be as high as 300 kbs⁻¹, yielding energy efficiencies of 0.65 nJ bit⁻¹. The good energy efficiency of the link results from the use of simple and energy-efficient transmitter and receiver modulation and demodulation strategies. Since the link was not found to be thermal-noise limited, further improvements in energy efficiency with future designs are possible.

V. RF POWER LINK

An RF power link test setup is shown in Fig. 8(a). The system uses a custom Class-E 4.5 MHz driver built on a chip, a primary resonator circuit, a secondary resonator circuit, and a Schottky diode rectifier as shown in Fig. 8(b). The rectifier loads the secondary resonator with an effective AC resistance of R_L/2 if there is little ripple on the load R_L, a necessary condition for a good power supply. Ignoring rectifier and driver losses, a theoretical analysis shows that the maximum possible power efficiency in the link occurs when the load R_L is chosen so that

\[
Q_L = \frac{\omega R_L C_2}{2} = \frac{1}{k} \sqrt{\frac{Q_1}{Q_2}}
\]
and is given by
\[ \eta_{\text{match}} = \frac{k^2 Q_1 Q_2}{(k^2 Q_1 + 1)(k^2 Q_2 + 1)} \]

where \( k \) is a geometry-dependent coupling factor and \( Q_1 \) and \( Q_2 \) are the quality factors of the primary and secondary resonators, respectively [13]. Fig. 8(c) shows that theoretical and experimental measurements are in good accord (the results shown here were obtained for \( Q_1 \approx 40 \) and \( Q_2 \approx 70 \)). RF power links can thus be quite power efficient when operated near their optima.

In our work and in the present discussion different RF coils and RF frequencies have been used for experimental convenience, to characterize the power and data links and to explore the bounds on energy efficiency in power and data transmission systems separately. However, it must be realized that in a practical system, space constraints may require that the same coils be used for data and power telemetry, although the operating frequencies may differ for these two modes of operation. Several systems of this kind designed to transmit power and data over the same link are described in [24]–[27]. Power transfer is more efficient with high-\( Q \) coils and lower carrier frequencies where switching losses are minimized. However, high-bandwidth data transfer is easier with low-\( Q \) coils and higher carrier frequencies. Thus, operation at different carrier frequencies for power and data links is often advantageous. Hence, other work has used separate coupled resonators for the power link and data link [12], [28]. High-\( Q \) systems need periodic auto-calibration and auto-tuning feedback loops to maintain resonance.

In the low-power BMIs that we envisage in the future, the use of ultra-low-power electronics on the secondary side will in some systems enable an implanted battery with a finite number of wireless recharges, typically 1000 or fewer, to function for at least a decade: A secondary with 1-mW power consumption, for example, could function on an implanted 100-mAh battery for 30 years with 1000 wireless recharges, performing neural signal conditioning and analog decoding. In such systems data and power transmission requirements need not conflict even if only one coupled resonator is used for both the data link and the power link as power transfer only occurs during infrequent recharging. Large interfering RF voltages necessary for transmitting power to the secondary then do not corrupt sensitive low-power secondary circuits either. Implanted batteries do, however, require careful hermetic sealing, short-circuit protection, and battery-management circuits.

VI. HYBRID ANALOG–DIGITAL SYSTEM DESIGN

Fig. 9 shows an overall architecture that consists of low-power implanted DAC-programmable analog circuits that are configured by a low-power external DSP or FPGA. Depending on the user’s choice, the system can be configured to report raw neural data from a selected set of electrodes, single and multiunit spikes extracted via spike sorting, local field potential (LFP) data, or decoded motor parameters via a data telemetry uplink as we have previously described. Such flexibility is possible because the parameter values for the analog wavelet-like matched filters and other analog parameters are determined by detailed digital analysis of raw uplinked analog neural data, then downloaded into the implant via the low-bandwidth telemetry downlink. Since the relatively power-hungry digital analysis need only be done occasionally to keep the system up-to-date as signal properties evolve (due to effects such as relative movement between neurons and electrodes), the overall power consumption of the system remains low but the flexibility of a digital system can be leveraged. Low-power implanted analog architectures for decoding that are highly efficient could have their parameters determined by flexible digital processors in an external unit and then downloaded into the implanted unit. Thus, the combination of external digital programmability combined with internal analog efficiency can enable an overall system to incorporate the best of the analog and digital worlds.

VII. WIRELESS NEURAL STIMULATION

Thus far we have focused on circuits and systems capable only of recording from neurons in the brain. Fig. 10(a) shows a system that we have used for wirelessly stimulating neurons in the zebra finch brain to study the neural basis of birdsong as shown in Fig. 10(c). For example, stimulation in the zebra finch brain while the bird is singing can yield insight into the mechanisms of song generation [29]. The system consists of an external transmitter (not shown) controllable through a computer interface, and a miniature, implantable wireless receiver-and-stimulator. The miniature printed circuit board contains a receiver coil and electrodes (reverse side), battery (not shown), and a custom integrated circuit for data demodulation and neural stimulation. The chip, fabricated in a standard 0.5 \( \mu \)m CMOS process, occupies 2.25 mm\(^2\) and is capable of delivering biphasic current pulses to 4 addressable electrode sites at 32 selectable current levels ranging from 10 \( \mu \)A to 1 mA, as shown in Fig. 10(b). The entire implant weighs 0.6 g (including the battery) and occupies a footprint smaller than 1.5 cm\(^2\). Fig. 10(c) shows simultaneous recording from one area of the bird brain, HVC, while stimulation was applied to another area, Area X. The observed action potentials in HVC (after the stimulus artifacts due to stimulation of Area X have subsided) prove that the wireless neural stimulation system is working. Fig. 11(a) illustrates the Colpitts oscillator transmitter.
A wake-up system on the chip permits us to extend the life of the battery over a full 30 days and thus facilitates extended neuroscience experiments. During periods of birdsong inactivity, the receiver chip enters a sleep mode in which it consumes only 7 μA of quiescent current. When an external microphone detects song, a computer activates the wireless transmitter, which generates an RF signal. A detector on the chip receives the RF signal and ‘wakes up’ the data demodulation and output-driver circuitry. A rectifier on the chip converts the amplitude-modulated RF signal to a full-scale digital signal, whose edge is detected via XOR circuits and used to wake up other circuits. The rectifier’s time constant is determined by the system data rate, typically 25 kbps, such that the system is capable of waking up in just a single bit period (about 40 μs). Fig. 12 shows experimental waveforms illustrating the wake-up operation of the system. After the system is assumed to have awakened, a complete stimulation command is issued. The entire time from when the bird first begins to vocalize until the device can deliver stimulation is only about 1 ms. This delay is insignificant compared to the duration of a typical songbird vocalization [30]. When the bird is not singing, the RF signal is turned off and an internal timer puts the chip to sleep in about 1 second. When the chip is awake, static power consumption in the core is about 16 μA. However, the power consumption in the output stage can be significantly greater, reaching as much as 100 μA for the reference current when full-scale stimulation currents of 1 mA are required. The use of the wake-up system thus significantly improves battery lifetime. Such feed-forward signal-triggered stimulation could be used to save power in other brain–machine interfaces; the stimulation circuits of devices used to treat epilepsy, for example, could be activated only when triggered by seizure-detection signals. Of course, such power savings are only practical if detection is much cheaper than stimulation, as is often the case. Algorithmic strategies [31], [32], switched-capacitor strategies [33], and better electrode design can also lower stimulation power. Techniques to create highly accurate charge-balancing circuits that obviate the need for large dc blocking capacitors and that consequently lower implanted-system size have also been described [34].

Fig. 10. (a) Photograph of the chip-on-board wireless neural stimulation system. (b) Measured biphasic current pulses demonstrating 32 programmable stimulation levels. (c) HVC response to neural stimulation in Area X of the zebra finch brain.

Fig. 11. (a) Colpitts oscillator circuit used in our wireless neural stimulation system. (b) Receiver circuit used in our wireless neural stimulation system.

Fig. 12. Experimental waveforms illustrating the wake-up operation of the neural stimulation system.
VIII. Conclusion

We have presented several circuits and architectures for low-power recording, processing, stimulation, and wireless transmission of neural signals in brain–machine interfaces. These include a state-of-the-art micropower differential neural amplifier; adaptive power biasing of amplifier arrays in multi-electrode systems; analog architectures for neural signal decoding, learning and data compression; 0.65 nJ bit⁻¹ impedance-modulation-based bidirectional wireless links that minimize implanted-unit power consumption; RF links that achieve theoretically optimal power-transfer efficiencies; hybrid analog-digital architectures that combine flexibility and efficiency; and wireless neural stimulation circuits that exploit sleep modes to save power while allowing quick wake-up. Together, such circuits and systems could enable highly power-efficient brain–machine interfaces to be developed, thus bringing them a step closer to universal accessibility in experimental neuroscience and widespread clinical use.

REFERENCES

Rahul Sarpeshkar (M’97) received the B.S. degrees in electrical engineering and physics from the Massachusetts Institute of Technology (MIT), Cambridge, in 1990 and the Ph.D. degree from the California Institute of Technology, Pasadena, in 1997.

He was with Bell Labs as a Member of the Technical Staff in 1997. Since 1999, he has been on the faculty of the Electrical Engineering and Computer Science Department, MIT, where he heads a research group on Analog VLSI and Biological Systems and is currently an Associate Professor. He holds over 20 patents and has authored more than 70 publications, including one that was featured on the cover of *Nature*. His research interests include analog and mixed-signal VLSI, biomedical systems, ultra-low-power circuits and systems, biologically inspired circuits and systems, molecular biology, neuroscience, and control theory.

Dr. Sarpeshkar has received the Packard Fellow Award given to outstanding young faculty, the Office of Naval Research Young Investigator Award, the National Science Foundation Career Award, and the Indus Technovator Award. He has also received the Junior Bose Award and the Ruth and Joel Spira Award, both for excellence in teaching at MIT. He is currently an Associate Editor of *IEEE Transactions on Biomedical Circuits and Systems*.

Woradorn Wattanapanitch (S’07) received the B.S. degree (*summa cum laude*) in electrical and computer engineering from Cornell University, Ithaca, NY, in 2005, and the M.S. degree in electrical engineering from the Massachusetts Institute of Technology (MIT), Cambridge, in 2007. He is currently working toward the Ph.D. degree at MIT in the Analog VLSI and Biological Systems Group at the MIT Research Laboratory of Electronics. His research interests include low-power analog and mixed-signal VLSI for biomedical applications, and control theory.

Mr. Wattanapanitch was awarded a Royal Thai Government Fellowship in 2000.

Scott K. Arfin (S’00) received the B.S. degree (*summa cum laude*) from Columbia University, New York, in 2004 and the M.S. degree from the Massachusetts Institute of Technology (MIT), Cambridge, in 2006, both in electrical engineering. He is currently pursuing the Ph.D. degree in the Analog VLSI and Biological Systems Group at the MIT Research Laboratory of Electronics, in the field of low-power electronics for biomedical applications.

Benjamin I. Rapoport (S’07) received the B.A. degree in physics and mathematics and the M.A. degree in physics from Harvard University, Cambridge, MA, in 2003, the M.Sc. degree in mathematics from Oxford University, Oxford, U.K., in 2004, and the M.S. degree in physics from the Massachusetts Institute of Technology (MIT), Cambridge, in 2007.

He is currently a student in the M.D.-Ph.D. Program at Harvard Medical School and is jointly pursuing the M.D. at Harvard Medical School and the Ph.D. in electrical engineering at MIT. His research and professional interests include bioimplantable electronic interfaces with the brain and nervous system, biological and computational neuroscience, and clinical neurosurgery.

Soumyajit Mandal (S’01) received the B.Tech. degree from the Indian Institute of Technology, Kharagpur, India, in 2002, and the M.S. degree in electrical engineering from the Massachusetts Institute of Technology (MIT), Cambridge, in 2004, where he is currently working toward the Ph.D. degree. His research interests include nonlinear dynamics, low-power analog and RF circuit design, antennas and biochemical networks.

Mr. Mandal was awarded the President of India Gold Medal in 2002.

Michael W. Baker (S’04) received the B.S., M.Eng., and Ph.D. degrees in electrical engineering and computer science in 2000, 2002, and 2007, respectively, all from the Massachusetts Institute of Technology (MIT), Cambridge.

His research interests include neural and bionic implants, low-power integrated analog design, and integrated radio-frequency circuits.

W. Michele S. Fee received the B.S. degree in engineering physics from the University of Michigan, Ann Arbor, and the Ph.D. in applied physics from Stanford University, Palo Alto, CA.

He joined Bell Laboratories as a Postdoctoral Fellow in the Biological Computation Research Department, and continued there as a Member of Technical Staff until 2003, when he joined the faculty at the Massachusetts Institute of Technology (MIT), Cambridge. He is presently an Investigator at the McGovern Institute for Brain Research as well as an Associate Professor in the Department of Brain and Cognitive Sciences, both at MIT. His laboratory focuses on the biophysical and neural circuit mechanisms underlying the learning and generation of sequences in the brain, with a particular interest in vocal sequences of the songbird. The Fee Laboratory also develops new technologies for measuring and manipulating neural activity in behaving animals.

Sam Musallam is an Assistant Professor in the Department of Electrical and Computer Engineering and an Associate Member of the Department of Physiology, both at McGill University, Montreal, QC, Canada.

He leads the Neural Prosthetics Laboratory at McGill University, which investigates the neuro-scientific aspects of developing optimal neural prosthetic devices, and also develops implantable devices for measurement of biological signals in the brain and body.

Richard A. Andersen is the James G. Boswell Professor of Neuroscience at the California Institute of Technology (Caltech), Pasadena.

Since 1994, he has also been the Director of the Sloan Center for Theoretical Neurobiology at Caltech. He studies the neurobiological underpinnings to such brain processes as sight, hearing, and action planning. His laboratory is also studying neural prosthetics, electronic interfaces with the brain that may eventually enable disabled humans to achieve thought-based control of external devices.

Professor Andersen is a Fellow of the American Association for the Advancement of Science and a Member of the American Academy of Arts and Sciences.