Demystifying Beamformers
Lately there has been a resurgence of interest in beamformers following the success of the Amazon Echo. This talk presents beamformers in a simple-to-understand and straightforward light. You'll understand how the number of microphones and geometry impacts performance. The importance of microphone matching and SNR, and even how to create steerable arrays. The talk has a minimum of math and an emphasis on practical applications. Although the focus is on audio, the techniques presented extend to other applications such as RF.
What this presentation is about and why it matters
This talk, "Demystifying Beamformers," breaks down how microphone arrays become spatial filters that emphasize sound from some directions and reject it from others. Paul Beckmann focuses on practical design choices that determine how well a beamformer works in real products: microphone type and datasheet numbers, spacing and geometry, gain and time matching, frequency-dependent behavior, and how to tune for real devices. The examples and measurements (including Amazon Echo and Echo Show) connect signal‑processing theory to real-world tradeoffs engineers face when building far‑field voice systems, sound bars, TVs, and small form‑factor devices.
Who will benefit the most from this presentation
- DSP engineers and firmware developers implementing voice capture and beamforming algorithms.
- System and hardware engineers choosing microphones and laying out arrays on enclosures.
- Product engineers evaluating microphone datasheets, sensitivity, noise floor, and AOP in system context.
- Students and practitioners who want intuition (not heavy math) about how geometry, sampling, and noise limit beamformer performance.
What you need to know
The talk is accessible, but you'll get more if you come prepared with a few fundamentals:
- Basic digital audio: sampling rate, Nyquist limit, and the difference between acoustic levels (dB SPL) and digital levels (dBFS). A useful mapping to keep in mind is that 0 dBFS is typically aligned to the microphone acoustic overload point (AOP), so you can think of digital level roughly as $\text{dBFS} = \text{SPL} - \text{AOP}$ when AOP is used as the 0 dBFS reference.
- FIR filters and convolution: delay‑and‑subtract or delay‑and‑sum beamformers are fundamentally small FIR filters across microphones. Understanding impulse response and frequency response of short FIRs helps you interpret the two‑mic examples in the talk.
- Array basics: how inter‑microphone spacing maps to directional sensitivity (wider spacing improves low‑frequency reach but causes spatial aliasing at high frequency), and what end‑fire vs broadside orientations mean.
- SNR and noise sources: difference between ambient/diffuse noise and microphone self‑noise; how SNR and microphone noise floors limit achievable beamformer gain.
- Optimization concepts: familiarity with the idea of frequency‑by‑frequency complex weights (steering vectors) and constrained optimizations like MVDR or MaxSNR helps when the speaker describes N‑mic design and tradeoffs.
- Practical system issues: clock sync, sample drops, gain matching, and acoustic porting matter as much as the algorithm—Paul includes a concrete checklist for these.
Glossary
- Beamformer — A spatial filter that forms directional sensitivity by combining multiple microphone signals with delays and weights.
- Delay‑and‑sum / Delay‑and‑subtract — Simple beamforming topologies implemented as time delays and summation (or subtraction) of microphone channels; basis of many two‑mic designs.
- Cardioid / Supercardioid / Hypercardioid — Common polar patterns produced by two‑mic combinations; each has different front‑to‑back rejection and directivity index.
- Steering vector — The set of transfer functions (or delays/phase shifts) from a target direction to each microphone; used to compute frequency‑dependent weights.
- SNR (Signal‑to‑Noise Ratio) — Ratio of signal power to noise power; beamforming aims to improve SNR for the target direction given diffuse and self‑noise.
- AOP (Acoustic Overload Point) — The SPL at which a microphone output clips; often aligned to 0 dBFS in digital systems.
- dBFS / dB SPL — dBFS is decibels relative to digital full scale; dB SPL is an acoustic sound‑pressure level. Correct mapping between them is needed to reason about real signals.
- Spatial aliasing — When microphone spacing is large relative to wavelength, multiple angular directions produce the same phase delays at high frequency, creating unwanted grating lobes.
- Diffuse‑field noise — Acoustic noise arriving uniformly from all directions (e.g., babble); used as a benchmark for directivity metrics.
- MaxSNR / MVDR — Optimization approaches that compute frequency‑dependent complex weights to maximize SNR (MaxSNR) or minimize variance for a constraint (MVDR); used for N‑mic design.
Final notes — why this talk is worth your time
Paul combines practical product examples, measurements, and simple intuition with minimal math. The talk is especially valuable because it bridges datasheet numbers to system behavior (sensitivity, noise floor, AOP), presents hands‑on tuning and a real‑time demo, and gives concrete design rules (recommended spacings, gain‑matching tolerances, porting guidance). If you build voice‑enabled products or are responsible for microphone selection, layout, or beamformer tuning, you will come away with actionable rules and a clear checklist to avoid common pitfalls.
Thanks for the presentation. completely new topic for me but I got the basic ideas, thanks for that. Do you have challenges when you add/sub the 2 micro phone signals because of the mismatches in the microphones, and do you do something special to mitigate them ?
You need matched microphones in order for a beamformer to work well. We recommend that they match +/- 1 dB. See slides 63 to 65 of my presentation for details and actual performance numbers.
Really enjoyed the presentation and learned a lot.
Thank you!
What an enjoyable presentation. I think you did a great job of showing a simple example of the microphones separated in space, and one signal delayed by one sample. It gave me a solid intuition as to how beamforming works. It set the tone for the rest of the presentation. Thank you so much.
very informative! certainly demystifyed beamformers for me. Thank you, paul!
Very, very interesting presentation, nice demos
Thanks Christophe! Putting the presentation together helped me crystalize a lot of ideas.
Welcome everyone and thanks for attending my presentation. I'll be monitoring the chat window for any questions you may have.

Regarding gain-matching of mics - can you correct the individual mic response curves to a common response level using measured response data for each mic before applying the beamformer?