Chapter 02

Doppler Effect & De-Doppler Search

Why SETI candidates appear as slanted lines in spectrograms, and how integration along those lines pulls weak signals out of noise.

Saman Tabatabaeian — Deep Field Labs MitraSETI Tutorial Series

Prerequisite: Foundations (spectrograms, noise, basic signal concepts)

This chapter explains why narrowband SETI candidates appear as slanted lines in spectrograms, and how a de-Doppler search integrates power along those lines to pull weak signals out of noise. You already know what a spectrogram is: power as a function of time (rows or columns) and frequency (the other axis). Here we connect motion, frequency shift, and the brute-force algorithm that MitraSETI-style pipelines refine later with faster structures.

1. The Doppler Effect — Everyday Examples

Sound: the ambulance siren

When an ambulance approaches you, the siren sounds higher in pitch; after it passes, it sounds lower. The siren's mechanical vibration frequency at the vehicle is unchanged. What changes is how often the compressed air peaks reach your ear: motion along the line of sight stretches or squeezes the spacing between wave crests that actually arrive.

That is the Doppler effect: observed frequency depends on relative radial velocity (motion toward or away from you), not only on what the source emits.

Figure 2.1 — The Doppler effect with sound: compressed wavefronts (higher frequency) toward the approaching observer, stretched wavefronts (lower frequency) behind.

A simple formula for small speeds

For speeds much less than the wave speed c (sound in air, or the speed of light for radio), a good linear approximation is:

✦

Key Concept — The Doppler Formula

f_observed = f_source × (1 + v / c)

Here v is positive if the source is receding and negative if approaching (sign conventions vary; what matters is the idea). Approaching motion increases observed frequency (blueshift for light); receding motion decreases it (redshift).

Light and radio

Electromagnetic waves obey the same idea. A transmitter on a spacecraft, a radar echo, or a hypothetical beacon on an exoplanet all have a rest-frame frequency. The antenna on Earth measures a different frequency when source and observer move relative to each other along the line of sight.

Redshift: observed frequency is lower than emitted (source receding or equivalent).
Blueshift: observed frequency is higher (source approaching).

SETI often imagines a narrow carrier or comb of lines at some f_source. Everything that changes the radial velocity between that source and Earth changes f_observed over time.

For software engineers, a useful mental model is resampling: radial motion is like a continuous time-varying mix between the source clock and the receiver clock in the line-of-sight direction. You do not need general relativity for night-scale SETI intuition—classical Doppler plus known ephemerides gets you most of the way—but remember that precision work (comparing candidates across days) eventually pulls in barycentric corrections: you express frequencies in a frame tied to the solar system's center of mass so Earth's orbital reflex motion does not masquerade as an intrinsic source drift.

2. Why Does a SETI Signal Drift?

A fixed frequency in the source's frame is not fixed at the telescope unless every relative motion is constant—and it never is for long.

Earth's rotation

A point on the equator moves at roughly 465 m/s due to Earth's spin. That velocity vector projects onto the line of sight to a star or galaxy and changes as Earth turns. So the radial velocity toward a distant source drifts throughout a night.

Earth's orbit

Earth orbits the Sun at about 30 km/s. Over weeks and months this dominates many drift signatures compared to rotation alone for a given pointing.

The source's motion

If the "beacon" sits on a planet, that planet orbits its star with an unknown orbital speed and phase. The star may have its own motion. All of these add vectorially to what the telescope sees.

Combined effect during one observation

Over a single recording (seconds to minutes), the cleanest first-order model is often:

f(t) = f₀ + drift_rate × t

f₀ is the frequency at some reference time (e.g. start of the file).
drift_rate is how fast the observed frequency changes: Hz/s (hertz per second).

Typical magnitudes discussed for habitable-zone contexts are often in the ballpark of ±0.01 Hz/s to ±4 Hz/s, depending on band, duration, geometry, and whether you include only Earth rotation or full barycentric corrections. The exact number matters less for intuition than the fact that drift is normal for a celestial narrowband line.

Over short snippets (a few seconds), drift_rate × duration may be smaller than one channel width, so the line looks almost vertical until you zoom out or use finer resolution. Over tens of seconds to minutes, the same Hz/s accumulates into many channels of walk—exactly the regime where de-Doppler integration pays off.

✦

Key Concept — Why Zero Drift Is Suspicious

If the transmitter and receiver share the same rotating, orbiting frame—like a terrestrial interferer fixed to Earth's surface—there is no differential Doppler between source and dish. The line stays at one frequency (aside from equipment drift, which is usually slow).

A candidate that shows zero drift (a vertical ridge in the spectrogram) is therefore more consistent with RFI than with a geometrically distant beacon, though it is not a proof by itself.

3. What the Drift Looks Like on a Spectrogram

Think of the spectrogram as a time–frequency plane: time runs horizontally, frequency runs vertically (or the axes may be swapped in software—the geometry is the same).

A constant drift rate draws a straight line through that plane.

Figure 2.2 — Three drift signatures in a spectrogram: positive drift (frequency increases), negative drift (frequency decreases), and zero drift (often a sign of terrestrial RFI).

ASCII sketch: frequency vs time

Higher frequency is "up"; time advances to the right.

Positive drift (frequency increases with time): line slopes up and right.

freq ^ | / | / | / | / +--------------------> time

Negative drift (frequency decreases with time): line slopes down and right.

freq ^ | \ | \ | \ | \ +--------------------> time

Zero drift: vertical line (same channel over time)—often a flag for terrestrial RFI in SETI-style reasoning.

Buried in noise at each instant

In any single time step, a weak carrier spreads over a few bins and sits near the noise floor. You might not see a convincing peak. The eye catches structure only when many time steps are viewed together—and even then, the diagonal smear can be faint.

Integration along the diagonal

If you sum (or average coherently in more advanced setups) power along the correct trajectory, energy from the signal adds constructively along that path, while noise tends to average down. That is the core idea of de-Doppler search: integrate along candidate lines in the time–frequency plane.

Figure 2.3 — De-Doppler integration: summing power along the correct diagonal trajectory. The signal adds coherently while noise averages down, producing a high SNR detection.

Discrete grid intuition

Real pipelines work on bins, not continuous lines. Here is a toy spectrogram: rows are frequency channels (0 at bottom), columns are time; · is noise-dominated and * marks where a weak signal passes through.

Figure 2.4 — Discrete grid view: the signal drifts through integer channel bins at each time step. The integrator visits these (time, channel) pairs for a trial drift rate d.

The trajectory is the set of (time, channel) pairs the integrator visits for a trial d starting at channel 1 at t = 0. Interpolation (linear, sinc, or nearest-neighbor) decides how to read values between exact bin centers when d does not land on integers. Nearest-neighbor is fast but can bias scores; better pipelines use sub-bin interpolation so drift is not artificially quantized worse than the instrument already is.

4. The De-Doppler Search — Brute Force

Goal: try a grid of trial drift rates d and starting frequency channels f, integrate power along each corresponding line, and mark high signal-to-noise trajectories as detections.

Step 0: normalize the spectrogram

Raw dynamic range is harsh. A common robust preprocessing step:

Subtract a per-channel or global median (or running baseline) to center the background.
Scale by a robust spread, e.g. MAD × 1.4826, so that Gaussian-ish noise ends up with unit-ish variance in many pipelines.

MAD is the median absolute deviation; the factor 1.4826 makes it comparable to standard deviation for normal noise. Exact choices vary by implementation, but the intent is the same: stabilize comparisons before summing along paths.

Step 1: trial drift rates

Choose a list of drift rates d spanning the physically plausible range (symmetric positive and negative).

Step 2: for each d and each starting channel f

For a discrete spectrogram:

Time steps t = 0 … N_t − 1.
Channel index at time t for drift d is something proportional to f + d × t × (time per step) / (Hz per channel), i.e. you convert Hz/s into channels per time step using tsamp (seconds per sample) and channel width (Hz per bin, sometimes called foff).

Concretely, if k(t) is the (possibly fractional) channel index at time step t:

k(t) = f + d × t × tsamp / foff

(Adjust signs if your convention defines positive drift as decreasing frequency.) At each t, sample the spectrogram at k(t)—e.g. linear interpolation between floor(k) and ceil(k)—to get a value x(t). Accumulate:

S = Σ_t x(t)

then form a detection statistic such as S / √N_t or a variant that also accounts for per-step weights.

If the integrated score exceeds a threshold, record (f, d, t_span, SNR, …) as a candidate.

💡

Implementation Note

The inner loop is embarrassingly parallel over f and d, but memory bandwidth and cache locality dominate at scale: you sweep the spectrogram many times unless you restructure access (again: Taylor tree).

Step 3: post-process

Nearby detections in (frequency, drift) space are clustered or non-max suppressed so one physical line does not yield hundreds of duplicate hits.

A single bright RFI burst can also create aliases at wrong drifts if the model is imperfect; conservative pipelines therefore combine de-Doppler scores with RFI masks, kurtosis gates, or multiple observations of the same sky location. This chapter stays focused on the geometry of drift; later chapters cover how MitraSETI decides which high-SNR blobs survive scrutiny.

Complexity

Rough operation count:

✦

Key Concept — Brute Force Complexity

O(N_d × N_t × N_f)

N_d = number of trial drift rates
N_t = time steps
N_f = frequency channels

For Breakthrough Listen–style resolutions, orders of magnitude can look like N_f = 1,048,576, N_t = 16, N_d ≈ 300, which is on the order of 5 × 10⁹ inner-loop contributions per file for the naive triply nested structure. That is slow at scale, which motivates algorithmic acceleration (the next tutorial).

5. Visual Example: Finding Voyager 1

Voyager 1 transmits near ~8.4 GHz (band-dependent; exact channel matters for plotting). From Earth, the dominant smooth drift from Earth's rotation is often quoted in the ~0.287 Hz/s class for such geometry (illustrative; always compute for your epoch and pointing).

On a spectrogram, Voyager does not look like a bright vertical stripe. It is a faint diagonal whose slope encodes that drift.

Without aligning integration to that slope: any single frequency bin across time sees only a weak, inconsistent bump—SNR too low to claim confidently.
With de-Doppler integration at the correct drift rate: power stacks along the true path. In worked examples, integrated SNR values on the order of 47 (e.g. 47.18) can appear—an unambiguous detection for a well-calibrated pipeline.

You can reproduce the lesson on any stable narrowband transmitter whose line-of-sight velocity changes smoothly—satellites, planetary spacecraft, or calibrated lab sources—provided your spectrogram cadence and resolution resolve the drift across N_t.

★

Fun Fact

The pedagogical point: the signal is a line in 2D, not a point in 1D. Searching only per-channel FFTs without drift matching leaves most astrophysical or deep-space narrowband energy under-integrated.

6. Channel Math

Let:

foff = channel bandwidth (Hz per bin), e.g. ~2.79 Hz per channel in some setups
tsamp = time resolution (seconds per step), e.g. ~18.25 s per spectrum in some setups
N_t = number of time steps in the observation

A natural drift step in Hz/s that moves the signal by one channel over the full duration is:

drift_step = foff / (N_t × tsamp)

That sets the grid spacing in drift rate: finer steps cost more N_d and more compute.

The maximum drift you care about in Hz/s, call it max_drift_rate, maps to a span in channels of roughly:

max_drift_in_channels ≈ max_drift_rate / drift_step

(Up to factors of order unity depending on whether you use half-steps or full channel quantization.) This is how you connect physical Hz/s limits to how many drift trials you need and how wide a frequency walk you must allow per trajectory so you do not "lose" the line off the edge of the band.

7. Normalization: Why Divide by √N_t

Assume per-time-step noise contributions are roughly independent with zero mean and similar variance. If you sum N_t of them:

The expected magnitude of the random walk grows like √N_t (central limit intuition: variance of the sum is N_t times per-step variance).
A deterministic signal component that adds with the same sign along the path grows like N_t (linear accumulation of coherent energy).

So define something proportional to:

score = (sum along path) / √N_t

Then noise-only paths have scores of order 1, while a real aligned narrowband ridge produces a larger positive outlier. That is why brute-force de-Doppler works: it separates coherent accumulation along the correct line from incoherent wandering of noise.

Equivalently, you may see implementations that use a matched filter or normalized cross-correlation viewpoint; the √(N_t) factor is the same statistical normalization in disguise.

If steps are not perfectly independent (spectra overlap in time, or preprocessing introduces correlation), the effective scaling deviates slightly from √(N_t). Pipeline designers calibrate thresholds on real data or simulations so false-alarm rates stay under control. The concept—coherent vs incoherent growth—remains the reason line integration is a principled detector for drifting narrowband energy.

8. Preview: Why Brute Force Is Not Good Enough

The nested loops recompute trajectories that overlap heavily. Two adjacent trial drift rates d and d + ε visit almost the same set of pixels, shifted slightly. A naive implementation re-sums from scratch every time, throwing away shared work.

The Taylor tree (next chapter) exploits that redundancy by organizing partial sums so that families of drifts share intermediate results, driving complexity down toward O(N log N)-style scaling in the number of drift hypotheses instead of a flat O(N_d × N_t × N_f) blow-up.

Picture two nearby drifts d and d′ = d + δ. For early time steps, the two paths through the grid share the same integer cells or differ by at most one bin. Brute force re-reads those cells for every d; a tree-based method stores partial sums along time and reuses them when only the tail of the trajectory diverges. That is the same algorithmic story as dynamic programming or prefix structures: pay once, query many related hypotheses.

You should leave this chapter with three anchors:

Drift is expected for celestial narrowband; zero drift is a caution sign for RFI.
Spectrogram lines are diagonal; integration along the correct diagonal is the detection primitive.
Brute-force de-Doppler is correct but expensive; smarter data structures are mandatory at Breakthrough Listen scale.

When you are ready, open 03 – Taylor Tree Algorithm for the efficient engine.

Doppler Effect & De-Doppler Search

1. The Doppler Effect — Everyday Examples

Sound: the ambulance siren

A simple formula for small speeds

Key Concept — The Doppler Formula

Light and radio

2. Why Does a SETI Signal Drift?

Earth's rotation

Earth's orbit

The source's motion

Combined effect during one observation

Key Concept — Why Zero Drift Is Suspicious

3. What the Drift Looks Like on a Spectrogram

ASCII sketch: frequency vs time

Buried in noise at each instant

Integration along the diagonal

Discrete grid intuition

4. The De-Doppler Search — Brute Force

Step 0: normalize the spectrogram

Step 1: trial drift rates

Step 2: for each d and each starting channel f

Implementation Note

Step 3: post-process

Complexity

Key Concept — Brute Force Complexity

5. Visual Example: Finding Voyager 1

Fun Fact

6. Channel Math

7. Normalization: Why Divide by √N_t

8. Preview: Why Brute Force Is Not Good Enough

Further Reading (Conceptual)

Try it in the Cloud

1. The Doppler Effect — Everyday Examples

Sound: the ambulance siren

A simple formula for small speeds

Key Concept — The Doppler Formula

Light and radio

2. Why Does a SETI Signal Drift?

Earth's rotation

Earth's orbit

The source's motion

Combined effect during one observation

Key Concept — Why Zero Drift Is Suspicious

3. What the Drift Looks Like on a Spectrogram

ASCII sketch: frequency vs time

Buried in noise at each instant

Integration along the diagonal

Discrete grid intuition

4. The De-Doppler Search — Brute Force

Step 0: normalize the spectrogram

Step 1: trial drift rates

Step 2: for each d and each starting channel f

Implementation Note

Step 3: post-process

Complexity

Key Concept — Brute Force Complexity

5. Visual Example: Finding Voyager 1

Fun Fact

6. Channel Math

7. Normalization: Why Divide by √Nt

8. Preview: Why Brute Force Is Not Good Enough

Further Reading (Conceptual)

Try it in the Cloud

7. Normalization: Why Divide by √N_t