Pipeline Details

Pipeline Details#

This document explains how the spatial neural activity analysis pipeline works.

Overview#

Data Files#

Neural inputs (in neural_path directory):

{trace_name}.zarr: calcium traces (frames × units). Name set by trace_name in config (e.g. C.zarr, C_lp.zarr)
A.zarr: spatial footprints for cell contour overlay (optional)
max_proj.zarr: max projection image for visualization background (optional)

Behavior inputs:

neural_timestamp.csv: neural frame timestamps (frame, timestamp_first, timestamp_last)
behavior_position.csv: animal position per frame (DeepLabCut format with bodypart columns)
behavior_timestamp.csv: behavior frame timestamps

Intermediate DataFrames (generated during pipeline):

event_index: deconvolved neural events (frame, unit_id, amplitude s)
event_place: events matched to position with speed (unit_id, frame, s, x, y, speed) — only events above speed_threshold

Processing Steps#

`ds.load()`#

Load calcium traces from {trace_name}.zarr
Load behavior position and timestamps → compute raw speed (px/s)
Load visualization assets (max_proj, footprints, behavior video frame)

`ds.preprocess_behavior()`#

Saves a copy of the raw trajectory in trajectory_raw, then applies corrections and speed filtering.

When arena_bounds is configured (full pipeline):

Jump removal: interpolate frame-to-frame displacements exceeding jump_threshold_mm
Perspective correction: correct for camera angle using camera_height_mm and tracking_height_mm
Boundary clipping: clip positions to arena_bounds
Recompute speed: recalculate speed in mm/s from corrected positions
Speed filter: apply speed_threshold → trajectory_filtered

When arena_bounds is not configured (fallback with warnings):

Spatial corrections are skipped (jump removal, perspective correction, boundary clipping)
Speed and position remain in pixels
Speed filter is still applied → trajectory_filtered

`ds.deconvolve()`#

Run OASIS AR2 deconvolution on each unit’s calcium trace
Parameters: g (AR coefficients), baseline, penalty, s_min
Output: good_unit_ids, spike trains (S_list), event_index

`ds.match_events()`#

For each neural event, find the closest behavior frame by timestamp
Discard matches where timestamp difference exceeds 0.5 / behavior_fps
Filter out events where animal speed was below speed_threshold
Output: event_place

`ds.compute_occupancy()`#

Compute 2D occupancy histogram from trajectory_filtered
Smooth with occupancy_sigma, mask bins below min_occupancy
Output: occupancy_time, valid_mask, bin edges

`ds.analyze_units()` — per unit#

Four independent computations from the same inputs (events, filtered trajectory, occupancy):

Rate map: event weights / occupancy time, smoothed with activity_sigma (for display)
Spatial information + shuffle test: Skaggs SI with circular-shift shuffle → SI p-value
Shuffled rate percentile: per-bin percentile of shuffled rate maps → used for place field seed detection
Split-half stability + shuffle test: correlation between first/second half rate maps with circular-shift shuffle → stability p-value

Place cell classification: units with SI p-value < p_value_threshold AND stability p-value < p_value_threshold.

Place field detection (Guo et al. 2023):

Seed detection: bins where rate exceeds the shuffled rate percentile. Only contiguous seed regions with ≥ place_field_min_bins bins are kept
Extension: each seed region extends to contiguous bins with rate ≥ place_field_threshold × (seed’s peak rate)

Results#

Coverage map: sum of place field masks across place cells
Coverage curve: cells sorted by field size (largest first), cumulative fraction of environment covered
Interactive browser: max projection overlay, trajectory with events, rate map with place field contour, SI histogram, stability maps, trace view

Key Parameters#

speed_threshold: minimum speed to include data (mm/s)
min_occupancy: minimum time per bin to be valid (seconds)
bins: spatial resolution (number of bins per axis)
occupancy_sigma: Gaussian smoothing sigma for occupancy map (in bins)
activity_sigma: Gaussian smoothing sigma for rate map (in bins)
n_shuffles: number of circular-shift shuffle iterations
min_shift_seconds: minimum circular shift for shuffle test (seconds)
p_value_threshold: p-value threshold for both SI and stability significance
si_weight_mode: "amplitude" (event amplitudes) or "binary" (event counts)
place_field_threshold: fraction of peak rate for place field extension
place_field_min_bins: minimum contiguous bins for a place field seed
place_field_seed_percentile: percentile of shuffled rates for seed detection

Pipeline Details

On this page

Pipeline Details#

Overview#

Data Files#

Processing Steps#

`ds.load()`#

`ds.preprocess_behavior()`#

`ds.deconvolve()`#

`ds.match_events()`#

`ds.compute_occupancy()`#

`ds.analyze_units()` — per unit#

Results#

Key Parameters#

Configuration Reference#

Data Paths Config#

Analysis Config#

Pipeline Details

On this page

Pipeline Details#

Overview#

Data Files#

Processing Steps#

ds.load()#

ds.preprocess_behavior()#

ds.deconvolve()#

ds.match_events()#

ds.compute_occupancy()#

ds.analyze_units() — per unit#

Results#

Key Parameters#

Configuration Reference#

Data Paths Config#

Analysis Config#

`ds.load()`#

`ds.preprocess_behavior()`#

`ds.deconvolve()`#

`ds.match_events()`#

`ds.compute_occupancy()`#

`ds.analyze_units()` — per unit#