This repository contains raw and post-processed replication data for the publication "Robustly optimal dynamics for active matter reservoir computing" (Gaimann and Klopotek, 2025).
The datasets contain physical observables recorded during non-equilibrium simulations of active matter systems (swarms) driven by an external force. These simulations serve as information processors in a reservoir computing setup.
We provide replication data for all figures and supplementary videos shown in our publication:
speed-controller
speed-controller, with an integration time step of 2e-3
speed-controller, with a maximum correlation delay time of 75 integration time steps
speed-controller, without external driving (undriven)
speed-controller, with a single agent
speed-controller, with two agents
speed-controller, with 500 agents (overdamped phenomenology)
speed-controller, with initial transient (burn-in phase)
speed-controller, with 12 integration time steps predicted ahead
speed-controller, with 50 integration time steps predicted ahead
speed-controller, with 100 integration time steps predicted ahead
speed-controller, with Hénon-Heiles driving protocol
speed-controller, with Hénon-Heiles driving protocol, Lyapunov time-adjusted prediction (638 integration time steps), and maximum correlation delay time of 75 integration time steps
speed-controller, with Rössler driving protocol
speed-controller, with Rössler driving protocol, Lyapunov time-adjusted prediction (150 integration time steps), and maximum correlation delay time of 75 integration time steps
speed-controller, with Chua driving protocol
speed-controller, with Chua driving protocol, Lyapunov time-adjusted prediction (18 integration time steps), and maximum correlation delay time of 75 integration time steps
speed-controller, with Lorenz-96 driving protocol
speed-controller, with Lorenz-96 driving protocol, Lyapunov time-adjusted prediction (17 integration time steps), and maximum correlation delay time of 75 integration time steps
speed-controller, with a larger correlation recording delay time of 500 integration time steps
damping analysis, with non-interacting agents
damping analysis, with interacting agents
damping analysis, with non-interacting agents and an integration time step of 2e-3
damping analysis, with interacting agents and an integration time step of 2e-3
alignment force, with speed-controller settings of Lymburn et al. (2021)
alignment force, with overdamped speed-controller settings
homing force strength vs. speed-controller strength, with speed-controller settings of Lymburn et al. (2021)
homing force strength vs. speed-controller strength, with overdamped speed-controller settings
homing force strength vs. speed-controller target agent speed
number of agents vs. Gaussian white noise strength, with speed-controller settings of Lymburn et al. (2021)
number of agents vs. Gaussian white noise strength, with overdamped speed-controller settings
reproduction of the dynamical regimes analyzed in Lymburn et al. (2021) (Fig. 7)
reproduction of the (driven) alignment strength vs. repulsion strength parameter scans in Lymburn et al. (2021) (Fig. 6B, Fig. 8A)
reproduction of the undriven alignment strength vs. repulsion strength parameter scan in Lymburn et al. (2021) (Fig. 6A)
The Lorenz-63 driving protocol was generated on the fly during the simulation. We also provide the raw chaotic time series used as benchmark driving protocols:
Hénon-Heiles
Rössler
Chua
Lorenz-96
Each dataset typically contains 400 parameter combinations. Each parameter combination contains four files:
config.yaml: controlled variables
simulation_output_train.h5: physical simulation observables in first (training) run
simulation_output_test.h5: physical simulation observables in second (testing) run
reservoir_computer_output.h5: observables related to reservoir computing and time series prediction
The second run has a different chaotic driving protocol, using the same underlying dynamical system (chaotic attractor) but different initial conditions.
Only the first file is generated if the dataset contains a simulation without an external driving force (undriven). By default, for all driven simulations, physical observables are only recorded for the test run for a full reservoir computing train/test cycle. Each simulation typically consists of 1,000.00 time units (50,000 integration time steps of 0.02 time units by default). A burn-in phase of 20.0 simulation time units (1,000 integration time steps of 0.02 time units by default) takes place at the beginning of each simulation, which is not recorded by default (only recorded in the "speed-controller, with initial transient" dataset). Controlled variables are stored as HDF5 attributes. At each step, we predict by default 25 integration time steps ahead (=0.45283 L63-Lyapunov times). For Lyapunov times adjusted attractor predictions, we predict n integration time steps ahead that equal 0.45283 Lyapunov times of the corresponding attractor.
The simulation output files contain:
agent_observables: positions, velocities, total forces, velocity fluctuations for all agents; for the first 20.0 simulation time units
frame_observables: driver position (external driving trajectory / input time series), center of mass (taking periodic boundary conditions into account), agent-averaged observables, scalar polarity, scalar rotation; for the full simulation
histograms: binned agent observables and derived quantities; for the full simulation
radially_binned: radial distribution function (agent count), connected velocity correlation, cumulative velocity correlation
time_lags: auto-correlations of agent observables and derived quantities, two-time correlations of agent observables and derived quantities
reference_frame_steps: reference frames (measured in integration steps) for the recording of delay-based quantities in time_lags
The reservoir computer output files contain:
linear_regression_model: the weights of the linear model (readout layer)
observer_kernel_params: placement positions and widths of the Gaussian observation kernels
predictions_train: n-steps-ahead prediction using the trained linear model, on training data
predictions_test: n-steps-ahead prediction using the trained linear model, on testing data
Aggregates of physical observables across all parameter combinations in a single dataset are stored as CSV files for convenience, the relevant observable is indicated by the file name. Files that carry the "time_avg" tag are averaged over all simulation time steps, for the "ensemble_avg" averaged over all seeds (only one seed is used here), and for the "array_avg" averaged over all recorded entries (typically samples at different time steps). We provide the following aggregated observables that were processed to generate figures in our associated publication:
lymburn_correlation_coefficient: Correlation coefficient, predictive performance
agent_avg_msd_at_lyapunov_time_step=55: Agent-averaged mean squared displacement at the Lyapunov integration time step of the Lorenz-63 attractor (after 55 integration time steps of 0.02 each)
first_local_min.array_avg.h5?connected_velocity_correlation: First local minimum of the connected velocity correlation function, averaged over all recorded samples
mean_speed: Agent-averaged speed
scalar_polarity: Scalar polarity
scalar_rotation: Scalar rotation
attanasi_susceptibility: Dynamical susceptibility
The supplementary videos generated using this raw data are published as: Gaimann, M. U., & Klopotek, M. (2025). Supplementary Videos for: Robustly optimal dynamics for active matter reservoir computing (Gaimann and Klopotek, 2025). DaRUS. doi:10.18419/DARUS-4619.
ResoBee, 0.10.0 - 0.14.0