Replication Data for: Robustly optimal dynamics for active matter reservoir computing (Gaimann and Klopotek, 2025)

Dataset

DOI

This repository contains raw and post-processed replication data for the publication "Robustly optimal dynamics for active matter reservoir computing" (Gaimann and Klopotek, 2025).

The datasets contain physical observables recorded during non-equilibrium simulations of active matter systems (swarms) driven by an external force. These simulations serve as information processors in a reservoir computing setup.

We provide replication data for all figures and supplementary videos shown in our publication:

speed-controller speed-controller, with an integration time step of 2e-3 speed-controller, with a maximum correlation delay time of 75 integration time steps speed-controller, without external driving (undriven) speed-controller, with a single agent speed-controller, with two agents speed-controller, with 500 agents (overdamped phenomenology) speed-controller, with initial transient (burn-in phase) speed-controller, with 12 integration time steps predicted ahead speed-controller, with 50 integration time steps predicted ahead speed-controller, with 100 integration time steps predicted ahead speed-controller, with Hénon-Heiles driving protocol speed-controller, with Hénon-Heiles driving protocol, Lyapunov time-adjusted prediction (638 integration time steps), and maximum correlation delay time of 75 integration time steps speed-controller, with Rössler driving protocol speed-controller, with Rössler driving protocol, Lyapunov time-adjusted prediction (150 integration time steps), and maximum correlation delay time of 75 integration time steps speed-controller, with Chua driving protocol speed-controller, with Chua driving protocol, Lyapunov time-adjusted prediction (18 integration time steps), and maximum correlation delay time of 75 integration time steps speed-controller, with Lorenz-96 driving protocol speed-controller, with Lorenz-96 driving protocol, Lyapunov time-adjusted prediction (17 integration time steps), and maximum correlation delay time of 75 integration time steps speed-controller, with a larger correlation recording delay time of 500 integration time steps damping analysis, with non-interacting agents damping analysis, with interacting agents damping analysis, with non-interacting agents and an integration time step of 2e-3 damping analysis, with interacting agents and an integration time step of 2e-3 alignment force, with speed-controller settings of Lymburn et al. (2021) alignment force, with overdamped speed-controller settings homing force strength vs. speed-controller strength, with speed-controller settings of Lymburn et al. (2021) homing force strength vs. speed-controller strength, with overdamped speed-controller settings homing force strength vs. speed-controller target agent speed number of agents vs. Gaussian white noise strength, with speed-controller settings of Lymburn et al. (2021) number of agents vs. Gaussian white noise strength, with overdamped speed-controller settings reproduction of the dynamical regimes analyzed in Lymburn et al. (2021) (Fig. 7) reproduction of the (driven) alignment strength vs. repulsion strength parameter scans in Lymburn et al. (2021) (Fig. 6B, Fig. 8A) reproduction of the undriven alignment strength vs. repulsion strength parameter scan in Lymburn et al. (2021) (Fig. 6A)

The Lorenz-63 driving protocol was generated on the fly during the simulation. We also provide the raw chaotic time series used as benchmark driving protocols:

Hénon-Heiles Rössler Chua Lorenz-96

Each dataset typically contains 400 parameter combinations. Each parameter combination contains four files:

config.yaml: controlled variables simulation_output_train.h5: physical simulation observables in first (training) run simulation_output_test.h5: physical simulation observables in second (testing) run reservoir_computer_output.h5: observables related to reservoir computing and time series prediction

The second run has a different chaotic driving protocol, using the same underlying dynamical system (chaotic attractor) but different initial conditions. Only the first file is generated if the dataset contains a simulation without an external driving force (undriven). By default, for all driven simulations, physical observables are only recorded for the test run for a full reservoir computing train/test cycle. Each simulation typically consists of 1,000.00 time units (50,000 integration time steps of 0.02 time units by default). A burn-in phase of 20.0 simulation time units (1,000 integration time steps of 0.02 time units by default) takes place at the beginning of each simulation, which is not recorded by default (only recorded in the "speed-controller, with initial transient" dataset). Controlled variables are stored as HDF5 attributes. At each step, we predict by default 25 integration time steps ahead (=0.45283 L63-Lyapunov times). For Lyapunov times adjusted attractor predictions, we predict n integration time steps ahead that equal 0.45283 Lyapunov times of the corresponding attractor.

The simulation output files contain:

agent_observables: positions, velocities, total forces, velocity fluctuations for all agents; for the first 20.0 simulation time units frame_observables: driver position (external driving trajectory / input time series), center of mass (taking periodic boundary conditions into account), agent-averaged observables, scalar polarity, scalar rotation; for the full simulation histograms: binned agent observables and derived quantities; for the full simulation radially_binned: radial distribution function (agent count), connected velocity correlation, cumulative velocity correlation time_lags: auto-correlations of agent observables and derived quantities, two-time correlations of agent observables and derived quantities reference_frame_steps: reference frames (measured in integration steps) for the recording of delay-based quantities in time_lags

The reservoir computer output files contain:

linear_regression_model: the weights of the linear model (readout layer) observer_kernel_params: placement positions and widths of the Gaussian observation kernels predictions_train: n-steps-ahead prediction using the trained linear model, on training data predictions_test: n-steps-ahead prediction using the trained linear model, on testing data

Aggregates of physical observables across all parameter combinations in a single dataset are stored as CSV files for convenience, the relevant observable is indicated by the file name. Files that carry the "time_avg" tag are averaged over all simulation time steps, for the "ensemble_avg" averaged over all seeds (only one seed is used here), and for the "array_avg" averaged over all recorded entries (typically samples at different time steps). We provide the following aggregated observables that were processed to generate figures in our associated publication:

lymburn_correlation_coefficient: Correlation coefficient, predictive performance agent_avg_msd_at_lyapunov_time_step=55: Agent-averaged mean squared displacement at the Lyapunov integration time step of the Lorenz-63 attractor (after 55 integration time steps of 0.02 each) first_local_min.array_avg.h5?connected_velocity_correlation: First local minimum of the connected velocity correlation function, averaged over all recorded samples mean_speed: Agent-averaged speed scalar_polarity: Scalar polarity scalar_rotation: Scalar rotation attanasi_susceptibility: Dynamical susceptibility

The supplementary videos generated using this raw data are published as: Gaimann, M. U., & Klopotek, M. (2025). Supplementary Videos for: Robustly optimal dynamics for active matter reservoir computing (Gaimann and Klopotek, 2025). DaRUS. doi:10.18419/DARUS-4619.

ResoBee, 0.10.0 - 0.14.0

Identifier
DOI	https://doi.org/10.18419/DARUS-4620
Metadata Access	https://darus.uni-stuttgart.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18419/DARUS-4620

Provenance
Creator	Gaimann, Mario U. (ORCID: 0000-0002-2789-090X); Klopotek, Miriam
Publisher	DaRUS
Contributor	Gaimann, Mario U.; Klopotek, Miriam
Publication Year	2025
Funding Reference	DFG EXC 2075 - 390740016
Rights	CC BY 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by/4.0
OpenAccess	true
Contact	Gaimann, Mario U. (University of Stuttgart); Klopotek, Miriam (University of Stuttgart)

Representation
Resource Type	Dataset
Format	application/x-xz; text/tab-separated-values
Size	15487238648; 16819004192; 61100424108; 5875266112; 59132699880; 5826381032; 2856000; 14609877440; 12849491856; 15653190564; 13216209904; 13504436020; 28044401256; 160408132; 15772236604; 10955682684; 6915797484; 15088151564; 6917151784; 15083348300; 6912666652; 15104328780; 6911493100; 14987912064; 980934792; 6449098828; 6453247916; 6447490096; 3477286732; 149518099000; 2145011504; 15081470076; 956938572; 2815385952; 10728325560
Version	1.0
Discipline	Natural Sciences; Physics
Spatial Coverage	Stuttgart, Germany