src.config module

class src.config.ConfigParameters(num_workers: int = 5, staleness: int = 50, lr: float = 0.01, local_steps: int = 1, batch_size: int = 10, device: str = 'cpu', log_level: int = 20, tol: float = 1e-08, Amplitude: float = 1)[source]

Bases: object

Configuration for Stale Synchronous Parallel training for Asynchronous SGD (SSP-ASGD).

Parameters:
  • num_workers (int) – Number of worker processes.

  • staleness (int) – Staleness bound allowed for the workers during training. Represents the maximum number of versions a worker can be behind the latest version.

  • lr (float) – Learning rate for the model. Represents the step size for updating the model parameters.

  • local_steps (int) – Number of steps/updates each worker locally computes before pushing gradients to the server.

  • batch_size (int) – Batch size for each training step and the data loader.

  • device (str) – Device to use for training (e.g., “cuda” or “cpu”).

  • log_level (int) – Logging verbosity level.

Amplitude: float = 1
batch_size: int = 10
device: str = 'cpu'
local_steps: int = 1
log_level: int = 20
lr: float = 0.01
num_workers: int = 5
staleness: int = 50
tol: float = 1e-08