Existing approaches can be categorized into three major families based on how the information regarding previous task data is stored and used to mitigate forgetting and potentially support the learning of new tasks. These include replay-based [8, 30] methods which store prior samples, dynamic architectures [34, 37] which add and remove components and prior-focused [18, 39, 9, 7] methods that rely on regularization.