Chimera
Understanding Chimera: A Pipeline Model Parallelism Scheme Chimera is a model parallelism scheme designed to train large-scale models efficiently. Its unique feature is the combination of bidirectional pipelines, namely down and up pipelines, to accomplish the task. The aim is to execute a large number of micro-batches by each worker within a training iteration with the minimum of four pipeline stages. How Chimera Pipeline Works? Chimera pipeline, as shown in the figure, consists of four pip