State of the art AI requires orchestrating large clusters to perform a single synchronised calculation. How does this orchestration work? And how can it be done without incurring expensive communication overheads?
Can you boost when you don't have labels?