An initial implementation keeps a separate VM which holds onto the external data for nodes to download from. If each node is able to mount the external data from a volume, this would remove the need for the nodes to do any downloading at all.