To see whether this is a good idea I can have a look into the implementation of Interpolations.
The most important counter argument is the speed for the use of EMD on cubes. There we don't need the metadata on every pixel, but rather on the whole cube.