Quality-controlled audio-visual depth in stereoscopic 3D media