Is it even possible to combine in one call (in Node) requests on an image?
Instead of calling methods separately and trying to handle outputs.
vision.faceDetection()
vision.documentTextDetection()
vision.safeSearchDetection()
...
In order to get a final JSON like here in try-out section https://cloud.google.com/vision/