I’d use the first part of the tutorial to match simple 3d geometry to your image. You don’t have to animate the camera or distort the image if you don’t need to.
Once you have 3d geometry matched from your camera’s perspective, it should be easier and more accurate to place objects in your scene. Having the correct focal length here should help, especially on extremely wide lenses.
The reason I think there is no one click solution for this is that a 3d match moving program requires some motion in order to solve a camera using optical flow. An image simply doesn’t provide this information.