Text this: Towards vertical urban geometry extraction: occlusion-reduced estimation from street view images using diffusion models