ZeoD_K: Unexpected Max Depth Values #139

HernandezEduin · 2025-01-22T06:29:33Z

Hi, I'm testing the ZoeDepth pretrained models on images captured by highway-side cameras. These images have ground truth depth values that span beyond 300 meters, which I understand exceeds the model's training range (max depth: 80.0 meters).

However, when running the pretrained KITTI model, I consistently get a maximum estimated depth of approximately 5 meters. Below is the code snippet I used:

import torch

zoe = torch.hub.load("isl-org/ZoeDepth", "ZoeD_K", pretrained=True)
predicted_depth = zoe.infer_pil(image, pad_input=False)  # Better 'metric' accuracy

For comparison, I also tested the NK model. It provides a more reasonable maximum depth estimate, ranging between 30–50 meters, which aligns better with the expected values from the model.Additionally, I tried using the KITTI model weights from Hugging Face
, but the results were similar.

Upon inspecting the model configurations, I noticed potential discrepancies in the uploaded weights:

"bin_configurations": [
{
"max_depth": 10.0,
"min_depth": 0.001,
"n_bins": 64,
"name": "nyu"
}

This configuration suggests a maximum depth of 10.0 meters, which might explain the observed behavior.

Questions/Concerns:

Is the issue related to incorrect configurations in the uploaded weights?
Could you reupload the weights of the KITTI model with a max depth of 80 meters?
Are there any suggestions for datasets that are beyond 100 meters?

The text was updated successfully, but these errors were encountered:

HernandezEduin · 2025-01-22T06:50:37Z

It seems to be a similar issue to #28 and #45, but hasn't been properly addressed.

zhangjy328 · 2025-03-27T06:35:56Z

Hello, I have also encountered the same problem. Can we understand that the maximum depth is a conversion ratio scale from relative distance to absolute distance? If I know the maximum depth of the actual image and input it into the model, does it mean that I can obtain an accurate absolute depth?

HernandezEduin · 2025-03-27T08:50:44Z

Hello, I have also encountered the same problem. Can we understand that the maximum depth is a conversion ratio scale from relative distance to absolute distance? If I know the maximum depth of the actual image and input it into the model, does it mean that I can obtain an accurate absolute depth?

From my understanding, the model returns the estimated absolute depth distance. You can try rescaling it accordingly, but it won't be perform that well. I'd recommend using NYU-KITTI weights instead. It performs better and more consistent, but still not good enough for outdoor (You'll notice this if you do a point cloud reconstruction of the scene with textures).

HernandezEduin · 2025-03-27T09:02:49Z

Additionally, you can refer to the response in their respective hugging face website link

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZeoD_K: Unexpected Max Depth Values #139

ZeoD_K: Unexpected Max Depth Values #139

HernandezEduin commented Jan 22, 2025

HernandezEduin commented Jan 22, 2025

zhangjy328 commented Mar 27, 2025

HernandezEduin commented Mar 27, 2025

HernandezEduin commented Mar 27, 2025

ZeoD_K: Unexpected Max Depth Values #139

ZeoD_K: Unexpected Max Depth Values #139

Comments

HernandezEduin commented Jan 22, 2025

HernandezEduin commented Jan 22, 2025

zhangjy328 commented Mar 27, 2025

HernandezEduin commented Mar 27, 2025

HernandezEduin commented Mar 27, 2025