Two questions about inferring the lightstereo model

Hi, I have two questions while running inference with the lightstereomodel. 
### 1. Is Image Normalization missing in the config file? 
It seems the current configuration file for KITTI 2015 evaluation is missing the image normalization step:
{ NAME: NormalizeImage, MEAN: [ 0.485, 0.456, 0.406 ], STD: [ 0.229, 0.224, 0.225 ] }
Reference: https://github.com/XiandaGuo/OpenStereo/blob/5f0134cd7297e5ebc353d1f4c3f17aecdd6bfa84/cfgs/kitti15_eval.yaml#L12-L14
Without normalization, the results are poor:
<img width="1089" height="76" alt="Image" src="https://github.com/user-attachments/assets/d10a5b70-8ba5-4c0a-bf86-38ac9d3c1c99" />

After adding normalization, the results significantly improve and match the LightStereo-S (Ours) metrics reported in Table VII of the paper:
<img width="1101" height="82" alt="Image" src="https://github.com/user-attachments/assets/67485492-7013-4597-9234-036739a466d6" />

### 2. Performance discrepancy: 
Result of LightStereo-S-KITTI.ckpt is better than the paper.

My inference command:
<img width="838" height="96" alt="Image" src="https://github.com/user-attachments/assets/78cf4938-2600-406d-8844-f3c3739da152" />

My inference result:
<img width="1095" height="126" alt="Image" src="https://github.com/user-attachments/assets/3b428805-113e-421d-b6ae-3365c2bb42b0" />

The output shows D1_all = 1.3604, while the paper reports 2.30 for the same metric:
<img width="636" height="481" alt="Image" src="https://github.com/user-attachments/assets/85ee83aa-0672-42b6-8f10-65dd98ead341" />

Could you please clarify if I have misinterpreted the evaluation setup or if there is a specific reason for this performance gap?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Two questions about inferring the lightstereo model #293

1. Is Image Normalization missing in the config file?

2. Performance discrepancy:

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

	- { NAME: RightTopPad, SIZE: [ 384, 1248 ]}
	- { NAME: TransposeImage }
	- { NAME: ToTensor }

Two questions about inferring the lightstereo model #293

Description

1. Is Image Normalization missing in the config file?

2. Performance discrepancy:

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions