New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Fine-tuning Metric3D-small Model on Drone Perspective Dataset - Technical Questions #197

Open

Victoire7 opened this issue Jan 10, 2025 · 0 comments

Victoire7 commented Jan 10, 2025

I am currently fine-tuning the Metric3D-small model on a drone perspective dataset and have several technical questions about the training process.

Questions

1. Learning Rate Configuration

Is the initial learning rate of 1e-6 (mentioned in the paper) applied equally to both encoder and decoder?
Do I need to scale the learning rate based on the batch size?
Are there recommended parameter settings specific to the small model?

2. Normal Branch Loss

In my training dataset, some data includes normal annotations. Should I enable NormalBranchLoss?
When I enable NormalBranchLoss, the loss becomes negative after training for some time. What could be causing this?

3. Depth Scaling and Max Value

Because the absolute depth values from drone perspectives are relatively large, the max_val in RAFTDepthNormalDPT5 would be set to a larger value. Does this mean the regress_scale needs corresponding adjustment? What is the relationship between these two settings?

4. Depth Normalization Challenges

After applying LabelScaleCononical and RandomResize transforms, depth labels can exceed original depth values. This means max_val (depth_normalize) cannot be set based on the original dataset depth range. Is the only solution to pre-calculate max_val based on the transform pipeline?
I've tried adaptive max_val, but it leads to NaN gradients. Is an adaptive approach feasible?

Additional Context

Model: Metric3D-small
Dataset: Drone perspective dataset, such as Wilduav、Mid-air.

I would appreciate insights into these technical challenges. If my understanding is incorrect, please provide guidance. Thank you in advance!

The text was updated successfully, but these errors were encountered:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment