You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
dataset : 1000 pairs of glb, all processed by blender-script.py, containing rgba, pose, and intrinsics.npy
trained on : Runpod, A100 SXM 80GB VRAM x8 instance
the purpose of the fine tunining was overfitting, since there is not enough data for now.
the result GLB outputs' resolution is very different between front and back sides.
Configuration
train-sample.yaml
experiment:
type: lrmseed: 42parent: lrm-objaversechild: small-dummyrunmodel:
camera_embed_dim: 1024# modified for the openlrm-mix-large-1.1's config.jsonrendering_samples_per_ray: 128# modified for the openlrm-mix-large-1.1's config.jsontransformer_dim: 1024# modified for the openlrm-mix-large-1.1's config.jsontransformer_layers: 16# modified for the openlrm-mix-large-1.1's config.jsontransformer_heads: 16# modified for the openlrm-mix-large-1.1's config.jsontriplane_low_res: 32triplane_high_res: 64triplane_dim: 80# modified for the openlrm-mix-large-1.1's config.jsonencoder_type: dinov2encoder_model_name: dinov2_vitb14_reg # modified for the openlrm-mix-large-1.1's config.jsonencoder_feat_dim: 768# modified for the openlrm-mix-large-1.1's config.jsonencoder_freeze: falsedataset:
subsets:
- name: objaverseroot_dirs:
- "/root/OpenLRM/views"# my processed data directorymeta_path:
train: "/root/OpenLRM/train_uids.json"val: "/root/OpenLRM/val_uids.json"sample_rate: 1.0sample_side_views: 3source_image_res: 448# modified for the higher resolutionrender_image:
low: 128# modified for the higher resolutionhigh: 384# modified for the higher resolutionregion: 128# modified for the higher resolutionnormalize_camera: truenormed_dist_to_center: autonum_train_workers: 4num_val_workers: 2pin_mem: truetrain:
mixed_precision: bf16find_unused_parameters: falseloss:
pixel_weight: 1.0perceptual_weight: 1.0tv_weight: 5e-4optim:
lr: 4e-4weight_decay: 0.05beta1: 0.9beta2: 0.95clip_grad_norm: 1.0scheduler:
type: cosinewarmup_real_iters: 3000batch_size: 2# modified since using higher resolutionaccum_steps: 8# modified since using higher resolutionepochs: 1000# modified from 60 to 1000, for overfitting the insufficient datadebug_global_steps: nullval:
batch_size: 2# modified since using higher resolutionglobal_step_period: 1000debug_batches: nullsaver:
auto_resume: trueload_model: "/root/OpenLRM/model.safetensors"# this refers to openlrm-mix-large-1.1checkpoint_root: ./exps/checkpointscheckpoint_global_steps: 1000checkpoint_keep_level: 5logger:
stream_level: WARNINGlog_level: INFOlog_root: ./exps/logstracker_root: ./exps/trackersenable_profiler: falsetrackers:
- tensorboardimage_monitor:
train_global_steps: 100samples_per_log: 4compile:
suppress_errors: trueprint_specializations: truedisable: true
source_size: 448# modified to fit the fine-tuned model's source_image_ressource_cam_dist: 2.0render_size: 384# modified to fit the fine-tuned model's render_image highrender_views: 160render_fps: 40frame_size: 2mesh_size: 384# modified to fit the fine-tuned model's render_image highmesh_thres: 3.0
inference result
input image
result video
result mesh (front)
result mesh (back)
As you can see above, there is not much difference in resolution in the generated videos. However, when importing the model into Blender, as shown in the images, there is a significant resolution difference exactly between the front and back sides. The front side shows relatively lower resolution, while the back side shows higher resolution inference results.
Hi @ZexinHe, I’ve tagged you since you're the owner. Sorry for the inconvenience.
I would greatly appreciate it if you could let me know what I might be doing wrong and how I can fix this issue.
The text was updated successfully, but these errors were encountered:
Summary
blender-script.py
, containingrgba
,pose
, andintrinsics.npy
Configuration
train-sample.yaml
result
training result
0.112
infer-l.yaml
inference result
input image
result video
result mesh (front)
result mesh (back)
Hi @ZexinHe, I’ve tagged you since you're the owner. Sorry for the inconvenience.
I would greatly appreciate it if you could let me know what I might be doing wrong and how I can fix this issue.
The text was updated successfully, but these errors were encountered: