README.md

Multi-view to Novel view:
Synthesizing Novel Views with Self-Learned Confidence

Descriptions

This project is a TensorFlow implementation of Multi-view to Novel view: Synthesizing novel views with Self-Learned Confidence, which is published in ECCV 2018. We provide codes, datasets, and checkpoints.

In this work, we address the task of multi-view novel view synthesis, where we are interested in synthesizing a target image with an arbitrary camera pose from given source images.

We propose an end-to-end trainable framework that learns to exploit multiple viewpoints to synthesize a novel view without any 3D supervision. Specifically, our model consists of a flow prediction module (flow predictor) and a pixel generation module (recurrent pixel generator) to directly leverage information presented in source views as well as hallucinate missing pixels from statistical priors. To merge the predictions produced by the two modules given multi-view source images, we introduce a self-learned confidence aggregation mechanism. An illustration of the proposed framework is as follows.

We evaluate our model on images rendered from 3D object models (ShapeNet) as well as real and synthesized scenes (KITTI and Synthia). We demonstrate that our model is able to achieve state-of-the-art results as well as progressively improve its predictions when more source images are available.

*This code is still being developed and subject to change.

Prerequisites

Datasets

All datasets are stored as HDF5 files, and the links are as follows. Each data point (HDF5 group) contains an image and its camera pose.

ShapeNet

Download from
- car (150GB)
- chair (14GB)
Put the file to this directory ./datasets/shapenet.

KITTI

Download from here (4.3GB)
Put the file to this directory ./datasets/kitti.

Synthia

Download from here (3.3GB)
Put the file to this directory ./datasets/synthia.

Usage

After downloading the datasets, we can start to train models with the following command:

Train

$ python trainer.py  --batch_size 8 --dataset car --num_input 4

Selected arguments (see the trainer.py for more details)
- --prefix: a nickname for the training
- --dataset: choose among car, chair, kitti, and synthia. You can also add your own datasets.
- Checkpoints: specify the path to a pre-trained checkpoint
  - --checkpoint: load all the parameters including the flow and pixel modules and the discriminator.
- Logging
  - --log_setp: the frequency of outputing log info ([train step 681] Loss: 0.51319 (1.896 sec/batch, 16.878 instances/sec))
  - --ckpt_save_step: the frequency of saving a checkpoint
  - --test_sample_step: the frequency of performing testing inference during training (default 100)
  - --write_summary_step: the frequency of writing TensorBoard summaries (default 100)
- Hyperparameters
  - --num_input: the number of source images
  - --batch_size: the mini-batch size (default 8)
  - --max_steps: the max training iterations
- GAN
  - --gan_type: the type of GAN losses such as LS-GAN, WGAN, etc

Interpret TensorBoard

Launch Tensorboard and go to the specified port, you can see differernt losses in the scalars tab and plotted images in the images tab. The plotted images could be interpreted as follows.

Test

We can also evaluate trained models or the checkpoints provided by the authors with the following command:

$ python evaler.py --dataset car --data_id_list ./testing_tuple_lists/id_car_test.txt --loss [--train_dir /path/to/the/training/dir/ OR --checkpoint /path/to/the/trained/model] --write_summary --summary_file log_car.txt --plot_image --output_dir img_car

Selected arguments (see the evaler.py for more details)
- Id list
  - --data_id_list: specify a list of data point that you want to evaluate
- Task
  - --loss: report the loss
  - --plot_image: rendered predicted images
- Output
  - --quiet: only display the final report
  - --write_summary: write the summary of this evaluation as a text file
  - --summary_file: the path to the summary file
  - --output_dir: the output dir of plotted images

Result

ShapeNet Cars

More results for ShapeNet cars (1k randomly samlped results from all 10k testing data)

ShapeNet Chairs

More results for ShapeNet cars (1k randomly samlped results from all 10k testing data)

Scenes: KITTI and Synthia

Checkpoints

We provide checkpoints and evaluation report files of our models for all experiments.

Related work

[L_1] Multi-view 3D Models from Single Images with a Convolutional Network in CVPR 2016
[Appearance Flow]View Synthesis by Appearance Flow in ECCV 2016
[TVSN] Transformation-Grounded Image Generation Network for Novel 3D View Synthesis in CVPR 2017
Neural scene representation and rendering in Science 2018
Weakly-supervised Disentangling with Recurrent Transformations for 3D View Synthesis in NIPS 2015
DeepStereo: Learning to Predict New Views From the World's Imagery in CVPR 2016
Learning-Based View Synthesis for Light Field Cameras in SIGGRAPH Asia 2016

Cite the paper

If you find this useful, please cite

@inproceedings{sun2018multiview,
  title={Multi-view to Novel view: Synthesizing Views via Self-Learned Confidence},
  author={Sun, Shao-Hua and Huh, Minyoung and Liao, Yuan-Hong and Zhang, Ning and Lim, Joseph J},
  booktitle={European conference on computer vision},
  year={2018},
}

Authors

Shao-Hua Sun, Minyoung Huh, Yuan-Hong Liao, Ning Zhang, and Joseph J. Lim

GitHub - shaohua0116/Multiview2Novelview: An official TensorFlow implementation...

README.md

Multi-view to Novel view:
Synthesizing Novel Views with Self-Learned Confidence

Descriptions

Prerequisites

Datasets

ShapeNet

KITTI

Synthia

Usage

Train

Interpret TensorBoard

Test

Result

ShapeNet Cars

ShapeNet Chairs

Scenes: KITTI and Synthia

Checkpoints

Related work

Cite the paper

Authors

Recommend

Open Match: Flexible and extensible matchmaking for games

Text-to-Speech for Low-Resource Languages (Episode 4): One Down, 299 to Go

GitHub - anishathalye/porcupine: A fast linearizability checker written in Go ?

GitHub - headzoo/surf: Stateful programmatic web browsing in Go.

GitHub - skx/deployr: A simple golang application to automate the deployment of...

华为被发现在手机性能测试中作弊|华为：别家也有作弊行为

如何评价电视剧「香蜜沉沉烬如霜」? - 知乎

哪些国产的小众品牌是你无限回购的？ - 知乎

Help save CppQuiz.org! We’re porting to C++17.

Pacu：一款全面的AWS渗透测试框架

About Joyk

GitHub - shaohua0116/Multiview2Novelview: An official TensorFlow implementation...

README.md

Multi-view to Novel view: Synthesizing Novel Views with Self-Learned Confidence

Descriptions

Prerequisites

Datasets

ShapeNet

KITTI

Synthia

Usage

Train

Interpret TensorBoard

Test

Result

ShapeNet Cars

ShapeNet Chairs

Scenes: KITTI and Synthia

Checkpoints

Related work

Cite the paper

Authors

Recommend

About Joyk

Multi-view to Novel view:
Synthesizing Novel Views with Self-Learned Confidence