wenz116 / TransferSeg

TransferSeg

Caffe implementation of our method for transferring knowledge from seen objects in images to unseen objects in videos.
Contact: Yi-Wen Chen (chenyiwena at gmail dot com)

Paper

Unseen Object Segmentation in Videos via Transferable Representations
Yi-Wen Chen, Yi-Hsuan Tsai, Chu-Ya Yang, Yen-Yu Lin and Ming-Hsuan Yang
Asian Conference on Computer Vision (ACCV), 2018 (oral)
Best Student Paper Award Honorable Mention

Please cite our paper if you find it useful for your research.

@inproceedings{Chen_TransferSeg_2018,
  author = {Y.-W. Chen and Y.-H. Tsai and C.-Y. Yang and Y.-Y. Lin and M.-H. Yang},
  booktitle = {Asian Conference on Computer Vision (ACCV)},
  title = {Unseen Object Segmentation in Videos via Transferable Representations},
  year = {2018}
}

Installation

Install Caffe: http://caffe.berkeleyvision.org/.
Install MATLAB
Clone this repo

git clone https://github.com/wenz116/TransferSeg.git
cd TransferSeg

Prepare for MBS

Go to the folder utils/MBS/mex.
Modify the opencv include and lib paths in compile.m/compile_win.m (for Linux/Windows).
Run compile/compile_win in MATLAB (for Linux/Windows).

Dataset

Download the PASCAL VOC Dataset as the source image dataset, and put it in the data/PASCAL/VOC2011 folder.
Download the DAVIS Dataset as the target video dataset, and put it in the data/DAVIS folder.

Training

Download the FCN model pre-trained on PASCAL VOC, and put it in the nets folder.
Go to the folder scripts.

Compute optical flow of the input video. Run compute_optical_flow('<VIDEO_NAME>') in MATLAB. The optical flow images will be saved at data/DAVIS/Motion/480p/<VIDEO_NAME>/.
Compute motion prior of the input video via minimum barrier distance. Run get_prior('<VIDEO_NAME>') in MATLAB. The motion prior images will be saved at data/DAVIS/Prior/480p/<VIDEO_NAME>/.
Extract features of each category in PASCAL VOC. The extracted features will be saved at cache/features/, named as features_PASCAL_<CLASS_NAME>_fc7.p.

python get_feature_PASCAL.py <GPU_ID>

Extract features of the input video. The extracted features will be saved at cache/features/, named as features_DAVIS_<VIDEO_NAME>_fc7.p.

python get_feature_DAVIS.py <GPU_ID> <VIDEO_NAME>

Segment mining. The selected segments will be saved at data/DAVIS/Train/480p/<VIDEO_NAME>/.

python get_score.py <GPU_ID> <VIDEO_NAME>

Self learning. The trained models will be saved at output/snapshot/.

./train.sh <GPU_ID> <VIDEO_NAME>

Note

The model and code are available for non-commercial research purposes only.

12/2018: code released

wenz116 / TransferSeg

README.md

TransferSeg

Paper

Installation

Dataset

Training

Note

About

Releases

Packages

Languages

wenz116 / TransferSeg

Join GitHub today

Launching GitHub Desktop

Launching GitHub Desktop

Launching Xcode

Launching Visual Studio

Latest commit

Git stats

Files

README.md

TransferSeg

Paper

Installation

Dataset

Training

Note

About

Topics

Resources

Releases

Packages 0

Languages

Packages