Pull Command
docker pull intel/object-detection:tf-1.15.2-preprocess-coco-val
Description
The Common Objects in Context (COCO) dataset validation images are used for inference with object detection models.
The preprocess_coco_val.sh script calls the create_coco_tf_record.py script from the TensorFlow Model Garden to convert the raw images and annotations to TF records. The version of the conversion script that you need depends on which model you're running. The table below has Git commit IDs for the TensorFlow Model Garden that have been tested with each model.
Model | Git Commit ID |
---|---|
Faster R-CNN | 7a9934df2afdf95be9405b4e9f1f2480d748dc40 |
R-FCN | 1efe98bb8e8d98bbffc703a90d88df15fc2ce906 |
SSD-MobileNet | 7a9934df2afdf95be9405b4e9f1f2480d748dc40 |
SSD-ResNet34 | 1efe98bb8e8d98bbffc703a90d88df15fc2ce906 |
Prerequisites
Prior to running the script, you must download and extract the COCO validation images and annotations from the COCO website.
export DATASET_DIR=<directory where raw images/annotations will be downloaded>
mkdir -p $DATASET_DIR
cd $DATASET_DIR
wget http://images.cocodataset.org/zips/val2017.zip
unzip val2017.zip
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip
unzip annotations_trainval2017.zip
Set following environment variables are expected by the script:
DATASET_DIR
: Parent directory of the val2017 raw images and annotations filesOUTPUT_DIR
: Directory where the TF records file will be written
Follow the instructions below to run the script in the docker container, if the model that you are running requires the dataset to be in the TF records format.
How to Use the Docker* Container
-
This container includes the prerequisites needed to run the dataset preprocessing script. You will need to mount volumes for the dataset (raw images and annotations) and the output directory (the location where the TF records file will be written), and set the
TF_MODELS_BRANCH
environment variable to the git commit id for the TensorFlow Model Garden.export DATASET_DIR=<Parent directory of the val2017 raw images and annotations files> export OUTPUT_DIR=<directory where TF records will be written> export TF_MODELS_BRANCH=<git commit id> docker run \ --env VAL_IMAGE_DIR=${DATASET_DIR}/val2017 \ --env ANNOTATIONS_DIR=${DATASET_DIR}/annotations \ --env TF_MODELS_BRANCH=${TF_MODELS_BRANCH} \ --env OUTPUT_DIR=${OUTPUT_DIR} \ -v ${DATASET_DIR}:${DATASET_DIR} \ -v ${OUTPUT_DIR}:${OUTPUT_DIR} \ -t intel/object-detection:tf-1.15.2-preprocess-coco-val
After the script completes, the
OUTPUT_DIR
will have a TF records file for the coco validation dataset:$ ls $OUTPUT_DIR coco_val.record
Documentation and Sources
Get Started
Docker* Repository
Main GitHub*
Readme
Release Notes
Get Started Guide
Code Sources
Dockerfile
Report Issue
License Agreement
LEGAL NOTICE: By accessing, downloading or using this software and any required dependent software (the “Software Package”), you agree to the terms and conditions of the software license agreements for the Software Package, which may also include notices, disclaimers, or license terms for third party software included with the Software Package. Please refer to the license file for additional details.