GitHub - avasid/Image-Object-Localization: Image Object Localization by ResNet-18 using tensorflow and keras

Description

Given a picture with a bird, we are supposed to box the bird.

In src/data directory, images.txt is the index of all images, bouding_boxex.txt is the label box of all images and images contains all images. Box data make up of 4 data: the top left corner coordinate of box, width of box and height of box.

Neural Network

For traditional CNN and FC, it will meet degeneration problems when layers go deep.

In paper Deep Residual Learning for Image Recognition, they try to solve this problem by using a Residual Block:

These blocks compose ResNet:

I use ResNet-18 in this project by adding a 4-dimension layer after ResNet-18 to predict box's x, y ,w and h.

Loss: smooth l1 loss

Metric: IoU of groound truth and prediction, threshold=0.75

Train

Resize all images to square dimensions (224*224*3 recommended)

Then normalize and standardize all pixel channel.

Split all data into 0.75 training data and 0.25 tesing data. Train network on training data using batch size=128, epoch=100 and validation split ratio=0.1

Training result:

Testing result:

Examples

Red box represents ground truth and green box is the prediction of network.

Failed example:

Attention

You should keep the directory structure.

Dependency

python 3.6

Run

Run pip install -r requirements.txt

In git root directory:

python object_localization.py to run master script.

Follow the instructed options.

Reference

Deep Residual Learning for Image Recognition: https://arxiv.org/pdf/1512.03385.pdf

Author

CKCZZJ

Licence

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
img		img
prediction		prediction
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
object_localization.py		object_localization.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Description

Neural Network

Train

Examples

Attention

Dependency

Run

Reference

Author

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

avasid/Image-Object-Localization

Folders and files

Latest commit

History

Repository files navigation

Description

Neural Network

Train

Examples

Attention

Dependency

Run

Reference

Author

Licence

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages