Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

Latest commit

 

History

History
187 lines (122 loc) · 10.8 KB

README.md

File metadata and controls

187 lines (122 loc) · 10.8 KB

Image recognition training with TensorFlow Inception and transfer learning

Read this in other languages: 한국어.

Transfer learning is the process of taking a pre-trained model (the weights and parameters of a network that has been trained on a large dataset by somebody else) and “fine-tuning” the model with your own dataset. The idea is that this pre-trained model will act as a feature extractor. You will remove the last layer of the network and replace it with your own classifier (depending on what your problem space is). You then freeze the weights of all the other layers and train the network normally (Freezing the layers means not changing the weights during gradient descent/optimization). For this experiment we used Google's Inception-V3 pretrained model for Image Classification. This model consists of two parts:

  • Feature extraction part with a convolutional neural network.
  • Classification part with fully-connected and softmax layers.

The pre-trained Inception-v3 model achieves state-of-the-art accuracy for recognizing general objects with 1000 classes. The model extracts general features from input images in the first part and classifies them based on those features in the second part. We will use this pre-trained model and re-train it it to classify houses with or without swimming pools.

Flow

  1. The developer loads the provided notebook, which is run on a PowerAI system.
  2. A provided example dataset demonstrates a use case of recognizing images of houses with pools vs. houses without pools.
  3. The notebook uses the TensorFlow Inception model and retraining example to leverage the existing model and build a custom classifier.
  4. The notebook compares results using the original model and the new model.

Included components

  • IBM Power AI: A software platform that includes the most popular machine learning frameworks with IBM Power Systems.
  • IBM Power Systems: IBM Power Systems is IBM's Power Architecture-based server line, built with open technologies and designed for mission-critical applications.
  • Nimbix Cloud Computing Platform: An HPC & Cloud Supercomputing platform enabling engineers, scientists & developers, to build, compute, analyze, and scale simulations in the cloud

Featured technologies

  • Jupyter Notebooks: An open-source web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text.
  • Tensorflow: An open source software library for numerical computation using data flow graphs.

Watch the Video

Steps

Follow these steps to setup and run this Code Pattern. The steps are described in detail below.

  1. Get 24-hours of free access to the PowerAI platform
  2. Access and start the Jupyter notebook
  3. Run the notebook
  4. Analyze the results
  5. Save and share
  6. End your trial

1. Get 24-hours of free access to the PowerAI platform

IBM has partnered with Nimbix to provide cognitive developers a trial account that provides 24-hours of free processing time on the PowerAI platform. Follow these steps to register for access to Nimbix to try the PowerAI code patterns and explore the platform.

  • Go here and follow the instructions to register for your free trial.

  • Use the welcome page (or confirmation email) to determine when your container is "ACTIVE" and collect the following information:

    • IP Address (might be fully qualified domain name)
    • User Id
    • Password
  • Take the IP Address (FQDN) and use your local browser to go to https://<IP Address>.

  • Login with the User Id and Password.

    welcome

2. Access and start the Jupyter notebook

  • Get a new terminal window by clicking on the New pull-down and selecting Terminal.

    powerai-notebook-terminal

  • When using the free trial, a 4-hour timeout will cause you to lose data that is not in the /data directory. Create a patterns directory under /data and a symbolic link for that directory under /usr/local/samples/ as follows:

    mkdir /data/patterns
    ln -s /data/patterns /usr/local/samples/patterns
    
  • Use git clone to download the example notebook, dataset, and retraining library into /data/patterns:

    cd /data/patterns
    git clone https://github.com/IBM/powerai-transfer-learning
    

    powerai-notebook-clone

  • Once done, you can exit the terminal and return to the notebook browser. Use the Files tab. From the root folder, click on patterns, powerai-transfer-learning, notebooks, and then Classifying-House-And-Pool-Images.ipynb to open the notebook.

  • If your container is paused (after 4 hours) and you resume it, your data will still be under /data. Recreate the symbolic link for it to show up in the Jupyter files tree.

    ln -s /data/patterns /usr/local/samples/patterns
    

3. Run the notebook

When a notebook is executed, what is actually happening is that each code cell in the notebook is executed, in order, from top to bottom.

Each code cell is selectable and is preceded by a tag in the left margin. The tag format is In [x]:. Depending on the state of the notebook, the x can be:

  • A blank, this indicates that the cell has never been executed.
  • A number, this number represents the relative order this code step was executed.
  • A *, this indicates that the cell is currently executing.

There are several ways to execute the code cells in your notebook:

  • One cell at a time.
    • Select the cell, and then press the Play button in the toolbar.
  • Batch mode, in sequential order.
    • From the Cell menu bar, there are several options available. For example, you can Run All cells in your notebook, or you can Run All Below, that will start executing from the first cell under the currently selected cell, and then continue executing all cells that follow.

4. Analyze the results

When you run the "Main" code cell you can watch the training as the accuracy quickly improves. At the end, the final test accuracy is shown. We usually see somewhere around 85% accuracy with these images.

We captured the model before and after the training. Look at our example images at the bottom of the notebook and see our before and after results.

The results should go from no recognition ability at all to a pretty good success rate. You might find it interesting to try different images and see if you can identify why it has more difficulty classifying some images.

5. Save and share

How to save your work:

Because this notebook is running temporarily on a Nimbix Cloud server, use the following options to save your work:

Under the File menu, there are options to:

  • Download as... will download the notebook to your local system.
  • Print Preview will allow you to print the current state of the notebook.

6. End your trial

When you are done with your work, please cancel your subscription by visiting the Manage link on the My Products and Services page.

Sample Output

You can see a copy of the notebook including output here

Troubleshooting

See DEBUGGING.md.

Links

Learn more

  • Artificial Intelligence Code Patterns: Enjoyed this Code Pattern? Check out our other AI Code Patterns.
  • Data Analytics Code Patterns: Enjoyed this Code Pattern? Check out our other Data Analytics Code Patterns
  • AI and Data Code Pattern Playlist: Bookmark our playlist with all of our Code Pattern videos
  • Data Science Experience: Master the art of data science with IBM's Data Science Experience
  • PowerAI: Get started or get scaling, faster, with a software distribution for machine learning running on the Enterprise Platform for AI: IBM Power Systems

License

This code pattern is licensed under the Apache Software License, Version 2. Separate third party code objects invoked within this code pattern are licensed by their respective providers pursuant to their own separate licenses. Contributions are subject to the Developer Certificate of Origin, Version 1.1 (DCO) and the Apache Software License, Version 2.

Apache Software License (ASL) FAQ