In this hands-on tutorial, we'll go through how to host a pre-trained Tensorflow model and the associated data on Algorithmia, learn how to deploy that model into production, and then call the algorithm once it's been deployed to make inferences.
This trained model is a basic shallow network for the purpose of demoing image classiciation of handwritten digits trained on the MNIST dataset. We'll be deploying the serialized pre-trained model into production and run the model using test data that the model wasn't trained on to receive the predicted handwritten digits of new images.
Clone this repository so you have access to the code and data files. While we will use the Algorithmia Web IDE in this demo, note that you can use the CLI to deploy your model instead once you've created your algorithm and cloned your repository.
Alternatively you can simply download the repository as a .zip and get the data and code from sample-apps/algo-dev-demo/tensorflow-mnist-cpu
In this demo, we are going to host our data on the Algorithmia platform in Data Collections.
You'll want to create a data collection to host your saved model and your test data:
-
Log in to your Algorithmia account and click your avatar which will show a dropdown of choices. Click "Manage Data"
-
Then in the left panel on the page of data collection options, go ahead and click "My Hosted Data"
-
Click on “New Collection” under the “My Hosted Data” section on your data collections page. Let's name ours "tensorflow_mnist_data"
-
After you create your collection you can set the read and write access on your data collection. We are going to select "Private" since only you will be calling your algorithm in this instance.
-
Now, let's put some data into your newly created data collection. You can either drag and drop the files in the folder: tensorflow-mnist-cpu/data or you can click "Drop files here to upload" from where you stored the repo on your computer.
Now we are ready to deploy our model.
- Click the "Plus" icon at the top right of the navbar
- Let's go through the form together to create our algorithm
- Click on the purple "Create Algorithm"
- Give it a name, pick "Python 3", and be sure to select "Standard Execution Env" (all other settings are optional)
Now that you have created your algorithm, you'll get a modal with information about using the CLI and Git. Every algorithm has a Git repo behind it so you can experiment with different I/O in development mode by calling the hash version.
- Click on the tab "Source" and you'll notice boilerplate code for Hello World.
- Let's delete that code, and copy and paste the code from the file demo.py
- Recall our data collection is called "tensorflow_mnist_data" and you'll need
to change "YOUR_USERNAME" to your own username:
file_path = 'data://YOUR_USERNAME/tensorflow_mnist_data/model.zip'
on line 41.
Then note that there is another file in the repo called loadmnistdata.py.
- Create a file in the directory structure to the left of your code in the Web IDE by clicking on "+ New File" and call it "loadmnistdata.py" then copy and paste from loadmnistdata.py.
- Click the "Dependencies" button in the grey navbar.
- Add Dependencies to the requirements.txt file under the ones that already exist, adding:
tensorflow
This will be a brief description of the code example including where to load the model.
Note you always want to initialize the model outside of the apply() function. This way, after the model is initially loaded, subsequent calls will be much faster within that session.
- Click the "Compile" button in the top right of the grey navbar
- Now test your code in the console by passing in the data file we stored in our data collection.
While the first commit and compile is occuring, this is a good opportunity to introduce where to find the documentation. Welcome to the Developer Center where you can find documentation on Python algorithm development and the full documentation on Tensorflow.
In the web console, paste in: {"mnist_images": "data://YOUR_USERNAME/tensorflow_mnist_data/t10k-images-idx3-ubyte.gz", "mnist_labels": "data://YOUR_USERNAME/tensorflow_mnist_data/t10k-labels-idx1-ubyte.gz"}
We'll cover adding your sample I/O, versioning, release notes, and best practices of creating your algorithms.