The Quickstart is found in the module test_quickstarts:
This file can be run as a test or as a standalone application.
The entrypoint of the Quickstart is Quickstart.run().
- Define the datasource.
- Define the model.
- Create the Context.
- The Context object is the orchestrator of all tasks in the library. It is also the central point for all calls to ML tasks. This centralizes logging, intrumentation, and error handling.
- Prepare the data.
- High dimensional vectors cannot be plotted with standard 2D and 3D plots. If you would like to graph the data, use the "features" parameter to Context.prepare_data(features) to limit the number of features.
- Train the model.
- Evaluate the model.
- Look at the returns Score object. The R Squared value is 0.98.
- You'll notice that if you are using only 2 features for better plotting the accuracy of the logistic regression is on 86%. But if you use all features, the model is better fitted to the data and achieves 97% accuracy.
- Make predictions.
- Compare the predictions, returned in the array Prediction.values, with the actual labels in DataContainer.y_test.
Notice that linear regression quickstart has an optional parameter to select the ML framework that the model will use. If you pass 'tensorflow' in this parameter, the model will be built to use a TensorFlow implementation.
quickstart.run_linear_regression(plot=True, framework='tensorflow')
- Evaluating the model.
- Tuning the model.
- These are often repeated to find the optimum model configuration.