Real-time interfaces
Here are the key steps to deploy real-time interfaces in SageMaker:
Prepare Model
Package your trained model with inference code
Create model.tar.gz containing the model artifacts and inference.py script
Create SageMaker Model
Use CreateModel API/console to register your model
Specify container image and model artifacts location in S3
Create Endpoint Configuration
Define instance type (e.g. ml.t2.medium)
Set number of instances for scaling
Configure production variants if A/B testing
Create Endpoint
Deploy using the endpoint configuration
Wait for status to become "InService"
Get endpoint name/URL for making predictions
Invoke Endpoint
Use InvokeEndpoint API for real-time predictions
Send properly formatted input data
Get predictions in response
The minimum required components are:
Last updated
Was this helpful?