31. Model Deployment#
Once a model is trained, it can then be deployed. Deployment strategies vary based on use cases and needs. Some examples:
In Applications β Models can be integrated into mobile, desktop, or web apps for real-time predictions.
On Edge (on device) Deployment β Running models on local devices without needing internet connectivity.
API Deployment β Hosted on a server, allowing external applications and users to send requests and receive predictions via an API.
Via Batch Processing Pipelines β Running large datasets through the model at scheduled intervals.
In Active Learning & Model-Assisted Labeling Systems β Humans review model-generated predictions, verifying or correcting them to quickly build more ground truth data. This feedback loop helps retrain and improve the model over time.
In a Semi-Supervised Learning Pipeline A mix of labeled and unlabeled data is used to improve performance, reducing the need for fully annotated datasets.
Self-Supervised Learning Pipeline The model learns patterns from data without human-labeled annotations, useful for feature extraction and pretraining.
Each deployment approach has trade-offs in terms of latency, cost, scalability, and human oversight.