AWS SageMaker JumpStart offers ready-to-use solutions for various machine learning use cases, allowing users to quickly get started with machine learning. The platform provides pretrained, open-source models that can be customized and fine-tuned before deployment. Additionally, SageMaker JumpStart offers solution templates and example notebooks to help users set up infrastructure and implement machine learning projects with ease.
Accessing JumpStart
To access JumpStart in Amazon SageMaker Studio, users can navigate to the JumpStart landing page from either the Home page or the Home menu on the left-side panel. From the Home page, users can choose JumpStart in the Prebuilt and automated solutions pane. Alternatively, they can select a specific model in the JumpStart pane or choose Browse all JumpStart to access the SageMaker JumpStart landing page.
![](https://sonne.technology/media/02082023/1.png)
Utilizing JumpStart
On the SageMaker JumpStart landing page, users can explore a wide range of solutions, models, notebooks, and other resources.
Solutions:
The Solutions section allows users to launch comprehensive machine learning solutions that integrate with other AWS services. By selecting Explore All Solutions, users can view all available solutions.
![](https://sonne.technology/media/02082023/2.png)
For example, if you choose Demand Forecasting which uses historical time series data in order to make future estimations in relation to customer demand over a specific period and streamline the supply-demand decision-making process across businesses, just select Demand Forecasting with Deep Learning and launch it by pressing “Launch” button.
![](https://sonne.technology/media/02082023/3.png)
It takes a little time then your solution will be ready to use.
![](https://sonne.technology/media/02082023/4.png)
Foundation Models:
For users looking for foundation models, SageMaker JumpStart offers state-of-the-art models for various use cases such as content writing, code generation, question answering, copywriting, summarization, classification, and information retrieval.
![](https://sonne.technology/media/02082023/5.png)
These foundation models can be used as a starting point to build generative AI solutions tailored to specific business needs.
![](https://sonne.technology/media/02082023/6.png)
Machine Learning tasks:
To find a model based on a specific problem type, such as Image Classification, Image Embedding, Object Detection, or Text Generation, users can select Explore All Models on the JumpStart landing page. This allows them to search and browse through the available models. When a model is selected, users can access the model detail page, which provides in-depth information about the model. This includes details on what the model can do, the expected input and output types, and the data requirements for fine-tuning the model.
Problem types | Supports inference with pre-trained models | Trainable on a custom dataset | Supported frameworks |
---|---|---|---|
Image classification | Yes | Yes | PyTorch, TensorFlow |
Object detection | Yes | Yes | PyTorch, TensorFlow, MXNet |
Semantic segmentation | Yes | Yes | MXNet |
Instance segmentation | Yes | Yes | MXNet |
Image embedding | Yes | No | TensorFlow,MXNet |
Text classification | Yes | Yes | TensorFlow |
Sentence pair classification | Yes | Yes | TensorFlow, HuggingFace |
Question answering | Yes | Yes | PyTorch, HuggingFace |
Named entity recognition | Yes | No | HuggingFace |
Text summarization | Yes | No | HuggingFace |
Text generation | Yes | No | HuggingFace |
Machine Translation | Yes | No | HuggingFace |
Text embedding | Yes | No | TensorFlow,MXNet |
Tabular classification | Yes | Yes | LightGBM, CatBoost, XGBoost, AutoGluon-Tabular, TabTransformer, Linear Learner |
Tabular regression | Yes | Yes | LightGBM, CatBoost, XGBoost, AutoGluon-Tabular, TabTransformer, Linear Learner |
Model deployment configuration
After you choose a model, the model's tab opens. In the `Deploy Model` pane, choose`Deployment Configuration` to configure your model deployment.
![](https://sonne.technology/media/02082023/7.png)
You can also change the endpoint name, add `key;value` resource tags, activate or deactive the `jumpstart-` prefix for any JumpStart resources related to the model, and specify an Amazon S3 bucket for storing model artifacts used by your SageMaker endpoint.
![](https://sonne.technology/media/02082023/8.png)
Fine-Tuning data source
When you fine-tune a model, you can use the default dataset or choose your own data, which is located in an Amazon S3 bucket.
To browse the buckets available to you, choose `Find S3 bucket`. These buckets are limited by the permissions used to set up your Studio account. You can also specify an Amazon S3 URI by choosing `Enter Amazon S3 bucket location`.
![](https://sonne.technology/media/02082023/9.png)
For text models:
- The bucket must have a data.csv file.
- The first column must be a unique integer for the class label. For example: 1, 2, 3, 4, n
- The second column must be a string.
- The second column should have the corresponding text that matches the type and language for the model.
For vision models:
- The bucket must have as many subdirectories as the number of classes.
- Each subdirectory should contain images that belong to that class in `.jpg` format.
Conclusion
Overall, AWS SageMaker JumpStart streamlines the process of getting started with machine learning by offering pretrained models, solution templates, and example notebooks. This allows users to quickly implement machine learning projects and customize models to suit their business needs, ultimately accelerating the development and deployment of AI solutions.