Tips & Tricks
02 August 2023
Accelerate Your ML Projects with AWS SageMaker JumpStart: Pretrained Models at Your Fingertips
Blog Hero

AWS SageMaker JumpStart offers ready-to-use solutions for various machine learning use cases, allowing users to quickly get started with machine learning. The platform provides pretrained, open-source models that can be customized and fine-tuned before deployment. Additionally, SageMaker JumpStart offers solution templates and example notebooks to help users set up infrastructure and implement machine learning projects with ease.

Accessing JumpStart

To access JumpStart in Amazon SageMaker Studio, users can navigate to the JumpStart landing page from either the Home page or the Home menu on the left-side panel. From the Home page, users can choose JumpStart in the Prebuilt and automated solutions pane. Alternatively, they can select a specific model in the JumpStart pane or choose Browse all JumpStart to access the SageMaker JumpStart landing page.

Utilizing JumpStart

On the SageMaker JumpStart landing page, users can explore a wide range of solutions, models, notebooks, and other resources.

Solutions:

The Solutions section allows users to launch comprehensive machine learning solutions that integrate with other AWS services. By selecting Explore All Solutions, users can view all available solutions.

For example, if you choose Demand Forecasting which uses historical time series data in order to make future estimations in relation to customer demand over a specific period and streamline the supply-demand decision-making process across businesses, just select Demand Forecasting with Deep Learning and launch it by pressing “Launch” button.

It takes a little time then your solution will be ready to use.

Foundation Models:

For users looking for foundation models, SageMaker JumpStart offers state-of-the-art models for various use cases such as content writing, code generation, question answering, copywriting, summarization, classification, and information retrieval.

These foundation models can be used as a starting point to build generative AI solutions tailored to specific business needs.

Machine Learning tasks:

To find a model based on a specific problem type, such as Image Classification, Image Embedding, Object Detection, or Text Generation, users can select Explore All Models on the JumpStart landing page. This allows them to search and browse through the available models. When a model is selected, users can access the model detail page, which provides in-depth information about the model. This includes details on what the model can do, the expected input and output types, and the data requirements for fine-tuning the model.

Problem types Supports inference with pre-trained models Trainable on a custom dataset Supported frameworks
Image classification Yes Yes PyTorch, TensorFlow
Object detection Yes Yes PyTorch, TensorFlow, MXNet
Semantic segmentation Yes Yes MXNet
Instance segmentation Yes Yes MXNet
Image embedding Yes No TensorFlow,MXNet
Text classification Yes Yes TensorFlow
Sentence pair classification Yes Yes TensorFlow, HuggingFace
Question answering Yes Yes PyTorch, HuggingFace
Named entity recognition Yes No HuggingFace
Text summarization Yes No HuggingFace
Text generation Yes No HuggingFace
Machine Translation Yes No HuggingFace
Text embedding Yes No TensorFlow,MXNet
Tabular classification Yes Yes LightGBM, CatBoost, XGBoost, AutoGluon-Tabular, TabTransformer, Linear Learner
Tabular regression Yes Yes LightGBM, CatBoost, XGBoost, AutoGluon-Tabular, TabTransformer, Linear Learner

Model deployment configuration

After you choose a model, the model's tab opens. In the `Deploy Model` pane, choose`Deployment Configuration` to configure your model deployment.

You can also change the endpoint name, add `key;value` resource tags, activate or deactive the `jumpstart-` prefix for any JumpStart resources related to the model, and specify an Amazon S3 bucket for storing model artifacts used by your SageMaker endpoint.

Fine-Tuning data source

When you fine-tune a model, you can use the default dataset or choose your own data, which is located in an Amazon S3 bucket.

To browse the buckets available to you, choose `Find S3 bucket`. These buckets are limited by the permissions used to set up your Studio account. You can also specify an Amazon S3 URI by choosing `Enter Amazon S3 bucket location`.

For text models:

For vision models:

Conclusion

Overall, AWS SageMaker JumpStart streamlines the process of getting started with machine learning by offering pretrained models, solution templates, and example notebooks. This allows users to quickly implement machine learning projects and customize models to suit their business needs, ultimately accelerating the development and deployment of AI solutions.

Continue Reading