From the course: Rust LLMOps
Unlock this course with a free trial
Join today to access over 23,100 courses taught by industry experts.
VS Code SSH to AWS accelerated - Rust Tutorial
From the course: Rust LLMOps
VS Code SSH to AWS accelerated
- [Instructor] Rust is a great place to start for doing high-performance, modern LLMOps because Rust is one of the fastest languages in the world, it doesn't have legacy issues with a scripting language and it can do inference with just one command using cargo. One problem though is where do you run this? If you need to have a very powerful GPU, this can be an issue because not everybody has a powerful GPU. Well, one of the ways you can do this is via remote development and using VS Code. So what you would do in this scenario here is first go to AWS, set up an EC2 instance. You'd have to pick the right one, then go ahead and install VS Code and the remote SSH extension. Next step, SSH into EC2. Then install Rust via the rustup command, just a one liner. Then do a git clone for the Candle repo and you're ready to run inference. Really, it's that easy, and you can do all of this actually via VS Code locally. So let's go ahead and do a demo. So one of the things that's a little bit…
Contents
-
-
-
(Locked)
Function: The essence of programming6m 48s
-
(Locked)
Operationalizing microservices1m 57s
-
(Locked)
Continuous integration for microservices6m 54s
-
(Locked)
What is a Makefile and how do you use it?2m 41s
-
(Locked)
What is DevOps?2m 29s
-
(Locked)
Kaizen methodology4m 6s
-
(Locked)
Infrastructure as code for continuous delivery2m 50s
-
(Locked)
Responding to compromised resources and workloads4m 16s
-
(Locked)
Monitoring and logging1m 47s
-
(Locked)
Auditing networks1m 6s
-
(Locked)
Rust: Secure by design4m 52s
-
(Locked)
Preventing data races with the Rust compiler3m 29s
-
(Locked)
AWS config for security4m 26s
-
(Locked)
Demo: AWS Security Hub3m 39s
-
(Locked)
Securing accounts with 2FA3m 11s
-
(Locked)
Access permissions overview4m 4s
-
(Locked)
Repository permission levels2m 37s
-
(Locked)
Repository privacy settings2m 52s
-
(Locked)
Key concepts in the GitHub ecosystem3m 43s
-
(Locked)
Demo: GitHub Actions3m 50s
-
(Locked)
Demo: Codespaces6m 8s
-
(Locked)
Demo: Copilot8m 9s
-
(Locked)
Candle framework in Rust2m 58s
-
(Locked)
GitHub Codespaces with GPU5m 55s
-
(Locked)
VS Code SSH to AWS accelerated5m 14s
-
(Locked)
Candle hello world2m 56s
-
(Locked)
Exploring StarCoder in Rust5m 54s
-
(Locked)
Whisper Candle transcriber5m 51s
-
(Locked)
Exploring remote development on AWS2m 10s
-
(Locked)
Rust for large language models (LLMs)1m 56s
-
(Locked)
Serverless inference1m 52s
-
(Locked)
Rust CLI inference2m 2s
-
(Locked)
Rust chat inference1m 59s
-
(Locked)
The continuous build binary2m 6s
-
(Locked)
The chat loop with StarCoder2m 4s
-
(Locked)
Invoke an LLM on an AWS G5 instance, part 14m 36s
-
(Locked)
Invoke an LLM on an AWS G5 instance, part 23m 1s
-
(Locked)
Rust-BERT introduction1m 51s
-
(Locked)
Installing and setting up Rust-BERT5m 38s
-
(Locked)
Basic syntax and model loading in Rust-BERT2m 4s
-
(Locked)
Rust sentiment analysis in the CLI4m 13s
-
(Locked)
Rust-PyTorch introduction1m 54s
-
(Locked)
Rust-PyTorch hello world2m 28s
-
(Locked)
PyTorch pretrained models3m 39s
-
(Locked)
Running pretrained PyTorch models in Rust6m 41s
-
(Locked)
Introduction to ONNX1m 25s
-
(Locked)
ONNX conversions2m 5s
-
(Locked)
Extending Google Bard4m 22s
-
(Locked)
Exploring Google Colab with Bard4m 22s
-
(Locked)
Exploring the Colab AI4m 56s
-
(Locked)
Exploring the Google Cloud Generative AI App Builder2m 29s
-
(Locked)
AWS Bedrock for responsible AI4m 39s
-
(Locked)
AWS Bedrock with Claude7m 20s
-
(Locked)
Summarizing text with Claude5m 28s
-
(Locked)
Using the AWS Bedrock API1m 39s
-
(Locked)
-