Explore AI Model Architectures
Get hands-on experience with AI model formats, sizes, and resource requirements using Ollama.
Lab Overview
In this lab, you'll explore the practical aspects of AI models from an infrastructure perspective.
You'll learn to:
- Understand different model formats (GGUF, SafeTensors) and when to use each
- Pull and inspect model metadata to understand resource requirements
- Compare inference speeds across different model sizes
- Make informed decisions about model selection for your infrastructure
Prerequisites:
- Basic Linux command line skills
- Understanding of system resources (memory, CPU)
Key Concepts:
- Model quantization levels (FP16, INT8, INT4)
- VRAM and RAM requirements
- Inference latency vs model capability tradeoffs
What You'll Learn
Identify different model formats and their use cases
Inspect model metadata to determine resource requirements
Compare inference performance across model sizes
Select appropriate models based on infrastructure constraints
Prerequisites
basic-linux-commands
understanding-system-resources
Technologies Covered
Part of a Course
This lab is part of the AI Foundations for Infrastructure Engineers course
View All CoursesChoose your plan
Simple, Transparent Pricing
One price, everything included
Monthly Plan
Access all content
Quarterly Plan
Save 16% with quarterly billing
Everything Included in Your Subscription
Content & Learning
- Access to all courses and bootcamps
- Video lessons with closed captions
- Interactive quizzes and assessments
- Course completion certificates
Hands-On Labs
- Browser-based cloud labs
- Pre-configured VMs ready to use
- Playgrounds for experiments
- Multi-VM realistic scenarios
AWS Integration
- Managed AWS Account included
- Pre-configured environments
- Real-world cloud scenarios
Support & Community
- Priority support
- Active community forum
No Setup Required
- Everything runs in your browser
- No software installation needed
- Automatic environment provisioning
- Works on any device
Ready to Get Started?
Start this hands-on lab and build real-world Platform Engineering skills
Get Access Now