This three-day instructor-led course teaches participants techniques for monitoring, troubleshooting, and improving infrastructure and application performance in Google Cloud. Guided by the principles of Site Reliability Engineering (SRE), and using a combination of presentations, demos, hands-on labs, and real-world case studies, attendees gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, tracing application performance bottlenecks, and profiling CPU and memory usage.

Audience

This class is intended for the following participants:

Cloud architects, administrators, and SysOps personnel
Cloud developers and DevOps personnel

We can organize this training at your preferred date and location. Contact Us!

Prerequisites

To get the most out of this course, participants should have:

Google Cloud Platform Fundamentals: Core Infrastructure or equivalent experience
Basic scripting or coding familiarity
Proficiency with command-line tools and Linux operating system environments

What You Will Learn

This course teaches participants the following skills:

Plan and implement a well-architected logging and monitoring infrastructure
Define Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
Create effective monitoring dashboards and alerts
Monitor, troubleshoot, and improve Google Cloud infrastructure
Analyze and export Google Cloud audit logs
Find production code defects, identify bottlenecks, and improve performance
Optimize monitoring costs

Training Outline

Module 1
Introduction to Google Cloud Monitoring Tools

Understand the purpose and capabilities of Google Cloud operations-focused components: Logging, Monitoring, Error Reporting, and Service Monitoring
Understand the purpose and capabilities of Google Cloud application performance management focused components: Debugger, Trace, and Profiler

Module 2
Avoiding Customer Pain

Construct a monitoring base on the four golden signals: latency, traffic, errors, and saturation
Measure customer pain with SLIs
Define critical performance measures
Create and use SLOs and SLAs
Achieve developer and operation harmony with error budgets

Module 3
Alerting Policies

Develop alerting strategies
Define alerting policies
Add notification channels
Identify types of alerts and common uses for each
Construct and alert on resource groups
Manage alerting policies programmatically

Module 4
Monitoring Critical Systems

Choose best practice monitoring project architectures
Differentiate Cloud IAM roles for monitoring
Use the default dashboards appropriately
Build custom dashboards to show resource consumption and application load
Define uptime checks to track aliveness and latency

Module 5
Configuring Google Cloud Services for Observability

Integrate logging and monitoring agents into Compute Engine VMs and images
Enable and utilize Kubernetes Monitoring
Extend and clarify Kubernetes monitoring with Prometheus
Expose custom metrics through code, and with the help of OpenCensus

Module 6
Advanced Logging and Analysis

Identify and choose among resource tagging approaches
Define log sinks (inclusion filters) and exclusion filters
Create metrics based on logs
Define custom metrics
Link application errors to Logging using Error Reporting
Export logs to BigQuery

Module 7
Monitoring Network Security and Audit Logs

Collect and analyze VPC Flow logs and Firewall Rules logs
Enable and monitor Packet Mirroring
Explain the capabilities of Network Intelligence Center
Use Admin Activity audit logs to track changes to the configuration or metadata of resources
Use Data Access audit logs to track accesses or changes to user-provided resource data
Use System Event audit logs to track GCP administrative actions

Module 8
Managing Incidents

Define incident management roles and communication channels
Mitigate incident impact
Troubleshoot root causes
Resolve incidents
Document incidents in a post-mortem process

Module 9
Investigating Application Performance Issues

Debug production code to correct code defects
Trace latency through layers of service interaction to eliminate performance bottlenecks
Profile and identify resource-intensive functions in an application

Module 10
Optimizing the Costs of Monitoring

Analyze resource utilization cust for monitoring related components within Google Cloud
Implement best practices for controlling the cost of monitoring within Google Cloud

Why Choose Us

Experience Logging, Monitoring, and Observability in Google Cloud through Bilginç IT Academy's live and interactive virtual classroom environment, accessible from your home, office, or any location. Connect with expert trainers in real time and bring the energy of classroom learning into the digital experience.

Live Instructor-Led Sessions: Join scheduled training sessions with your instructor and fellow delegates in real time.
Interactive Learning Experience: Take part in discussions, practical exercises, group activities, and Q&A sessions throughout the course.
Expert Trainer Network: Learn from experienced trainers with strong industry backgrounds and practical field expertise.
Over 30 Years of Training Expertise: Benefit from Bilginç IT Academy's long-standing experience in delivering professional training since 1995.
Flexible and Scalable Delivery: Access live virtual classrooms worldwide with flexible planning options for individual and corporate training needs.

Experience Logging, Monitoring, and Observability in Google Cloud in a focused classroom environment designed for high engagement and effective learning. Bilginç IT Academy's carefully selected training venues provide a professional setting where delegates can interact directly with expert trainers and peers.

Experienced Trainers: Learn from specialists with extensive field experience and real-world knowledge.
Professional Training Venues: Attend courses in comfortable, well-equipped classrooms designed to support effective learning.
Focused Classroom Experience: Benefit from limited class sizes that encourage discussion, interaction, and personalized support.
Quality-Driven Learning: Develop practical skills through structured, up-to-date, and professionally designed training content.

Meet your team's training needs with Bilginç IT Academy's onsite Logging, Monitoring, and Observability in Google Cloud solution, delivered at your office or preferred location. Align your team's development with your business goals through a training experience tailored to your organization.

Tailored Course Content: Adapt the training program to your organization's projects, team structure, and specific business requirements.
Time and Cost Efficiency: Reduce travel, accommodation, and operational costs while maximizing the value of your training investment.
Team-Focused Learning: Help your employees develop around the same knowledge base and strengthen collaboration across your organization.
Simplified Planning and Tracking: Manage the training process, participant development, and organizational requirements with greater control.

Why have you chosen us?

I have attended a training from Bilginc IT Academy before and I was satisfied.

I have attended a training from a different provider and it was not helpful.

Other

How many employees do you have in your IT department?

0 – 50

50 – 250

250 – 1000

1000+

Logging, Monitoring, and Observability in Google Cloud Training

Audience

Prerequisites

What You Will Learn

Training Outline

Why Choose Us