Session: The Invisible Engine: Optimizing AI for the Cloud Without Breaking the Bank
AI workloads in the cloud can be costly, inefficient, and energy-intensive if not optimized properly. This session explores strategies for optimizing AI models for cloud environments, ensuring efficiency without sacrificing performance. Key topics include:
Cloud-aware algorithm optimization for reduced compute costs.
Auto-scaling AI workloads to dynamically adapt to demand.
Trade-offs between accuracy, speed, and resource utilization.
Real-world insights from NeurIPS research on benchmarking AI creativity in cloud-based environments.
Attendees will leave with practical, immediately applicable techniques to make their AI models faster, more affordable, and sustainable in the cloud.
Bio
Kaustubha is a Solution Architect at Microsoft, a multi-cloud expert (Azure, AWS, GCP), and an AI researcher. She has 46+ cloud certifications and has presented research at NeurIPS, CODS-COMAD, and international AI conferences. Passionate about AI efficiency, cloud optimization, and responsible AI, she works on benchmarking AI performance in cloud environments and enabling cost-effective, scalable AI solutions