Session - Managing and Evaluating Generative AI systems - Centre for Continuing and Distance Education

Period

25 September 2025 - 26 September 2025

Registration deadline

12 September 2025

Managing and Evaluating Generative AI systems

Short course 2025

Informatics and Technologies

Gain practical experience in evaluating and deploying your own Generative AI systems, develop skills to assess various Generative AI models and how to perform fast prototyping with GenAI

Information

Period

25 September 2025 - 26 September 2025

Language

English

2 ECTS credits (micro-credential)

Location

Campus Biotech Innovation Park

Registration deadline

12 September 2025

Fees:

CHF听1300.- for the full course
CHF 650.- (reduced rate)
An additional CHF 200.- for the Micro-credential

Contact

Dr Jose Luis FERNANDEZ-MARQUEZ
cui-formationcontinue(at)unige.ch

Overview

Objectives

This course provides practical experience in managing Generative AI systems and develops skills to assess Large Language models. It focuses on the deployment, monitoring, optimization, and scaling of Generative AI applications in production using Generative AI Operations (GenAIOps).

A light touch on Machine Learning Operations (MLOps) ensures foundational understanding before diving into LLM-specific challenges such as performance tuning, cost efficiency, observability, retrieval augmented generation (RAG), LLM Agents, and prompt management.

Participants will gain experience with common (Commercial and open-source) cloud-based LLM tooling and platforms, ensuring they can build scalable, efficient, and cost-effective LLM applications.

Audience

This session is for IT professionals who manage AI systems. Participants should have a foundational understanding of machine learning (particularly LLMs), intermediate-level Python skills, basic API usage, and a laptop with a working Python environment. Prior experience with MLOps, cloud platforms, or prompt engineering is helpful but not mandatory.

Learning outcomes

This course offers participants key competencies in deploying, monitoring, optimizing, and scaling large language models (LLMs) within operational environments.

Here are the key competencies provided by the course:

Foundational Knowledge of MLOps and LLMOps: Understanding the lifecycle of LLMs including pre-training, fine-tuning, inference, and monitoring; distinguishing between MLOps and LLMOps.
Deployment Strategies for LLMs: Learning various hosting options, tackling scaling issues like model parallelism, and exploring cost optimization strategies such as caching.
Prompt Engineering and Management: Gaining skills in prompt engineering for enhanced efficiency, accuracy, and cost-effectiveness; managing and versioning听prompts to ensure consistency and maintainability.
Performance and Cost Optimization: Techniques to optimize inference speed and reduce operational costs.
Observability and Monitoring: Setting up systems to track LLM performance metrics such as latency and token usage, detecting errors, and implementing feedback loops mechanisms.

Programme

Day 1. 听GenAIOps/LLMOps Fundamentals

1. Introduction to MLOps and LLMOps

MLOps vs. GenAIOps/LLMOps: Key differences and challenges
LLM lifecycle: Pre-training, fine-tuning, inference, and monitoring
Challenges in deploying LLMs: Scalability, latency, cost, observability
听

2. GenAIOps/LLMOps Lifecycle

Foundational models
Fine Tuning
LLM Deployment strategies and infrastructure
Scaling challenges: Model parallelism, quantization, and distillation
Cost optimization strategies: Caching, batch inference, serverless deployments, Optimizing inference speed
听

3. Hands-on Session: LLM Finetuning

FineTune a small LLM
听

4. Prompt Engineering

Prompt Engineering Basics
Optimizing prompts for efficiency, accuracy, and cost
Structuring prompts for different use cases
Fine-tuning vs. prompt engineering: When to use which
听

5. Prompt Management

Why prompt management matters: Consistency, scalability, and maintainability
Versioning and tracking prompts: Best practices
Using prompt management tools
A/B testing prompts: Measuring effectiveness and iterating
听

6. Hands-on Session: Prompts Engineering & Management

Experimenting with different prompt engineering techniques and tools

听

Day 2. Advanced LLMOps

1. Retrieval augmented generation (RAG)

What is RAG and how does it work?
Exploring the different types of RAG
Common issues in RAG solutions
When to use Fine tuning vs prompt engineering vs RAG
听

2. Agentic workflows

What are LLM Agents?
Key components of LLM Agent with an overview on opensource frameworks such as LangChain and LlamaIndex
Use cases

3. Hands-on: Building a GenAI Application

Building a RAG and Agent application
Implementing monitoring & logging for performance tracking
Optimizing deployment for cost and scalability
听

4. Observability, Monitoring, and Feedback Loops

Tracking LLM performance: Latency, token usage, response quality
Detecting hallucinations and errors
Implementing feedback loop mechanisms
听

5. Security, Compliance, and Responsible AI

Data privacy in LLMOps: PII redaction, secure API handling
Regulatory compliance
Bias detection and mitigation strategies

Registration

Registration deadline

12 September 2025

Fees:

CHF听1300.- for the full course
CHF 650.- (reduced rate)
An additional CHF 200.- for the Micro-credential

Curriculum

Period

25 September 2025 - 26 September 2025

Microcertification

2 ECTS credits

: 7

Day 1: GenAIOps/LLMOps Fundamentals

Date(s)

25 September 2025

Description

1. Introduction to MLOps and LLMOps

MLOps vs. GenAIOps/LLMOps: Key differences and challenges
LLM lifecycle: Pre-training, fine-tuning, inference, and monitoring
Challenges in deploying LLMs: Scalability, latency, cost, observability
听

2. GenAIOps/LLMOps Lifecycle

Foundational models
Fine Tuning
LLM Deployment strategies and infrastructure
Scaling challenges: Model parallelism, quantization, and distillation
Cost optimization strategies: Caching, batch inference, serverless deployments, Optimizing inference speed
听

3. Hands-on Session: LLM Finetuning

FineTune a small LLM
听

4. Prompt Engineering

Prompt Engineering Basics
Optimizing prompts for efficiency, accuracy, and cost
Structuring prompts for different use cases
Fine-tuning vs. prompt engineering: When to use which
听

5. Prompt Management

Why prompt management matters: Consistency, scalability, and maintainability
Versioning and tracking prompts: Best practices
Using prompt management tools
A/B testing prompts: Measuring effectiveness and iterating
听

6. Hands-on Session: Prompts Engineering & Management

Experimenting with different prompt engineering techniques and tools

Speakers

PhD Hisham MOHAMED, AV短视频

Day 2: Advanced LLMOps

Date(s)

26 September 2025

Description

1. Retrieval augmented generation (RAG)

What is RAG and how does it work?
Exploring the different types of RAG
Common issues in RAG solutions
When to use Fine tuning vs prompt engineering vs RAG
听

2. Agentic workflows

What are LLM Agents?
Key components of LLM Agent with an overview on opensource frameworks such as LangChain and LlamaIndex
Use cases

3. Hands-on: Building a GenAI Application

Building a RAG and Agent application
Implementing monitoring & logging for performance tracking
Optimizing deployment for cost and scalability
听

4. Observability, Monitoring, and Feedback Loops

Tracking LLM performance: Latency, token usage, response quality
Detecting hallucinations and errors
Implementing feedback loop mechanisms
听

5. Security, Compliance, and Responsible AI

Data privacy in LLMOps: PII redaction, secure API handling
Regulatory compliance
Bias detection and mitigation strategies

Intervenant-es

PhD Hisham MOHAMED,听AV短视频

Hisham is an AI and machine learning expert with over 10 years of experience in machine learning, software engineering, and big data. With a PhD in Computer Science from the AV短视频, he has led high-impact projects and built and managed diverse teams.

Hisham has deep experience in deploying and scaling AI systems. In this session, he will focus on GenAIOps/LLMOps, sharing insights on managing, optimizing, and operationalizing large language models in real-world applications.

Director(s)

Prof. Giovanna DI MARZO SERUGENDO, Centre universitaire d'informatique (CUI), AV短视频

Coordinator(s)

Dr Jose Luis FERNANDEZ-MARQUEZ, AV短视频

AV短视频

Period

Registration deadline

Managing and Evaluating Generative AI systems

Short course 2025

Informatics and Technologies

Information

Period

Language

Location

Registration deadline

Fees:

Contact

Overview

Objectives

Audience

Learning outcomes

Programme

Registration

Registration deadline

Fees:

Curriculum

Period

Microcertification

Day 1: GenAIOps/LLMOps Fundamentals

Date(s)

Description

Speakers

Day 2: Advanced LLMOps

Date(s)

Description

Intervenant-es

Director(s)

Coordinator(s)

Contribution to the Sustainable Development Goals

FAQ