Are VLMs ready fo critical use cases ? Evaluating zero-shot performance with contextual prompts

Zayene, Oussama; Audergon, Vincent; Hennebert, Jean; Chabbi, Houda; de Raemy, Benoît

Are VLMs ready fo critical use cases ? Evaluating zero-shot performance with contextual prompts

Zayene, Oussama; Audergon, Vincent; Hennebert, Jean; Chabbi, Houda; de Raemy, Benoît

2025

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This study investigates Vision-Language Models (VLMs) for fire detection tasks, leveraging contextual prompts to assess their performance across various models. Notable results include, the Bunny model, which achieved a 76% F1-score, highlighting its effectiveness. These findings emphasize the impact of prompt engineering on performance while raising key questions about automating prompt optimization and selecting the most suitable VLMs based on task complexity, resource constraints, and real-world applicability.

Details

Title

Are VLMs ready fo critical use cases ? Evaluating zero-shot performance with contextual prompts

Author(s)

Zayene, Oussama (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland)
Audergon, Vincent (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland)
Hennebert, Jean (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland)
Chabbi, Houda (School of Engineering and Architecture (HEIA-FR), HES-SO University of Applied Sciences and Arts Western Switzerland)
de Raemy, Benoît (Morphean SA, Fribourg, Switzerland)

Date

2025-01

Published in

AI days HES-SO '25

Volume

2025

Pagination & equivalents

6 p.

Presented at

AI days HES-SO '25, Geneva, Switzerland, 2025-01-27, 2025-01-29

Keywords

VLM ; image captioning ; visual question answering ; contextual prompting

Paper type

non-published full paper

Faculty

Ingénierie et Architecture

School

HEIA-FR

Institute

iCoSys- Institut d’intelligence artificielle et systèmes complexes

Record Appears in

Conference materials
Global

Are VLMs ready fo critical use cases ? Evaluating zero-shot performance with contextual prompts

Files

Abstract

Details

Actions

PDF