Review of large vision models and visual prompt engineering

166Citations
Citations of this article
192Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Visual prompt engineering is a fundamental methodology in the field of visual and image artificial general intelligence. As the development of large vision models progresses, the importance of prompt engineering becomes increasingly evident. Designing suitable prompts for specific visual tasks has emerged as a meaningful research direction. This review aims to summarize the methods employed in the computer vision domain for large vision models and visual prompt engineering, exploring the latest advancements in visual prompt engineering. We present influential large models in the visual domain and a range of prompt engineering methods employed on these models. It is our hope that this review provides a comprehensive and systematic description of prompt engineering methods based on large visual models, offering valuable insights for future researchers in their exploration of this field.

Cite

CITATION STYLE

APA

Wang, J., Liu, Z., Zhao, L., Wu, Z., Ma, C., Yu, S., … Zhang, S. (2023, November 1). Review of large vision models and visual prompt engineering. Meta-Radiology. KeAi Publishing Communications Ltd. https://doi.org/10.1016/j.metrad.2023.100047

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free