Approximate domain unlearning: Enabling safer and more controllable vision-language models
Vision-language model (VLM) is a core technology of modern artificial intelligence (AI), and it can be used to represent different forms of expression or learning, such as photographs, illustrations, and sketches. It has high generalization ability, which allows it to accurately recognize objects in images within a domain. However, this generalization ability is at risk. For example, VLM recognizes both real cars and illustrated cars as “cars.” If this is installed in a system, there is a risk that a car illustrated in a roadside advertisement ...