LLM in Practice by Accumulation Point

LLM blog posts and term-definitions: a practical collection

Back to term definitions

Multimodal model

A multimodal language model or multimodal model is capable of processing and generating text, images, and other modalities simultaneously. It integrates multiple sources of information, such as text and visual data, to enhance understanding and generation of content.

Last updated 5th March 2023.