OpenAI's ChatGPT, a popular large language model (LLM), is set to gain multimodal capabilities in a major upcoming update. This will enable ChatGPT to process and understand images and videos, generate text tailored to the specific context of an image or video, and respond to voice commands and generate spoken responses. The update is significant because it will make ChatGPT...[ read more ]