File:Gemini multimodal AI.png

Original file ‎(1,341 × 572 pixels, file size: 76 KB, MIME type: image/png)

Captions

English

From the study "Gemini: A Family of Highly Capable Multimodal Models"

DescriptionGemini multimodal AI.png	English: "Gemini supports interleaved sequences of text, image, audio, and video as inputs (illustrated by tokens of different colors in the input sequence). It can output responses with interleaved image and text."
Date	19 December 2023
Source	https://arxiv.org/abs/2312.11805
Author	Authors of the preprint: Gemini Team Google: Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, et al.

You are free:

Under the following conditions:

attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

Click on a date/time to view the file as it appeared at that time.

	Date/Time	Thumbnail	Dimensions	User	Comment
current	22:14, 4 March 2024		1,341 × 572 (76 KB)	Prototyperspective (talk \| contribs)	Uploaded a work by Authors of the preprint: Gemini Team Google: Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, et al. from https://arxiv.org/abs/2312.11805 with UploadWizard

You cannot overwrite this file.

There are no pages that use this file.