TechGoogle Elevates AI Game with Gemini 1.5 Pro: A Leap in Multimodal AI Tech

Google Elevates AI Game with Gemini 1.5 Pro: A Leap in Multimodal AI Tech

During the Google Next event, the company unveiled a significant upgrade to its AI model, Gemini 1.5 Pro, which greatly enhanced the tool’s functionality.

Google Elevates AI Game with Gemini 1.5 Pro: A Leap in Multimodal AI Tech
Images source: © Pixabay

9:04 AM EDT, May 7, 2024

One of the standout features of Gemini - an advanced AI solution developed by Google - lies in its multimodal capabilities. Designed to simultaneously and seamlessly process various data types such as text, images, audio, video, and code, it's adept at performing intricate reasoning and problem-solving tasks across a wide range of disciplines.
The Verge reports that the latest update introduces the ability for the model to "listen," enabling it to analyze audio files and extract information without requiring text transcripts. This innovation expands its proficiency in handling audio data, offering potential applications in areas like business meeting recordings or film soundtracks.

Gemini 1.5 Pro becomes more accessible

Furthermore, Google announced plans to broaden the availability of Gemini 1.5 Pro, making it accessible to a larger audience through its AI application development platform, Vertex AI. Despite being a mid-range option within the Gemini model lineup, it surpasses the performance of the series' largest and most sophisticated model, Gemini Ultra. Notably, the updated model simplifies use by eliminating the need for intricate customization. Google views this as a leap forward in making AI technology more accessible and user-friendly.

Updates to other Google AI models

Google also revealed enhancements to another major AI model, Imagen 2, which specializes in generating images from text. Innovations such as inpainting and outpainting will enable users to modify images by adding or removing elements. Furthermore, the introduction of SynthID digital watermarking technology is designed to subtly mark images generated by Imagen models, facilitating the identification of their origins.

Integrating AI with Google's search engine

According to The Verge, Google is pioneering efforts to integrate AI more closely with its search engine, aiming to boost the relevance of AI-generated responses by leveraging the most current information. This initiative represents a significant advancement over previous language model limitations, which notably skirted around topics like the upcoming US elections in 2024 and were criticized for creating historically inaccurate images.
Related content
© essanews.com
·

Downloading, reproduction, storage, or any other use of content available on this website—regardless of its nature and form of expression (in particular, but not limited to verbal, verbal-musical, musical, audiovisual, audio, textual, graphic, and the data and information contained therein, databases and the data contained therein) and its form (e.g., literary, journalistic, scientific, cartographic, computer programs, visual arts, photographic)—requires prior and explicit consent from Wirtualna Polska Media Spółka Akcyjna, headquartered in Warsaw, the owner of this website, regardless of the method of exploration and the technique used (manual or automated, including the use of machine learning or artificial intelligence programs). The above restriction does not apply solely to facilitate their search by internet search engines and uses within contractual relations or permitted use as specified by applicable law.Detailed information regarding this notice can be found  here.