klaudiadev.com

We had some exiting announcement this February. Google announce Gemini and Open AI bring attention with Sora AI. Let's dive into this two new features.

Google Gemini

Couple months ago, Google team start talking about their new creation which supposed to beat GPT-4. A little bit later we got the video showing as pretty amazing showcase of the model. But.. this turns out to be mostly created for video purposes so the model was prepare for handling the conversation with the human.

So what Google tell us ?

Gemini is the result of large-scale collaborative efforts by teams across Google, including our colleagues at Google Research. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across and combine different types of information including text, code, audio, image and video.

Google offers two models. Gemini and Gemini Advanced. Gemini and Gemini Advanced are both large language models from Google, but they differ in their capabilities and accessibility. Let's explore the diversity.

Gemini:

Free to use - Available without any additional cost.
Handles general tasks well: Can answer your questions, complete basic requests, and generate different creative text formats.
Limited in complexity: May struggle with highly complex tasks like coding, intricate reasoning, or nuanced instructions.

Gemini Advanced:

Part of a paid Google One plan: Requires a subscription for access.
Significantly more powerful: Utilizes Google's 1.0 Ultra model, making it adept at complex tasks like:
- Coding: Can assist with writing and debugging code.
- Logical reasoning: Solves problems requiring logical thinking.
- Following nuanced instructions: Understands subtle details and executes tasks accordingly.
- Creative collaboration: Acts as a partner for brainstorming and creative projects.
Currently limited in language support: Optimised for English only, although it can respond to queries in other languages Gemini supports.

_____

There is a possibility to test Gemini Advanced - Google

offers 2 months, at no charge

Learn more

Launch app

OpenAI Sora

OpenAI hit back Google announcement with his new model Sora. Sora is an AI modal that can create realistic and imaginative scenes from text instructions.

Introducing Sora, our text-to-video model. Sora can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt.

Let's look at an example video generated by Sora model.

Prompt: A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colourful lights. Many pedestrians walk about.

I think we all agree that it looks incredible. For now Sora is only available for a small group of users to assemble first feedbacks. For now we do not have a date when it will be available for everyone to use and under what conditions. Subscribe to my Newsletter to get updates on that and other topics!

______
Thanks for reading 🤍

Exploring the Future of Browsing: A Dive into Google Gemini, and OpenAI Sora

Google Gemini

Gemini:

Gemini Advanced:

OpenAI Sora