Google Gemini: Another ChatGPT competitor in the market

Sudeep Singh Rawat
2 min readDec 16, 2023

--

Google is making itself advance and better every day. In a recent race among all the top companies to develop a flawless generative AI tool, Google has launched its Gemini generative AI model — which has positioned itself as ChatGPT’s key competitor. However, ChatGPT is also working on specialised AI projects.

Google recently shared a blog post introducing MedLM for the healthcare industry. According to Google, MedLM is the future of its generative AI in healthcare, focusing on enabling users for safe and responsible use of Artificial Intelligence.

The LLM model is the first to obtain over 60 per cent passing score on a US medical licensing exam-style question paper published in Nature. Now, it has scored expert-level marks, above 86.5 per cent, applying itself in a real-world scenario through a measured approach.

Gemini’s Two Models

Image source: Google

A healthcare organisation is exploring the use of AI for a range of applications, from basic to complex workflows, as there are two models under MedLM, which is built on Med-PaLM 2, offering flexibility to healthcare organisations. The first model is designed for complex tasks, while the second model can fine-tune and be best for scaling across tasks.

The two models are informed by specific healthcare and life sciences customer needs, answering complex medical questions and drafting summaries. Google is also planning to bring gemini based model into the MedM suite to provide more capabilities.

The latest AI model will be used across the healthcare industry, like hospitals, drug development, patient-facing chatbots and more. The search engine company mentioned that several of the organisations it tapped to test the MedLM suite also added that they are moving Gemini into production in their solutions or broadening their testing.

What makes Gemini different from previous AI models?

Gemini is a “multi-modal model,” which means it works directly with multiple modes of input and output; supporting text input and output. The Gemini model supports images, audio and video. With gemini, a new term also comes into existence, i.e., LMM (large multimodal model) and not to be confused with LLM.

OpenAI announced GPT-4Vision in September, which can work with images, audio, and text as well. However, it is not a fully multimodal model in the way Gemini promises to be.

Best books to learn about Generative AI

If you are keen to learn more about Generative AI, then I have some recommendations for you:

  • Impromptu: Amplifying Our Humanity Through AI by Reid Hoffman
  • The Master Algorithm by Pedro Domingos
  • The Age of AI: And Our Human Future by Henry Kissinger, Eric Schmidt, and Daniel Huttenlocher
  • Power and Prediction: The Disruptive Economics of Artificial Intelligence by Ajay Agrawal, Joshua Gans, and Avi Goldfarb

--

--

Sudeep Singh Rawat

Technical Content Writer | READ × WRITE = WISDOM | BIBLIOPHILE | Life, Philosophy, and Technology | Writing for adding value to the life of others.