• ThePrompt
  • Posts
  • This AI outperforms GPT3.5 and humans in most tests🧪

This AI outperforms GPT3.5 and humans in most tests🧪

Hi folks! 👋🏻 This is the Prompt, we deliver the juiciest AI news straight to your inbox.

We're like DoorDash for AI.

Here's what the dasher brought today:

This AI outperforms GPT3.5 and humans in most tests🧪

The key challenge with LLMs is that models under 100 billion parameters tend to generate hallucinations that generate wrong answers.

To mitigate this challenge, Multimodal-COT, combines both text and images to improve performance.

The model has only 738M parameters and outperforms GPT3.5 by 16%.

For comparison: GPT3.5 has 175B parameters. This kind of model could be run on a home computer, with about 4GB of RAM.

Aside from being better than GPT3.5, the model beats humans in every test except social sciences. Which is incredible!

This is where it gets interesting folks. Get ready for much smarter AIs.

💥 The Latest

-

OpenAI is quietly launching a new developer platform that lets customers run the company’s newer machine learning models, like GPT-3.5, on dedicated capacity. Foundry will provide some level of version control as well. Running a lightweight version of GPT-3.5 will cost $78,000 for a three-month commitment. Really, lightweight?

-

Tome is a storytelling app, that lets you create powerful visual presentations with the help of AI. They have massive adoption — 500m+ MAU, 30M+ daily decks across many products - but they still haven't made any money with it. Is this the AI bubble?

-

DJ is a personalized AI that picks music for users based on taste and past listening history. Or, as Spotify says, it’s putting an “AI DJ in your pocket.” Some Steve Jobs vibes right there.

📚 Learning lounge

🛼 Toolbox

  • EzMail.ai: Reply to emails 10X faster with pre-filled drafts by AI

  • AnyAPI: Add AI power to your product in minutes

  • CassetteAI: The AI-powered music producer, create your dream music

  • Sync: Music search AI engine from video, photo, game, music, text

🤓 Latest AI papers

  • Disco: Portrait Distortion Correction with Perspective-Aware 3D GANs

  • MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes

📸 AI Photo of the day

Chonky Animals

❤️ If you like The Prompt, and want to support my work:

  • Share The Prompt with a friend, and invite them to subscribe here.

  • Book an ad in The Prompt (reply to this email if you’re interested)

  • Check out my latest tool: VoicePen.ai (audio to blog post)

What'd you think of today's edition?

Login or Subscribe to participate in polls.

Thank you for reading! ✌🏼