- ThePrompt
- Posts
- Visual ChatGPT ๐ฅ
Visual ChatGPT ๐ฅ
+ GPT4 launch next week: Hype or Real?

Hi folks!๐๐ป This is The Prompt! Get your thinking caps on because weโre here to spill the AI tea! โ๏ธ
Multi-modalities + GPT4 are all the rage this week, so letโs see whatโs new ๐๐ป
FEATURED
Visual ChatGPT ๐ฅ

VisualChatGPT connects ChatGPT and other visual models to enable sending and editing images in chat. (demo above)
They use some of the models that weโve already covered like:
ControlNet: Control AI results
Stable Diffusion: text-to-image generation
BLIP: image-to-chat
InstructPix2Pix: photo editing with prompts
ClipSeg: image segmentation
Seems like text is becoming the universal interface, and connecting to smaller models like these can be a good path to achieve multimodality.
Multi-modality is all the rage these days, and we also got a bit of a โtea spillโ yesterday for GPT4 ๐๐ป
HYPE OR REAL?
GPT-4 launch next week - What do we know?

According to Heise, Microsoft Germany's CTO, Andreas Braun said that GPT-4 will come next week in a multimodal format.
This means that GPT-4 might be good at many different tasks than text generation, like understanding language, talking to people, and describing pictures.. and videos.
The amazing multimodality work of MSFS is not a secret. We already know that their newest model called Kosmos, can understand images and follow in-context instructions.
๐ค However, itโs a bit weird that the news is coming only from this source, and itโs not reported elsewhere. So, take these news with a grain of salt.
WHAT ELSE IS GOING ON
๐ฆ Discord announced the launch of its new AI features. Some of the features include AI Chat, AI-powred Mods, summaries, and avatars.
๐ DuckDuckGo launched DuckAssist. It summarizes answers from Wikipedia and displays them at the top of DuckDuckGoโs search results.
๐๐ปโโ๏ธ ChatGPT is now available in Azure OpenAI Service. With this update, developers can integrate custom AI-powered experiences directly into their own apps.
RESOURCES
The best resources we came across lately that will help you become better at writing prompts & building AI apps.
๐ Unlocking creativity with prompt engineering [a16z podcast]
๐๐ป Make 3D Models using ChatGPT [quick video tutorial]
๐ฅ ControlNet Full Tutorial [text & video tutorials]
TOOLBOX
The latest AI tools to use or get inspiration from.
Rework: AI-driven recruitment
OpenCommit: GPT CLI to auto-generate commits
FlexClip: text-to-speech video maker
Second: Developer bots that create new features or apps
AI testimonial: Free AI testimonial generator
WonderStudio: a mindblowing tool that automatically animates, lights, and composes CG characters into a live-action scene
PROMPT OF THE DAY
TOOL
Midjourney
PROMPT
Sony A1 camera with Sony Logo, intricate demonic black-paint biomechanical Sony A1 camera with Sony logo, scary, beautiful, circuits platemail armor. sleek iridescent details. domed toned glass, octane, volumetric lighting. sharpen, details + professional lighting, photography lighting
RESULT
LATEST PAPERS
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Scaling up GANs for Text-to-Image Synthesis
Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision
3D Video Loops from Asynchronous Input
Cherry-Picking with Reinforcement Learning
MathPrompter: Mathematical Reasoning using Large Language Models
Cones: Concept Neurons in Diffusion Models for Customized Generation
