![OpenAI’s GPT-4o Launch: The Advanced AI Chatbot for Text, Vision, and Audio Interactions | Science & Technology Update 2 skynews chatgpt openai 6364467](https://i0.wp.com/theubj.com/uae/wp-content/uploads/2024/05/skynews-chatgpt-openai_6364467.jpg?w=1170)
OpenAI has unveiled the latest iteration of its ChatGPT AI chatbot – GPT-4o. This new version promises near-instantaneous interaction capabilities across text, vision, and audio modalities.
The company has highlighted that GPT-4o’s capabilities in comprehending visual and auditory information have seen significant enhancements compared to its predecessors.
GPT-4o is pushing the boundaries by offering real-time communication with the chatbot, including the ability to cut in during its responses.
Conversational interactions can now seamlessly incorporate any mix of text, audio, and images both as input and output, as stated by OpenAI.
The release of GPT-4o is part of a phased rollout scheduled for the upcoming weeks, marking another significant moment in the tech industry’s race to refine artificial intelligence tools.
Demonstrations on Monday showcased GPT-4o’s diverse applications, such as on-the-fly language translation, using its visual abilities to solve mathematical problems off paper, and even assisting visually impaired individuals navigate through London.
OpenAI reports that GPT-4o can reply to audio queries within 232 milliseconds on average, achieving a mean response time of 320 milliseconds, comparable to that of a human.
To address potential issues around bias, equity, and disinformation, OpenAI, backed by Microsoft, claims that it has conducted in-depth testing of the new model with over 70 third-party experts.
This follows Google’s earlier public relations incident involving the AI-generated images from its Gemini system.
While the base GPT-4o service will be available at no cost, the enhanced ‘Plus’ subscription will offer users higher message capacity limits.
Earlier renditions of the chatbot have stirred controversy in educational settings as instances of cheating through the use of its sophisticated essay-writing capabilities were reported.
The popularity of ChatGPT was evident when, two years prior, it was declared as the fastest application to surpass 100 million monthly active users.
The recent announcement by OpenAI precedes Google’s expected unveiling of new AI features at its forthcoming annual developers’ conference.
FAQs About OpenAI’s GPT-4o
- What is GPT-4o capable of? – GPT-4o integrates text, vision, and audio to provide real-time interaction with users, excelling at tasks such as language translation, solving math problems through visual input, and assisting the visually impaired.
- How quick is GPT-4o’s response time? – It responds to audio prompts within an average of 320 milliseconds, on par with human reaction times.
- Will there be any cost to using GPT-4o? – The base service of GPT-4o will be free. However, there is a premium ‘Plus’ subscription offering higher capacity limits for messages.
- What has OpenAI done to address AI bias and misinformation concerns? – The company has conducted extensive testing with more than 70 external experts to mitigate the potential for bias and misinformation in GPT-4o.
- When was the ChatGPT original version released? – The original ChatGPT was launched two years ago and was noted for its rapid user growth.
Conclusion
The release of OpenAI’s GPT-4o signals a significant leap forward in the domain of artificial intelligence. Offering versatility in handling text, vision, and audio inputs and outputs, it is set to redefine real-time interaction with AI chatbots. With impressive response times and the ability for nuanced conversational exchanges, GPT-4o’s deployment is poised to leave a substantial impact on various sectors, notwithstanding the challenges and concerns surrounding such technologies.