OpenAI, the company behind the popular generative AI assistant ChatGPT, is expanding its capabilities by introducing voice and image-based search features to the chatbot.
This development marks a significant evolution in the generative AI landscape, making ChatGPT a more interactive and versatile tool for users. OpenAI CEO Sam Altman shared this new update with a tweet that read, «voice mode and vision for chatgpt! really worth a try.»
Users can now engage in voice conversations with ChatGPT, allowing them to interact with the AI assistant in a more conversational manner. This feature enables users to ask questions verbally and receive spoken word responses from ChatGPT.
For example, users can request ChatGPT to create a bedtime story or seek answers to their queries through voice commands.
John Grisham, George RR Martin, other top US authors sue OpenAI over copyrights
OpenAI has collaborated with established voice actors to develop five distinct voices for this feature, using their open-source Whisper speech recognition system to transcribe spoken words into text. However, OpenAI is cautious about potential misuse, acknowledging the risk of malicious actors impersonating public figures or committing fraud.
In addition to voice, ChatGPT users can now utilise image-based search and analysis. They can upload images and ask ChatGPT to provide explanations or instructions related to the content of the image. This feature adds a new dimension to ChatGPT's capabilities, making it a valuable tool for tasks such as troubleshooting, meal planning, or data analysis.
The voice and image features will be available to ChatGPT
Read more on economictimes.indiatimes.com