Status: Starting webcam...
ATTENBOT first analyses and summarises the photo with OpenAI's GPT-4 Vision to generate a script in the style of a BBC earth nature documentary.
Then, this script is fed to an ElevenLabs model trained to synthesise an audio transcript in a lovingly familiar tone.