CIT UPC - Audiovisual

Design and validation of highly realistic virtual, augmented, and mixed reality applications
Advanced real-time rendering and visualization
2D/3D modeling and animation (people, crowds, physical elements)
Immersive interaction with multisensory feedback (haptic, auditory, proprioceptive)
Usability design and analysis of applications, video games, and immersive interfaces
Serious Games and gamification
Simulation of complex environments and scenarios
Creation of visual Digital Twins
Immersive narrative experiences (interactive documentary, procedural storytelling)
Photography and digital heritage: optimized processes for dissemination, digital museums, and scientific analysis

Multilingual automatic speech recognition
Automatic transcription and subtitling
Natural language processing and multimodal models
Speech-to-sign-language translation
Analysis, detection, and localization of acoustic events
Speech enhancement, adaptation, and synthesis (Text-to-Speech, TTS)
Affective computing and emotion analysis
Speech processing in complex environments (microphone arrays, multiple speakers)

uPlayer: enhancing the video playback experience
The AgroTech research group at UPC, in collaboration with its spin-off Ugiat Technologies, has developed uPlayer, a new multimedia player concept that enables more intuitive video navigation and viewing, intelligently enhancing the user experience, especially on YouTube and other platforms, by integrating as a plugin or advanced player.
DoblAI: AI for easy and fast dubbing of multimedia content
The AgroTech research group at the Universitat Politècnica de Catalunya – BarcelonaTech (UPC), together with its spin-off Ugiat Technologies, have driven DoblAI, an AI platform that integrates transcription, translation, subtitling and video dubbing into a single workflow. The solution, which uses deep learning technology and cloned or default voice models, is specifically designed for the journalism and communications sector.
AI and Music Festival (S+T+ARTS): musical creativity with AI
The Image and Video Processing Group (GPI), part of the IDEAI-UPC research group, and the Digital Culture and Creative Technologies Research Group (DiCode) from the Image Processing and Multimedia Technology Center (CITM) at the Universitat Politècnica de Catalunya – BarcelonaTech (UPC), have co-organised the AI and Music Festival (S+T+ARTS) together with Sónar+D and Betevé, to explore the creative use of artificial intelligence in music.

Audiovisual