
Human–Machine Interaction and Immersive Experiences
- Design and validation of highly realistic virtual, augmented, and mixed reality applications
- Advanced real-time rendering and visualization
- 2D/3D modeling and animation (people, crowds, physical elements)
- Immersive interaction with multisensory feedback (haptic, auditory, proprioceptive)
- Usability design and analysis of applications, video games, and immersive interfaces
- Serious Games and gamification
- Simulation of complex environments and scenarios
- Creation of visual Digital Twins
- Immersive narrative experiences (interactive documentary, procedural storytelling)
- Photography and digital heritage: optimized processes for dissemination, digital museums, and scientific analysis

Communications, Streaming, and Digital Infrastructure
- Advanced mobile networks (5G/6G) for multimedia transmission
- Real-time streaming and broadcasting technologies
- Audio and video integration in intelligent multimedia systems
- IoT and Edge Computing for audiovisual applications
- Protocols and networks for efficient transmission
- Cybersecurity in audiovisual environments
- Optimization of audiovisual communications and advanced encoding
- Systems for smart buildings and interactive spaces
- Recording and georeferencing of virtual objects in the real world
- Optical systems for virtual, augmented, and mixed reality

Language, Speech, and Accessibility Technologies
- Multilingual automatic speech recognition
- Automatic transcription and subtitling
- Natural language processing and multimodal models
- Speech-to-sign-language translation
- Analysis, detection, and localization of acoustic events
- Speech enhancement, adaptation, and synthesis (Text-to-Speech, TTS)
- Affective computing and emotion analysis
- Speech processing in complex environments (microphone arrays, multiple speakers)

Intelligent Image, Video, and Multimedia Content Processing
- Image and video processing using AI and deep learning
- Action analysis and pattern recognition
- Real-time video analytics
- Automatic generation of clips and audiovisual content
- Geometric processing and procedural modeling
- Automatic content annotation and indexing

Data, AI, and Predictive Systems
- Big Data applied to multimedia
- Audience and behavior prediction
- Machine learning and complex data mining
- Multimodal machine learning
- Audiovisual Business Intelligence
- AI-based optimization
- Computational efficiency algorithms

Environmental Acoustics and Sound Engineering
- Acoustic characterization of outdoor and indoor spaces
- Development of noise maps and capacity maps
- Design of acoustic action plans
- Design and operation of low-cost acoustic sensor networks
- Acoustic impact analysis of leisure infrastructures

Specialised Infrastructures
- Cave Automatic Virtual Environment (CAVE): full virtual immersion environment
- Stereowall and large-format displays
- High-speed 3D scanner
- Human–Computer Interaction Laboratory with eye tracking
Related Projects
- The AgroTech research group at UPC, in collaboration with its spin-off Ugiat Technologies, has developed uPlayer, a new multimedia player concept that enables more intuitive video navigation and viewing, intelligently enhancing the user experience, especially on YouTube and other platforms, by integrating as a plugin or advanced player.
- The AgroTech research group at the Universitat Politècnica de Catalunya – BarcelonaTech (UPC), together with its spin-off Ugiat Technologies, have driven DoblAI, an AI platform that integrates transcription, translation, subtitling and video dubbing into a single workflow. The solution, which uses deep learning technology and cloned or default voice models, is specifically designed for the journalism and communications sector.
- The Image and Video Processing Group (GPI), part of the IDEAI-UPC research group, and the Digital Culture and Creative Technologies Research Group (DiCode) from the Image Processing and Multimedia Technology Center (CITM) at the Universitat Politècnica de Catalunya – BarcelonaTech (UPC), have co-organised the AI and Music Festival (S+T+ARTS) together with Sónar+D and Betevé, to explore the creative use of artificial intelligence in music.



