Summary
Dolby’s research division is currently looking for a talented, self-motivated researcher to push the boundaries of the state-of-the-art in multi-media technologies. An ideal candidate would have a strong background in HCI and deep learning, both in terms of conceptual understanding, as well as practical experience. A core aspect of this role involves being able to keep up to date with the literature, implement, and innovate. Consequently, knowledge or experience in any / all the following are helpful :
- Human-computer interaction techniques and application of deep learning to this area
- Diffusion, autoregressive, or other generative models.
- Self-supervised, contrastive learning, auto-encoders.
- Audio, image, or text applications – Source separation, text-to-speech, music synthesis, image segmentation, image captioning, question answering, language models, etc.
- Latent space exploration, navigation, control and alignment techniques
The role will involve prototyping inspiring experiences that explore a complement of modalities. These technologies will be used to extend immersion and interaction, the candidate should be willing to explore empirical refinement of the user experience. The ideal candidate has experience in developing real-time applications delivering multi-modal experiences and / or human-computer interfaces with generative AI involvement.
Main responsibilities :
Prototype and demonstrate multimodal, interactive and immersive user experiences.Work closely with other domain experts to refine and execute Dolby’s technical strategy in artificial intelligence and machine learning.Use deep learning to create new solutions and enhance existing applications.Push the state-of-the-art and develop intellectual property.Transfer technology to product groups and draft patent applications.Advise internal leaders on recent deep learning advancements in the industry and academia to further influence research direction and business decisions.Requirements :
Ph.D. in computer science or similar, with a focus on deep learning. Knowledge in audio, video, 3D, or text processing is desirable.Strong publication record, with publications in top domain-specific or general machine learning conferences e.g., ACM CHI, NeurIPS, ICLR, ICML, ACL, CVPR, ICASSP.Good knowledge about current machine learning literature.Highly skilled in Python and one or more popular deep learning frameworks (TensorFlow or PyTorch).Ability to envision new technologies and turn them into innovative products, Creativity.Good communication skills.All official communication regarding employment opportunities at Dolby will come from an official dolby.com email address. We will never request payment as part of the hiring process. If you receive a suspicious message, please verify its authenticity before responding.