PhD Research Intern - Multimodal Research (Summer 2025)
Full Time
/
San Francisco, CA
/
Dolby
Join the leader in entertainment innovation and help us design the future. The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby’s continued growth. As a valued member of the Dolby team, you’ll see and hear the results of your work everywhere, from movie theaters to smartphones. We continuously push the boundaries of audio, imaging, and cloud technology to create spectacular entertainment experiences.
As a diverse and dynamic group, our ATG researchers work on cutting-edge projects related to computer science and electrical engineering for audio, video, and cloud technologies, exploring exciting domains such as AI/ML, algorithms, digital signal processing, audio processing, image processing, computer vision, AR/VR, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.
What is the Research Internship Program?
As a Research Intern at Dolby, you will have the opportunity to define and lead your own pioneering research project that connects big-picture change with the meticulous detail of technological innovation. Our intern research projects are driven by a greater purpose: enhancing the human experience.
With the guidance of Dolby’s leading media technology experts, you will delve into your own innovative projects across exciting domains, which could include, but not limited to: Bringing Human Perception into Digital Experiences, Personalizing Content, Enhancing Wellbeing and Healthcare, Nurturing Deeper Connections, and Enhancing Education and Learning Experiences.
This opportunity will be based out of our research offices in San Francisco or Sunnyvale, depending on the role.
What are we looking for in candidates?
We are seeking current PhD students with a diverse range of backgrounds and experiences. To be eligible, you should have completed at least one year of your doctoral program. Along with solid technical skills, candidates should demonstrate problem-solving and analytical abilities, good communication and collaboration skills, a curiosity for how and why things work as they do, and a passion for audio, video, movies, music, or game technology. You have a desire to bring in new ideas and are open to learning from others.
Summary of Position:
We are a key research team within Dolby’s Advanced Technology Group, focused on creating cutting edge multimodal technologies that drive next generation experiences. We are looking for strong candidates with an interest in, but not limited to one of more of the following areas:
Machine perception and reasoning
Human Perception
Computer vision, speech/audio processing and natural language processing
Multimodal/cross-modal representation learning
Multimedia content analysis, generation and enhancement (music, speech, audio, video, and/or text)
Speech, music, or video content understanding and information retrieval
Generative AI for media creation
Efficient deep learning
Explainable AI and trustworthy AI
Perceptual and interactive multimedia
Spatial computing for 6DoF A/V media
3D deep learning (including NERF, Gaussian Splatting, etc.)
2D/3D Object detection/segmentation/recognition/tracking
Computational photography
Human sensing for entertainment applications (human computer interfaces, affective computing, physiological data measurement and analysis, etc.)
Computational/experimental neuroscience and human perception
Immersive experiences and personalization experiential systems
Requirements:
PhD students in Artificial Intelligence, Electrical Engineering, Computer Science, Bioengineering, Neuroscience, or related field.
Proven ability to pursue new areas of multimodal research for AI, data analysis, or neuroscience and demonstrate results through projects, prototypes, patent filings, and papers in peer reviewed journals and conferences.
Experience as a researcher, including internships, full-time, or at a lab.
Experience with one or more programming languages or game engines (e.g., Python, C/C++, MATLAB, Unreal, Unity, USD, etc.).
Highly Desired Experience in one of the following:
Creating demos and prototypes for research applications.
Working with frameworks like PyTorch or TensorFlow.
Working with HMDs
Developing and training multimodal deep learning architectures.
Writing technical reports and/or publications.
First-author publications at peer-reviewed conferences or journals.
We will review applications on a rolling basis. For the best chance to have your resume reviewed and considered, we recommend submitting your application by September 25, 2024.
Eligibility:
Currently a PhD student in Artificial Intelligence, Electrical Engineering, Computer Science, Bioengineering, Neuroscience, or related field. Must be available to work full-time Monday – Friday for 12 weeks between May/June 2025 – August/September 2025.
The start dates for this internship are as follows (please note these dates are not flexible):
May 19, 2025 or
June 16, 2025
The San Francisco/Bay Area base hourly range for this internship position is $50-57/hr and can vary if outside of this location. Our hourly ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific hourly range and perks and benefits for your location during the hiring process.
Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12
Equal Employment Opportunity:
Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.