Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.
At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.
Note: At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content and build a generational company together. You should have start-up experience and be a self-starter that is driven to build impactful products that change the status quo. You must be willing to work in-person in either NYC or SF.
Overview:
We are seeking a highly motivated Research Scientist to join our team and drive innovation in video generation and AI agents using 3D Variational Auto-encoders (3DVAE) and video diffusion models. The ideal candidate will have a strong background in machine learning, particularly in generative models, and a passion for pushing the boundaries of what’s possible in AI.
Responsibilities:
-
Conduct cutting-edge research in video generation and AI agents, focusing on developing new architectures and algorithms for 3DVAE and video diffusion models.
-
Collaborate with the team to integrate research findings into practical applications, ensuring models are suitable for deployment.
-
Stay up-to-date with the latest developments in the field, such as advancements in diffusion models for video, and identify opportunities for innovation.
-
Present research findings and results to the team and stakeholders, potentially contributing to publications in top-tier conferences or journals.
Qualifications:
-
PhD in Computer Science, Machine Learning, or a related field, with a focus on generative models.
-
Extensive knowledge of deep learning, particularly VAEs and diffusion models, with experience in 3D data processing and representation,
-
Proficiency in programming languages like Python, and familiarity with deep learning frameworks such as PyTorch or TensorFlow, essential for model development.
-
Strong communication and collaboration skills, given the need to work with diverse teams and present complex research.
-
A track record of publications in top-tier conferences or journals is a plus, reflecting research impact.
This role is critical for advancing the state of the art, with a focus on developing new methods rather than applying existing ones, aligning with the startup’s innovative goals.
Benefits:
-
Competitive compensation and equity
-
401k (no match)
-
Healthcare (Silver PPO Medical, Vision, Dental)
-
Lunch and snacks at the office
We encourage you to apply even if you don't fully meet all the listed requirements; we value potential and diverse perspectives, and your unique skills could be a great asset to our team.
What We Do
Hedra's flagship foundation model, Character-2, allows users to turn any image into an expressive talking character video. Since launching in July, our platform has attracted over a million users and has created countless pieces of viral content. And we're backed by top investors at Index, A16Z, and Abstract Ventures.
Why Work With Us
Hedra released the first ever audio-conditioned video foundation model, which enables the generation of the most realistic and expressive character videos with AI. Looking forward, our team is creating a full platform for the next generation of media creation, built with AI and creative intelligence at its core.
Hedra Offices
OnSite Workspace
Hedra's main office is in San Francisco and secondary hub is in New York.