Manager I, Engineering - ML Observability

Posted 9 Days Ago
Be an Early Applicant
New York, NY
Hybrid
3-5 Years Experience
Artificial Intelligence • Cloud • Software • Cybersecurity
We are building the monitoring and security platform for developers, IT ops teams and business users in the cloud age.
The Role
As the Engineering Manager for ML Observability at Datadog, you will lead a team focused on enhancing and expanding the ML Observability product. Responsibilities include managing and mentoring a team of engineers, leveraging technical expertise in software engineering and applied science, exploring and implementing new techniques for model evaluation and monitoring, and staying current with industry trends in machine learning and observability.
Summary Generated by Built In

The ML Observability team is committed to empowering our customers with an advanced observability platform, specifically designed for applications that increasingly integrate machine learning components such as large language models and generative AI. We provide comprehensive monitoring and diagnostics for ML-based components, tracking model performance, drift, fairness, and system stability. Our platform also offers model prediction explainability and root-cause analysis, enhancing organizations' confidence in the reliability of their deployments. You can learn more about our ML/LLM Observability solution here .
As the Engineering Manager, you will lead a team focused on enhancing and expanding Datadog's ML Observability product. Positioned at the forefront of R&D, you will emphasize rigor and experimentation to design, refine, and implement advanced techniques for evaluating and monitoring AI components - LLMs in particular - in our customers' applications. Your leadership and expertise in both engineering and applied science will be pivotal in shaping the direction of our product, ensuring Datadog remains a key player in this rapidly evolving field.
At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.
What You'll Do:

  • Manage and mentor a team of engineers, fostering a collaborative and innovative work environment
  • Leverage your technical expertise in software engineering and applied science to guide the team in building robust and scalable solutions
  • Apply your experience with LLMs to enhance the product's capabilities in evaluating and monitoring LLM-based applications
  • Explore and implement new techniques and tools to provide deeper insights into model behavior, drift, fairness, and interpretability
  • Engage with senior management and executives, articulating complex technical concepts clearly and precisely
  • Stay current with industry trends and advancements in machine learning and observability, driving innovation within the team


Who You Are:

  • Proven experience in software engineering and applied science, with a focus on engineering LLM-based systems in production
  • Demonstrated experience managing small teams of software engineers and/or applied scientists, with a track record of delivering high-quality products
  • Strong software development skills and proficiency in Python and Go
  • Strong understanding of machine learning theory, statistics, and fundamentals
  • Excellent communication abilities to convey complex technical concepts clearly
  • A collaborative mindset and proven experience in working in cross-functional teams
  • A proactive approach with a passion for continuous learning and innovation


Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That's okay. If you're passionate about technology and want to grow your skills, we encourage you to apply.
Benefits & Growth:

  • New hire stock equity (RSUs) and employee stock purchase plan (ESPP)
  • Continuous professional development, product training, and career pathing
  • Intradepartmental mentor and buddy program for in-house networking
  • An inclusive company culture, ability to join our Community Guilds (Datadog employee resource groups)
  • Access to Inclusion Talks, our internal panel discussions
  • Free, global mental health benefits for employees and dependents age 6+
  • Competitive global benefits


Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.
Datadog offers a competitive salary and equity package, and may include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Datadog offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, a 401(k) plan and match, paid time off, fitness reimbursements, and a discounted employee stock purchase plan.
The reasonably estimated yearly salary for this role at Datadog is:
$187,000 - $240,000 USD
About Datadog:
Datadog (NASDAQ: DDOG) is a global SaaS business, delivering a rare combination of growth and profitability. We are on a mission to break down silos and solve complexity in the cloud age by enabling digital transformation, cloud migration, and infrastructure monitoring of our customers' entire technology stacks. Built by engineers, for engineers, Datadog is used by organizations of all sizes across a wide range of industries. Together, we champion professional development, diversity of thought, innovation, and work excellence to empower continuous growth. Join the pack and become part of a collaborative, pragmatic, and thoughtful people-first community where we solve tough problems, take smart risks, and celebrate one another. Learn more about #DatadogLife on Instagram , LinkedIn, and Datadog Learning Center.
Equal Opportunity at Datadog:
Datadog is an Affirmative Action and Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. Here are our Candidate Legal Notices for your reference.
Your Privacy:
Any information you submit to Datadog as part of your application will be processed in accordance with Datadog's Applicant and Candidate Privacy Notice .

Top Skills

Go
Python

What the Team is Saying

Darcy
Kyvaune
Mia
Zina
Cameron
LJ
Micah
Wissal
The Company
New York, NY
5,000 Employees
Hybrid Workplace
Year Founded: 2010

What We Do

Datadog (NASDAQ: DDOG) is a global SaaS business, delivering a rare combination of growth and profitability. We are on a mission to break down silos and solve complexity in the cloud age by enabling digital transformation, cloud migration, and infrastructure monitoring of our customers' entire technology stacks. Built by engineers, for engineers, Datadog is used by organizations of all sizes across a wide range of industries. Together, we champion professional development, diversity of thought, innovation, and work excellence to empower continuous growth. Join the pack and become part of a collaborative, pragmatic, and thoughtful people-first community where we solve tough problems, take smart risks, and celebrate one another.

Why Work With Us

At Datadog, we learn from and celebrate each other daily - each win is a team win. Datadogs solve tough problems, innovate pragmatically, and grow together. We promote from within, provide mentorship and opportunities for career development, and support our colleagues in the process. Best of all? We truly love what we do.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Datadog Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them and their team.

Typical time on-site: 3 days a week
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account