Sift Logo

Sift

Staff Cloud Platform Engineer - Core Infra

Job Posted 12 Days Ago Posted 12 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
This role involves maintaining Sift's infrastructure, ensuring service availability, optimizing storage systems, and developing fault-tolerant applications. Responsibilities include system design, incident response, and collaborating to improve performance.
The summary above was generated by AI

The Core Platform team maintains and optimizes the data, infrastructure, messaging, and services platform that powers Sift’s online systems. We ensure these systems are always available, reliable, and performing at their best to meet customer needs. In the event of an outage or failure, we follow well-practiced recovery plans to restore services swiftly. Managing such complex, large-scale systems requires continuous monitoring and proactive maintenance to uphold these standards.

What you’ll do:

  • Own the availability, performance, and scalability of Sift’s primary online storage systems and infrastructure

  • Design and build immutable infrastructure and fault-tolerant, multi-AZ/multi-region systems that are resilient and self-healing.

  • Design and Implement multi-region deployments, such as BigTable clusters spanning multiple regions, with strategies to ensure specific customers are routed to designated regions (e.g., sticky sessions at the regional level).

  • Solve complex problems that arise from our unique data volume and request rate which may involve digging deep into data store and messaging internals

  • Optimize local development and testing workflows to be fast, efficient, and seamless.

  • Design and implement services and libraries for components to interact with data stores, messaging layer and services platform

  • Develop tools for monitoring, detecting faults, and automatically repairing distributed systems

  • Provide design support to internal engineering teams for optimal usage of data stores, data growth planning, production workload optimization, messaging, caching and service platform

  • Participate in on-call support and incident response activities, providing 12/7 coverage for one calendar week approximately once every 3-4 weeks.

Technical stack: GCP, AWS, Airflow, Terraform, Kubernetes, Vault, Jenkins, Kafka, Snowflake, Spark, Java 11, Python 3, Ruby 2.7, Ruby on Rails.

What makes you a strong fit:

You have a deep understanding of large-scale computing and approach infrastructure as code. You're passionate about designing and building immutable infrastructure and resilient, multi-AZ/multi-region systems that can withstand failures. While you recognize the importance of monitoring and alerting, your ultimate goal is to design self-healing systems. Collaboration is key to you, and you strive to act as a force multiplier by making thoughtful trade-offs to drive success.

Key Qualifications:

  • 8+ years of experience as a Software Engineer focused on infrastructure/platform services or in a Site Reliability Engineering (SRE) role.

  • Strong programming skills in languages such as Java, Scala, or Python.

  • Experience designing and implementing distributed systems.

  • Experience building and managing cloud infrastructure on AWS or GCP.

  • Expertise in building infrastructure as code and automating provisioning processes using tools like CloudFormation or Terraform.

  • Proficiency in setting up and managing monitoring and alerting systems, both open-source and commercial.

  • Familiarity with Docker and container orchestration technologies like Kubernetes, GKE, or AWS ECS.

  • Strong experience troubleshooting and resolving production system issues, with a focus on building automated solutions to prevent future occurrences.

  • Proven expertise in automation and a solid understanding of configuration management tools.

Benefits and perks: 

  • Competitive total compensation package

  • 401k plan

  • Medical, dental, and vision coverage

  • Wellness reimbursement

  • Education reimbursement

  • Flexible time off

Our interview process:

  • Introduction interview: a 30-minute session with a recruiter to discuss your background and the role.

  • Hiring Manager interview: a 60-minute interview with the hiring manager to explore your fit for the position.

  • Virtual onsite loop with the team: a comprehensive session comprising four interviews lasting approximately 4 hours, covering system design, coding abilities, deep dive, and values and behavior-based conversations. 

During these sessions, you will have the opportunity to learn about company culture, meet engineers or peers from your team, and discuss distributed system problems. You will have time for interesting questions and gain transparency regarding your future responsibilities and the project.

A little about us:

Sift is the AI-powered fraud platform securing digital trust for leading global businesses. Our deep investments in machine learning and user identity, a data network scoring 1 trillion events per year, and a commitment to long-term customer success empower more than 700 customers to grow fearlessly. Brands including DoorDash, Yelp, and Poshmark rely on Sift to unlock growth and deliver seamless consumer experiences. Visit us at sift.com and follow us on LinkedIn.

Top Skills

Airflow
AWS
GCP
Java 11
Jenkins
Kafka
Kubernetes
Python 3
Ruby 2.7
Ruby On Rails
Snowflake
Spark
Terraform
Vault

Similar Jobs

38 Minutes Ago
Remote
Illinois, USA
50K-80K Annually
Junior
50K-80K Annually
Junior
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Provide technical support for complex communication systems, optimizing RF and broadband architecture, troubleshooting, and performing maintenance routines for government public safety communications.
Top Skills: AccessBridgesCablingData CircuitsExcelFirewallsLocal Area NetworksMs WordOutlookPacket Switching TechniquesRf SystemsRoutersSwitchesTelephonyWide Area NetworksWired Communications Systems
52 Minutes Ago
Remote
USA
179K-199K Annually
Junior
179K-199K Annually
Junior
eCommerce • Food • Software
As a Machine Learning Engineer, you will develop and enhance ML models for ads systems, collaborating with product leaders and engineers to optimize ads selection, ranking, and pricing.
Top Skills: GoKerasPandasPythonScikit-LearnSparkSQLTensorFlowTorchXgboost
An Hour Ago
Easy Apply
Remote
United States
Easy Apply
Mid level
Mid level
Cloud • Healthtech • Professional Services • Software • Pharmaceutical
The Senior Software Engineer will design and develop core modules for the elluminate platform, focusing on software design, development, unit testing, and collaborating with QA teams.
Top Skills: AngularAsp.Net MvcAWSC#CSSHTMLJavaScriptMicrosoft Sql ServerSagemaker

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account