Cloud Site Reliability Engineer (SRE) Job at Promise, Oakland, CA

d2hJdldHVjdXZVQ0eEpRcDRiVGlCNGhl
  • Promise
  • Oakland, CA

Job Description

Company Overview

Promise empowers utilities and government agencies to create flexible, affordable solutions for individuals struggling with debt. Our innovative approach to payment plans and relief distribution significantly improves enrollment and recovery rates, helping individuals clear debts faster and reducing delinquencies for our partners.

We treat people facing financial difficulties with respect and dignity, providing the tools and resources they need to thrive. Our team includes experts from companies like Palantir, Google, Stripe , and esteemed government leaders.

Backed by over $50 million in funding from top investors such as 8VC, Kapor Capital, XYZ Ventures, and Howard Schultz, we've been recognized as one of Fast Company's "World's Most Innovative Companies of 2022."

Role Overview 

We’re looking for a Cloud Site Reliability Engineer (SRE) to build, operate, and optimize the infrastructure that powers our products. You’ll be responsible for ensuring high reliability, performance, and scalability of our cloud-based systems. The ideal candidate is self-sufficient, detail-oriented, and execution-driven, with a strong background in software development, site reliability engineering (SRE), and infrastructure-as-code (IaC).

You’ll collaborate closely with product and engineering teams to improve system architecture, troubleshoot issues, and automate operational processes. This role is ideal for someone who thrives in a hard-working, fast-moving environment, enjoys solving complex technical challenges, and takes personal responsibility for ensuring security outcomes are achieved and aligned to business goals.

What You’ll Do

  • Design, implement, and manage cloud infrastructure to ensure reliability, scalability, and security.

  • Automate infrastructure and operations using Terraform, scripting, and configuration management tools.

  • Develop strong relationships with engineering teams to define system reliability goals and best practices.

  • Troubleshoot and resolve complex network and system issues using observability tools, stack traces, and system logs.

  • Monitor and optimize system performance, implementing best practices for high availability and disaster recovery.

  • Formalize and liaise with the Engineering team to guide them through a security design review process

  • Ensure the security and stability of Linux-based production systems.

  • Provide essential support in aligning our technology projects with compliance requirements, navigating the complexities of state and federal regulations, while fostering an environment of innovation. 

  • Serve as a bridge between technical teams and non-technical stakeholders, translating security and compliance needs into actionable plans that support our broader business objectives.

What Will Enable You

  • 4+ years of experience in Linux system administration, managing large-scale production environments.

  • Strong debugging skills, with experience in performance tuning, observability, and system-level troubleshooting.

  • Hands-on experience with cloud platforms (AWS, Azure, or GCP).

  • Expertise in Infrastructure-as-Code (IaC) using Terraform or similar tools.

  • Proficiency in monitoring tools (e.g., Prometheus, Datadog) and health check implementation.

  • Experience with containerization (Docker, Podman, Kubernetes).

  • Scripting experience (Python, Bash, or equivalent) to automate infrastructure management.

  • Knowledge of networking and security best practices for cloud environments.

Promise is an equal opportunity employer and does not discriminate against any applicant or employee because of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, genetic information, age, or military or veteran status. Additionally, the Company complies with applicable state and local laws governing non-discrimination in employment in every jurisdiction in which it operates. Promise is committed to promoting diversity and inclusion in the workplace. We also provide reasonable accommodations to qualified individuals with disabilities, pregnant individuals, and those with sincerely held religious beliefs, in accordance with applicable laws.

Promise engages in US government contracts and restricts hiring to US persons, which includes US citizens and permanent residents (e.g., Green Card holders). Additionally, candidates must reside in the US.

Compensation

$149K – $195K

Job Tags

Remote job, Permanent employment, Relief, Local area, Flexible hours,

Similar Jobs

Orion Consortium

Hardware Technician 2 Job at Orion Consortium

~ The Hardware Technician 2 provides Tier 2 and 3 on-site and remote supports for computer workstations, servers, printers, peripherals, and teleconferencing equipment. ~ Conducts sites surveys; assesses and documents current site configuration and user requirements... 

Old World Industries

IT Finance Analyst I Job at Old World Industries

 ...POSITION PURPOSE : The IT Finance Analyst is responsible for processing and coordinating IT financial transactions while also serving as an entry-level support role for SAP Record-to-Report (R2R) functions. This position ensures financial accuracy, manages IT purchase... 

The Brothers that just do Gutters

High-End Gutter Sales Job at The Brothers that just do Gutters

 ...schedule Home office stipend New Job Description: Solutionist Take Everything You Ever Learned About Sales and Throw It in the Gutter! Our mission to "Reinvent Contractor Service", is best achieved by investing in our employees, always doing what's right, and... 

Nesco Resource

PC Technician Job at Nesco Resource

Nesco Resource is working with an IT-Infrastructure company in the Urbancrest area seeking a PC Technician with related experience to work in their warehouse! Hours: 8am-4:30pm Monday-Friday Pay: $18hr JOB DESCRIPTION: Test, troubleshoot, and diagnose computer... 

Sentry Insurance

Public Relations Specialist Job at Sentry Insurance

Responsible for operational implementation of corporate communications strategy by promoting a positive organizational image through newspapers, periodicals, television, radio, digital media, speeches, or personal contact. Prepares written press releases, distributes press...