SRE - Platform Engineer
<meta><p style="font-family:" basel="" grotesk",arial,sans-serif;font-size:9pt;font-weight:400;line-height:1.6;letter-spacing:0.25px;margin:4px="" 0px;padding:0px;"=""><b><strong style="font-size:16pt;white-space:pre-wrap;">About the role</strong></b></p><ul data-pattern="discCircleSquare" data-depth="1" style="font-family:" basel="" grotesk",arial,sans-serif;font-size:11pt;font-weight:400;margin:8px="" 0px;line-height:1.6;padding:0px="" 0px="" 32px;list-style-type:disc;"=""><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">DroneUp is seeking an SRE - Platform Engineer who will focus on ensuring the reliability, scalability, and performance of our internal and client-facing IT infrastructure and developer platform. This role combines strong operational expertise with platform engineering principles, emphasizing uptime, incident response, and observability. The ideal candidate will drive SRE best practices, including SLO/SLI management, monitoring, and proactive system improvements, while collaborating with the broader platform engineering team. Our principles include self-service, security by default, automation, and building resilient systems for software delivery at scale.</span></li></ul><p style="font-family:" basel="" grotesk",arial,sans-serif;font-size:11pt;font-weight:400;line-height:1.6;letter-spacing:0.25px;margin:4px="" 0px;padding:0px;"=""><b><strong style="font-size:16pt;white-space:pre-wrap;">What you'll do</strong></b></p><ul data-pattern="discCircleSquare" data-depth="1" style="font-family:" basel="" grotesk",arial,sans-serif;font-size:11pt;font-weight:400;margin:8px="" 0px;line-height:1.6;padding:0px="" 0px="" 32px;list-style-type:disc;"=""><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Broad domain architect for the internal developer platform and all cloud engineering</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Drive architecture for tooling or in-house software</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Mentor other platform engineers to drive strong engineering practices</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Enablement of platform engineering technical capabilities in our internal client teams in software engineering</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Peer with the senior architects and engineers in software engineering</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Architecture and engineering focused on GCP environment</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Architect and oversee GKE cluster operations and workload management</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Provide feedback to others and participate in peer reviews / pair programming</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Drive the broad adoption of Test Driven Development through designing, development, and debugging unit and integration tests for new and existing infrastructure and code</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Continuous curiosity of existing implementations and new technologies and sharing with the team</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Practice continuous improvement across all job areas and personally / professionally</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Clearly communicate with platform engineering teams and other stakeholders and provide technical direction while doing so</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Stay current with platform changes and third-party libraries. Proactively investigate better solutions for current solutions</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">An understanding of Open Telemetry and true observability and the difference between it and monitoring and logging</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Grow the engineering culture towards a high-performing team</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Practice the arts of self-service, least privilege and security by default in all solutions</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Lead incident response, including on-call rotations, root cause analysis, and post-mortem reviews</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Implement and optimize monitoring, alerting, and observability systems for system reliability</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Collaborate on capacity planning and performance optimization to ensure high availability</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Other duties as assigned</span></li></ul><p style="font-family:" basel="" grotesk",arial,sans-serif;font-size:15pt;font-weight:400;line-height:1.6;letter-spacing:0.25px;margin:4px="" 0px;padding:0px;"=""><b><strong style="font-size:16pt;white-space:pre-wrap;">Our Tooling Stack Includes but is Not Limited to:</strong></b></p><ul data-pattern="discCircleSquare" data-depth="1" style="font-family:" basel="" grotesk",arial,sans-serif;font-size:11pt;font-weight:400;margin:8px="" 0px;line-height:1.6;padding:0px="" 0px="" 32px;list-style-type:disc;"=""><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">Github / Github Actions</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">GCP</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">Kubernetes (via GKE), Helm, Docker</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">GSM Secrets Management (part of GCP)</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">Terraform</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">Honeycomb</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">Grafana stack</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="font-size:12pt;white-space:pre-wrap;">Prometheus</span></li></ul><p style="font-family:" basel="" grotesk",arial,sans-serif;font-size:12pt;font-weight:400;line-height:1.6;letter-spacing:0.25px;margin:4px="" 0px;padding:0px;"=""><b><strong style="font-size:16pt;white-space:pre-wrap;">Qualifications</strong></b></p><ul data-pattern="discCircleSquare" data-depth="1" style="font-family:" basel="" grotesk",arial,sans-serif;font-size:11pt;font-weight:400;margin:8px="" 0px;line-height:1.6;padding:0px="" 0px="" 32px;list-style-type:disc;"=""><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Bachelor's degree in Computer Science, Computer Engineering or related field or 8+ years experience as a software engineer</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Proficiency in kubernetes. Optional: CKA, CKAD</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Extensive experience in Unix / Linux</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Polyglot and proficiency in multiple languages (ideally: Golang, NodeJS, Python, HCL and more)</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Knowledge of multi-cloud environment, including GCP, AWS, and Azure (familiar with at least two of these environments)</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experienced in using git in trunk-based development models</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience in use of feature flagging in infrastructure and runtime (k8s)</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience with backend database technology is a plus, including supporting and performance enhancements</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Advanced experience working with and creating public cloud resources in Terraform or other infrastructure as code tools</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience participating in a 24/7 on-call schedule without supervision and successfully resolving issues without escalation</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience using Open Telemetry for observability as well as other monitoring tools such as datadog, new relic and others</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Good understanding of networking and routing principles</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience in dockerizing applications and orchestrating them with kubernetes</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Familiarity with security configuration for web/api services (SSL, Access control)</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience with JIRA or other work tracking systems. Ability to resolve tickets according to priority order and collaborating with the Technical Product Manager to adjust priorities</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Excellent documentation details, using Confluence or similar tooling – this could include support notes, runbooks, ADRs, etc</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Familiarity with creating an end to end CI/CD pipeline using various tools with artifact storage</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Familiarity with use of MacOS as a desktop and predominantly CLI interfaces</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience in a “product mindset” by understanding stakeholder needs, priorities and business value</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience with security compliance frameworks including FedRAMP, NIST, and SOC2</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Proven experience in SRE practices, including incident management and reliability engineering</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Familiarity with monitoring tools like Prometheus, Grafana, or Honeycomb for observability</span></li><li style="font-size:12pt;margin:3px 0px;letter-spacing:0.25px;line-height:1.6;"><span style="white-space:pre-wrap;">Experience with chaos engineering, load testing, or reliability testing frameworks</span></li></ul><p style="font-family:" basel="" grotesk",arial,sans-serif;font-size:11pt;font-weight:400;line-height:1.6;letter-spacing:0.25px;margin:4px="" 0px;padding:0px;"=""><i><b><strong style="color:rgb(0,0,0);font-size:8pt;font-style:italic;white-space:pre-wrap;">Security Responsibility Statement: </strong></b></i><span style="color:rgb(0,0,0);font-size:8pt;white-space:pre-wrap;">Employees are expected to provide a high level of security to any personal or private information accessed as part of their work, whether at a DroneUp facility or remotely. This includes participating in security training, remaining sensitive to individual rights to personal privacy, and complying with company policies. Employees who have access to sensitive data that is protected by regulation, such as HIPAA, or by contract, such as credit card data, must comply with any additional requirements dictated by the governing regulations or associated contracts.</span></p>