- Career Center Home
- Search Jobs
- Cloud Monitoring (Observability) Engineer
Description
Position Title: Cloud Monitoring (Observability) Engineer
Country: US
Type: Regular Full-Time
# of Openings: 1
Company Name: DOT NHTSA
Overview:
Tantus Technologies, Inc. (Tantus) - recognized by the Washington Post as a Top Workplace - is seeking a skilled Observability Engineer with deep expertise in New Relic and other observability tools to help design, implement, and manage our monitoring and observability infrastructure. You will work closely with engineering, DevOps, and site reliability teams to ensure application performance and reliability through actionable telemetry and insights.
Clearance: This position supports a federal contract and requires U.S. citizenship or lawful permanent resident (Green Card holder) status, as well as the ability to obtain a Public Trust clearance.
What Youll do:
- Design and implement observability solutions using New Relic and other observability tools, including custom dashboards, alerts, and instrumentation.
- Collaborate with development and DevSecOps teams to define monitoring requirements and SLIs/SLOs.
- Optimize performance and troubleshoot issues using logs, metrics, and traces.
- Integrate New Relic with CI/CD pipelines and other monitoring tools.
- Lead efforts to mature the observability practice across the organization (standardization, training, documentation).
- Work with federal stakeholders to set up and manage an Observability Center of Excellence.
- Participate in on-call rotation and incident response to provide insights from observability data.
- Stay up-to-date with New Relic feature releases and observability trends.
Required knowledge and skills
Degree in Computer Science, Mathematics, Engineering, or equivalent professional experience
5+ years of experience in observability, Site Reliability Engineer, DevOps, or infrastructure engineering roles
- 2-3+ years specifically focused on observability platforms
- A working knowledge of AI/ML-driven monitoring solutions.
- At least 7 years overall IT SDLC and Cloud experience.
Tooling & Platform Proficiency
- Expertise in New Relic: Advanced knowledge of NRQL (New Relic Query Language), APM, Infrastructure monitoring, Synthetics, and Workloads.
- OpenTelemetry (OTel): Experience implementing vendor-neutral instrumentation for traces, metrics, and logs.
- Log Management: Proficiency in ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, or New Relic Logs.
- Dashboarding: Ability to create high-level executive "single pane of glass" views as well as granular technical dashboards.
- Exposure to AIOps tools: New Relic AI, Moogsoft, BigPanda, or similar
Engineering & Infrastructure
- Programming/Scripting: Proficiency in languages like Python, Go, or Java for custom instrumentation and automation.
- Infrastructure as Code (IaC): Experience using Terraform or CloudFormation to deploy monitoring configurations at scale.
- CI/CD Integration: Skills in integrating observability gates into pipelines (e.g., Jenkins, GitLab CI, GitHub Actions) to automate performance testing.
- Cloud Platforms: Strong understanding of AWS, Azure, or GCP, particularly how to monitor serverless (Lambda) and containerized (Kubernetes/EKS) environments.
SRE & Governance
- SLI/SLO/SLA Mastery: Ability to translate business requirements into technical Service Level Indicators and Objectives.
- Center of Excellence (CoE) Management: Experience defining standards, best practices, and governance models for multi-tenant environments
Abilities
- Ability to design and implement observability solutions using New Relic required.
- SLI/SLO/SLA Mastery: Ability to translate business requirements into technical Service Level Indicators and Objectives
- Dashboarding: Ability to create high-level executive "single pane of glass" views as well as granular technical dashboards.
Nice to haves
- Certifications in New Relic or AI/ML frameworks, nice to have.
- Experience with other observability tools (Datadog, Site 24x7 and WhatsUpGold).
- Experience working with "Federal Stakeholders" to align technical monitoring with mission-critical goals.
- A track record of conducting workshops or writing documentation to upskill development teams on observability.
- Strong communication and documentation skills.
Salary Range:
- Salary range is $140,000-160,000/year. The salary range for this position reflects a variety of factors that influence compensation decisions, including skills, experience, training, certifications, and organizational needs.
PI280984585
