Jobgether-posted 5 days ago
$160,000 - $190,000/Yr
Full-time • Mid Level

This role offers the opportunity to take ownership of the reliability and health of a large-scale IoT device fleet. You will work at the intersection of embedded systems, backend infrastructure, and site reliability to ensure devices remain online, performant, and resilient. This position requires hands-on problem-solving, proactive monitoring, and collaboration with cross-functional teams to maintain operational excellence. You will contribute to building observability tools, automating device management, and improving system reliability. Success in this role will directly impact operational efficiency and the user experience for a nationwide technology platform. Occasional travel for on-site deployments and debugging adds variety and engagement to the work. This is an ideal role for engineers who thrive on ownership, scalability challenges, and innovative problem-solving.

  • Design, implement, and maintain systems to monitor and improve IoT device health at scale.
  • Develop internal tools and scripts for device setup, QA automation, and fleet observability.
  • Collaborate with backend and hardware teams to support device integration, calibration, and reliability.
  • Investigate and resolve fleet-wide issues using logs, telemetry, and metrics.
  • Test and optimize hardware to ensure peak performance.
  • Conduct periodic health assessments and recommend firmware or process improvements.
  • Serve as the primary point of contact for hardware health and report findings to operations teams.
  • Author and maintain troubleshooting guides, playbooks, and documentation for internal use.
  • Travel occasionally to support on-site deployments or device debugging.
  • 5+ years of professional software engineering experience.
  • Experience managing distributed Linux-based hardware appliances or IoT fleets.
  • Proficiency with observability and monitoring tools (e.g., DataDog, OpenTelemetry, Prometheus, Grafana).
  • Strong coding skills in Python and SQL, with experience delivering production-quality software.
  • Experience building internal tools, monitoring platforms, or reliability systems.
  • Hands-on experience with Linux systems administration and troubleshooting (e.g., dmesg, journalctl, systemd).
  • Background in wireless connectivity technologies (e.g., cellular, WiFi).
  • Excellent communication skills to convey complex technical findings clearly.
  • Self-starter mentality with the ability to thrive in a fast-paced, ownership-driven environment.
  • Competitive salary: $160,000 - $190,000 USD per year.
  • Equity stake in the business to share in its growth.
  • Fully remote role with flexibility to work anywhere in North America.
  • Unlimited PTO, with a minimum of 10 days per year.
  • Health insurance options and 401(k) plan.
  • Home office setup reimbursement.
  • Collaborative, mission-driven work environment with growth opportunities.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service