DataBricksposted 18 days ago
Senior
San Francisco, CA
Professional, Scientific, and Technical Services

About the position

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems - from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. Founded by engineers - and customer obsessed - we leap at every opportunity to tackle technical challenges, from designing next-gen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And we're only getting started. As a production engineer with a backend focus, you will ensure stable and efficient operation of production environments of your service by proactively monitoring systems, automating routine tasks, optimizing performance, responding to incidents, and managing deployment pipelines. This implies, among others, to write software in Scala/Java and to work closely with other engineering teams to maintain high availability and ensure the integrity and security of live systems.

Responsibilities

  • Proactively monitor systems to ensure stable and efficient operation of production environments.
  • Automate routine tasks and deployment processes to enhance operational efficiency.
  • Optimize performance and address performance bottlenecks in backend services and infrastructure.
  • Respond to incidents and manage deployment pipelines.
  • Work closely with other engineering teams to maintain high availability and ensure the integrity and security of live systems.

Requirements

  • BS/MS/PhD in Computer Science, or a related field.
  • 10+ years of production level experience in one of: Java, Scala, C++, or similar language.
  • Comfortable working towards a multi-year vision with incremental deliverables.
  • Experience in architecting, deploying and operating large scale distributed systems with high availability, scalability and durability.
  • Experience in performance and cost optimization, disaster recovery mechanisms, incident management and troubleshooting.
  • Good knowledge of SQL and operational experience in distributed and single node database engines.
  • Experience with software security and systems that handle sensitive data.
  • Experience with cloud technologies, e.g. AWS, Azure, GCP, Docker, Kubernetes.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service