Staff Software Engineer - Production Engineering

DataBricksposted 18 days ago

Senior

San Francisco, CA

Professional, Scientific, and Technical Services

Upload and Match ResumeTrack Jobs with Teal

About the position

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems - from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. Founded by engineers - and customer obsessed - we leap at every opportunity to tackle technical challenges, from designing next-gen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And we're only getting started. As a production engineer with a backend focus, you will ensure stable and efficient operation of production environments of your service by proactively monitoring systems, automating routine tasks, optimizing performance, responding to incidents, and managing deployment pipelines. This implies, among others, to write software in Scala/Java and to work closely with other engineering teams to maintain high availability and ensure the integrity and security of live systems.

Responsibilities

Proactively monitor systems to ensure stable and efficient operation of production environments.
Automate routine tasks and deployment processes to enhance operational efficiency.
Optimize performance and address performance bottlenecks in backend services and infrastructure.
Respond to incidents and manage deployment pipelines.
Work closely with other engineering teams to maintain high availability and ensure the integrity and security of live systems.

Requirements

BS/MS/PhD in Computer Science, or a related field.
10+ years of production level experience in one of: Java, Scala, C++, or similar language.
Comfortable working towards a multi-year vision with incremental deliverables.
Experience in architecting, deploying and operating large scale distributed systems with high availability, scalability and durability.
Experience in performance and cost optimization, disaster recovery mechanisms, incident management and troubleshooting.
Good knowledge of SQL and operational experience in distributed and single node database engines.
Experience with software security and systems that handle sensitive data.
Experience with cloud technologies, e.g. AWS, Azure, GCP, Docker, Kubernetes.

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder

Staff Software Engineer - Production Engineering

About the position

Responsibilities

Requirements

Tools

Career Hubs

Guides

Company