Find Jobs
Site Reliability Engineer
Eden Prairie, Minnesota  |  Onsite
Contract to Hire Position
It appears that you have already applied to this job.
Applied on November 27, 2021
Job Id #52126 Posted November 23, 2021


Description:

  • The primary focus of this Site Reliability Engineer will be on the new mission Data Forge in the Cloud platform (Azure, Google) involving data mesh architecture.
  • This role ensure that Data Forge (DF) operates with high reliability, availability, and performance at scale for our customers.
  • The Data Forge team is looking for a Site Reliability Engineer with Cloud experience and a diverse set of experiences and skillsets to help maintain exciting new initiative running on Google Cloud infrastructure.
  • As Cloud SRE/Cloud engineer you will provide service reliability and availability via following SRE practices.
  • Your job function comes with SRE mindset and a set of engineering practices to run reliable production systems.
  • You will help to implement SRE principles through opensource tooling, and automation and prevent toil. Automate end to end, from writing code to running services in production. Leverage SRE principles developed and proven to work at scale.
  • You will partner with Engineering teams to understand their challenges, work through their issues, and provide solutions that can be adopted widely.
  • The ideal candidate is someone with a proven track record, sound technical knowledge and skills in delivering large scale complex software solutions deployed.
  • This role will be responsible for designing, building, running, and monitoring public cloud infrastructure to support a variety of mission critical services.
  • This is a highly technical, hands-on role that requires expertise supporting systems at enterprise level.

Primary Responsibilities:

  • 50% Software Engineering and 50% Systems Engineering
  • Improve availability and reliability of services
  • Ensure compliance with appropriate security standards
  • Identifying the Service Level Objects (SLOs) and Service Level Indicators and maintaining those metrics in a good standard for smooth operations
  • Engineering - Continuously optimize secure, scalable and performant security tools and service.
  • Reliability - Drive fault detection and correction, performance, and uptime at scale
  • Monitoring - Instrument systems to gain visibility and understanding of how they are performing at any time
  • Accelerated infrastructure, application, and software configuration deployment
  • Automated response to alerts or indicators of performance issues
  • Infrastructure as code
  • Programming in one or more of these languages – Java spring boot, GO, Python for building automation
  • Experience with common formats such as JSON, YAML
  • Expertise with monitoring or log aggregation tools (Prometheus, Grafana)
  • Expertise in key SRE Skills (Scalability, Reliability and Observability)
  • Familiarity with CI/CD tools and deployment processes
  • Solid understanding and experience with Incident / Change management tool like ServiceNow
  • Conduct blameless post-mortems to analyze failures and prevent recurrence
  • Provide service support by participating in regular on-call shifts responding to service issues
  • Systematic problem-solving approach coupled with a strong sense of ownership and independence
  • Experience operating, troubleshooting, and scaling online services in cloud-based environment
  • Operational experience with networking and an understanding of networking principles
  • Experience reviewing security scans and remediating vulnerabilities
  • Experience with modern container orchestration systems like Kubernetes
  • Familiarity with security issues in the cloud such as intrusion, penetration, and vulnerability scanning
  • Experience with various data management technologies including relational and non-relational databases and message queues
  • Stay up to date on relevant technologies, plug into user groups, understand trends and opportunities to ensure we are using the best possible techniques and tools
  • Facilitation/presentation experience and ability to properly communicate with Business and Technical audience
  • Define and document standards and guidelines
  • Develop and automate repeatable tasks
  • Consult with development users; determine requirements and recommend solutions
  • Participate in product evaluations, design review session, data requirement meetings and consulting with application development products

Required Qualifications:

  • 5+ years’ software engineering background covering the entire software lifecycle in a team-oriented environment.
  • 2+ years Azure, or Google Cloud Platform. Experience supporting infrastructure and services in public and private cloud environments
  • 3+ years of software development experience such as Java Spring boot, Python or Golang
  • The candidate must have working knowledge Terraform, Ansible, Helm.
  • The candidate must have working knowledge in container and container management technologies (Docker, Kubernetes).
  • Willingness to participate in on-call support rotation

Preferred Qualifications:

  • Experience in analysis of healthcare data and management of healthcare information systems
  • Passion for automated CI and CD; record of doing considerable work in this area
  • Ability to use a wide variety of open-source technologies and cloud services
  • Experience with Infrastructure as Code (Terraform)
  • Hand on experience with Open Shift and Google Cloud and Azure cloud platforms
  • Understanding of Hadoop Distribution technologies and any Cloud Experiences

Horizontal is proud to be an Equal Opportunity and Affirmative Action Employer. We seek to provide employment opportunities to talented, qualified candidates regardless of race, color, sex/gender including gender identity and/or expression, national origin, religion, sexual orientation, disability, marital status, citizen status, veteran status, or any other protected classification under federal, state or local law.

In addition, Horizontal will provide reasonable accommodations for qualified individuals with disabilities. If you need to request a reasonable accommodation in order to complete the application or interview process, please contact hr@horizontal.com.

All applicants applying must be legally authorized to work in the country of employment.

EQUAL OPPORTUNITY EMPLOYMENT SURVEY

What is your gender?

What is your ethnicity?

What is your Veteran / U.S. Military Status?

Do you identify with one or more of the classifications of protected veterans below?

If yes, please indicate by checking the appropriate box below

Do you have a disability?

You are considered to have a disability if you have a physical or mental impairment or medical condition that substantially limits a major life activity, or if you have a history or record of such an impairment or medical condition.

Horizontal is proud to be an Equal Employment Opportunity/Affirmative Action Employer providing a drug-free workplace.

Success!

You have saved your first job! To see all your Saved Jobs, click here. Or continue scrolling through jobs and bookmark openings that catch your eye and apply for those jobs later.

Return to Job Search
Close

We’re sorry!

There are currently no open positions in your location or accepting applications from out of the country

Return to Home
Close
X
Cookies help us improve your website experience.
By using our website, you agree to our use of cookies.
Confirm