Senior Specialty Systems Operations Engineer

  • Wells Fargo
  • Columbus, Ohio
  • Full Time

About this role:

Wells Fargo is seeking a highly motivated and experienced Systems Engineer to join our Systems Operations team. In this role, you will be a key contributor to ensuring the stability, reliability, and performance of our critical systems and infrastructure. You will leverage your technical expertise to lead and participate in application upgrades, vulnerability remediation, and automation initiatives, while collaborating with cross-functional teams to resolve technical issues and drive continuous improvement. The ideal candidate will possess a strong understanding of application monitoring, cloud deployment, CI/CD pipelines, and automation tools, with a passion for applying Site Reliability Engineering (SRE) principles to optimize system performance and availability.

In this role, you will:

  • Lead or participate in managing all installed systems and infrastructure, moderately complex application upgrades and vulnerability remediation efforts, within the Systems Operations functional area

  • Lead team to meet moderately complex technical deliverables while leveraging solid understanding of technical process controls or standards

  • Act independently as a liaison for the line of business in support of daily inquiries, problem and incident management, project delivery, and escalations by following established guidebooks

  • Liaise with functional or operational managers to understand their current and future information needs and develop plans and schedules for integrating these needs into existing operations

  • Recommend solutions to resolve technical issues and achieve highest levels of systems and infrastructure availability by automating platform activities to lower human intervention time on related tasks

  • Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability

  • Facilitate discussions on preventative action, root cause analysis and resolutions with service management

  • Collaborate and consult with peers, mid-level managers, vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability and reliability

  • Provide training and mentoring to lower experienced team members on guidebook changes and lead team to meet technical deliverables, while leveraging solid understanding of technical process controls or standards

  • Collaborate with development teams to update business continuity plans and provide input on development of new and existing support guidebooks

Required Qualifications:

  • 4+ years of systems engineering or technology architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

Desired Qualifications:

  • Advance understanding of application monitoring stack (Logs, Metrics, Events, Traces, Alerts) and ability to visualize and setup end to end observability (Infra and App components)

  • Strong experience in using industry standard monitoring tools (AppDynamics, Splunk, ELK, APM, Grafana, Prometheus

  • Experience in deploying the application to cloud platforms

  • Experience in using CI/CD tools like Jenkins, uDeploy, Gradle, Groovy and Maven

  • Experience in CM tools like Ansible and Puppet

  • Proficient in one of the programming Languages (Java and Python)

  • Knowledge of Web services

  • Experience in working Agile methodology

  • Proficient in multiple infrastructure technologies

  • Linux experience

  • Database experience - hands on in Oracle SQL, Pl/SQL, Mongo DB

  • DevOps experience

  • Very good experience on Autosys

  • Knowledge of Abinitio

  • Exposure to tools like Service Now, JIRA

  • Hands on experience in Ansible, Python, Shell Scripting

  • Familiarity with NDM, SFTP

  • Basics of Networking

Job Expectations:

  • Design, code , test and deliver software to automate manual operation work

  • Partner with different application teams throughout the life cycle to understand their application infrastructure monitoring and apply site reliability principles to baseline and set up SLOs for critical components

  • Identify application patterns and analytics in support of better service level objectives

  • Design automated software and product upgrades, change management and release management solutions

  • Other responsibilities extend to application deployment, change management, incident management, capacity upgrades, reporting, system integrations and essentially ensuring the availability of a stable and performing platform used by development and technology across the firms

  • Design self-healing and resiliency patterns. Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents

  • Experience in running the chaos experiments

  • Experience in measuring the reliability stack using SLI, SLO and Error budget

Posting End Date:

29 Apr 2025

*Job posting may come down early due to volume of applicants.

We Value Equal Opportunity

Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business units risk appetite and all risk and compliance program requirements.

Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.

Applicants with Disabilities

To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo .

Drug and Alcohol Policy

Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.

Wells Fargo Recruitment and Hiring Requirements:

a. Third-Party recordings are prohibited unless authorized by Wells Fargo.

b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.

Job ID: 474967983
Originally Posted on: 4/28/2025

Want to find more Engineering opportunities?

Check out the 119,876 verified Engineering jobs on iHireEngineering