Disabled Vets
close

Inspira Financial Trust, LLC

Apply for this job

Sr. Cloud Operations Engineer (Remote) (Finance)



The Sr. Cloud Operations Engineer (SCOE) is a key member of the Cloud Operations team, responsible for maintaining the health, performance, and reliability of Inspira Financials enterprise infrastructure. This role demands a high level of autonomy and technical expertise across system monitoring, NOC operations, and cloud platforms.

The SCOE will lead efforts in documenting and maintaining runbooks, training junior team members, and managing escalations from the help desk and call center. The role also includes hands-on support for Azure Virtual Desktop environments and requires the ability to work independently while collaborating effectively with cross-functional teams.

This position is ideal for a self-driven, solutions-oriented professional who thrives in a dynamic environment and is passionate about operational excellence and continuous improvement.

ESSENTIAL JOB FUNCTIONS:

  • Perform regular system health checks and review monitoring reports to proactively identify and resolve issues.
  • Ensure infrastructure performance and availability through continuous monitoring and alert management.
  • Provide direct support in the day-to-day operations of hardware and operating systems, including Cloud Services:
  • Evaluate system utilization, monitor response time and provide primary support for detection and correction of operational and infrastructure problems
  • Coordinate and perform changes to servers, network, operating systems and attached devices, including investigation, analysis, recommendation, configuration, installation and testing of new network hardware and software
  • Ensure servers, operating systems and network components are implemented and adhere to the information security policies and infrastructure standards
  • Utilize metrics and cloud native consumption-based services to improve cost efficiencies
  • Oversee real-time monitoring of infrastructure, applications, and services.
  • Respond to alerts and incidents, ensuring timely resolution and escalation when necessary.
  • Maintain SLAs and ensure proper documentation of incidents and resolutions.
  • Support and Administer the Disaster Recovery and Business Resumption Plan as it relates to the backup and restoration of the technology infrastructure.
  • Ensure run books are updated on a regular basis
  • Participate in tabletop and real DR exercises
  • Maintain the VMware and Multi Cloud virtual environments
  • Write and maintain runbooks for common tasks, collaborating with Offshore and Onshore shifts to ensure runbooks are usable and up to date.
  • Experience supporting Containerization Platforms such as K8s and Docker
  • Maintain multiple Microsoft Active Directory Domains and Entra ID hybrid tenant
  • Provide infrastructure problem resolution for various applications throughout the organization
  • Experience working with Automation tools such as ADO, Jenkins, and Chef
  • Utilize automation to design and develop programs or scripts for various repetitive functions
  • Performs all duties with a focus on goals of Inspira Financial, which includes risk mitigation
  • Good Exposure to Infrastructure-as-code using terraform
  • Provide technical subject matter expertise and work with wider team to ensure development activities are aligned with scope, schedule, priority and business objective
  • Experience with IaC tools such as Terraform, bash scripting, etc
  • Experience integrating Sonar cloud for code scanning
  • Support inbound calls/emails, maintaining tickets within the issue tracking application related to Infrastructure Support
  • Monitoring of Platform and Environment with tools such as Datadog, Azure Monitor, Logic Monitor, etc
  • Utilize programming skills to design and develop programs or scripts for various repetitive functions
  • Performs all duties with a focus on goals of Inspira Financial, which includes risk mitigation
  • Configure and Support firewalls and security appliances
  • Cross-train other team members in order to facilitate coverage
  • Participate in rotating on-call schedule
  • Ability to respond to pages after hours to resolve critical issues
  • Maintain and update in house Documentation
  • Facilitate Release Management in multiple environments
  • Other duties as assigned
Apply
Apply Here done

© 2025 Disabled Vets