Job Summary:
Role holder at this level is responsible for the effective design, delivery, and operations of a resilient hybrid cloud computing environment. Needs to be proficient in proactively anticipating, planning, and implementing IT solutions and initiatives which enable business value. The candidate should have relevant experience working with network, storage, and database teams to implement application infrastructure in virtualized environments. Also, experienced in supporting data center operational task and maintenance activities.
Key Accountabilities:
- Oversee the migration and implementation of cloud workflows, maintenance activities and reporting functions.
- Plan and develop roadmaps to advance the migration of on-premises system/applications to the cloud.
- Manage deployment of virtual machines in the cloud, configure VMs for optimum performance and security. Also put in place a recovery plan when a virtual machine fails and leverage log analytics to gain operational insights.
- Conduct regular performance capacity evaluations for hosted applications, ensure new workloads are evenly distributed in the virtual environment.
- Upgrade VMware Tools to the latest compatible version to improve fault tolerance and enhance the management of virtual resources in Production, Development and DMZ environments, respectively.
- Liaise with relevant IT sections to streamline infrastructure maintenance tasks activities, virtual machine deployment and decrease the average ticket response time.
- Provide installation, support, and administration of both physical and virtualized (Windows and non-Windows) server environments with VMware to ensure high availability and optimum performance.
- Maintain a working knowledge of Windows Server OS, Red-Hat Linux OS, Storage Area Networks, Server Virtualization, Office 365, Directory Services, Backup Technologies, and PowerShell Scripting to support essential QAFCO applications.
- Implementation of enterprise class monitoring tool that tracks the health and utilization of all system resources in the cloud and on-premises environments, respectively.
- Perform monitoring and analysis of server and storage components for multiple operating systems. Subsequently provide recommendations to ensure integrity and availability of server resources.
- Perform advanced problem identification and resolution, performance monitoring and capacity planning functions.
- Create and maintain a comprehensive inventory of virtual and physical hardware, software, and services.
- Proactively support capacity planning and the development of long-term strategic goals for systems and software in conjunction with business priorities and strategies.
- Recommend, schedule, and implement software and hardware improvements, upgrades, patches, reconfigurations, and refresh to support current OLAs.
- Perform disaster recovery tests using documented walkthrough testing procedures for specified business applications to identify inconsistencies and process gaps.
- Troubleshoot and contribute knowledgebase material to the service desk ticketing system to increase efficiency.
- Identify areas for process and efficiency improvement within datacenter operations including refresh of current hardware equipment and monitoring tools.
- Ensure all applications hosted in the datacenter and cloud are maintained in accordance with license agreements and regulatory requirements.
- Develop and maintain relevant system level documentation and ensure all technical documentation are current.
- Ensure all necessary datacenter operational processes and procedures are carried out with a high level of attention to detail, expediency, and on-time delivery.
- Support the development of Qataris and assist qualified Qatari nationals to ensure that they develop the necessary skills to independently carry out assigned activities.