Infrastructure and Application Monitoring Specialist
Infrastructure and Application Monitoring Specialist
Plus Ti Guatemala, Guatemala, Guatemala (Hybrid)
At Plus Ti, we are more than just a technology company; we are a team committed to innovation, ethics, and social impact. Our vision of "a safer financial world" guides every project, decision, and person on our team.
We are seekingtwopositions for Infrastructure and Applications 24/7:
Intermediate Specialist and Senior Specialist
Intermediate Specialist
Purpose of the Position
Ensure uninterrupted monitoring of the customer's technology infrastructure—including Windows servers, IIS, .NET applications, and SQL Server—by detecting and documenting anomalies, responding to alerts, performing preventive checks, and escalating incidents in a timely manner, thus ensuring service availability 24 hours a day, 365 days a year.
Primary Responsibilities
- Windows Server Monitoring:
- Validate server availability and resources (CPU, memory, disk).
- Analyze Event Viewer for critical errors and warnings.
- Identify files and logs whose size is growing abnormally.
- Verify that critical services are operational.
Review of IIS and .NET Applications:
- Check the status and stability of App Pools.
- Review IIS logs to identify errors (500, 503, 504, 403).
- Monitor health alerts (MDM).
- Validate basic accessibility of applications.
- SQL Server Review:
- Check the availability of the SQL service.
- Review resource consumption and performance.
- Identify slow queries (depending on permissions).
- Analyze error logs, crashes, and the status of jobs/backups.
- Incident Management and General Monitoring:
- Run network and connectivity tests (ping, telnet, Test-NetConnection).
- Review failed authentication attempts and possible blocks.
- Record findings in the ticket system.
- Escalate incidents to higher levels in a timely manner.
Senior Infrastructure and Application Monitoring Specialist 24/7
Continuous monitoring of servers, applications, and databases to ensure operational availability (Infrastructure, IIS, .NET Applications, and SQL Server)
Primary responsibilities:
- Leadership and Team Management
- Ability to guide and mentor junior members in technical problem solving, process compliance, and best practices.
- Delegation and management of workloads, ensuring proper task assignment, escalation handling, and prioritization.
- Experience establishing operating procedures, runbooks, and work standards for the team.
Leadership and Team Management
- Ability to interact professionally with customers, especially during incidents or complex situations.
- Strong customer-oriented communication skills, including:
- Explain technical problems in non-technical terms
- Provide status updates and remediation plans
- Manage expectations related to ANS/SLA and resolution times
- Conflict management and ability to conduct difficult conversations constructively.
- Competence in gathering requirements, understanding customer needs, and translating them into technical actions.
- Experience in service review meetings, post-incident analysis, and operational reports for clients.
Strategic and Process Skills
- Experience in improving ITSM processes (incident, problem, and change management).
- Ability to analyze data and metrics in order to:
- Identify recurring problems
- Recommend systemic solutions
- Improve reliability and customer satisfaction
- Contribution to capacity planning, availability strategies, and risk management.
Project Organization and Management
- Ability to manage small or medium-sized operational projects:
- Patch cycles
- System updates
- Migration tasks
- Improvements in monitoring or tools
- Understanding project schedules, dependencies, and resource planning.
- Familiarity with agile practices for operational work.
TO APPLY FOR ANY OF THESE POSITIONS, THE MINIMUM REQUIREMENTS ARE AS FOLLOWS:
Minimum profile requirements:
- Education: Technician, advanced student, or graduate in Systems Engineering, Computer Science, or a related field.
- Experience: Minimum 2 years in technical support, infrastructure monitoring, or NOC.
Technical Knowledge:
- Basic Windows Server administration.
- Managing App Pools, logs, and HTTP codes in IIS.
- Essential concepts of SQL Server, logs, jobs, and performance. – essential-
- Use of network tools such as ping, tracert, and telnet.
- Ticket system management.
Competencies:
- Attention to detail.
- Ability to work shifts and under pressure.
- Clear communication and accurate documentation.
- Analytical and problem-solving skills.
Working Conditions
- Rotating 8-hour shifts to cover 24/7/365 operation.
- Night work, weekends, and holidays.
- Continuous, real-time oriented operation.