Position Details: SRE (Site Reliability Engineer)
Description:
Must Have Technical/Functional Skills
- Minimum 8+ years of relevant experience in enterprise network environment/customers. Strong in Linux Administration – OnPrem.
- Proficiency in designing, implementing, and maintaining enterprise-level infrastructure solutions across Linux environments.
- Awareness on IP, DNS, DHCP, Ports, Authentications, Authorizations & Privileges, Router, Switches & Firewall operations.
- Understanding of network segmentation.
- Troubleshoot overall infrastructure, Applications, IoT Systems & network infrastructure.
- Diagnosing and resolving complex technical issues across infrastructure and applications. Good to have IOT System Experience.
Key Responsibilities
Infrastructure Management
- Physical & Virtual Servers Management.
- Administer and optimize Linux servers for high availability and performance.
- Implement security best practices across enterprise systems, including IoT environments.
- Provide support on existing IoT systems, manage upgrades and troubleshoot problems.
- Manage and maintain SQL Server and Sybase databases, ensuring data integrity and performance.
- Develop scripts using Perl/Python/PowerShell for automation and system monitoring.
Application Management on the Servers
- Configure and maintain applications and services hosting applications.
- Perform packaging and deployment, updates, patching, and upgrades.
- Monitor application performance and proactively troubleshoot issues.
- Stay well versed with End-to-end request flow with network integration.
- Manage user accounts, permissions, and enforce security policies
Troubleshooting & Analysis
- Diagnosing and resolving complex technical issues across infrastructure and applications.
Networking & Security
- Awareness on IP, DNS, DHCP, Ports, Authentications, Authorizations & Privileges.
- Awareness on Router, Switches & Firewall operations.
- Understanding of network segmentation.