- Hybrid - 2-3 days onsite in McKinney, TX 75070.
Seeking a Principal Network Engineer to own the responsibility of designing, implementing, and maintaining reliable, scalable, secure, performant, and cost-effective networking solutions across multiple data centers and public cloud environments in support of the Cloud Engineering team.
Responsibilities include, but are not limited to:
- Data center evaluation, onboarding, and management
- ISP evaluation, onboarding, and management
- Networking design, configuration, and monitoring
- Hardware evaluation, qualification, onboarding, and management
- Capacity planning
- Utilize Infrastructure as Code (IaC), configuration management, and centralized secrets storage to automate the provisioning and management of infrastructure, ensuring consistent and repeatable environments.
- Implement robust observability solutions to monitor system performance (logs, metrics, traces) using tools like Prometheus and Grafana. Configure alerting for proactive issue resolution.
- Oversee installation, configuration, patching, and updating of network equipment to ensure reliable, secure, performant, cost-effective operation.
- Optimize resource allocation to ensure efficient utilization of hardware capabilities.
Competencies:
Non-Technical
- Leads from behind, coaching and developing their peers
- Balances tactical and strategic needs to address both short and long-term organizational priorities based on articulated team and company goals
- Demonstrates intrinsic motivation
- Writes clear, concise, and meaningful documentation
- Develops and leverages collaborative, empathetic relationships across the organization
- Ability to make and explain thoughtful decisions based on sound logical, analytical, data-driven reasoning
Technical
- Expert knowledge of Linux operating system internals and architecture, with a focus on networking stability and performance tuning.
- In-depth understanding of networking layers, protocols, services, and security practices.
- Familiarity with configuring and troubleshooting network hardware and software, as well as analyzing packet captures.
- Expertise designing and managing network infrastructure across multiple data centers and public cloud environments.
- Expertise with virtualization technologies, with a focus on networking stability and performance tuning.
- Experience in evaluating, qualifying, and managing networking hardware and capacity planning.
- Expertise with container management (Kubernetes, ECS, Docker, Helm)
- Expertise with configuration management (Ansible, Chef, Puppet)
- Expertise with infrastructure as code (Terraform, OpenTofu, Pulumi)
- Expertise with monitoring and alerting systems (Cloudwatch, Datadog, New Relic, Site24x7, Dynatrace)
- Experience with VCS systems and providers (Git, Mercurial, Github, Sourcehut)
- Experience with CI/CD systems (Github Actions, Circle CI, Argo)
- Experience with ticket management systems (Jira, Shortcut, Azure Devops)