Site Reliability Engineer


Site Reliability Engineer


Our SRE team is a small and close-knit group of experts spread across the world, and we are looking to expand our team in Europe and NA. Our role is to help to make business and development operations continue to flow smoothly, as well as building on our existing processes and platforms to reduce costs, increase stability, and enable the rest of the business to have as friction-free experience as possible.


  • Troubleshooting issues along with developers, providing systems level and architecture insight to the current issue.
  • Assisting and consulting with developers, network engineers, etc to ensure our software and platforms adhere to best practices and provide the best outcomes
  • Working with our configuration management systems to improve automation workflows and support requested features
  • Maintaining our compute infrastructure, comprising of both cloud resources and bare metal devices
  • Solving complex and/or unintuitive system stability issues.
  • Researching, investigating, and providing justification for new technologies that would benefit the organisation


As a well-rounded systems engineer and automation enjoyer with a diverse set of skills, this makes you one of the very best people to troubleshoot, monitor the platform, and be on top of releases. You need:

  • A high level of Linux systems knowledge and experience
  • The ability to identify risk and manage appropriately
  • The ability to work autonomously as well as with colleagues as required
  • The ability and willingness to learn new things, which may require figuring out without documentation – and then write documentation afterwards!
  • A high degree of drive to improve and automate your environment with minimal guidance
  • Experience with automation of cloud platform (GCP, AWS, etc) management
  • Experience working with Python
  • Experience with message queue systems like RMQ, ZMQ, Kafka, Pub/Sub etc
  • Experience with configuration management/IaC tools such as Ansible, Salt, Terraform, Chef, Puppet, CloudFormation etc
  • Experience with CI/CD pipelines using Jenkins, GitHub Actions, BitBucket, Bamboo etc
  • Experience with Docker and Kubernetes as well as other forms of containerisation and virtualisation
  • Solid understanding of TCP/IP, including knowledge of common protocols such as HTTP, TLS, DNS, DHCP, NTP, SSH, SMTP, etc
  • Solid understanding of nginx and SSL


  • Familiarity with network platforms (Juniper, Cisco, Arista, Ciena, Nokia, etc)
  • Familiarity with networking concepts such as VLAN, VXLAN, MPLS, BGP, etc
  • Experience with large scale network management and/or monitoring.
  • Hands-on experience making applications work at scale.
  • RDBMS experience, preferably PostgreSQL
  • Experience with time-series data stores
  • Experience in PXE based deployments
  • Experience working in an environment leveraging remote communication collaboration tools like slack, zoom etc. across multiple time zones
  • Knowledge of server hardware
  • Larger-scale software development experience, ideally with Python
  • Experience with multiple programming languages


PacketFabric is the connectivity cloud. We built a global, 50+Tbps carrier-class optical network that is completely automated and consumable on-demand like SaaS, so enterprises can connect the core of their hybrid and multi-cloud architectures and grow their digital business.

We offer private and secure point-to-point, hybrid cloud, multi-cloud, and custom connectivity services that you can provision in minutes via our self-service portal or programmable API. We offer flexible consumption of our services, with month-to-month or longer terms, or even usage-based for bursting and disaster recovery.

PacketFabric was recognized with the “2020 Fierce Telecom Innovation Award for Cloud Services,” named one of the “10 Hottest Networking Startups of 2020” by CRN, a Futuriom 40 Top Private Company, and a “2020 Cool Vendor in Enhanced Internet Services and Cloud Connectivity” by Gartner.

PacketFabric as an organisation is 100% remote. There is no dress code, webcams remain off, and we have a strong culture of sharing, not hoarding knowledge, and of using mistakes/failures as positive learning experiences. Other perks include unlimited PTO and flexible working hours, although 0800-1200 Pacific (1600-2000 GMT) are core hours for larger meetings.


  • Remote first, globally distributed team
  • The chance to disrupt the entrenched telecommunications infrastructure industry
  • A supportive and optimistic team that likes to learn from each other
  • A product development pipeline that’s constantly pushing new features and enhancing the quality of existing products
  • The opportunity to work with many different industries and customer types
  • A small company culture
  • Great health, dental, and 401(k) for US residents

Here at PacketFabric, we want all of our employees to feel valued, appreciated, and free to be who they are. We provide equal opportunities to all employees and applicants for employment and follow employment lifecycle processes designed to prevent discrimination against our people, regardless of gender identity or expression or intersex, sexual orientation, religion, spiritual beliefs, ethnicity, age, neurodiversity, disability status, national origin, citizenship, generation, culture, or any protected category under federal, state and local law.

PacketFabric is not accepting resumes from unsolicited headhunters or agencies at this time.