Senior Platform Engineer (Python/AWS)
Alexi
Alexi is Toronto based VC ($15M Series A) funded AI LegalTech company building tools and technology to solve problems for lawyers and litigation professionals . We are on a mission to transform the legal services industry by building cutting edge AI-powered solutions and our team is made of industry leading software engineers, AI/ML engineers, scientists, lawyers and designers.
As of today, we are hiring a Senior Platform Engineer with a strong background in Python development and cloud infrastructure to join our platform engineering team. In this role, you will be responsible for designing, building, and maintaining the foundational infrastructure that powers our platform. You will work closely with engineering, and product teams to ensure that our platform is scalable, reliable, and secure.
As a Senior Platform Engineer, you will leverage your expertise in cloud infrastructure, automation, and infrastructure as code (IaC) to create efficient and robust systems. You’ll play a pivotal role in evolving our platform architecture, optimizing performance, and driving engineering best practices across the organization.
Key Responsibilities:
Platform & Infrastructure Development:
- Design, develop, and maintain platform infrastructure with a focus on scalability, security, and reliability.
- Develop and optimize platform tooling and services in Python to support continuous delivery, monitoring, and operations.
- Implement automation frameworks to manage cloud infrastructure and deployments at scale.
- Build and enhance microservices, APIs, and backend services to support platform functionality.
- Build and enhance systems that support artificial intelligence teams
Cloud Infrastructure Management:
- Architect and manage cloud infrastructure (AWS, Azure, GCP) using Infrastructure as Code (IaC) tools like Terraform, CloudFormation
- Design, implement, and manage CI/CD pipelines for automated builds, testing, and deployment.
- Implement monitoring, alerting, and logging solutions to maintain platform health and performance.
- Lead efforts in optimizing cloud resource usage and cost management.
Platform Reliability & Performance:
- Establish best practices for platform observability, monitoring, and alerting.
- Ensure high availability and performance of the platform through proactive monitoring, incident management, and performance tuning.
- Troubleshoot and resolve infrastructure and platform issues in real-time.
- Implement disaster recovery and failover strategies to ensure platform resilience.
Security & Compliance:
- Design and implement secure infrastructure solutions, enforcing security best practices such as identity and access management (IAM), encryption, and least privilege access controls.
- Collaborate with teams to ensure platform compliance with regulatory standards and best practices.
- Conduct regular security audits, vulnerability scans, and risk assessments.
Collaboration & Mentorship:
- Work closely with software engineers and product teams to deliver a seamless, efficient platform experience.
- Provide technical leadership and mentorship
- Contribute to the evolution of platform engineering practices and processes, ensuring continuous improvement in scalability, security, and reliability.
Qualifications:
- Experience: 5+ years of experience in platform engineering, DevOps, or infrastructure engineering roles with a strong focus on Python.
- Cloud Expertise: 3+ years of hands-on experience with cloud infrastructure (AWS, GCP, or Azure).
- Infrastructure as Code (IaC): Proven experience with IaC tools such as Terraform, CloudFormation, or Ansible.
- Python Development: Strong proficiency in Python for infrastructure automation and service development.
- CI/CD Pipelines: Proven experience building and managing CI/CD pipelines with tools like Jenkins, GitLab CI, or AWS CodePipeline.
- Containerization & Orchestration: Hands-on experience with Docker and orchestration tools for containerized deployments.
- Monitoring & Logging: Experience with monitoring and logging tools like Prometheus, Grafana, CloudWatch, or Datadog.
- Security Best Practices: Solid understanding of security best practices in cloud environments, including IAM, network security, encryption, and vulnerability management.
Nice to Have:
- Experience with serverless computing (AWS Lambda, Azure Functions).
- Proficiency in setting up and managing distributed systems and microservices architectures.
- Experience with message queues and event-driven architecture.
- Knowledge of performance tuning and cost optimization in cloud environments.
- Exposure to Agile methodologies and experience working in fast-paced environments.
- Strong communication skills and ability to work across teams to achieve business objectives.
Culture:
We are a dynamic team of engineers bringing together iterative NLP model development, high performance systems engineering and effective software development. Much of our work is highly experimental. We are not perfectionists, we move fast, and are not afraid to be on the learning curve. A willingness to learn and a can-do attitude are must-haves to survive and thrive in this team.
As a startup we often have to find a balance between speed, quality and cost of development. This often requires us to think creatively and even invent what doesn’t exist. We are ambitious, we take risks, and we embrace failure in a judgment free but accountable environment.
We work in a trust-based, mission-driven, goal-oriented, fully-remote work environment. We enable each other, but also consciously protect our time to deliver high quality work, to help create a thriving innovation culture.
Benefits:
- Competitive Stock Options
- Competitive Pay
- Unlimited vacation days
- Additional paid time off winter holidays (last week of December)
- Company-wide innovation days – dedicated to projects you’d like to innovate on within the context of the work the company is doing
- 16" MacBook Pro dedicated for work, and any other equipment that enables you for work
- Hybrid work environment (flexible). Easy public transit access to office in downtown Toronto
- Recruiting and growing your own team
- No red tape, no unnecessary processes, accelerated learning and growth.
- Health benefits
- Work for a high growth startup, wear multiple hats, make a real impact. We are building something legit and impactful, that can truly change the world for good.