P2P. org logo

SRE - Cosmos

P2P. org
Full-time
Remote
Spain
Remote Crypto

P2P.org is the largest institutional staking provider with a TVL of over $10B and a market share exceeding 20% in restaking.

We are continually focused on researching and improving our infrastructure to extract maximum APR while enhancing security. For instance, in ETH and SOL, our NRR is on average 10% higher than the market, and in DOT, it's 20% higher.

We also place significant focus and resources on launching new networks such as TON, Avail, Monad, Babylon, Story, Berachain, and others, along with yield products. From restaking, where we are the largest operator with a 20+% market share, to yield aggregators on stablecoins.

Our clients include BitGo, Copper, Crypto.com, Ledger, ByBit, Bitget, OKX, HTX, Bitvavo, SBI, and others, who choose us for our client-centric approach and extensive product line from unified API to widgets and custom dApps.

We are also actively expanding our product line, exploring RWA, data, yield, and service products for banks, exchanges, custodians, and wallets.

P2P.org unites talented individuals globally ❀️

Despite our distributed team, we share a passion for decentralized finance - a fairer system for all. We code, learn, create, and connect to shape finance's future πŸ’°

P2P.orgΒ boasts a strong reputation and network. We prioritize customer satisfaction and, as tech enthusiasts, develop innovative solutions that bolster our brand.


Project Overview

Joining the Cosmos team not only means ensuring the availability, security, and monitoring of our core staking infrastructure across numerous blockchains within the ecosystem - you will also engage in developing architectural transformations and drive enhancements in a dynamic, fast-paced environment whilst actively collaborating and contributing to the wider Cosmos community.

P2P.org currently runs validators in 20+ blockchains in the Cosmos ecosystem (Cosmos Hub, Celestia, Axelar, dYdX etc.), with many more in the launch pipeline. We are firm believers in this ecosystem and have a dedicated team that supports all Tendermint/CometBFT-based chains in our portfolio. We aim to expand our presence in the ecosystem as a validator and find new opportunities to support Cosmos and grow our business.


Who we are looking for

Our perfect match is an experienced Site Reliability Engineer with a strong experience with Cosmos ecosystems. In this role, you will design, build, and maintain multi-cloud infrastructure, ensuring that our services run smoothly, securely and with high performance. You’ll collaborate with world-class engineers, contribute to open-source projects, and directly influence the infrastructure behind some of the most innovative blockchain networks.


You will do:

  • Infrastructure Management:

    • Provision, maintain, and scale multi-cloud/multi-architecture infrastructure using Infrastructure as Code (IaC) tools and CI/CD pipelines.

    • Develop and manage Kubernetes workloads following GitOps best practices.

    • Create and maintain Ansible roles to deploy and manage various blockchain validators.

  • Observability & Reliability:

    • Implement and refine monitoring, alerting, and logging solutions using Prometheus, Grafana, Loki, and Opsgenie.

    • Continuously improve reliability through proactive security patches, system hardening, and performance tuning.

  • Automation & Tooling:

    • Build and maintain CI/CD workflows, ensuring seamless deployments to Kubernetes clusters.

    • Contribute to open-source tooling that supports the Bitcoin and Tendermint ecosystems.

  • Collaboration & Community:

    • Engage in architecture discussions and technical presentations, influencing the direction of core infrastructure.

    • Collaborate closely with passionate engineers, DevOps specialists, and community contributors across Web3.

    • Participate in a 24/7 on-call rotation, ensuring rapid response and resolution to critical infrastructure incidents.


You have:

  • Core Expertise:

    • Extensive hands-on experience administering Linux-based systems, including both hardware and software aspects.

    • Proven ability to implement IaC using tools like Terraform, Ansible, and Git.

    • Demonstrated success managing workloads on cloud providers such as Google Cloud Platform (GCP) and Oracle Cloud.

  • Containerization & CI/CD:

    • Experience deploying and managing applications on Kubernetes with tools like ArgoCD, Argo workflows, GitHub Actions, Helm, and HashiCorp Vault.

  • Scripting & Programming:

    • Proficiency in scripting in Shell, and at least one programming language (Python or Golang) to automate infrastructure tasks and reduce manual effort.

  • Monitoring & Alerting:

    • Skilled in configuring comprehensive observability solutions (Prometheus, Grafana, Loki, OTEL agent) to ensure prompt and accurate incident response.


Nice to Have:

  • Hands-on experience running and configuring blockchain nodes/validators, especially within the Cosmos / Tendermint ecosystems.

At P2P.org we have a team of experts with their own unique approach and ownership culture. Together we gain experience and make dreams come true! 🌟

  • Full-time Contractor (Indefinite-term Consultancy Agreement)

  • Competitive salary level in $ (we can also pay in Crypto)

  • Well-being program

  • Mental Health care program

  • Compensation for education, including Foreign Language & professional growth courses

  • Equipment & co-working reimbursement program

  • Overseas conferences, community immersion

  • Positive and friendly communication culture

P2P.orgΒ is committed to providing equal opportunities. All applicants will be considered without regard to race, color, national origin, religion, sex, sexual orientation, gender identity, veteran status, or disability.