Location

Pune

Business Area

Engineering and CTO

Ref #

10046170

Description & Requirements

About the Team

The Storage and Compute Stability Team is a trusted partner in ensuring the reliability, performance, and security of Bloomberg’s cloud storage and compute infrastructure. We operate at the intersection of infrastructure, software, and services, proactively identifying, solving, and preventing issues before they impact our users.

Our focus is on streamlining processes, driving automation, and serving as a bridge between product teams and stakeholders. This enables Bloomberg’s engineers to innovate rapidly, while maintaining stability at scale. We follow agile practices and thrive in a collaborative environment where code reviews, design discussions, and brainstorming are part of our daily rhythm. The team is driven by curiosity, creativity, and a shared passion for building efficient, resilient systems.

This isn’t just another operations role you’ll be embedded at the core of Bloomberg’s infrastructure. Our team spans infrastructure, software, and services, supporting both short-term needs and long-term strategic investments.

You’ll Have The Opportunity To

Work on critical infrastructure and help define how it evolves

Take on meaningful projects that balance immediate impact with sustainable improvements

Join a culture that values innovation, automation, and continuous improvement

We'll Trust You To

Ensure system reliability and performance by monitoring, troubleshooting, and optimizing compute and storage services

Proactively identify issues and trends to prevent outages, reduce mean time to recovery (MTTR), and improve overall service availability

Collaborate with product owners, developers, and infrastructure teams to deliver scalable, long-term solutions

Automate operational processes such as deployments, monitoring, maintenance, and capacity management

Develop and maintain runbooks, reproducers, and documentation to support knowledge-sharing and workflow efficiency

Participate in on-call rotations to support critical infrastructure and respond to incidents

Contribute to infrastructure lifecycle management, including capacity forecasting, proactive refresh planning, and upgrades

Continuously explore opportunities to improve team processes and system stability

What We Value

Our work is guided by key principles that define how we operate:

Expertise – We invest in deep technical knowledge to solve complex infrastructure challenges

Proactivity – We anticipate issues before they occur and design systems to withstand failure

Collaboration – We build strong relationships with product teams and stakeholders to deliver end-to-end solutions

Efficiency – We reduce manual work through thoughtful automation and streamlined processes

Documentation – We believe in capturing and sharing knowledge to make systems transparent and maintainable

What Makes You Successful

Strong communication and collaboration skills; the ability to explain technical concepts to diverse audiences

The ability to be self-motivated and autonomous; you take ownership of problems and drive them to resolution

Passion for continuous learning and working across a broad spectrum of systems and technologies

Being comfortable working in an agile environment, participating in daily standups, sprint planning, and code reviews

Curiosity, adaptability, and eagerness to work across the entire infrastructure stack

You'll Need To Have

5+ years of demonstrated experience working with object-oriented programming languages such as C/C++ and Python, and the willingness to work with Python as your primary language on the job

Experience with monitoring, logging, and observability tools

Understanding of containers and orchestration technologies

Solid knowledge of networking, operating systems, and distributed systems concepts

Experience participating in incident response and on-call support for production systems

We'd Love To See

Familiarity with cloud platforms (Ceph or OpenStack) and related compute/storage services

Experience with infrastructure-as-code tools (e.g., Terraform, Ansible)

If This Sounds Like You

Apply if you think we're a good match. We'll get in touch to let you know what the next steps are, but in the meantime feel free to have a look at this:

Tech at Bloomberg -

Discover what makes Bloomberg unique - watch our for an inside look at our culture, values, and the people behind our success.

Show more Show less

Requirements

No specific requirements listed.

Senior Software Engineer - Storage & Compute Stability, Pune

Requirements

Explore more jobs