Location
Pune
Business Area
Engineering and CTO
Ref #
10046170
Description & Requirements
About the Team
The Storage and Compute Stability Team is a trusted partner in ensuring the reliability, performance, and security of Bloomberg’s cloud storage and compute infrastructure. We operate at the intersection of infrastructure, software, and services, proactively identifying, solving, and preventing issues before they impact our users.
Our focus is on streamlining processes, driving automation, and serving as a bridge between product teams and stakeholders. This enables Bloomberg’s engineers to innovate rapidly, while maintaining stability at scale. We follow agile practices and thrive in a collaborative environment where code reviews, design discussions, and brainstorming are part of our daily rhythm. The team is driven by curiosity, creativity, and a shared passion for building efficient, resilient systems.
This isn’t just another operations role you’ll be embedded at the core of Bloomberg’s infrastructure. Our team spans infrastructure, software, and services, supporting both short-term needs and long-term strategic investments.
You’ll Have The Opportunity To
Work on critical infrastructure and help define how it evolves
Take on meaningful projects that balance immediate impact with sustainable improvements
Join a culture that values innovation, automation, and continuous improvement
We'll Trust You To
Ensure system reliability and performance by monitoring, troubleshooting, and optimizing compute and storage services
Proactively identify issues and trends to prevent outages, reduce mean time to recovery (MTTR), and improve overall service availability
Collaborate with product owners, developers, and infrastructure teams to deliver scalable, long-term solutions
Automate operational processes such as deployments, monitoring, maintenance, and capacity management
Develop and maintain runbooks, reproducers, and documentation to support knowledge-sharing and workflow efficiency
Participate in on-call rotations to support critical infrastructure and respond to incidents
Contribute to infrastructure lifecycle management, including capacity forecasting, proactive refresh planning, and upgrades
Continuously explore opportunities to improve team processes and system stability
What We Value
Our work is guided by key principles that define how we operate:
Expertise – We invest in deep technical knowledge to solve complex infrastructure challenges
Proactivity – We anticipate issues before they occur and design systems to withstand failure
Collaboration – We build strong relationships with product teams and stakeholders to deliver end-to-end solutions
Efficiency – We reduce manual work through thoughtful automation and streamlined processes
Documentation – We believe in capturing and sharing knowledge to make systems transparent and maintainable
What Makes You Successful
Strong communication and collaboration skills; the ability to explain technical concepts to diverse audiences
The ability to be self-motivated and autonomous; you take ownership of problems and drive them to resolution
Passion for continuous learning and working across a broad spectrum of systems and technologies
Being comfortable working in an agile environment, participating in daily standups, sprint planning, and code reviews
Curiosity, adaptability, and eagerness to work across the entire infrastructure stack
You'll Need To Have
5+ years of demonstrated experience working with object-oriented programming languages such as C/C++ and Python, and the willingness to work with Python as your primary language on the job
Experience with monitoring, logging, and observability tools
Understanding of containers and orchestration technologies
Solid knowledge of networking, operating systems, and distributed systems concepts
Experience participating in incident response and on-call support for production systems
We'd Love To See
Familiarity with cloud platforms (Ceph or OpenStack) and related compute/storage services
Experience with infrastructure-as-code tools (e.g., Terraform, Ansible)
If This Sounds Like You
Apply if you think we're a good match. We'll get in touch to let you know what the next steps are, but in the meantime feel free to have a look at this:
Tech at Bloomberg -
Discover what makes Bloomberg unique - watch our for an inside look at our culture, values, and the people behind our success.
Show more Show less
Requirements
No specific requirements listed.