10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and even high-growth start-ups. But those are just facts. What makes us unique is that we have a true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good and know how to balance the two.
Role
We are seeking a highly skilled Staff/Senior Software Consultant with deep expertise in observability, distributed systems, and cloud-native environments. In this role, you will work closely with engineering teams and stakeholders to design, implement, and optimize monitoring, logging, and tracing solutions that enhance system reliability, performance, and scalability.
Responsibilities
Collaborate with engineering, operations, and other stakeholders to understand enterprise architecture, monitoring requirements, and performance goals.
Identify and define key performance indicators (KPIs) and metrics, diagnose issues, and proactively identify areas for optimization.
Develop and implement observability frameworks, tools, and processes to enable comprehensive monitoring, logging, and tracing of systems and applications.
Ensure the availability, scalability, and reliability of infrastructure and deployment environments.
Implement and manage monitoring and observability tools (such as AppDynamics, DataDog, Splunk, ELK, or Sentry) to gain insights into system performance and health.
Provide timely and accurate reports on application performance, highlighting key insights and trends.
Collaborate with digital squads to implement performance improvements, including code optimizations and infrastructure adjustments.
Offer guidance and training to end-users and internal teams on best practices for APM and optimizing application performance.
Provide recommendations on monitoring systems, logging frameworks, and distributed tracing platforms.
Manage and deliver key KPI metrics across enterprise architecture and perform trend analysis.
Deliver a proactive monitoring framework across infrastructure and digital experience monitoring domains.
Provide expertise in problem detection, isolation, and root cause analysis during incident management, using relevant data and artifacts from observability tools and corresponding systems.
Required Qualifications
Around 8+ years of experience with IT infrastructure and applications.
3-5 years of hands-on experience in observability and continuous integration.
2+ years of programming background in Java or relevant technologies.
Knowledge of cloud infrastructure (Azure) and cluster management tools such as Kubernetes.
In-depth knowledge of application performance metrics, monitoring, and troubleshooting.
Strong communication skills with the ability to align the organization on complex technical decisions.
Bachelor's or Master's degree in Information Technology, Computer Science, or a related quantitative discipline.