Company Overview
10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and even high-growth start-ups. But those are just facts. What makes us unique is that we have a true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good and know how to balance the two.
Role
We are seeking a highly skilled Staff/Senior Software Consultant with deep expertise in observability, distributed systems, and cloud-native environments. In this role, you will work closely with engineering teams and stakeholders to design, implement, and optimize monitoring, logging, and tracing solutions that enhance system reliability, performance, and scalability.
Responsibilities
Lead the design and implementation of observability solutions across complex distributed systems
Architect and optimize logging, monitoring, and tracing frameworks for cloud-native applications
Diagnose and resolve performance bottlenecks and system failures in large-scale distributed environments
Collaborate with engineering, DevOps, and SRE teams to improve system visibility and reliability
Implement best practices for metrics collection, alerting, and incident response
Work with tools such as Datadog, Prometheus, Grafana, OpenTelemetry, and APM platforms like New Relic, Dynatrace, AWS X-ray and Azure Application Insights
Provide consulting guidance on cloud architectures (AWS, Azure, GCP) and observability strategies
Implement and manage Application Performance Monitoring (APM) solutions to gain deep visibility into application behavior and performance
Correlate APM data with logs, metrics, and traces for faster root cause analysis (RCA)
Develop dashboards, alerts, and automated workflows to ensure proactive system monitoring
Mentor junior engineers and contribute to technical strategy and decision-making
Stay up-to-date with industry trends and emerging technologies in observability and distributed systems
Required Qualifications
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
3+ years of experience in software engineering, DevOps, or SRE roles
Strong expertise in observability concepts: logging, monitoring, tracing
Hands-on experience with tools like Datadog, Prometheus, Grafana, ELK stack and APM tools such as Datadog, NewRelic, AWS X-ray, Application Insights
Deep understanding of distributed systems architecture and debugging techniques
Experience working in cloud environments (AWS, Azure, or GCP)
Proficiency in at least one programming language (e.g., Python, Java, Go)
Experience with containerization and orchestration (Docker, Kubernetes)
Strong understanding of application performance tuning, transaction tracing, and service dependency mapping
Strong analytical and problem-solving skills