About

25 years at the forefront of enterprise software investing.

Firm

Portfolio

Opportunities

Careers

Investor Login

Strategies

Investing across the software lifecycle with tailored equity and credit solutions.

Enterprise Software: Investing in the Engine of Innovation

White Paper

Investing in Enterprise Software

Learn from Robert F. Smith why enterprise software remains a compelling opportunity for value creation

White Paper

Demystifying Generative AI

Explore how Gen AI is redefining industries

Investor Login

News + Insights

Vista’s latest news, insights and perspectives on enterprise software investing.

Alt Goes Mainstream Podcast: Robert F. Smith on Who Will Benefit From AI

CNBC: Ashley MacNeill on AI Market Trends

Investor Login

Wealth Professionals

A differentiated investment partner for advisors and their clients.

Strategies

White Paper

AI Investment Landscape Primer

Understanding the AI ecosystem, opportunity and path to value

Chartbook

Agentic AI Is Here - What Investors Need to Know

Unpacking the investor opportunity in Agentic AI

Investor Login

Careers at Vista

Explore opportunities in our network

My job alerts

Senior Data Engineer - Data (AVD Team)

Sonatype

Software Engineering, Data Science

Hyderabad, Telangana, India

Posted on Sep 4, 2025

Sonatype is the software supply chain security company. We provide the world’s best end-to-end software supply chain security solution, combining the only proactive protection against malicious open source, the only enterprise grade SBOM management and the leading open source dependency management platform. This empowers enterprises to create and maintain secure, quality, and innovative software at scale.

As founders of Nexus Repository and stewards of Maven Central, the world’s largest repository of Java open-source software, we are software pioneers and our open source expertise is unmatched. We empower innovation with an unparalleled commitment to build faster, safer software and harness AI and data intelligence to mitigate risk, maximize efficiencies, and drive powerful software development.

More than 2,000 organizations, including 70% of the Fortune 100 and 15 million software developers, rely on Sonatype to optimize their software supply chains.

Who We Are

At Sonatype, we help organizations build better, more secure software by enabling them to understand and control their software supply chains. Our products are trusted by thousands of engineering teams globally, providing critical insights into dependency health, license risk, and software security. We’re passionate about empowering developers—and we back it with data.

The Opportunity

We’re looking for a Senior Data Engineer to join our growing Data Platform team. You’ll play a key role in designing and scaling the infrastructure and pipelines that power analytics, machine learning, and business intelligence across Sonatype.
You’ll work closely with stakeholders across product, engineering, and business teams to ensure data is reliable, accessible, and actionable. This role is ideal for someone who thrives on solving complex data challenges at scale and enjoys building high-quality, maintainable systems.

What You’ll Do

Design, build, and maintain scalable data pipelines and ETL/ELT processes
Architect and optimize data models and storage solutions for analytics and operational use
Collaborate with data scientists, analysts, and engineers to deliver trusted, high-quality datasets
Own and evolve parts of our data platform (e.g., Airflow, dbt, Spark, Redshift, or Snowflake)Implement observability, alerting, and data quality monitoring for critical pipelines
Drive best practices in data engineering, including documentation, testing, and CI/CDContribute to the design and evolution of our next-generation data lakehouse architecture

What We’re Looking For

Minimum Qualifications
5+ years of experience as a Data Engineer or in a similar backend engineering role
Strong programming skills in Python, Scala, or Java
Hands-on experience with HBase or similar NoSQL columnar stores
Hands-on experience with distributed data systems like Spark, Kafka, or Flink
Proficient in writing complex SQL and optimizing queries for performance
Experience building and maintaining robust ETL/ELT pipelines in production
Familiarity with workflow orchestration tools (Airflow, Dagster, or similar)
Understanding of data modeling techniques (star schema, dimensional modeling, etc.)
Bonus Points
Experience working with Databricks, dbt, Terraform, or Kubernetes
Familiarity with streaming data pipelines or real-time processing
Exposure to data governance frameworks and tools
Experience supporting data products or ML pipelines in production
Strong understanding of data privacy, security, and compliance best practices

Why You’ll Love Working Here

Data with purpose: Work on problems that directly impact how the world builds secure software
Modern tooling: Leverage the best of open-source and cloud-native technologies
Collaborative culture: Join a passionate team that values learning, autonomy, and impact

At Sonatype, we value diversity and inclusivity. We offer perks such as parental leave, diversity and inclusion working groups, and flexible working practices to allow our employees to show up as their whole selves. We are an equal-opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you have a disability or special need that requires accommodation, please do not hesitate to let us know.

See more open positions at Sonatype

Privacy policy Cookie policy