Banyan software

Senior Data / RAG Engineer

Chennai, Tamil Nadu, India Full Time

Banyan Software provides the best permanent home for successful enterprise software companies, their employees, and customers. We are on a mission to acquire, build and grow great enterprise software businesses all over the world that have dominant positions in niche vertical markets. In recent years, Banyan was named the #1 fastest-growing private software company in the US on the Inc. 5000 and amongst the top 10 fastest-growing companies by the Deloitte Technology Fast 500. Founded in 2016 with a permanent capital base setup to preserve the legacy of founders, Banyan focuses on a buy and hold for life strategy for growing software companies that serve specialized vertical markets.

Role Overview:
We’re looking for a Senior Data Engineer with deep expertise in RAG (Retrieval-Augmented Generation) and Vector Database design to build and manage the knowledge backbone for AI compliance and insights. This role focuses on modernizing archival data ingestion and enabling real-time contextual retrieval for AI-driven systems.

Key Responsibilities:

  • Design and implement RAG Vector Databases (e.g., OpenSearch, Pinecone) using archival data from S3 / Glacier and overall data management via MS SQL Server.
  • Modernize existing data ingestion pipelines, replacing legacy OCR-based processes with scalable ETL/ELT frameworks.
  • Ensure data synchronization and consistency between RDS (MS SQL Server) and Vector DB for real-time AI context.
  • Collaborate with AI, backend, and infrastructure teams to optimize retrieval performance and model access.
  • Drive data integrity, schema evolution, and compliance readiness across systems.

Required Skills & Experience:

  • Proven expertise in data engineering pipelines (Kafka / MSK, ETL / ELT).
  • Hands-on experience with Vector Databases and RAG implementations (OpenSearch, Pinecone, FAISS, Chroma).
  • Strong proficiency in SQL, data modeling, and Python / C# / Go.
  • Experience with AWS data ecosystem (S3, RDS, Glue, Lambda and related technologies).
  • 8–10 years of experience in data engineering or AI data platforms.

Diversity, Equity, Inclusion & Equal Employment Opportunity at Banyan: Banyan affirms that inequality is detrimental to our Global Teams, associates, our Operating Companies, and the communities we serve. As a collective, our goal is to impact lasting change through our actions. Together, we unite for equality and equity. Banyan is committed to equal employment opportunities regardless of any protected characteristic, including race, color, genetic information, creed, national origin, religion, sex, affectional or sexual orientation, gender identity or expression, lawful alien status, ancestry, age, marital status, or protected veteran status and will not discriminate against anyone on the basis of a disability. We support an inclusive workplace where associates excel based on personal merit, qualifications, experience, ability, and job performance.

 

Beware of Recruitment Scams

We have been made aware of individuals fraudulently posing as members of our Talent Acquisition team and extending fake job offers. These scams may involve requests for personal information or payment for equipment. 

Protect yourself by following these steps:

  • Verify that all communications from our recruiting team come from an @banyansoftware.com email address.
  • Remember, employers will never request payment or banking information during the hiring process.
  • If you receive a suspicious message, do not respond — instead, forward it to careers@banyansoftware.com and/or report it to the platform where you received it.

Your safety and security are important to us. Thank you for staying vigilant.