Do you have a passion for building the highperformance data backbone for cuttingedge AI Are you an expert in designing data architectures that leverage cuttingedge technologies like vector databases and infrastructure as code etc. If so please join our team and play a pivotal role in shaping the future of generative AI! Were seeking a forwardthinking Data Architect to collaborate closely with our Generative AI Engineers. Youll be responsible for designing developing and implementing the cuttingedge data infrastructure that fuels our innovative generative AI solution development with a focus on high performance and scalability.
Responsibilities:
- Partner with Generative AI Engineers and architects to understand their data requirements and design a highly scalable secure and efficient data architecture leveraging vector databases for efficient similarity search and retrieval tasks.
- Design and implement data pipelines for ingesting processing and storing massive datasets for training and running generative models utilizing Terraform (or similar technologies) for infrastructure as code (IaC) to ensure infrastructure automation and repeatability.
- Select and implement cuttingedge data storage solutions considering factors like scalability performance cost and suitability for vector data (e.g. specialized vector databases).
- Ensure data quality by implementing data cleansing transformation and validation processes.
- Develop data governance policies and procedures to ensure data security compliance and accessibility.
- Automate data pipelines and workflows using tools and techniques optimized for highperformance data processing.
- Monitor and optimize data infrastructure performance for efficiency and scalability focusing on optimizing vector database usage for generative AI workloads.
- Collaborate with Data Scientists and Machine Learning Engineers to understand broader data needs and ensure alignment.
- Stay uptodate on the latest big data technologies vector databases and best practices for data management in AI environments.
Qualifications:
- 10 years of experience in data architecture design and implementation with a focus on highperformance data solutions
- Strong understanding of data management principles data modeling techniques data governance practices and distributed systems
- Experience working with big data technologies (e.g. Kafka postgre mongo) and familiarity with vector databases (e.g. Pinecone Faiss Lance DB etc.)
- Proficiency in SQL and experience with data warehousing solutions (e.g. Snowflake Redshift) is added advantage.
- Experience with Azure and AWS cloud platforms and Terraform.
- Excellent communication and collaboration skills to effectively interact with technical and nontechnical stakeholders.
- Strong problemsolving and analytical skills with a datadriven approach
- Ability to work independently and manage multiple projects simultaneously
Qualifications :
BTech/BE/ME/MTech
Additional Information :
10 15 years of experience
Remote Work :
No
Employment Type :
Fulltime