Job Title: Data Engineer
Location: Columbia SC (Onsite)
Duration: 12 Months
New Capgemini Onboarding Process Updates:
Due to additional onboarding requirements a meet and greet is required for all new hires.
Selected candidates must be willing to go to the closest Capgemini/Client office location as indicated by the project team to meet and greet with a Capgemini team member prior to starting their assignment.
If the candidate is not local Capgemini will pay the expenses.
Job Description:
8 years expertise as a Backend/Data Engineer building graph systems and graph databases.
5 years expertise with Machine Learning and/or Natural Language processing.
Degree in Computer Science Machine Learning Data Science or related field with expertise in knowledge representation.
Strong proficiency in Graph Theory Graph Algorithms and Graph Databases (e.g. Neo4j TigerGraph) coupled with extensive knowledge of vector databases (OpenSearch Milvus).
Proficient in Databricks Python Pyspark Scala for developing and maintaining data engineering pipelines with expertise in Apache Spark Flink and containerization.
Experienced in Cloud platforms (AWS Azure Google Cloud) and skilled in working with various databases and data warehouses for efficient data processing and storage.
Expertise working with graph Data Models databases (Neo4J TigerGraph) or graph query languages (Gremlin SPARQL Cypher).
Expertise in architecting designing and building data pipelines and acquiring data needed to build and evaluate models using tools like Databricks Dataflow Apache Beam or Spark.