Job Role Senior Data Engineer
Work Mode Remote
Experience 10 years
Location: WFH (Flexible with Europe Time Zone)
KEY RESPONSIBILITIES:
- Understand the factories manufacturing process data availability and avenues for
- improvement
- Brainstorm together with engineering manufacturing and quality problems that can be solved using the acquired data in the data lake platform.
- Define what data is required to create a solution and work with connectivity engineers users to collect the data
- Create and maintain optimal data pipeline architecture.
- Assemble large complex data sets that meet functional / nonfunctional business requirements.
- Identify design and implement internal process improvements: automating manual processes optimizing data delivery for greater scalability
- Work on data preparation data deep dive help engineering process and quality to understand the process/ machine behavior more closely using available data Deploy and monitor the solution
- Work with data and analytics experts to strive for greater functionality in our data systems.
- Work together with Data Architects and data modeling teams.
SKILLS /COMPETENCIES
- Good knowledge of the business vertical with prior experience in solving different use cases in the manufacturing or similar industry
- Ability to bring cross industry learning to benefit the use cases aimed at improving manufacturing process
Problem Scoping/definition Skills:
- Experience in problem scoping solving quantification
- Strong analytic skills related to working with unstructured datasets.
- Build processes supporting data transformation data structures metadata dependency and workload management.
- Working knowledge of message queuing stream processing and highly scalable big data data stores
- Ability to foresee and identify all right data required to solve the problem
Data Wrangling Skills:
- Strong skill in data mining data wrangling techniques for creating the required analytical dataset
- Experience building and optimizing big data data pipelines architectures and data sets
- Adaptive mindset to improvise on the data challenges and employ techniques to drive desired outcomes
Programming Skills:
- Experience with big data tools: Spark Delta CDC NiFi Kafka etc
- Experience with relational SQL NoSQL databases and query languages including oracle Hive sparkQL.
- Experience with objectoriented languages: Scala Java C etc.
Visualization Skills
- Know how of any visualization tools such as PowerBI Tableau
- Good storytelling skills to present the data in simple and meaningful manner.
Data Engineering Skills
- Strong skill in data analysis techniques to generate finding and insights by means of exploratory data analysis
- Good understanding of how to transform and connect the data of various types and form
- Great numerical and analytical skills
- Identify opportunities for data acquisition
- Explore ways to enhance data quality and reliability
- Build algorithms and prototypes
- Reformulating existing frameworks to optimize their functioning.
- Good understanding of optimization techniques to make the system performant for requirements.
About the company: Innova Solutions
Founded in 1998 and headquartered in Atlanta (Duluth) Georgia Innova Solutions along with its subsidiaries employs over 50000 professionals worldwide and reports an annual revenue approaching $3.0B. Through global delivery centers across North America Asia and Europe Innova Solutions delivers strategic technology and business transformation solutions to its clients enabling them to operate as leaders within their fields. Whether it is onboarding a new service embracing a new consumer device or rolling out a Business Innovation Innova Solutions will empower your Enterprise to transition to new technologies embrace new service delivery models and enhance the business value provided by IT. Innova provides a full spectrum of services to plan prep and execute a data center migration and the development of workloads that can be moved to or inbetween cloud service providers.
data visualization (powerbi, tableau),data analysis techniques,kafka,sparql,optimization techniques,object-oriented programming (scala, java, c++),cdc,problem scoping and definition,data pipeline architecture,spark,data wrangling,dwh,relational sql and nosql databases,big data tools (spark, delta, cdc, nifi, kafka),scala