About the role:
As a Data Scientist you will help transform our data consumer data and advertising data into tangible business value by analyzing information communicating outcomes and collaborating on product development. Work with BestinClass open source and visual tools along with the most flexible and scalable deployment options. Whether it s investigating patient trends or weather patterns you will work to solve real world problems for the industries transforming how we live. The Data Scientist will support Data Science and BI efforts for the consumer business at our client s company. The main responsibility lies in analyzing consumer data from devices app stores analytics systems data lakes and the database to provide quality data for quality AI and BI solutions. A good candidate is someone who is proficient in Python/PySpark able to analyze data through visualization/tables/pivot tables organized and detail oriented. Expert level knowledge in SQL queries and database structure is also a must.
Responsibilities
Work with Analytics and Business Intelligence teams to understand data fields definitions KPIs various business metrics that are tacked by stakeholders and the workflows that transform the data.
Work with Data Engineers Machine Learning Engineers and Architects to perform efficient data exploration leverage appropriate tools and functions for data processing and build automation pipelines that perform machine learning at scale and output quality data for further downstream use cases.
Work with Product/Business owners to understand the needs for data and AI solutions define business rules and logic and test solution performance. Be a data advisor when it comes to using data to solve a business problem.
Use Python and other opensource frameworks to visualize analyze and model data. Proficient with statistical modeling machine learning algorithms and when/how they should be applied in real world applications.
Required skills:
bachelor s degree in a STEM field such as Statistics Math Engineering Information Systems etc.
Work experience in data/business analysis analytics data visualization reporting or data engineering.
Experience deploying and maintaining ML pipelines in production.
Python for data extraction transformation and analysis.
Python production level coding including best practices.
Docker and containerization.
Experience in querying data using SQL and BI tools.
Databases and pulling data with table joins.
Experience with cloud platforms such as AWS (Sage maker S3 EKS EMR Athena Glue Lambda) or corresponding experience in other clouds such as Azure and GCP.
Good communication and presentation skills to explain technical solutions to either technical peers or nontechnical stakeholders Nicetohave skills:
master s degree in a STEM field such as Statistics Math Engineering Information Systems etc.
Spark/Py Spark for data extraction transformation and analysis.
Certification in cloud platforms such as AWS GCP and/or Azure.
stem,sql,aws,business intelligence,statistical modeling,data extraction,aws lambda,gcp,databases,aiml,python,bi tools,cloud platforms,machine learning,athena,data analysis,docker,data visualization,containerization,communication skills,presentation skills,s3,data transformation,azure