drjobs Automated Feature Extraction and Clustering of Product Claims and Ingredients Using Machine Learning

Automated Feature Extraction and Clustering of Product Claims and Ingredients Using Machine Learning

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Solna - Sweden

Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

High Level Description

Consumer products often advertise features like protection against germs gentle on fabric natural ingredients only rich in vitamin C or high energy efficiency making it crucial to analyse and compare these claims and ingredient lists across a wide range of product categories. The aim is to develop analytics and market insights to identify trends across various consumer products including categories such as personal care food and beverages and household cleaning supplies. To achieve this automated clustering of product claims and ingredients is needed followed by feature extraction to identify key characteristics that define each group.

This thesis will involve developing and evaluating methods to first cluster similar product claims and ingredients into meaningful groups and then extract important features from each cluster to identify trends common themes and unique selling points. This will provide valuable market insights and help assess how products are positioned relative to one another.

Project Description

The project will begin with two datasets: a product claims dataset containing around 300000 claims from a variety of consumer products and an ingredients dataset containing lists of ingredients from these products.

The first step is to research and apply clustering techniques to group similar product claims. The focus will be on finding the most suitable clustering algorithms and optimizing them to ensure meaningful groupings. Various methods will be compared to determine which approach works best for the given data. Once the clustering is complete feature extraction methods will be applied to identify key characteristics from the clusters of product claims. The goal is to derive relevant insights that are specific to each group highlighting common keywords such as protection natural ingredients or efficiency.

The ingredients dataset will also be clustered to identify common groupings and standardize ingredient lists across different products (e.g. distilled water and aqua become water) allowing for clearer analysis and comparison.

Throughout the project different clustering and feature extraction methods will be compared using appropriate metrics to evaluate their performance. The research will involve identifying the best approaches optimizing their parameters and assessing their performance.

    Who are we looking for

    We are looking for a motivated student with great interest in machine learning natural language processing (NLP) and data science. Knowledge of clustering techniques and feature extraction is beneficial. This thesis is suitable for students of computer science data science or a related field with experience in Python and machine learning frameworks.

    Purpose

    The purpose of the thesis is to develop an automated system for clustering product claims and ingredient lists followed by extracting key features from the product claims. The ultimate goal is to provide analytics and market insights that can be used to identify trends compare product categories and understand key differentiators in the market.

    The thesis project can be published and used in your personal portfolio as well as in company marketing. Include Resum/CV and portfolio in your application.


      Employment Type

      Full Time

      Company Industry

      About Company

      Report This Job
      Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.