You are hired as a consultant to build an efficient model to predict whether a user will click on an ad or not, given the following features: Clickstream data/train data for duration: (2nd July 2017 – 7th July 2017) Test data for duration: (8th July 2017 – 9th July 2017) User features (demographics, user behaviour/activity, buying power etc.) Historical transactional data of the previous month with timestamp info (28th May 2017– 1st July 2017). This data contains actions (views/interest registered) taken by the user historically on the product page via an ad or other sources Ad features (product category, webpage, campaign for ad etc.) Date time features (exact timestamp of the user session)
"AdClickPredictor" is a repository exploring online ad engagement prediction using machine learning. It provides a dataset with user attributes and interaction outcomes, guiding users through preprocessing, modeling, and performance assessment via a comprehensive Deepnote notebook.
The main motivation behind the project is "Targeted Advertising." At its most basic, targeted advertising can just mean that ads are chosen for their relevance to site content, in the assumption that they will then be relevant to the site audience as well. Online advertisers can use different methods to target a particular advertisement on the user based on its traits. Most companies do this as part of social media platforms like Facebook, LinkedIn, etc. However, many times the process goes wrong, and the advertisement does not reach its target audience because it is sent out without actually understanding the probability of the occurring click.
Test data is further randomly divided into Public (30%) and Private (70%) data. Your initial responses will be checked and scored on the Public data. The final rankings would be based on your private score which will be published once the competition is over.American Express hosted a machine learning hackathon (https://datahack.analyticsvidhya.com/contest/american-express-amexpert-2018/). We have provided by data of clickstream of 6 days from 2 July 2017 to 7 July 2017. We have to predict depending upon the data wether a session will result in a click or not.