Data Science Challenge

One of the objectives of Team Data at Primer is to help improve authorization rates for our merchants - the Authorization Rate is defined as:

# of authorized payments / # payments attempted

As part of the challenge, I have generated a fake dataset of payments that we will be using. The dataset can be found in the repo under data.csv and it is described below:

created_at: Timestamp when the payment was created
amount_usd: Integer representing the amount in USD cents (if you see a value of 100 it is equal to $1). We have converted the values from their local currency to USD for simplicity
currency: String representing the original currency 
payment_instrument_type: String - one of Paypal, GoCardless, Klarna, PaymentCard, or Apple Pay
card_brand: String representing the brand of the card used for the payment 
issuing_country: String representing the country where the card was issued
authorized: Integer (0/1) indicating if the payment was authorized or not

The Challenge

The challenge has two parts:

Exploratory Data Analysis
Bayesian modeling

EDA

Perform exploratory data analysis to better understand the data and summarise your findings. Please use well-labeled charts and tables during the analysis. To get you started, these are some questions that could be interesting:

What is the overall authorization rate?
Which payment_instrument_type has the highest authorization rate?
What is the distribution of payment amounts?
Is there a relationship between any of the following features and whether or not a payment is authorized?
- amount_usd
- currency
- card_brand
- payment_instrument_type
- issuing_country

Model Building

Build a Bayesian logistic regression model that predicts if a payment will be authorized. Once complete, summarise your model, the approach (e.g. how did you select your priors), the results, and how well it performed.

Please complete the tasks using Python or R and you are free to use any and all third party libraries to help you.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
data.csv		data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Science Challenge

The Challenge

EDA

Model Building

About

Releases

Packages

primer-io/data-science-challenge

Folders and files

Latest commit

History

Repository files navigation

Data Science Challenge

The Challenge

EDA

Model Building

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages