Web analytics assignment 2 hotel clickstream analysis

 

– This assignment will analyze the data (HotelClickStream.xls) and interpret the results. This dataset includes clickstream data of online transactions for hotel booking in year 2011. Appendix includes the detailed description for the variables.
– Please follow the instructions very carefully to do this assignment! Please do the following analyses and answer the corresponding questions. Please copy/summarize your key results for each question to a word file along with your answers to produce the final report for submission.
1. Please first create the following 2 additional variables into your data
1) REF_D (create a dummy variable indicating whether the transaction was referenced from other website, if not, the final booking website was directly accessed. If no information provided for the variable REF_DOMAIN_NAME, REF_D = 0; otherwise REF_D = 1)
2) LOG_PRICE (take the log transformation of the variable PROD_TOTPRICE using the LOG function in excel)
a) Please provide a summary table showing the top 10 domain names (DOMAIN_NAME) that generated the most volume of transactions the report should look like the following Table (Hint: one way to do this is to use the COUNTIF function in excel). Please summarize briefly your observations from the results

Rank

Domain Names

# of Transactions

1

marriott

524

b) Please provide a summary table showing the top 10 reference domain names (REF_DOMAIN_NAME) that generated the most volume of transactions the report should look like the following Table. Please summarize briefly your observations from the results.

  Rank               

Reference Domain Names          

# of Transactions

1

google

620

c) Please provide summary statistics (N, Max, Min, Mean, and Std.) for variables: DIRECTP_D; REF_D; DURATION; PAGES_VIEWED; LOG_PRICE; and TRANS_FREQ. Please report your summary statistics table and provide short descriptions (a few bullet points) of your observations.
2. Please use the Binary Outcome (Logistic/Logit) regression technique to answer the question on “what are the factors that influence people’s decision on whether to book directly on a hotel website or from other third party website?” Please use DIRECT_D as your Dependent Variable (DV); and REF_D, LOG_PRICE, TRANS_FREQ, DURATION, HOUSEHOLD_SIZE, CHILDREN_D, and CONNECTIONSPEED_D as your Independent Variables (IV). Please report and interpret your regression results, which should include the interpretation of the regression coefficients.
3. a) Please use the Count Data (Poisson) regression model to answer the question on “what are the factors that influence people’s booking frequencies?” Please use TRANS_FREQ as your DV; and REF_D, LOG_PRICE, PAGES_VIEWED, HOUSEHOLD_SIZE, CHILDREN_D, and CONNECTIONSPEED_D as your IVs. Please report and interpret your regression results, which should include the interpretation of the regression coefficients.
b) Please repeat the analysis in question a) using the Negative Binomial Regression model. Please report and interpret your regression results and coefficients.
c) Please summarize your observations by comparing the results from a) and b).
4. a) Please use the linear regression technique to answer the question on “what are the factors that influence how much time people spend on a website?” Please use DURATION as your DV; and you may decide on the IVs by conducting the similar exercises in Assignment #1. Please ONLY report and interpret your final regression results.
b) Please use the linear regression technique to answer the question on “what are the factors that influence how many pages people views when visiting a website?” Please use PAGES_VIEWED as your DV; and you may decide on the IVs by conducting the similar exercises in Assignment #1. Please ONLY report and interpret your final regression results.
c) Alternatively, you can also use count data model (Poisson or Negarive Binomial) since PAGES_VIEWED is a variable with discrete and non-negative integers. Using the similar set of IVs, do you see significantly different results by using linear regression vs. count data models?
d) Please summarize your observations by comparing the results from a), b), and c).
Attachment:- HotelClickStream.rar

Calculate Your Essay Price
(550 words)

Approximate price: $22

Calculate the price of your order

550 words
We'll send you the first draft for approval by September 11, 2018 at 10:52 AM
Total price:
$26
The price is based on these factors:
Academic level
Number of pages
Urgency
Basic features
  • Free title page and bibliography
  • Unlimited revisions
  • Plagiarism-free guarantee
  • Money-back guarantee
  • 24/7 support
On-demand options
  • Writer’s samples
  • Part-by-part delivery
  • Overnight delivery
  • Copies of used sources
  • Expert Proofreading
Paper format
  • 275 words per page
  • 12 pt Arial/Times New Roman
  • Double line spacing
  • Any citation style (APA, MLA, Chicago/Turabian, Harvard)

Our guarantees

Delivering a high-quality product at a reasonable price is not enough anymore.
That’s why we have developed 5 beneficial guarantees that will make your experience with our service enjoyable, easy, and safe.

Money-back guarantee

You have to be 100% sure of the quality of your product to give a money-back guarantee. This describes us perfectly. Make sure that this guarantee is totally transparent.

Read more

Zero-plagiarism guarantee

Each paper is composed from scratch, according to your instructions. It is then checked by our plagiarism-detection software. There is no gap where plagiarism could squeeze in.

Read more

Free-revision policy

Thanks to our free revisions, there is no way for you to be unsatisfied. We will work on your paper until you are completely happy with the result.

Read more

Privacy policy

Your email is safe, as we store it according to international data protection rules. Your bank details are secure, as we use only reliable payment systems.

Read more

Fair-cooperation guarantee

By sending us your money, you buy the service we provide. Check out our terms and conditions if you prefer business talks to be laid out in official language.

Read more