Skip to content
Menu
Shark College
Shark College
E- Commerce Analytics

E- Commerce Analytics

December 27, 2021 by B3ln4iNmum

E- Commerce Analytics

AssignmentTutorOnline

Objective: Use Spark features for data analysis to derive the valuable insights.

Problem Statement:

You are working as a Big Data consultant for an E-commerce company. Your role is to analyze sales data. The company has multiple stores across the globe. They want you to do the analytics of their sales transaction data. You need to provide valuable insights to understand their sales across cities, state on a daily and weekly basis. Also, provide various other insights regarding the review of the products.

Domain: E-Commerce

Analysis to be done: Exploratory analysis, to determine actionable insights. 

Dataset File: olist_public_dataset.csv

Content: 

  1. Id
  2. order_status
  3. order_products_value
  4. order_freight_value
  5. order_items_qty
  6. order_purchase_timestamp
  7. order_aproved_at
  8. order_delivered_customer_date
  9. customer_city
  10. customer_state
  11. customer_zip_code_prefix
  12. product_name_lenght
  13. product_description_lenght
  14. product_photos_qty
  15. review_score

Insights on Historical Data

  1. Daily Insights
  2. SALESTotal sales.Total Sales in each Customer City.Total sales in each Customer State.
  3. ORDERSTotal number of orders sold.City wise order distribution.State wise order distribution.Average Review score per Order.Average Freight charges per order.Average time taken to approve the orders. (Order Approved – Order Purchased).Average order delivery time.
  4. Weekly Insights
    1. SALES
      1. Total sales.
      1. Total Sales in each Customer City.
      1. Total sales in each Customer State.
    1. ORDERS
      1. Total number of orders sold.
      1. City wise order distribution.
      1. State wise order distribution.
      1. Average Review score per Order.
      1. Average Freight charges per order.
      1. Average time taken to approve the orders. (Order Approved – Order Purchased).
      1. Average order delivery time.
    1. Total Freight charges.
    1. Freight charges distribution in each Customer City.

Approach

Tasks to perform:

Week 1: Approach Overview and Basic Configurations

  1. Install maven (3.6.2).
  2. Set environment variable of Maven

  a) Check if maven is setup properly using  “mvn -version” 

  • Install Java 1.8 and Scala 2.11.7
  • Use Intellij to validate or modify source code
  • Click “mvn clean install” to build jar file
  • Use README.md for details instructions and helper commands

Week 2: Data Ingestion

  1. Upload the entire data into Hive from CSV
  2. Copy the data from Hive into HDFS
  3. Check the data in HDFS path

Week 3 : Data Streaming 

  1. Create sample Maven Scala Project
  2. Add necessary spark dependencies
  3. Create Schema of CSV files
  4. Create Spark Session

  a) Add S3 details

  b) Add all variables to your environment as they have sensitive data

  • Read CSV file and convert into dataset
  • Create Map of City and Country
  • Convert Date to Hour, Month, Year, Daily, and Day Bucket using UDF
  • Iterate through all metrics for each column
  • For each type of segment, calculate stats of different cities. Stats include max, min, average, and total records

Week 4 : Data Analysis and Visualization

  1. Write the results into the HDFS
  2. Save final dataset into Amazon S3
  3. Create Amazon Document DB Cluster
  4. Save insights in Document DB and provide APIs to view aggregate data
  • Assignment status: Already Solved By Our Experts
  • (USA, AUS, UK & CA PhD. Writers)
  • CLICK HERE TO GET A PROFESSIONAL WRITER TO WORK ON THIS PAPER AND OTHER SIMILAR PAPERS, GET A NON PLAGIARIZED PAPER FROM OUR EXPERTS
QUALITY: 100% ORIGINAL PAPER – NO PLAGIARISM – CUSTOM PAPER

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • International business management assignment on Hilton
  • 73861 – STUDENT- RESEARCH REPORT ASSESSMENT TASK 1Task Number
  • Assessment Task 1 – IndividualTask overviewAssessment
  • this is assigmnets of Diploma of hospitality managementDocument
  • Similarities And Differences Between Religious Buildings

Recent Comments

  • A WordPress Commenter on Hello world!

Archives

  • August 2022
  • July 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021

Categories

  • Uncategorized

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
©2022 Shark College | Powered by WordPress and Superb Themes!