APJIS Asia Pacific Journal of Information Systems


The Journal for Information Professionals

Asia Pacific Journal of Information Systems (APJIS), a Scopus and ABDC indexed journal, is a
flagship journal of the information systems (IS) field in the Asia Pacific region.

ISSN 2288-5404 (Print) / ISSN 2288-6818 (Online)

Editor : Seung Hyun Kim

View full editorial board


Share this page

Current Issue

Date June 2021
Vol. No. Vol. 31 No. 2
DOI https://doi.org/10.14329/apjis.2021.31.2.185
Page 185~196
Title Iowa Liquor Sales Data Predictive Analysis Using Spark
Author Ankita Paul, Shuvadeep Kundu, Jongwook Woo
Keyword Machine learning, Big Data, Predictive analysis, PySpark, Regression
Abstract The paper aims to analyze and predict sales of liquor in the state of Iowa by applying machine learning algorithms to models built for prediction. We have taken recourse of Azure ML and Spark ML for our predictive analysis, which is legacy machine learning (ML) systems and Big Data ML, respectively. We have worked on the Iowa liquor sales dataset comprising of records from 2012 to 2019 in 24 columns and approximately 1.8 million rows. We have concluded by comparing the models with different algorithms applied and their accuracy in predicting the sales using both Azure ML and Spark ML. We find that the Linear Regression model has the highest precision and Decision Forest Regression has the fastest computing time with the sample data set using the legacy Azure ML systems. Decision Tree Regression model in Spark ML has the highest accuracy with the quickest computing time for the entire data set using the Big Data Spark systems.

Home     l      Site Map      l       Abstracting/Indexing      l      FAQ      l      Publisher      l       Contact Us     l       Admin Login

© 2013 The Korean Society of Management Information Systems. All rights reserved.