
I'm a Graduate student of Information Systems at California State University Los Angeles (CSU-LA).
Data-driven continuous improvement professional with a focus on Big Data Analytics & Data Visualizations.
I'm a Graduate student of Information Systems at California State University Los Angeles (CSU-LA).
Data-driven continuous improvement professional with a focus on Big Data Analytics & Data Visualizations.
Hotel Inspection Prediction System (Documentation: hotelinspection.info Code: Github)
• Analyzed Trip Advisor’s dataset based on the customer’s review and ratings in order to predict the hotels that have a higher possibility of inspection using Hadoop, MapReduce, Mahout, MongoDB.
Implementing the Generating Sequential Patterns algorithm using multiple minimum supports (MS-GSP) (Code: Github)
• Predicted the customer’s purchasing behavior by generating patterns with an algorithm and identified the most frequently purchased item in a particular store using Java.
Python Data Analysis (Code: Github)
• Election Analysis 2012, Stock market analysis and Titanic Survival analysis using NumPy, pandas, Seaborn, Matplotlib.
Text Mining, Web Scraping and Sentiment Analysis with Twitter Data (Code: Github)
• Cleaning the Tweets, Plotting and coloring the word cloud, Creating a term document matrix and getting list of most frequent terms and Performing a comparative sentiment analysis on 4 pharma companies using R.
Simulated E-commerce Portal (Documentation: PDF Code: Github)
• Built a system to efficiently deliver a product to a customer by identifying the nearest retailer who has the product in stock. The system facilitates interaction between the distributed objects-customer, retailer, supplier and warehouse using Apache Axis, Java, Amazon EC2, Google maps API, MongoDB.
Book Edu (Documentation: PDF Code: Github)
• Designed a Web Book Store with different modules like updating of user profile and password, searching for books/authors, add/update/remove books from cart using Java, HTML, CSS, JavaScript, MySql.
A rotated array is a combination of two sorted array. so let’s find a pointer on where the array is rotated. Then let’s split the input array into two sorted array. Now if the search key is present in left array, we can pass the binary search function for left array or vice versa. … read more
SELECT “vw_bluetooth_failure_by_device_by_profile”.”device_manufacturer” AS “Parent Category Label”, “vw_bluetooth_failure_by_device_by_profile”.”device_manufacturer” AS “Product Category”, 1 AS “Level”, 1 AS “Path”, SUM(“vw_bluetooth_failure_by_device_by_profile”.”failure_count” ) AS “Failure Count” FROM “public”.”vw_bluetooth_failure_by_device_by_profile” “vw_bluetooth_failure_by_device_by_profile” GROUP BY “vw_bluetooth_failure_by_device_by_profile”. ”device_manufacturer” UNION SELECT “vw_bluetooth_failure_by_device_by_profile”.”device_model” as [Parent Category Label], “vw_bluetooth_failure_by_device_by_profile”.”device_manufacturer” & ” > ” & “vw_bluetooth_failure_by_device_by_profile”.”device_model” as [Product Category], 2 AS [Level], 1 AS [Path], SUM(“vw_bluetooth_failure_by_device_by_profile”.”failure_count” ) AS… read more
/*********** UUID TIMESTAMP uuid1 2016-01-01 21:10:05.123 uuid1 2016-01-01 21:12:05.123 uuid2 2016-01-01 21:14:12.433 uuid1 2016-01-01 21:12:25.123 uuid3 2016-01-01 21:14:12.433 uuid2 2016-01-01 21:18:12.433 uuid1 2016-01-01 22:22:25.123 Write an algorithm that generate a list of user sessions. e.g. the above input… read more