Introduction This PySpark notebook introduces Spark GraphFrames. The dataset SNAP: https://snap.stanford.edu/data/egonets-Facebook.html Dataset description: Nodes 4039 Edges 88234 Abstract from Stanford’s website: This dataset consists of ‘circles’ (or ‘friends lists’) from Facebook. Facebook data was collected from survey participants using a Facebook app. The dataset includes node features (profiles), circles, and ego networks. Facebook data has … Continue reading Facebook circles – A Gentle Introduction to Apache Spark GraphFrames

In this article we discuss how to upload and use Microsoft Windows client operating systems in Amazon Web Services.  Unfortunately, there are no pre-canned images with Microsoft Windows client operating systems in AWS.  However, this does not mean that one cannot use them in Amazon’s EC2.  Although it is trickier than just attaching an ISO image … Continue reading Adding new images to Amazon Web Services (AWS)

The steps outlined in this article outlines the steps required to setup PostgreSQL 9.5 on Ubuntu 16.04. The last two steps outline the required Java Spring configuration required to use JPA Hibernate to the PostgreSQL installed. This article is based on the work of DigitalOcean (How To Install and Use PostgreSQL on Ubuntu 16.04). 1. … Continue reading 13 (1-liner) Steps to Setting up PostgreSQL and connecting with JPA Hibernate