Introduction This PySpark notebook introduces Spark GraphFrames. The dataset SNAP: Dataset description: Nodes 4039 Edges 88234 Abstract from Stanford’s website: This dataset consists of ‘circles’ (or ‘friends lists’) from Facebook. Facebook data was collected from survey participants using a Facebook app. The dataset includes node features (profiles), circles, and ego networks. Facebook data has … Continue reading Facebook circles – A Gentle Introduction to Apache Spark GraphFrames

The steps outlined in this article outlines the steps required to setup PostgreSQL 9.5 on Ubuntu 16.04. The last two steps outline the required Java Spring configuration required to use JPA Hibernate to the PostgreSQL installed. This article is based on the work of DigitalOcean (How To Install and Use PostgreSQL on Ubuntu 16.04). 1. … Continue reading 13 (1-liner) Steps to Setting up PostgreSQL and connecting with JPA Hibernate