Introduction This PySpark notebook introduces Spark GraphFrames. The dataset SNAP: https://snap.stanford.edu/data/egonets-Facebook.html Dataset description: Nodes 4039 Edges 88234 Abstract from Stanford’s website: This dataset consists of ‘circles’ (or ‘friends lists’) from Facebook. Facebook data was collected from survey participants using a Facebook app. The dataset includes node features (profiles), circles, and ego networks. Facebook data has … Continue reading Facebook circles – A Gentle Introduction to Apache Spark GraphFrames
Adding new images to Amazon Web Services (AWS)
In this article we discuss how to upload and use Microsoft Windows client operating systems in Amazon Web Services. Unfortunately, there are no pre-canned images with Microsoft Windows client operating systems in AWS. However, this does not mean that one cannot use them in Amazon’s EC2. Although it is trickier than just attaching an ISO image … Continue reading Adding new images to Amazon Web Services (AWS)
13 (1-liner) Steps to Setting up PostgreSQL and connecting with JPA Hibernate
The steps outlined in this article outlines the steps required to setup PostgreSQL 9.5 on Ubuntu 16.04. The last two steps outline the required Java Spring configuration required to use JPA Hibernate to the PostgreSQL installed. This article is based on the work of DigitalOcean (How To Install and Use PostgreSQL on Ubuntu 16.04). 1. … Continue reading 13 (1-liner) Steps to Setting up PostgreSQL and connecting with JPA Hibernate
Highlight a set of rows in Excel based on date range
Hi, in this blog we take a look how we can highlight a set of rows in Microsoft Excel that fall between two dates. This is inspired from Rodel Balasabas comment in my other post about highlighting a row in Excel. Thanks! The final product looks as: Adding multiple conditions to Excel’s conditional … Continue reading Highlight a set of rows in Excel based on date range
Mapping molecule similarity using rdkit
One cool feature in rdkit is the ability to visually highlight the similarity between two given molecules. In another post (Tanimoto Molecular Similarity Experiment) we saw how to find similar molecules using Tanimoto and Morgan Fingerprints. At the end of the experiment we created an image file with the top 20 similar molecules to D-Aspartate … Continue reading Mapping molecule similarity using rdkit