Introduction This PySpark notebook introduces Spark GraphFrames. The dataset SNAP: Dataset description: Nodes 4039 Edges 88234 Abstract from Stanford’s website: This dataset consists of ‘circles’ (or ‘friends lists’) from Facebook. Facebook data was collected from survey participants using a Facebook app. The dataset includes node features (profiles), circles, and ego networks. Facebook data has … Continue reading Facebook circles – A Gentle Introduction to Apache Spark GraphFrames

In this article we discuss how to upload and use Microsoft Windows client operating systems in Amazon Web Services.  Unfortunately, there are no pre-canned images with Microsoft Windows client operating systems in AWS.  However, this does not mean that one cannot use them in Amazon’s EC2.  Although it is trickier than just attaching an ISO image … Continue reading Adding new images to Amazon Web Services (AWS)

A common requirement when working with molecules is to display their molecular structure as an image.  To do this in iPython Notebook requires some simple steps.  Below is a way how you can do it.  Enjoy! %matplotlib inline %pylab inline from IPython.display import Image from rdkit.Chem import AllChem as Chem from rdkit.Chem.Draw import IPythonConsole Example: … Continue reading Painting molecules using rdkit and iPython Notebook