Recently Clounce was carrying out some experiments with MNIST dataset and he wanted to glimpse at how the digits in this dataset looks like. So he thought of displaying them using ASCII art in a console window. What is the MNIST dataset? MNIST (http://yann.lecun.com/exdb/mnist/index.html) is a subset of NIST (https://www.nist.gov/srd/nist-special-database-19), a database for handwritten digits. … Continue reading Fun with MNIST dataset

ACM have recently published new LaTeX templates (https://www.acm.org/publications/proceedings-template). We applied some tweaks to their sample to remove ACM specific text and changed the paper size to A4.  This is useful to students who want to use ACM SIG Proceedings format for their university assignments.  The sample paper is available at https://github.com/jod75/acm-like-text-paper.   Changes we did: Paper … Continue reading ACM SIG Proceedings-Like paper

Introduction This PySpark notebook introduces Spark GraphFrames. The dataset SNAP: https://snap.stanford.edu/data/egonets-Facebook.html Dataset description: Nodes 4039 Edges 88234 Abstract from Stanford’s website: This dataset consists of ‘circles’ (or ‘friends lists’) from Facebook. Facebook data was collected from survey participants using a Facebook app. The dataset includes node features (profiles), circles, and ego networks. Facebook data has … Continue reading Facebook circles – A Gentle Introduction to Apache Spark GraphFrames