snehosmani.comPaper Summary : The Google File SystemThe Google File System(GFS) is a distributed file system created by google for large distributed data-intensive applications. Unlike HDFS (Hadoop Distributed File System) which is open source, GFS is a proprietary system. Both these and other distrib...Oct 7, 2022·4 min read
snehosmani.comPaper Summary - Scaling Big Data Mining Infrastructure: The Twitter ExperienceThis case study talks about the experience and expectations of “Data Scientists/Engineers/Analysts” at Twitter. Jimmy lin and Dmitiry Ryaboy, the authors, share priceless information about the numerous challenges they faced in their journey to scale ...Sep 30, 2022·4 min read