**Pitfalls in pseudo-random number sampling at scale with Apache Spark** - Jun 27, 2017.

Large scale simulation of random number generation is possible with today’s high speed & scalable distributed computing frameworks. Let’s understand how it can be achieved using Apache Spark.

Tags: Apache Spark, GitHub, Random, RDD

**The Surprising Complexity of Randomness** - Jun 15, 2017.

The reason we have pseudorandom numbers is because generating true random numbers using a computer is difficult. Computers, by design, are excellent at taking a set of instructions and carrying them out in the exact same way, every single time.

Tags: Complexity, Probability, Random, Randomization

**Embrace the Random: A Case for Randomizing Acceptance of Borderline Papers** - May 16, 2016.

A case for using randomization in the selection of borderline academic papers, a particular use case which has parallels with many other possible scenarios.

Tags: Academics, ICML, NIPS, Random, Randomization

**Random vs Pseudo-random – How to Tell the Difference** - Oct 26, 2015.

Statistical know-how is an integral part of Data Science. Explore randomness vs. pseudo-randomness in this explanatory post with examples.

Tags: Correlation, Random

**Surprising Random Correlations** - May 14, 2015.

An interesting demo showing how easy it is to find surprising correlations in real data. Is German unemployment rate related to Apple Stock? Is 10-year Treasury rate related to price of Red Winter Wheat? You will be surprised.

Tags: Correlation, Overfitting, Quandl, Random

**Year in Review: Top KDnuggets tweets in September** - Dec 30, 2014.

One pattern is random, other is machine-generated. Can you guess which?; 14 Awesome (and Free) #DataScience Books; Dilbert 20 funniest cartoons on #BigData, data mining, privacy; Watch: Statistical, Machine learning with R, great 15 hour online course.

Tags: Data Mining Books, Dilbert, R, Random

**Top KDnuggets tweets, Sep 19-21: Dilbert funniest cartoons on #BigData, data mining; Guess which pattern is random** - Sep 22, 2014.

Guess which pattern is random, which machine-generated? Dilbert 20 funniest cartoons on #BigData, data mining, privacy; Data Scientist Cartoon; Neural Networks and Deep Learning, free online book (draft).

Tags: Cartoon, Deep Learning, Dilbert, Free ebook, Neural Networks, Random