KDnuggets Home » News » 2012 » Sep » Publications » BigData Borat speaks with KDnuggets about borshobytes, traditional Kazakhstan data measure  ( < Prev | 12:n22 | Next > )

BigData Borat speaks with KDnuggets about borshobytes, traditional Kazakhstan data measure


 
  
I manage to ask BigData Borat how big is his data, and learn when data gets restraining order against compute and what is the best tool for small data.


Here is my exchange with
BigData Borat

Big Data Borat,

@BigDataBorat




kdnuggets:
@BigDataBorat боpщ (Borscht) is a Big Ukrainian soup, so Borshobyte is for measuring Big Data streams, no? How many Borshobytes in Twitter?

BigDataBorat:
@kdnuggets Twitter data big but no is боpщоbyte can better measure as боpщкаbyte. BigDataBorat Lab have policy for never laugh at small data

kdnuggets:
@BigDataBorat so what is the best way to deal with Big Data which is bigger than 3 боpщоbytes?

BigDataBorat:

  • At BigDataBorat lab all rack is arrange in 3 dimensions cube with 3 stack on top each other. Best scaling of world wide.
  • In all case is important for solution scale linearly. In extreme case solution need scale cubically.
  • For best scale is important for bring compute close to data. But too close and data get restraining order against compute.
kdnuggets:
@BigDataBorat What tools do you use when you work with small data? And how do you Hadoop in Kazakhstan?

BigDataBorat:

  • Work with Hadoop hard problem is find Java and UNIX skill. BigDataBoratLab solve with novel way.
  • Outside BigDataBoratLab have box say "Free AbstractClassFactoryImpl inside." Anyone lift box trap in falling cage.
  • From there Java expert sent to dungeon for work on Map/Reduce code.
  • Small data like sand, if no careful can get every place. Best tool for work is microscope.
BigDataBorat:
You can pet a sheep but you no can petabyte. #bigdata

TGIF ! Stay tuned until next Friday !


KDnuggets Home » News » 2012 » Sep » Publications » BigData Borat speaks with KDnuggets about borshobytes, traditional Kazakhstan data measure  ( < Prev | 12:n22 | Next > )