Multiple outputs
Dumbo 0.21.20 adds support for multiple outputs by providing a -getpath option. Here’s an example: from dumbo import run, sumreducer, opt def mapper(key, value): for word in value.split(): yield word,...
View ArticleIntegration with Java code
Although Python has many advantages, you might still want to write some of your mappers or reducers in Java once in a while. Flexibility and speed are probably the most likely potential reasons. Thanks...
View ArticleOutputting Tokyo Cabinet or Constant DB files
Dumbo 0.21.30 got released this week. Apart from several bugfixes, it includes some cool new functionality that allows you to output Tokyo Cabinet or Constant DB files directly by using a special...
View Article
More Pages to Explore .....