Distributed phrase-based machine translation training tool based on Hadoop.
http://geek.kyloo.net/software
http://www.facebook.com/edwardgao
Pydoop is a Python MapReduce and HDFS API for Hadoop. Built as a wrapper around the C++ API, pydoop allows you to develop full-fledged MapReduce applications with HDFS access. Here is how you write a basic Python wordcount with pydoop:
http://sourceforge.net/apps/mediawiki/pydoop/index.php?title=Main_Page
High Performance Distributed File System and Parallel Data Processing Engine
http://sector.sourceforge.net/index.html