1.Hadoop数据类型如下图:
由上图的Writable层次结构图可以看到绝大多数的数据类型都实现了Writable、WritableComparable接口,在此先分析一下这两个接口情况。自顶下下逐步分析。
Writable接口的定义如下:
- <span style="font-family:SimSun;font-size:14px;">package org.apache.hadoop.io;
-
- import java.io.DataOutput;
- import java.io.DataInput;
- import java.io.IOException;
- public interface Writable {
-
-
-
-
-
- void write(DataOutput out) throws IOException;
-
-
-
-
-
-
- void readFields(DataInput in) throws IOException;
- }</span>
而DataInput、DataOutput是java.io.*中最基本的输入输出流接口,以此writable与流联系起来,其他输入输出流都需要实现DataInput与DataOutput这两个接口的方法。关于这两个接口,另外开篇分析解读。
WritableComparable接口定义如下:
- <span style="font-family:SimSun;font-size:14px;">package org.apache.hadoop.io;
- public interface WritableComparable<T> extends Writable, comparable<T> {
- }</span>
咋一看这个WritableComparable没有方法,其实它的方法全都是通过继承而来的,Writable接口上面已经分析了,所以WritableComparable以下两个方法。
- <span style="font-family:SimSun;font-size:14px;">void write(DataOutput out) throws IOException;
-
- void readFields(DataInput in) throws IOException;</span>
还有来自comparable的方法,comparable是属于java.lang.*中的一个接口,它只有一个方法。
- <span style="font-family:SimSun;font-size:14px;">int compareTo( T other);
-
-
-
-
-
-
- </span>
2.IntWritable类定义如下:
- <span style="font-family:SimSun;font-size:14px;">package org.apache.hadoop.io;
-
- import java.io.*;
-
-
- public class IntWritable implements WritableComparable {
- private int value;
-
- public IntWritable() {}
-
- public IntWritable(int value) { set(value); }
-
-
- public void set(int value) { this.value = value; }
-
-
- public int get() { return value; }
-
- public void readFields(DataInput in) throws IOException {
- value = in.readInt();
- }
-
- public void write(DataOutput out) throws IOException {
- out.writeInt(value);
- }
-
-
- public boolean equals(Object o) {
- if (!(o instanceof IntWritable))
- return false;
- IntWritable other = (IntWritable)o;
- return this.value == other.value;
- }
-
- public int hashCode() {
- return value;
- }
-
-
- public int compareTo(Object o) {
- int thisValue = this.value;
- int thatValue = ((IntWritable)o).value;
- return (thisValue<thatValue ? -1 : (thisValue==thatValue ? 0 : 1));
- }
-
- public String toString() {
- return Integer.toString(value);
- }
-
-
- public static class Comparator extends WritableComparator {
- public Comparator() {
- super(IntWritable.class);
- }
-
- public int compare(byte[] b1, int s1, int l1,
- byte[] b2, int s2, int l2) {
- int thisValue = readInt(b1, s1);
- int thatValue = readInt(b2, s2);
- return (thisValue<thatValue ? -1 : (thisValue==thatValue ? 0 : 1));
- }
- }
-
- static {
- WritableComparator.define(IntWritable.class, new Comparator());
- }
- }</span>
3.一般对于自定义数据类型要实现Writable接口,因为数据在网络传输或者进行永久性存储的时候,需要序列化和反序列化。如果该数据类型要作为主键使用或者要进行比较大小的操作,还要实现WritableComparable接口。
如:
- <span style="font-family:SimSun;font-size:14px;">public class Point3D implements WritableComparable<Point3D>
-
- {
-
- private float x,y,z;
-
- public float getX(){return x;}
-
- public float getY(){return y;}
-
- public float getZ(){return z;}
-
- public void readFields(DataInput in) throws IOException
-
- {
-
- x = in.readFloat();
-
- y = in.readFloat();
-
- z = in.readFloat();
-
- }
-
- public void write(DataOutput out) throws IOException
-
- {
-
- out.writeFloat(x);
-
- out.writeFloat(y);
-
- out.writeFloat(z);
-
- }
-
-
-
- public int CompareTo(Point3D p)
-
- {
-
-
-
-
-
- }
-
- }</span>
http://www.cnblogs.com/zhengyuhong/p/3952954.html