由于百度到的Hessian2.0协议规范都是老的,不信看Double的定义可以知道,所以自己上传一份官方的,方便需要的同学
Hessian 2.0 Serialization Protocol
hessian.txt
Status of this Memo
By submittingthis Internet-Draft, each author represents that any applicable patent or otherIPR claims of which he or she is aware have been or will be disclosed, and anyof which he or she becomes aware will be disclosed, in accordance with Section 6of BCP 79.
Internet-Draftsare working documents of the Internet Engineering Task Force (IETF), its areas,and its working groups. Note that other groups may also distribute workingdocuments as Internet-Drafts.
Internet-Draftsare draft documents valid for a maximum of six months and may be updated,replaced, or obsoleted by other documents at any time. It is inappropriate touse Internet-Drafts as reference material or to cite them other than as “workin progress.”
The list ofcurrent Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.
The list ofInternet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.
ThisInternet-Draft will expire on February 2, 2008.
Table of Contents
1. Introduction
2. Design Goals
3. Hessian Grammar
4. Serialization
4.1. binary data
4.1.1. Compact: short binary
4.1.2. Binary Examples
4.2. boolean
4.2.1. Boolean Examples
4.3. date
4.3.1. Compact: date in minutes
4.3.2. Date Examples
4.4. double
4.4.1. Compact: double zero
4.4.2. Compact: double one
4.4.3. Compact: double octet
4.4.4. Compact: double short
4.4.5. Compact: double float
4.4.6. Double Examples
4.5. int
4.5.1. Compact: single octet integers
4.5.2. Compact: two octet integers
4.5.3. Compact: three octet integers
4.5.4. Integer Examples
4.6. list
4.6.1. Compact: fixed length list
4.6.2. List examples
4.7. long
4.7.1. Compact: single octet longs
4.7.2. Compact: two octet longs
4.7.3. Compact: three octet longs
4.7.4. Compact: four octet longs
4.7.5. Long Examples
4.8. map
4.8.1. Map examples
4.9. null
4.10. object
4.10.1. Compact: class definition
4.10.2. Compact: object instantiation
4.10.3. Object examples
4.11. ref
4.11.1. Ref Examples
4.12. string
4.12.1. Compact: short strings
4.12.2. String Examples
4.13. type
4.14. Compact: type references
5. Reference Maps
5.1. value reference
5.2. class reference
5.3. type reference
6. Bytecode map
§ Authors' Addresses
§ Intellectual Property and CopyrightStatements
Hessian is adynamically-typed, binary serialization and Web Services protocol designed forobject-oriented transmission.
Hessian isdynamically-typed, compact, and portable across languages.
The Hessianprotocol has the following design goals:
· It must self-describe the serialized types, i.e. not requireexternal schema or interface definitions.
· It must be language-independent, including supporting scriptinglanguages.
· It must be readable or writable in a single pass.
· It must be as compact as possible.
· It must be simple so it can be effectively tested andimplemented.
· It must be as fast as possible.
· It must support Unicode strings.
· It must support 8-bit binary data without escaping or usingattachments.
· It must support encryption, compression, signature, andtransaction context envelopes.
# startingproduction
top ::= value
# 8-bit binarydata split into 64k chunks
binary ::= x41 b1 b0<binary-data> binary # non-final chunk
::= 'B' b1 b0<binary-data> # final chunk
::= [x20-x2f] <binary-data> # binary data of
# length 0-15
::= [x34-x37]<binary-data> # binary dataof
# length 0-1023
# boolean true/false
boolean ::= 'T'
::= 'F'
# definitionfor an object (compact map)
class-def ::= 'C' stringint string*
# time in UTCencoded as 64-bit long milliseconds since
# epoch
date ::= x4a b7 b6b5 b4 b3 b2 b1 b0
::= x4b b3 b2b1 b0 # minutes since epoch
# 64-bit IEEEdouble
double ::= 'D' b7 b6b5 b4 b3 b2 b1 b0
::= x5b # 0.0
::= x5c # 1.0
::= x5d b0 # byte cast to double
# (-128.0 to 127.0)
::= x5e b1b0 # short cast to double
::= x5f b3 b2b1 b0 # 32-bit float cast to double
# 32-bitsigned integer
int ::= 'I' b3 b2b1 b0
::=[x80-xbf] # -x10 to x3f
::= [xc0-xcf]b0 # -x800 to x7ff
::= [xd0-xd7]b1 b0 # -x40000 to x3ffff
# list/vector
list ::= x55 typevalue* 'Z' # variable-length list
::= 'V' type int value* # fixed-length list
::= x57 value*'Z' # variable-length untyped list
::= x58 intvalue* # fixed-length untyped list
::= [x70-77] type value* # fixed-length typed list
::= [x78-7f] value* # fixed-length untyped list
# 64-bitsigned long integer
long ::= 'L' b7 b6b5 b4 b3 b2 b1 b0
::=[xd8-xef] # -x08 to x0f
::= [xf0-xff]b0 # -x800 to x7ff
::= [x38-x3f]b1 b0 # -x40000 to x3ffff
::= x59 b3 b2b1 b0 # 32-bit integer cast to long
# map/object
map ::= 'M' type(value value)* 'Z' # key, value mappairs
::= 'H' (value value)* 'Z' # untyped key, value
# null value
null ::= 'N'
# Objectinstance
object ::= 'O' intvalue*
::= [x60-x6f] value*
# valuereference (e.g. circular trees and graphs)
ref ::= x51int # reference to nth map/list/object
# UTF-8encoded character string split into 64k chunks
string ::= x52 b1 b0<utf8-data> string # non-finalchunk
::= 'S' b1 b0<utf8-data> # string oflength
# 0-65535
::= [x00-x1f]<utf8-data> # string oflength
# 0-31
::= [x30-x34]<utf8-data> # string oflength
# 0-1023
# map/listtypes for OO languages
type ::=string # type name
::= int # type reference
# mainproduction
value ::= null
::= binary
::= boolean
::= class-defvalue
::= date
::= double
::= int
::= list
::= long
::= map
::= object
::= ref
::= string
Figure 1 |
Hessian'sobject serialization has 8 primitive types:
1. raw binary data
2. boolean
3. 64-bit millisecond date
4. 64-bit double
5. 32-bit int
6. 64-bit long
7. null
8. UTF8-encoded string
It has 3recursive types:
1. list forlists and arrays
2. map formaps and dictionaries
3. object forobjects
Finally, it hasone special contruct:
1. ref forshared and circular object references.
Hessian 2.0 has3 internal reference maps:
1. An object/listreference map.
2. An class definitionreference map.
3. A type (classname) reference map.
binary ::= b b1 b0 <binary-data> binary
::= B b1 b0<binary-data>
::= [x20-x2f]<binary-data>
Figure 2 |
Binary data isencoded in chunks. The octet x42 ('B') encodes the final chunk and x62 ('b')represents any non-final chunk. Each chunk has a 16-bit length value.
len = 256 * b1+ b0
Binary datawith length less than 15 may be encoded by a single octet length [x20-x2f].
len = code -0x20
x23 x01 x02 x03 # 3octet data
B x10 x00 .... # 4kfinal chunk of data
b x04 x00 .... # 1knon-final chunk of data
Figure 3 |
boolean ::= T
::= F
Figure 4 |
The octet 'F'represents false and the octet T represents true.
F # false
Figure 5 |
date ::= x4a b7 b6 b5 b4 b3 b2 b1 b0
::= x4b b4 b3 b2 b1b0
Figure 6 |
Daterepresented by a 64-bit long of milliseconds since Jan 1 1970 00:00H, UTC.
4.3.1. Compact: date in minutes
The second formcontains a 32-bit int of minutes since Jan 1 1970 00:00H, UTC.
x4a x00 x00 x00 xd0 x4bx92 x84 xb8 # 09:51:31 May 8, 1998 UTC
Figure 7 |
x4b x4b x92 x0b xa0 # 09:51:00 May 8, 1998 UTC
Figure 8 |
double ::= D b7 b6 b5 b4 b3 b2 b1 b0
::= x5b
::= x5c
::= x5d b0
::= x5e b1 b0
::= x5f b3 b2 b1b0
Figure 9 |
A 64-bit IEEEfloating pointer number.
The double 0.0can be represented by the octet x5b
The double 1.0can be represented by the octet x5c
Doubles between-128.0 and 127.0 with no fractional component can be represented in two octetsby casting the byte value to a double.
value =(double) b0
Doubles between-32768.0 and 32767.0 with no fractional component can be represented in threeoctets by casting the short value to a double.
value =(double) (256 * b1 + b0)
Doubles whichare equivalent to their 32-bit float representation can be represented as the4-octet float and then cast to double.
x5c # 1.0
x5d x00 # 0.0
x5d x80 # -128.0
x5d x7f # 127.0
x5e x00 x00 # 0.0
x5e x80 x00 # -32768.0
x5e x7f xff # 32767.0
D x40 x28 x80 x00 x00 x00 x00 x00 # 12.25
Figure 10 |
int ::= 'I' b3 b2 b1 b0
::= [x80-xbf]
::= [xc0-xcf] b0
::= [xd0-xd7] b1 b0
Figure 11 |
A 32-bit signedinteger. An integer is represented by the octet x49 ('I') followed by the 4octets of the integer in big-endian order.
value = (b3<< 24) + (b2 << 16) + (b1 << 8) + b0;
4.5.1. Compact: single octet integers
Integersbetween -16 and 47 can be encoded by a single octet in the range x80 to xbf.
value = code -0x90
4.5.2. Compact: two octet integers
Integersbetween -2048 and 2047 can be encoded in two octets with the leading byte inthe range xc0 to xcf.
value = ((code- 0xc8) << 8) + b0;
4.5.3. Compact: three octet integers
Integersbetween -262144 and 262143 can be encoded in three bytes with the leading bytein the range xd0 to xd7.
value = ((code- 0xd4) << 16) + (b1 << 8) + b0;
x80 # -16
xbf # 47
xc8 x00 # 0
xc0 x00 #-2048
xc7 x00 # -256
xcf xff # 2047
xd4 x00 x00 # 0
xd0 x00 x00 #-262144
xd7 xff xff #262143
I x00 x00 x00 x00 # 0
I x00 x00 x01 x2c # 300
Figure 12 |
list ::= x55 type value* 'Z' # variable-length list
::= 'V' type intvalue* # fixed-length list
::= x57 value*'Z' # variable-length untyped list
::= x58 intvalue* # fixed-length untyped list
::= [x70-77] typevalue* # fixed-length typed list
::= [x78-7f]value* # fixed-length untyped list
Figure 13 |
An orderedlist, like an array. The two list productions are a fixed-length list and avariable length list. Both lists have a type. The type string may be anarbitrary UTF-8 string understood by the service.
Each list itemis added to the reference list to handle shared and circular elements. See theref element.
Any parserexpecting a list must also accept a null or a shared ref.
The validvalues of type are not specified in this document and may depend on thespecific application. For example, a server implemented in a language withstatic typing which exposes an Hessian interface can use the type informationto instantiate the specific array type. On the other hand, a server written ina dynamicly-typed language would likely ignore the contents of type entirelyand create a generic array.
4.6.1. Compact: fixed length list
Hessian 2.0allows a compact form of the list for successive lists of the same type wherethe length is known beforehand. The type and length are encoded by integers,where the type is a reference to an earlier specified type.
Serialization of a typed int array: int[] = {0, 1}
V #fixed length, typed list
x04 [int # encoding of int[] type
x92 # length = 2
x90 # integer 0
x91 # integer 1
Figure 14 |
untyped variable-length list = {0, 1}
x57 #variable-length, untyped
x90 # integer 0
x91 # integer 1
Z
Figure 15 |
x72 #typed list length=2
x04 [int # type for int[] (save as type #0)
x90 # integer 0
x91 # integer 1
x73 #typed list length = 3
x90 # type reference to int[](integer #0)
x92 # integer 2
x93 # integer 3
x94 # integer 4
Figure 16 |
long ::= L b7 b6 b5 b4 b3 b2 b1 b0
::= [xd8-xef]
::= [xf0-xff] b0
::= [x38-x3f] b1 b0
::= x4c b3 b2 b1 b0
Figure 17 |
A 64-bit signedinteger. An long is represented by the octet x4c ('L' ) followed by the 8-bytesof the integer in big-endian order.
4.7.1. Compact: single octet longs
Longs between-8 and 15 are represented by a single octet in the range xd8 to xef.
value = (code -0xe0)
4.7.2. Compact: two octet longs
Longs between-2048 and 2047 are encoded in two octets with the leading byte in the range xf0to xff.
value = ((code- 0xf8) << 8) + b0
4.7.3. Compact: three octet longs
Longs between-262144 and 262143 are encoded in three octets with the leading byte in therange x38 to x3f.
value = ((code- 0x3c) << 16) + (b1 << 8) + b0
4.7.4. Compact: four octet longs
Longs betweenwhich fit into 32-bits are encoded in five octets with the leading byte x4c.
value = (b3<< 24) + (b2 << 16) + (b1 << 8) + b0
xd8 # -8
xef # 15
xf8 x00 # 0
xf0 x00 #-2048
xf7 x00 #-256
xff xff #2047
x3c x00 x00 # 0
x38 x00 x00 #-262144
x3f xff xff #262143
x4c x00 x00 x00 x00 # 0
x4c x00 x00 x01 x2c #300
L x00 x00 x00 x00 x00 x00 x01 x2c # 300
Figure 18 |
map ::= M type(value value)* Z
Figure 19 |
Representsserialized maps and can represent objects. The type element describes the typeof the map.
The type may beempty, i.e. a zero length. The parser is responsible for choosing a type if oneis not specified. For objects, unrecognized keys will be ignored.
Each map isadded to the reference list. Any time the parser expects a map, it must also beable to support a null or a ref.
The type ischosen by the service.
map = new HashMap();
map.put(new Integer(1), "fee");
map.put(new Integer(16), "fie");
map.put(new Integer(256), "foe");
---
H # untyped map(HashMap for Java)
x91 # 1
x03 fee # "fee"
xa0 # 16
x03 fie # "fie"
xc9 x00 # 256
x03 foe # "foe"
Z
Figure 20 |
Map Representation of a Java Object
public class Car implements Serializable {
String color = "aquamarine";
String model ="Beetle";
int mileage = 65536;
}
---
M
x13com.caucho.test.Car # type
x05 color # color field
x0a aquamarine
x05 model # model field
x06 Beetle
x07 mileage # mileage field
I x00 x01 x00 x00
Z
Figure 21 |
null ::= N
Figure 22 |
Null representsa null pointer.
The octet 'N'represents the null value.
class-def ::= 'C' stringint string*
object ::= 'O' intvalue*
::= [x60-x6f]value*
Figure 23 |
4.10.1. Compact: class definition
Hessian 2.0 hasa compact object form where the field names are only serialized once. Followingobjects only need to serialize their values.
The objectdefinition includes a mandatory type string, the number of fields, and thefield names. The object definition is stored in the object definition map andwill be referenced by object instances with an integer reference.
4.10.2. Compact: object instantiation
Hessian 2.0 hasa compact object form where the field names are only serialized once. Followingobjects only need to serialize their values.
The objectinstantiation creates a new object based on a previous definition. The integervalue refers to the object definition.
class Car {
String color;
String model;
}
out.writeObject(new Car("red","corvette"));
out.writeObject(new Car("green", "civic"));
---
C # object definition (#0)
x0b example.Car # type is example.Car
x92 # two fields
x05 color # color field name
x05 model # model field name
O # object def (long form)
x90 # object definition #0
x03 red # color field value
x08 corvette # model field value
x60 # object def #0 (short form)
x05 green # color field value
x05 civic # model field value
Figure 24 |
RED,
GREEN,
BLUE,
}
out.writeObject(Color.RED);
out.writeObject(Color.GREEN);
out.writeObject(Color.BLUE);
out.writeObject(Color.GREEN);
---
C # class definition #0
x0b example.Color # type is example.Color
x91 # one field
x04 name # enumeration field is"name"
x60 # object #0 (class def #0)
x03 RED # RED value
x60 # object #1 (class def #0)
x90 # object definition ref #0
x05 GREEN # GREEN value
x60 # object #2 (class def #0)
x04 BLUE # BLUE value
x51 x91 # object ref #1, i.e. Color.GREEN
Figure 25 |
ref ::= x51 int
Figure 26 |
An integerreferring to a previous list, map, or object instance. As each list, map orobject is read from the input stream, it is assigned the integer position inthe stream, i.e. the first list or map is '0', the next is '1', etc. A laterref can then use the previous object. Writers MAY generate refs. Parsers MUSTbe able to recognize them.
ref can referto incompletely-read items. For example, a circular linked-list will refer tothe first link before the entire list has been read.
A possibleimplementation would add each map, list, and object to an array as it is read.The ref will return the corresponding value from the array. To support circularstructures, the implementation would store the map, list or object immediately,before filling in the contents.
Each map orlist is stored into an array as it is parsed. ref selects one of the storedobjects. The first object is numbered '0'.
list = new LinkedList();
list.data = 1;
list.tail = list;
---
C
x0a LinkedList
x92
x04 head
x04 tail
o x90 # objectstores ref #0
x91 # data = 1
x51 x90 # next field refers to itself, i.e. ref #0
Figure 27 |
ref only refersto list, map and objects elements. Strings and binary data, in particular, willonly share references if they're wrapped in a list or map.
string ::= x52 b1 b0 <utf8-data> string
::= S b1 b0<utf8-data>
::= [x00-x1f]<utf8-data>
::= [x30-x33] b0<utf8-data>
Figure 28 |
A 16-bitunicode character string encoded in UTF-8. Strings are encoded in chunks. x53('S') represents the final chunk and x52 ('R') represents any non-final chunk.Each chunk has a 16-bit unsigned integer length value.
The length isthe number of 16-bit characters, which may be different than the number ofbytes.
String chunksmay not split surrogate pairs.
4.12.1. Compact: short strings
Strings withlength less than 32 may be encoded with a single octet length [x00-x1f].
value = code
x05 hello #"hello"
x01 xc3 x83 #"\u00c3"
S x00 x05 hello #"hello" in long form
x52 x00 x07 hello, #"hello, world" split into two chunks
x05 world
Figure 29 |
type ::= string
::= int
Figure 30 |
A map or list includesa type attribute indicating the type name of the map or list forobject-oriented languages.
Each type isadded to the type map forfuture reference.
4.14. Compact: type references
Repeated typestrings MAY use the type map torefer to a previously used type. The type reference is zero-based over all thetypes encountered during parsing.
Hessian 2.0 has3 internal reference maps:
1. An map/object/list reference map.
2. An class definition map.
3. A type (class name) map.
The valuereference map lets Hessian support arbitrary graphs, and recursive and circulardata structures.
The class andtype maps improve Hessian efficiency by avoiding repetition of common stringdata.
Hessiansupports arbitrary graphs by adding list, object,and map asit encounters them in the bytecode stream.
Parsers MUSTstore each list, object and map in the reference map as they are encountered.
The storedobjects can be used with a ref bytecode.
Each objectdefinition is automatically added to the class-map. ParsersMUST add a class definition to the class map as each is encountered. Followingobject instances will refer to the defined class.
The type stringsfor map and list valuesare stored in a type map for reference.
Parsers MUSTadd a type string to the type map as each is encountered.
Hessian isorganized as a bytecode protocol. A Hessian reader is essentially a switchstatement on the initial octet.
x00 - x1f # utf-8string length 0-32
x20 - x2f # binarydata length 0-16
x30 - x33 # utf-8string length 0-1023
x34 - x37 # binarydata length 0-1023
x38 - x3f #three-octet compact long (-x40000 to x3ffff)
x40 # reserved(expansion/escape)
x41 # 8-bitbinary data non-final chunk ('A')
x42 # 8-bitbinary data final chunk ('B')
x43 # objecttype definition ('C')
x44 # 64-bitIEEE encoded double ('D')
x45 # reserved
x46 # booleanfalse ('F')
x47 # reserved
x48 # untypedmap ('H')
x49 # 32-bitsigned integer ('I')
x4a # 64-bit UTCmillisecond date
x4b # 32-bit UTCminute date
x4c # 64-bitsigned long integer ('L')
x4d # map withtype ('M')
x4e # null ('N')
x4f # objectinstance ('O')
x50 # reserved
x51 # referenceto map/list/object - integer ('Q')
x52 # utf-8string non-final chunk ('R')
x53 # utf-8string final chunk ('S')
x54 # booleantrue ('T')
x55 #variable-length list/vector ('U')
x56 #fixed-length list/vector ('V')
x57 #variable-length untyped list/vector ('W')
x58 #fixed-length untyped list/vector ('X')
x59 # longencoded as 32-bit int ('Y')
x5a # list/mapterminator ('Z')
x5b # double 0.0
x5c # double 1.0
x5d # doublerepresented as byte (-128.0 to 127.0)
x5e # doublerepresented as short (-32768.0 to 327676.0)
x5f # doublerepresented as float
x60 - x6f # objectwith direct type
x70 - x77 # fixed listwith direct length
x78 - x7f # fixeduntyped list with direct length
x80 - xbf # one-octetcompact int (-x10 to x3f, x90 is 0)
xc0 - xcf # two-octetcompact int (-x800 to x7ff)
xd0 - xd7 #three-octet compact int (-x40000 to x3ffff)
xd8 - xef # one-octetcompact long (-x8 to xf, xe0 is 0)
xf0 - xff # two-octetcompact long (-x800 to x7ff, xf8 is 0)
Figure 31 |
Authors' Addresses
| Scott Ferguson |
| Caucho Technology Inc. |
| P.O. Box 9001 |
| La Jolla, CA 92038 |
| USA |
Email: | |
|
|
| Emil Ong |
| Caucho Technology Inc. |
| P.O. Box 9001 |
| La Jolla, CA 92038 |
| USA |
Email: |
Full Copyright Statement
Copyright © TheIETF Trust (2007).
This documentis subject to the rights, licenses and restrictions contained in BCP 78,and except as set forth therein, the authors retain all their rights.
This documentand the information contained herein are provided on an “AS IS” basis and THECONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS OR IS SPONSORED BY (IF ANY),THE INTERNET SOCIETY, THE IETF TRUST AND THE INTERNET ENGINEERING TASK FORCEDISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANYWARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS ORANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Intellectual Property
The IETF takesno position regarding the validity or scope of any Intellectual Property Rightsor other rights that might be claimed to pertain to the implementation or useof the technology described in this document or the extent to which any licenseunder such rights might or might not be available; nor does it represent thatit has made any independent effort to identify any such rights. Information onthe procedures with respect to rights in RFC documents can be found inBCP 78 and BCP 79.
Copies of IPRdisclosures made to the IETF Secretariat and any assurances of licenses to bemade available, or the result of an attempt made to obtain a general license orpermission for the use of such proprietary rights by implementers or users ofthis specification can be obtained from the IETF on-line IPR repository at http://www.ietf.org/ipr.
The IETFinvites any interested party to bring to its attention any copyrights, patentsor patent applications, or other proprietary rights that may cover technologythat may be required to implement this standard. Please address the informationto the IETF at ietf-ipr@ietf.org.
由于百度到的Hessian2.0协议规范都是老的,不信看Double的定义可以知道,所以自己上传一份官方的,方便需要的同学
Hessian 2.0 Serialization Protocol
hessian.txt
Status of this Memo
By submittingthis Internet-Draft, each author represents that any applicable patent or otherIPR claims of which he or she is aware have been or will be disclosed, and anyof which he or she becomes aware will be disclosed, in accordance with Section 6of BCP 79.
Internet-Draftsare working documents of the Internet Engineering Task Force (IETF), its areas,and its working groups. Note that other groups may also distribute workingdocuments as Internet-Drafts.
Internet-Draftsare draft documents valid for a maximum of six months and may be updated,replaced, or obsoleted by other documents at any time. It is inappropriate touse Internet-Drafts as reference material or to cite them other than as “workin progress.”
The list ofcurrent Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.
The list ofInternet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.
ThisInternet-Draft will expire on February 2, 2008.
Table of Contents
1. Introduction
2. Design Goals
3. Hessian Grammar
4. Serialization
4.1. binary data
4.1.1. Compact: short binary
4.1.2. Binary Examples
4.2. boolean
4.2.1. Boolean Examples
4.3. date
4.3.1. Compact: date in minutes
4.3.2. Date Examples
4.4. double
4.4.1. Compact: double zero
4.4.2. Compact: double one
4.4.3. Compact: double octet
4.4.4. Compact: double short
4.4.5. Compact: double float
4.4.6. Double Examples
4.5. int
4.5.1. Compact: single octet integers
4.5.2. Compact: two octet integers
4.5.3. Compact: three octet integers
4.5.4. Integer Examples
4.6. list
4.6.1. Compact: fixed length list
4.6.2. List examples
4.7. long
4.7.1. Compact: single octet longs
4.7.2. Compact: two octet longs
4.7.3. Compact: three octet longs
4.7.4. Compact: four octet longs
4.7.5. Long Examples
4.8. map
4.8.1. Map examples
4.9. null
4.10. object
4.10.1. Compact: class definition
4.10.2. Compact: object instantiation
4.10.3. Object examples
4.11. ref
4.11.1. Ref Examples
4.12. string
4.12.1. Compact: short strings
4.12.2. String Examples
4.13. type
4.14. Compact: type references
5. Reference Maps
5.1. value reference
5.2. class reference
5.3. type reference
6. Bytecode map
§ Authors' Addresses
§ Intellectual Property and CopyrightStatements
Hessian is adynamically-typed, binary serialization and Web Services protocol designed forobject-oriented transmission.
Hessian isdynamically-typed, compact, and portable across languages.
The Hessianprotocol has the following design goals:
· It must self-describe the serialized types, i.e. not requireexternal schema or interface definitions.
· It must be language-independent, including supporting scriptinglanguages.
· It must be readable or writable in a single pass.
· It must be as compact as possible.
· It must be simple so it can be effectively tested andimplemented.
· It must be as fast as possible.
· It must support Unicode strings.
· It must support 8-bit binary data without escaping or usingattachments.
· It must support encryption, compression, signature, andtransaction context envelopes.
# startingproduction
top ::= value
# 8-bit binarydata split into 64k chunks
binary ::= x41 b1 b0<binary-data> binary # non-final chunk
::= 'B' b1 b0<binary-data> # final chunk
::= [x20-x2f] <binary-data> # binary data of
# length 0-15
::= [x34-x37]<binary-data> # binary dataof
# length 0-1023
# boolean true/false
boolean ::= 'T'
::= 'F'
# definitionfor an object (compact map)
class-def ::= 'C' stringint string*
# time in UTCencoded as 64-bit long milliseconds since
# epoch
date ::= x4a b7 b6b5 b4 b3 b2 b1 b0
::= x4b b3 b2b1 b0 # minutes since epoch
# 64-bit IEEEdouble
double ::= 'D' b7 b6b5 b4 b3 b2 b1 b0
::= x5b # 0.0
::= x5c # 1.0
::= x5d b0 # byte cast to double
# (-128.0 to 127.0)
::= x5e b1b0 # short cast to double
::= x5f b3 b2b1 b0 # 32-bit float cast to double
# 32-bitsigned integer
int ::= 'I' b3 b2b1 b0
::=[x80-xbf] # -x10 to x3f
::= [xc0-xcf]b0 # -x800 to x7ff
::= [xd0-xd7]b1 b0 # -x40000 to x3ffff
# list/vector
list ::= x55 typevalue* 'Z' # variable-length list
::= 'V' type int value* # fixed-length list
::= x57 value*'Z' # variable-length untyped list
::= x58 intvalue* # fixed-length untyped list
::= [x70-77] type value* # fixed-length typed list
::= [x78-7f] value* # fixed-length untyped list
# 64-bitsigned long integer
long ::= 'L' b7 b6b5 b4 b3 b2 b1 b0
::=[xd8-xef] # -x08 to x0f
::= [xf0-xff]b0 # -x800 to x7ff
::= [x38-x3f]b1 b0 # -x40000 to x3ffff
::= x59 b3 b2b1 b0 # 32-bit integer cast to long
# map/object
map ::= 'M' type(value value)* 'Z' # key, value mappairs
::= 'H' (value value)* 'Z' # untyped key, value
# null value
null ::= 'N'
# Objectinstance
object ::= 'O' intvalue*
::= [x60-x6f] value*
# valuereference (e.g. circular trees and graphs)
ref ::= x51int # reference to nth map/list/object
# UTF-8encoded character string split into 64k chunks
string ::= x52 b1 b0<utf8-data> string # non-finalchunk
::= 'S' b1 b0<utf8-data> # string oflength
# 0-65535
::= [x00-x1f]<utf8-data> # string oflength
# 0-31
::= [x30-x34]<utf8-data> # string oflength
# 0-1023
# map/listtypes for OO languages
type ::=string # type name
::= int # type reference
# mainproduction
value ::= null
::= binary
::= boolean
::= class-defvalue
::= date
::= double
::= int
::= list
::= long
::= map
::= object
::= ref
::= string
Figure 1 |
Hessian'sobject serialization has 8 primitive types:
1. raw binary data
2. boolean
3. 64-bit millisecond date
4. 64-bit double
5. 32-bit int
6. 64-bit long
7. null
8. UTF8-encoded string
It has 3recursive types:
1. list forlists and arrays
2. map formaps and dictionaries
3. object forobjects
Finally, it hasone special contruct:
1. ref forshared and circular object references.
Hessian 2.0 has3 internal reference maps:
1. An object/listreference map.
2. An class definitionreference map.
3. A type (classname) reference map.
binary ::= b b1 b0 <binary-data> binary
::= B b1 b0<binary-data>
::= [x20-x2f]<binary-data>
Figure 2 |
Binary data isencoded in chunks. The octet x42 ('B') encodes the final chunk and x62 ('b')represents any non-final chunk. Each chunk has a 16-bit length value.
len = 256 * b1+ b0
Binary datawith length less than 15 may be encoded by a single octet length [x20-x2f].
len = code -0x20
x23 x01 x02 x03 # 3octet data
B x10 x00 .... # 4kfinal chunk of data
b x04 x00 .... # 1knon-final chunk of data
Figure 3 |
boolean ::= T
::= F
Figure 4 |
The octet 'F'represents false and the octet T represents true.
F # false
Figure 5 |
date ::= x4a b7 b6 b5 b4 b3 b2 b1 b0
::= x4b b4 b3 b2 b1b0
Figure 6 |
Daterepresented by a 64-bit long of milliseconds since Jan 1 1970 00:00H, UTC.
4.3.1. Compact: date in minutes
The second formcontains a 32-bit int of minutes since Jan 1 1970 00:00H, UTC.
x4a x00 x00 x00 xd0 x4bx92 x84 xb8 # 09:51:31 May 8, 1998 UTC
Figure 7 |
x4b x4b x92 x0b xa0 # 09:51:00 May 8, 1998 UTC
Figure 8 |
double ::= D b7 b6 b5 b4 b3 b2 b1 b0
::= x5b
::= x5c
::= x5d b0
::= x5e b1 b0
::= x5f b3 b2 b1b0
Figure 9 |
A 64-bit IEEEfloating pointer number.
The double 0.0can be represented by the octet x5b
The double 1.0can be represented by the octet x5c
Doubles between-128.0 and 127.0 with no fractional component can be represented in two octetsby casting the byte value to a double.
value =(double) b0
Doubles between-32768.0 and 32767.0 with no fractional component can be represented in threeoctets by casting the short value to a double.
value =(double) (256 * b1 + b0)
Doubles whichare equivalent to their 32-bit float representation can be represented as the4-octet float and then cast to double.
x5c # 1.0
x5d x00 # 0.0
x5d x80 # -128.0
x5d x7f # 127.0
x5e x00 x00 # 0.0
x5e x80 x00 # -32768.0
x5e x7f xff # 32767.0
D x40 x28 x80 x00 x00 x00 x00 x00 # 12.25
Figure 10 |
int ::= 'I' b3 b2 b1 b0
::= [x80-xbf]
::= [xc0-xcf] b0
::= [xd0-xd7] b1 b0
Figure 11 |
A 32-bit signedinteger. An integer is represented by the octet x49 ('I') followed by the 4octets of the integer in big-endian order.
value = (b3<< 24) + (b2 << 16) + (b1 << 8) + b0;
4.5.1. Compact: single octet integers
Integersbetween -16 and 47 can be encoded by a single octet in the range x80 to xbf.
value = code -0x90
4.5.2. Compact: two octet integers
Integersbetween -2048 and 2047 can be encoded in two octets with the leading byte inthe range xc0 to xcf.
value = ((code- 0xc8) << 8) + b0;
4.5.3. Compact: three octet integers
Integersbetween -262144 and 262143 can be encoded in three bytes with the leading bytein the range xd0 to xd7.
value = ((code- 0xd4) << 16) + (b1 << 8) + b0;
x80 # -16
xbf # 47
xc8 x00 # 0
xc0 x00 #-2048
xc7 x00 # -256
xcf xff # 2047
xd4 x00 x00 # 0
xd0 x00 x00 #-262144
xd7 xff xff #262143
I x00 x00 x00 x00 # 0
I x00 x00 x01 x2c # 300
Figure 12 |
list ::= x55 type value* 'Z' # variable-length list
::= 'V' type intvalue* # fixed-length list
::= x57 value*'Z' # variable-length untyped list
::= x58 intvalue* # fixed-length untyped list
::= [x70-77] typevalue* # fixed-length typed list
::= [x78-7f]value* # fixed-length untyped list
Figure 13 |
An orderedlist, like an array. The two list productions are a fixed-length list and avariable length list. Both lists have a type. The type string may be anarbitrary UTF-8 string understood by the service.
Each list itemis added to the reference list to handle shared and circular elements. See theref element.
Any parserexpecting a list must also accept a null or a shared ref.
The validvalues of type are not specified in this document and may depend on thespecific application. For example, a server implemented in a language withstatic typing which exposes an Hessian interface can use the type informationto instantiate the specific array type. On the other hand, a server written ina dynamicly-typed language would likely ignore the contents of type entirelyand create a generic array.
4.6.1. Compact: fixed length list
Hessian 2.0allows a compact form of the list for successive lists of the same type wherethe length is known beforehand. The type and length are encoded by integers,where the type is a reference to an earlier specified type.
Serialization of a typed int array: int[] = {0, 1}
V #fixed length, typed list
x04 [int # encoding of int[] type
x92 # length = 2
x90 # integer 0
x91 # integer 1
Figure 14 |
untyped variable-length list = {0, 1}
x57 #variable-length, untyped
x90 # integer 0
x91 # integer 1
Z
Figure 15 |
x72 #typed list length=2
x04 [int # type for int[] (save as type #0)
x90 # integer 0
x91 # integer 1
x73 #typed list length = 3
x90 # type reference to int[](integer #0)
x92 # integer 2
x93 # integer 3
x94 # integer 4
Figure 16 |
long ::= L b7 b6 b5 b4 b3 b2 b1 b0
::= [xd8-xef]
::= [xf0-xff] b0
::= [x38-x3f] b1 b0
::= x4c b3 b2 b1 b0
Figure 17 |
A 64-bit signedinteger. An long is represented by the octet x4c ('L' ) followed by the 8-bytesof the integer in big-endian order.
4.7.1. Compact: single octet longs
Longs between-8 and 15 are represented by a single octet in the range xd8 to xef.
value = (code -0xe0)
4.7.2. Compact: two octet longs
Longs between-2048 and 2047 are encoded in two octets with the leading byte in the range xf0to xff.
value = ((code- 0xf8) << 8) + b0
4.7.3. Compact: three octet longs
Longs between-262144 and 262143 are encoded in three octets with the leading byte in therange x38 to x3f.
value = ((code- 0x3c) << 16) + (b1 << 8) + b0
4.7.4. Compact: four octet longs
Longs betweenwhich fit into 32-bits are encoded in five octets with the leading byte x4c.
value = (b3<< 24) + (b2 << 16) + (b1 << 8) + b0
xd8 # -8
xef # 15
xf8 x00 # 0
xf0 x00 #-2048
xf7 x00 #-256
xff xff #2047
x3c x00 x00 # 0
x38 x00 x00 #-262144
x3f xff xff #262143
x4c x00 x00 x00 x00 # 0
x4c x00 x00 x01 x2c #300
L x00 x00 x00 x00 x00 x00 x01 x2c # 300
Figure 18 |
map ::= M type(value value)* Z
Figure 19 |
Representsserialized maps and can represent objects. The type element describes the typeof the map.
The type may beempty, i.e. a zero length. The parser is responsible for choosing a type if oneis not specified. For objects, unrecognized keys will be ignored.
Each map isadded to the reference list. Any time the parser expects a map, it must also beable to support a null or a ref.
The type ischosen by the service.
map = new HashMap();
map.put(new Integer(1), "fee");
map.put(new Integer(16), "fie");
map.put(new Integer(256), "foe");
---
H # untyped map(HashMap for Java)
x91 # 1
x03 fee # "fee"
xa0 # 16
x03 fie # "fie"
xc9 x00 # 256
x03 foe # "foe"
Z
Figure 20 |
Map Representation of a Java Object
public class Car implements Serializable {
String color = "aquamarine";
String model ="Beetle";
int mileage = 65536;
}
---
M
x13com.caucho.test.Car # type
x05 color # color field
x0a aquamarine
x05 model # model field
x06 Beetle
x07 mileage # mileage field
I x00 x01 x00 x00
Z
Figure 21 |
null ::= N
Figure 22 |
Null representsa null pointer.
The octet 'N'represents the null value.
class-def ::= 'C' stringint string*
object ::= 'O' intvalue*
::= [x60-x6f]value*
Figure 23 |
4.10.1. Compact: class definition
Hessian 2.0 hasa compact object form where the field names are only serialized once. Followingobjects only need to serialize their values.
The objectdefinition includes a mandatory type string, the number of fields, and thefield names. The object definition is stored in the object definition map andwill be referenced by object instances with an integer reference.
4.10.2. Compact: object instantiation
Hessian 2.0 hasa compact object form where the field names are only serialized once. Followingobjects only need to serialize their values.
The objectinstantiation creates a new object based on a previous definition. The integervalue refers to the object definition.
class Car {
String color;
String model;
}
out.writeObject(new Car("red","corvette"));
out.writeObject(new Car("green", "civic"));
---
C # object definition (#0)
x0b example.Car # type is example.Car
x92 # two fields
x05 color # color field name
x05 model # model field name
O # object def (long form)
x90 # object definition #0
x03 red # color field value
x08 corvette # model field value
x60 # object def #0 (short form)
x05 green # color field value
x05 civic # model field value
Figure 24 |
RED,
GREEN,
BLUE,
}
out.writeObject(Color.RED);
out.writeObject(Color.GREEN);
out.writeObject(Color.BLUE);
out.writeObject(Color.GREEN);
---
C # class definition #0
x0b example.Color # type is example.Color
x91 # one field
x04 name # enumeration field is"name"
x60 # object #0 (class def #0)
x03 RED # RED value
x60 # object #1 (class def #0)
x90 # object definition ref #0
x05 GREEN # GREEN value
x60 # object #2 (class def #0)
x04 BLUE # BLUE value
x51 x91 # object ref #1, i.e. Color.GREEN
Figure 25 |
ref ::= x51 int
Figure 26 |
An integerreferring to a previous list, map, or object instance. As each list, map orobject is read from the input stream, it is assigned the integer position inthe stream, i.e. the first list or map is '0', the next is '1', etc. A laterref can then use the previous object. Writers MAY generate refs. Parsers MUSTbe able to recognize them.
ref can referto incompletely-read items. For example, a circular linked-list will refer tothe first link before the entire list has been read.
A possibleimplementation would add each map, list, and object to an array as it is read.The ref will return the corresponding value from the array. To support circularstructures, the implementation would store the map, list or object immediately,before filling in the contents.
Each map orlist is stored into an array as it is parsed. ref selects one of the storedobjects. The first object is numbered '0'.
list = new LinkedList();
list.data = 1;
list.tail = list;
---
C
x0a LinkedList
x92
x04 head
x04 tail
o x90 # objectstores ref #0
x91 # data = 1
x51 x90 # next field refers to itself, i.e. ref #0
Figure 27 |
ref only refersto list, map and objects elements. Strings and binary data, in particular, willonly share references if they're wrapped in a list or map.
string ::= x52 b1 b0 <utf8-data> string
::= S b1 b0<utf8-data>
::= [x00-x1f]<utf8-data>
::= [x30-x33] b0<utf8-data>
Figure 28 |
A 16-bitunicode character string encoded in UTF-8. Strings are encoded in chunks. x53('S') represents the final chunk and x52 ('R') represents any non-final chunk.Each chunk has a 16-bit unsigned integer length value.
The length isthe number of 16-bit characters, which may be different than the number ofbytes.
String chunksmay not split surrogate pairs.
4.12.1. Compact: short strings
Strings withlength less than 32 may be encoded with a single octet length [x00-x1f].
value = code
x05 hello #"hello"
x01 xc3 x83 #"\u00c3"
S x00 x05 hello #"hello" in long form
x52 x00 x07 hello, #"hello, world" split into two chunks
x05 world
Figure 29 |
type ::= string
::= int
Figure 30 |
A map or list includesa type attribute indicating the type name of the map or list forobject-oriented languages.
Each type isadded to the type map forfuture reference.
4.14. Compact: type references
Repeated typestrings MAY use the type map torefer to a previously used type. The type reference is zero-based over all thetypes encountered during parsing.
Hessian 2.0 has3 internal reference maps:
1. An map/object/list reference map.
2. An class definition map.
3. A type (class name) map.
The valuereference map lets Hessian support arbitrary graphs, and recursive and circulardata structures.
The class andtype maps improve Hessian efficiency by avoiding repetition of common stringdata.
Hessiansupports arbitrary graphs by adding list, object,and map asit encounters them in the bytecode stream.
Parsers MUSTstore each list, object and map in the reference map as they are encountered.
The storedobjects can be used with a ref bytecode.
Each objectdefinition is automatically added to the class-map. ParsersMUST add a class definition to the class map as each is encountered. Followingobject instances will refer to the defined class.
The type stringsfor map and list valuesare stored in a type map for reference.
Parsers MUSTadd a type string to the type map as each is encountered.
Hessian isorganized as a bytecode protocol. A Hessian reader is essentially a switchstatement on the initial octet.
x00 - x1f # utf-8string length 0-32
x20 - x2f # binarydata length 0-16
x30 - x33 # utf-8string length 0-1023
x34 - x37 # binarydata length 0-1023
x38 - x3f #three-octet compact long (-x40000 to x3ffff)
x40 # reserved(expansion/escape)
x41 # 8-bitbinary data non-final chunk ('A')
x42 # 8-bitbinary data final chunk ('B')
x43 # objecttype definition ('C')
x44 # 64-bitIEEE encoded double ('D')
x45 # reserved
x46 # booleanfalse ('F')
x47 # reserved
x48 # untypedmap ('H')
x49 # 32-bitsigned integer ('I')
x4a # 64-bit UTCmillisecond date
x4b # 32-bit UTCminute date
x4c # 64-bitsigned long integer ('L')
x4d # map withtype ('M')
x4e # null ('N')
x4f # objectinstance ('O')
x50 # reserved
x51 # referenceto map/list/object - integer ('Q')
x52 # utf-8string non-final chunk ('R')
x53 # utf-8string final chunk ('S')
x54 # booleantrue ('T')
x55 #variable-length list/vector ('U')
x56 #fixed-length list/vector ('V')
x57 #variable-length untyped list/vector ('W')
x58 #fixed-length untyped list/vector ('X')
x59 # longencoded as 32-bit int ('Y')
x5a # list/mapterminator ('Z')
x5b # double 0.0
x5c # double 1.0
x5d # doublerepresented as byte (-128.0 to 127.0)
x5e # doublerepresented as short (-32768.0 to 327676.0)
x5f # doublerepresented as float
x60 - x6f # objectwith direct type
x70 - x77 # fixed listwith direct length
x78 - x7f # fixeduntyped list with direct length
x80 - xbf # one-octetcompact int (-x10 to x3f, x90 is 0)
xc0 - xcf # two-octetcompact int (-x800 to x7ff)
xd0 - xd7 #three-octet compact int (-x40000 to x3ffff)
xd8 - xef # one-octetcompact long (-x8 to xf, xe0 is 0)
xf0 - xff # two-octetcompact long (-x800 to x7ff, xf8 is 0)
Figure 31 |
Authors' Addresses
| Scott Ferguson |
| Caucho Technology Inc. |
| P.O. Box 9001 |
| La Jolla, CA 92038 |
| USA |
Email: | |
|
|
| Emil Ong |
| Caucho Technology Inc. |
| P.O. Box 9001 |
| La Jolla, CA 92038 |
| USA |
Email: |
Full Copyright Statement
Copyright © TheIETF Trust (2007).
This documentis subject to the rights, licenses and restrictions contained in BCP 78,and except as set forth therein, the authors retain all their rights.
This documentand the information contained herein are provided on an “AS IS” basis and THECONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS OR IS SPONSORED BY (IF ANY),THE INTERNET SOCIETY, THE IETF TRUST AND THE INTERNET ENGINEERING TASK FORCEDISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANYWARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS ORANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Intellectual Property
The IETF takesno position regarding the validity or scope of any Intellectual Property Rightsor other rights that might be claimed to pertain to the implementation or useof the technology described in this document or the extent to which any licenseunder such rights might or might not be available; nor does it represent thatit has made any independent effort to identify any such rights. Information onthe procedures with respect to rights in RFC documents can be found inBCP 78 and BCP 79.
Copies of IPRdisclosures made to the IETF Secretariat and any assurances of licenses to bemade available, or the result of an attempt made to obtain a general license orpermission for the use of such proprietary rights by implementers or users ofthis specification can be obtained from the IETF on-line IPR repository at http://www.ietf.org/ipr.
The IETFinvites any interested party to bring to its attention any copyrights, patentsor patent applications, or other proprietary rights that may cover technologythat may be required to implement this standard. Please address the informationto the IETF at ietf-ipr@ietf.org.