Introduction to Algorithms (Hashing with Chaining)

最新推荐文章于 2020-10-17 10:56:37 发布

长安一片月噢

最新推荐文章于 2020-10-17 10:56:37 发布

阅读量306

点赞数

Maintain a set of items each with a key

Dictionaries are perhaps the most popular data structure in CS

Less obvious, using hashing techniques:

built into most modern programming languages (Python, Perl, Ruby, JavaScript, Java, C++, C#, . . . )
e.g. best docdist code: word counts & inner product
implement databases: (DB HASH in Berkeley DB)
- English word → definition (literal dict.)
- English words: for a spelling correction
- word → all web pages containing that word
- username → account object
compilers & interpreters: names → variables
network routers: IP address → wire
network server: port number → socket/app.
virtual memory: virtual address → physical
substring search (grep, Google)
string commonalities (DNA)
file or directory synchronization
cryptography: file transfer & identification

Solution to 1: “prehash” keys to integers

Solution to 2: hashing

the linked list of colliding items in each slot of the table

Analysis

division method: h(k) = k mod m
multiplication method: $h(k) = [(ak)mod2^{w}]>>(w-r),m=2^{r}$
universal hashing:, where a and b are random ∈ {0, 1, . . . p−1} and p is a large prime (> |U|).