Data Structure Lecture Note (Week 2, Lecture 6)

最新推荐文章于 2021-08-02 19:54:52 发布

ZJ_Frank

最新推荐文章于 2021-08-02 19:54:52 发布

阅读量155

点赞数

分类专栏：数据结构与算法

本文链接：https://blog.csdn.net/ZJ_11701/article/details/106867723

版权

数据结构与算法专栏收录该内容

28 篇文章 0 订阅

订阅专栏

Homework 1 is very important to understand. In the final exam, many questions are just of such forms.

Amortized analysis practices: Binary Counter

Let

A be anarray of size K, with 0, 1 in eah slot, A is a counter and the count is the binary stored in A. (Assume right most is the lowest end).

e.g. A = [0,0,0,0,01] then the counter = 1. If A=[0,1,0,1,0] then the counter is 2^3 + 2^1 = 10

The counter = $\sum_{0\le i\le K-1} 2^i A[i]$

A.tick() is an op taht tick the counter up 1. N % 2^K . If binary exceeds, A will continue ticking the available bits

Naive analysis yield $O (K N)$

But carefully observation says, A[0] with N operations, tick every time, A[1] with N operations, will tick N/2 times, A[2] with N operations will tick N/4 times …

So, in total,

$\# flips = \sum_{0\le i \le K-1} N/2^i = O(N)$

Potential function method

Let $D_j$ be the state of the data structure at j’th operation

Let P(D_j) be a function (势能函数) from the state of the data structure to some real value

Let $x_j$ be the actual cost of j’th operation

Let $d_j = c_j + P(D_j)-P(D_{j-1})$ then
$\sum_j d_j = \sum_j (c_j + P(D_j) - P(D_{j-1})) = (\sum_j c_j) + P(D_N) - P(D_0)$
The idea is $d_i$ is easier to calculate than $c_j$ , and $P(D_N), P(D_0)$ are easy to find.

Dynamic Programming

Closest pair of points in 2D

problem: given N points with coordinate (x_i, y_i) Find the closet 2 points. (naive algorithm is O(n^2))

How about a divide and conquer approach?

There is a way to O(N(logN)^2). Which could be improved to O(N log N)

Finding the k-th largest number in A

Naive algorithm is O(N logN)

There is a smart algorithm gives $O (N)$ page 220 MIT book

suppose we are going to find median for N - 25 numbers

First break them into groups of 5 numbers, in general, N/5 groups. Suppose each is sorted

It’s easy to figure out the median for each group

Pick the median out. get N/5 numbers. Then find the median of N/5 numbers. Take it as a pivot, divide the entire array by the pivot (just like in quick sort). Say left A, right B

Claim: $∣ A ∣ o r ∣ B ∣ < 0.7 N$

Suppose F(N) is running time of finding the k-th number in an array of size N

we have

F(N) = F(N/5) + O(N) + F(7/10 N + 6)

Types of algorithms

Try to finish a task
- e.g. sorting, simulating a stack, evaluate the repression, find the median ,etc.
Yes/no problem, decision problem
- Whether there is a solution to sth.
Optimization Problems
- Solved by optimization algorithms

Example: knapsack problem: I have a bag of X volume, and N items, each item has a weight and value, given by array W and V (all numbers are int)

(Input size: N, log X.----> O(N) + log X)

Optimization problem, try to optimiza a value/goal with constaints

yes/no decision version: whether there is a way to pack the backpack to at least value Y

Claim: if you can solve the yes/no problem with a polynomial time algorithm A, you can solve the optimization problem with polynomial time.

pf: do a binary search, it takes log(sum(value)) * T(A)

Methodology to deal with optimization problems

Greedy algo. : easy to implement, fast running; usually not optimal solutions [example: sort the items by value/weight, from large to small. pack from the bset item until it can’t fit in (O(Nlog N))]

Searching algo. : pro: can find the optimal solution; con: usually slow. [example: ]