kmeanspython例子_python K-Means 实例二则 | 学步园

最新推荐文章于 2023-02-17 15:07:59 发布

weixin_39795325

最新推荐文章于 2023-02-17 15:07:59 发布

阅读量100

点赞数

文章标签： kmeanspython例子

I'm working on a little photography website for my Dad and thought it would be neat to extract color information from photographs. I tried a couple of different approaches before finding one that works pretty well. This approach uses k-means

clustering to cluster the pixels in groups based on their color. The center of those resulting clusters are then the "dominant" colors. k-means is a great fit for this problem because it is (usually) fast. It has the caveat of requiring you to specify

up-front how many clusters you want -- I found that it works well when I specified around 3.

A warning

I'm no expert on data-mining -- almost all my experience comes from reading Toby Segaran's excellent book Programming Collective Intelligence . In one of

the first chapters Toby covers clustering algorithms, including a nice treatment of k-means, so if you want to really learn from an expert I'd suggest picking up a copy. You won't be disappointed.

How it works

The way I understand it to work is you start with a bunch of data points. For simplicity let's say they're numbers on a number-line. You want to group the numbers into "k" clusters, so pick "k" points randomly from the data to use as your "clusters".

Now loop over every point in the data and calculate its distance to each of the "k" clusters. Find the nearest cluster and associate that point with the cluster. When you've looped over all the points they should all be assigned to one of the "k" clusters.

Now, for each cluster recalculate its center by averaging the distances of all the associated points and start over.

When the centers stop moving very much you can stop looping. You will end up with something like this -- the points are colored based on what "cluster" they are in and the dark-black circles indicate the centers of each cluster.

Applying it to photographs

The neat thing about this algorithm is, since it relies only on a simple distance calculation, you can extend it out to multi-dimensional data. Color is often represented using 3 channels, Red, Green, and Blue. So what I did was treat all the pixels in the

image like points on a 3-dimensional space. That's all there was to it!

I made a few optimizations along the way:

resize the image down to 200x200 or so using PIL

instead of storing "duplicate" points, store a count with each -- saves on calculations

Looking at some results

The results:

The source code

Below is the source code. It requires PIL to resize the image down to 200x200 and to extract the colors/counts. The "colorz" function is the one that returns the actual color codes for a filename.

from collections import namedtuple

from math import sqrt

import random

try:

import Image

except ImportError:

from PIL import Image

Point = namedtuple('Point', ('coords', 'n', 'ct'))

Cluster = namedtuple('Cluster', ('points', 'center', 'n'))

defget_points(img):

points = []

w, h = img.size

for count, color in img.getcolors(w * h):

points.append(Point(color, 3, count))

return points

rtoh = lambda rgb: '#%s' % ''.join(('%02x' % p for p in rgb))

defcolorz(filename, n=3):

img = Image.open(filename)

img.thumbnail((200, 200))

w, h = img.size

points = get_points(img)

clusters = kmeans(points, n, 1)

rgbs = [map(int, c.center.coords) for c in clusters]

return map(rtoh, rgbs)

defeuclidean(p1, p2):

return sqrt(sum([

(p1.coords[i] - p2.coords[i]) ** 2 for i in range(p1.n)

]))

defcalculate_center(points, n):

vals = [0.0 for i in range(n)]

plen = 0

for p in points:

plen += p.ct

for i in range(n):

vals[i] += (p.coords[i] * p.ct)

return Point([(v / plen) for v in vals], n, 1)

defkmeans(points, k, min_diff):

clusters = [Cluster([p], p, p.n) for p in random.sample(points, k)]

while 1:

plists = [[] for i in range(k)]

for p in points:

smallest_distance = float('Inf')

for i in range(k):

distance = euclidean(p, clusters[i].center)

if distance < smallest_distance:

smallest_distance = distance

idx = i

plists[idx].append(p)

diff = 0

for i in range(k):

old = clusters[i]

center = calculate_center(plists[i], old.n)

new = Cluster(plists[i], center, old.n)

clusters[i] = new

diff = max(diff, euclidean(old.center, new.center))

if diff < min_diff:

break

return clusters

Playing with it in the browser

I ported the code over to JavaScript -- let me tell you, its pretty rough, but it works and is fast. If you'd like to take a look at a live example, check out:

http://charlesleifer.com/static/colors/ -- you can view the source to see the js version, but basically it is just using the HTML5 canvas and its getImageDatamethod.

Thanks for reading

Thanks for reading, I hope you found this post interesting. I am sure this is not the only approach so if you have other ideas please feel free to leave

a comment orcontact me directly.