Canonicalize NFL team brandsthis is really a machine learning classification Ndamukong Suh NFL Jersey problem www.footballlionsshop.com/lions-calvin-johnson-jersey-c-3.html but I imagine there's a perfectly good quickanddirty way to do Reggie Bush Women's Jersey it. I want to map a string talking about an NFL team, much like "sf" or simply "49ers" and "bay area 49ers" or even a "SF fortyniners, To a canonical www.footballlionsshop.com name for they. (There are 32 NFL teams so it really just means searching out the nearest of 32 bins to put Ziggy Ansah Youth Jersey a given string in.)
I should also add that in case anyone knows of a source of data containing both moneyline Vegas odds as well as actual game outcomes for the past few years of NFL games, That would obviate the need for this. The reason I need the canonicalization is to match these two disparate data sets, One with odds www.footballlionsshop.com/lions-matthew-stafford-jersey-c-4.html and one with outcome:
Ideas for improve, more and more parsable, options for data are very welcome!
inserted: The substring matching idea might well suffice for this data; pleased, )! Could it be www.footballlionsshop.com/lions-nick-fairley-jersey-c-8.html made somewhat robust by picking the team name with the nearest levenshtein distance?
http://www.i-pharm.org/se4/index.php/blogs/2033/2136/sebby-s-mother-and-a-workforce-a
http://rrrus.ru/dsite/index.php?q=node/179662
http://remoclass.com/index.php?do=/blog/32398/wells-has-been-listed-as-suspicious-on-eight-separate-occasions-besides-the/
来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/29354992/viewspace-1061606/,如需转载,请注明出处,否则将追究法律责任。
转载于:http://blog.itpub.net/29354992/viewspace-1061606/