The Foundation Collection Classes

最新推荐文章于 2024-08-05 10:05:03 发布

weixin_33877092

最新推荐文章于 2024-08-05 10:05:03 发布

阅读量140

点赞数

文章标签： runtime c/c++ python

原文链接：https://my.oschina.net/grant110/blog/186850

版权

2019独角兽企业重金招聘Python工程师标准>>>

NSArray, NSSet, NSOrderedSet, and NSDictionary

Foundation’s collection classes are the basic building blocks of every Mac/iOS application. In this article, we’re going to have an in-depth look at both the “old” (NSArray,NSSet) and the “new” (NSMapTable,NSHashTable,NSPointerArray) classes, explore detailed performance of each of them, and discuss when to use what.

Author Note: This article contains several benchmark results, however they are by no means meant to be exact and there’s no variation/multiple runs applied. Their goal is to give you a direction of what’s faster and general runtime statistics. All tests have been made on an iPhone 5s with Xcode 5.1b1 and iOS 7.1b1 and a 64-bit binary. Compiler settings were release built with -Ofast. Vectorize loops and unroll loops (default settings) have both been disabled .

Big O Notation

First, we need some theoretical background. Performance is usually described with the Big O Notation. It defines the limiting behavior of a function and is often used to characterize algorithms on their performance. O defines the upper bound of the growth rate of the function. To see just how big the difference is, see commonly used O notations and the number of operations needed.

For example, if you sort an array with 50 elements, and your sorting algorithm has a complexity of O(n^2), there will be 2,500 operations necessary to complete the task. Furthermore, there’s also overhead in internal management and calling that method - so it’s 2,500 operations times constant. O(1) is the ideal complexity, meaning constant time. Good sorting algorithms usually need O(n*log n) time.

Mutability

Most collection classes exist in two versions: mutable and immutable (default). This is quite different than most other frameworks and feels a bit weird at first. However, others are now adopting this as well: .NET introduced immutable collections as an official extension only a few months ago.

What’s the big advantage? Thread safety. Immutable collections are fully thread safe and can be iterated from multiple threads at the same time, without any risk of mutation exceptions. Your API should never expose mutable collections.

Of course there’s a cost when going from immutable and mutable and back - the object has to be copied twice, and all objects within will be retained/released. Sometimes it’s more efficient to hold an internal mutable collection and return a copied, immutable object on access.

Unlike other frameworks, Apple does not provide thread-safe mutable variants of its collection classes, with the exception ofNSCache- which really doesn’t count since it’s not meant to be a generic container. Most of the time, you really don’t want synchronization at the collection level, but rather higher up in the hierarchy. Imagine some code that checks for the existence of a key in a dictionary, and depending on the result, sets a new key or returns something else - you usually want to group multiple operations together, and a thread-safe mutable variant would not help you here.

There are some valid use cases for a synchronized, thread-safe mutable collection, and it takes only a few lines to build something like that via subclassing and composition, e.g. for NSDictionary or NSArray.

Notably, some of the more modern collection classes likeNSHashTable,NSMapTable, andNSPointerArrayare mutable by default and don’t have immutable counterparts. They are meant for internal class use, and a use case where you would want those immutable would be quite unusual.

NSArray

NSArraystores objects as ordered collections and is probably the most-used collection class. That’s why it even got its own syntactic sugar syntax with the shorthand-literal@[...], which is much shorter than the old[NSArray arrayWithObjects:..., nil].

NSArrayimplementsobjectAtIndexedSubscript:and thus we can use a C-like syntax likearray[0]instead of the older[array objectAtIndex:0].

Performance Characteristics

There’s a lot more toNSArraythan you might think, and it uses a variety of internal variants depending on how many objects are being stored. The most interesting part is that Apple doesn’t guarantee O(1) access time on individual object access - as you can read in the note about Computational Complexity in the CFArray.h CoreFoundation header:

The access time for a value in the array is guaranteed to be at worst O(lg N) for any implementation, current and future, but will often be O(1) (constant time). Linear search operations similarly have a worst case complexity of O(Nlg N), though typically the bounds will be tighter, and so on. Insertion or deletion operations will typically be linear in the number of values in the array, but may be O(Nlg N) clearly in the worst case in some implementations. There are no favored positions within the array for performance; that is, it is not necessarily faster to access values with low indices, or to insert or delete values with high indices, or whatever.

When measuring, it turns out thatNSArrayhas some additional interesting performance characteristics. Inserting/deleting elements at the beginning/end is usually an O(1) operation, where random insertion/deletion usually will be O(N).

Useful Methods

Most methods ofNSArrayuseisEqual:to check against other objects (likecontainsObject:). There’s a special method namedindexOfObjectIdenticalTo:that goes down to pointer equality, and thus can speed up searching for objects a lot - if you can ensure that you’re searching within the same set.

With iOS 7, we finally got a publicfirstObjectmethod, which joinslastObject, and both simply returnnilfor an empty array - regular access would throw anNSRangeException.

There’s a nice detail about the construction of (mutable) arrays that can be used to save code. If you are creating a mutable array from a source that might be nil, you usually have some code like this:

NSMutableArray *mutableObjects = [array mutableCopy]; if (!mutableObjects) {
    mutableObjects = [NSMutableArray array];
}

or via the more concise ternary operator:

NSMutableArray *mutableObjects = [array mutableCopy] ?: [NSMutableArray array];

The better solution is to use the fact thatarrayWithArray:will return an object in either way - even if the source array is nil:

NSMutableArray *mutableObjects = [NSMutableArray arrayWithArray:array];

The two operations are almost equal in performance. Usingcopyis a bit faster, but then again, it’s highly unlikely that this will be your app bottleneck. Side Note: Please don’t use[@[] mutableCopy]. The classic[NSMutableArray array]is a lot better to read.

Reversing an array is really easy:array.reverseObjectEnumerator.allObjects. We’ll use the fact thatreverseObjectEnumeratoris pre-supplied and everyNSEnumeratorimplementsallObjects, which returns a new array. And while there’s no nativerandomObjectEnumerator, you can write a custom enumerator that shuffles the array or use some great open source options.

Sorting Arrays

There are various ways to sort an array. If it’s string based,sortedArrayUsingSelector:is your first choice:

NSArray *array = @[@"John Appleseed", @"Tim Cook", @"Hair Force One", @"Michael Jurewitz"]; NSArray *sortedArray = [array sortedArrayUsingSelector:@selector(localizedCaseInsensitiveCompare:)];

This works equally well for number-based content, sinceNSNumberimplementscompare:as well:

NSArray *numbers = @[@9, @5, @11, @3, @1]; NSArray *sortedNumbers = [numbers sortedArrayUsingSelector:@selector(compare:)];

For more control, you can use the function-pointer-based sorting methods:

- (NSData *)sortedArrayHint;
- (NSArray *)sortedArrayUsingFunction:(NSInteger (*)(id, id, void *))comparator 
                              context:(void *)context;
- (NSArray *)sortedArrayUsingFunction:(NSInteger (*)(id, id, void *))comparator 
                              context:(void *)context hint:(NSData *)hint;

Apple added an (opaque) way to speed up sorting usingsortedArrayHint.

The hinted sort is most efficient when you have a large array (N entries) that you sort once and then change only slightly (P additions and deletions, where P is much smaller than N). You can reuse the work you did in the original sort by conceptually doing a merge sort between the N “old” items and the P “new” items. To obtain an appropriate hint, you usesortedArrayHintwhen the original array has been sorted, and keep hold of it until you need it (when you want to re-sort the array after it has been modified).

Since blocks are around, there are also the newer block-based sorting methods:

- (NSArray *)sortedArrayUsingComparator:(NSComparator)cmptr;
- (NSArray *)sortedArrayWithOptions:(NSSortOptions)opts 
                    usingComparator:(NSComparator)cmptr;

Performance-wise, there’s not much difference between the different methods. Interestingly, the selector-based approach is actually the fastest. You’ll find the source code the benchmarks used here on GitHub.:

Sorting 1000000 elements. selector: 4947.90[ms] function: 5618.93[ms] block: 5082.98[ms].

Binary Search

NSArrayhas come with built-in binary search since iOS 4 / Snow Leopard:

typedef NS_OPTIONS(NSUInteger, NSBinarySearchingOptions) {
        NSBinarySearchingFirstEqual     = (1UL << 8),
        NSBinarySearchingLastEqual      = (1UL << 9),
        NSBinarySearchingInsertionIndex = (1UL << 10),
};

- (NSUInteger)indexOfObject:(id)obj 
              inSortedRange:(NSRange)r 
                    options:(NSBinarySearchingOptions)opts 
            usingComparator:(NSComparator)cmp;

Why would you want to use this? Methods likecontainsObject:andindexOfObject:start at index 0 and search every object until the match is found - they don’t require the array to be sorted but have a performance characteristic of O(n). Binary search, on the other hand, requires the array to be sorted, but only needs O(log n) time. Thus, for one million entries, binary search requires, at most, 21 comparisons, while the naive linear search would require an average of 500,000 comparisons.

Here’s a simple benchmark of just how much faster binary search is:

Time to search for 1000 entries within 1000000 objects. Linear: 54130.38[ms]. Binary: 7.62[ms]

For comparison, the search for a specific index withNSOrderedSettook 0.23 ms - that’s more than 30 times faster, even compared to binary search.

Keep in mind that sorting is expensive as well. Apple uses merge sort, which takes O(n*log n), so if you just have to callindexOfObject:once, there’s no need for binary search.

With specifyingNSBinarySearchingInsertionIndex, you can find the correct insertion index to keep an already sorted array sorted after inserting new elements.

Enumeration and Higher-Order Messaging

For a benchmark, we look at a common use case. Filter elements from an array into another array. This tests both the various enumeration ways, as well as the APIs specific to filtering:

// First variant, using `indexesOfObjectsWithOptions:passingTest:`. NSIndexSet *indexes = [randomArray indexesOfObjectsWithOptions:NSEnumerationConcurrent 
                                                   passingTest:^BOOL(id obj, NSUInteger idx, BOOL *stop) { return testObj(obj);
}]; NSArray *filteredArray = [randomArray objectsAtIndexes:indexes]; // Filtering using predicates (block-based or text)  NSArray *filteredArray2 = [randomArray filteredArrayUsingPredicate:[NSPredicate predicateWithBlock:^BOOL(id obj, NSDictionary *bindings) { return testObj(obj);
}]]; // Block-based enumeration  NSMutableArray *mutableArray = [NSMutableArray array];
[randomArray enumerateObjectsUsingBlock:^(id obj, NSUInteger idx, BOOL *stop) { if (testObj(obj)) {
        [mutableArray addObject:obj];
    }
}]; // Classic enumeration NSMutableArray *mutableArray = [NSMutableArray array]; for (id obj in randomArray) { if (testObj(obj)) {
        [mutableArray addObject:obj];
    }
} // Using NSEnumerator, old school. NSMutableArray *mutableArray = [NSMutableArray array]; NSEnumerator *enumerator = [randomArray objectEnumerator]; id obj = nil; while ((obj = [enumerator nextObject]) != nil) { if (testObj(obj)) {
        [mutableArray addObject:obj];
    }
} // Using objectAtIndex: (via subscripting) NSMutableArray *mutableArray = [NSMutableArray array]; for (NSUInteger idx = 0; idx < randomArray.count; idx++) { id obj = randomArray[idx]; if (testObj(obj)) {
        [mutableArray addObject:obj];
    }
}

Enumeration Method / Time [ms]	10.000.000 elements	10.000 elements
indexesOfObjects:, concurrent	1844.73	2.25
NSFastEnumeration(for in)	3223.45	3.21
indexesOfObjects:	4221.23	3.36
enumerateObjectsUsingBlock:	5459.43	5.43
objectAtIndex:	5282.67	5.53
NSEnumerator	5566.92	5.75
filteredArrayUsingPredicate:	6466.95	6.31

To better understand the performance measured here, we first have to look at how the array is enumerated.

indexesOfObjectsWithOptions:passingTest:has to call a block each time and thus is slightly less efficient than the classical for-based enumeration that uses theNSFastEnumerationtechnique. However, if we enable concurrent enumeration on the former, then it wins by a wide margin, and is almost twice as fast. Which makes sense, considering that the iPhone 5s has two cores. What’s not visible here is thatNSEnumerationConcurrentonly makes sense for a large number of objects - if your data set is small, it really doesn’t matter much what method you are going to use. Even worse, the additional thread management overhead forNSEnumerationConcurrentwill actually make results slower than without.

The real “loser” here isfilteredArrayUsingPredicate:.NSPredicatestill has a reason to be mentioned here, since one can write quite sophisticated expressions, especially with the non-block-based variant. People who use Core Data should be familiar with that.

For completeness, we also added a benchmark usingNSEnumerator- however there really is no reason to use this anymore. While it is surprisingly fast (still faster than using theNSPredicate-based filtering), it certainly has more runtime overhead than fast enumeration - nowadays it only exists for backward compatibility. Even a non-optimized access for viaobjectAtIndex:is faster here.

NSFastEnumeration

Apple added NSFastEnumeration in OS X 10.5 and it has been in iOS ever since the first release. Before that, there wasNSEnumeration, which returned one element at a time, and thus as quite a runtime overhead with each iteration. With fast enumeration, Apple returns a chunk of data withcountByEnumeratingWithState:objects:count:. The chunk is parsed as a C array ofids. This is where the additional speed comes from; iterating a C array is much faster, and can potentially be even further optimized by the compiler. Manually implementing fast enumeration is quite tricky, so Apple’s FastEnumerationSample is a good starting point, and there’s also an excellent article by Mike Ash on this topic.

Should I Use arrayWithCapacity:?

When initializingNSArray, you can optionally specify the expected count. When benchmarking this, it turns out that there is no difference in performance - the measured timings are almost equal and within the statistical uncertainty. A little birdie at Apple told me that this hint is indeed not used. However, usingarrayWithCapacity:can still be useful, as it can help understanding the code as part of an implicit documentation:

Adding 10.000.000 elements to NSArray. no count 1067.35[ms] with count: 1083.13[ms].

Subclassing Notes

There is rarely a reason why you would want to subclass the basic collection classes. Most of the time, the better solution is going down to CoreFoundation level and using custom callbacks to customize the behavior.

To create a case-insensitive dictionary, one could subclassNSDictionaryand write custom accessors that always lowercase (or uppercase) the string, and similar changes for storing. The better and faster solution is to instead provide a different set ofCFDictionaryKeyCallBackswhere you can provide customhashandisEqual:callbacks. You’ll find an example in this gist. The beauty is that - thanks to toll-free bridging - it’s still a simple dictionary and can be consumed by any API that takes anNSDictionary.

One example where a subclass is useful is the use case for an ordered dictionary. .NET has aSortedDictionary, Java hasTreeMap, C++ hasstd::map. While you could use C++’s STL container, you won’t get any automatedretain/release, which would make using those much more cumbersome. BecauseNSDictionaryis a class cluster, subclassing is quite different than one would expect. It’s outside of the boundaries of this article, but one real-world example of an ordered dictionary is here.

NSDictionary

A dictionary stores arbitrary key/value pairs of objects. For historical reasons, the initializer uses the reversed object to key notation,[NSDictionary dictionaryWithObjectsAndKeys:object, key, nil],while the newer literal shorthand starts with key@{key : value, ...}.

Keys inNSDictionaryare copied and they need to be constant. If the key changes after being used to put a value in the dictionary, the value may not be retrievable. As an interesting detail, keys are copied when using anNSDictionary, but are only retained when using a toll-free bridgedCFDictionary. There’s no notion of a generic object copy for CoreFoundation-classes, thus copy wasn’t possible at that time (*). This only applies if you useCFDictionarySetValue(). If you use a toll-free bridgedCFDictionaryviasetObject:forKey, Apple added additional logic that will still copy your key. This does not work the other way around - using anNSDictionaryobject casted toCFDictionaryand used viaCFDictionarySetValue()will call back tosetObject:forKeyand copy the key.

(*) There is one prepared key callbackkCFCopyStringDictionaryKeyCallBacksthat will copy strings, and becauseCFStringCreateCopy()calls back to[NSObject copy]for an ObjC class we could abuse this callback to create a key-copyingCFDictionary.

Performance Characteristics

Apple is rather quiet when it comes to defining computational complexity. The only note around this can be found in the CoreFoundation headers ofCFDictionary:

The access time for a value in the dictionary is guaranteed to be at worst O(N) for any implementation, current and future, but will often be O(1) (constant time). Insertion or deletion operations will typically be constant time as well, but are O(N*N) in the worst case in some implementations. Access of values through a key is faster than accessing values directly (if there are any such operations). Dictionaries will tend to use significantly more memory than a array with the same number of values.

The dictionary - much like array - uses different implementations depending on the size and switches between them transparently.

Enumeration and Higher-Order Messaging

Again, there are several ways how to best filter a dictionary:

// Using keysOfEntriesWithOptions:passingTest:,optionally concurrent NSSet *matchingKeys = [randomDict keysOfEntriesWithOptions:NSEnumerationConcurrent 
                                               passingTest:^BOOL(id key, id obj, BOOL *stop) 
{ return testObj(obj);
}]; NSArray *keys = matchingKeys.allObjects; NSArray *values = [randomDict objectsForKeys:keys notFoundMarker:NSNull.null];
__unused NSDictionary *filteredDictionary = [NSDictionary dictionaryWithObjects:values 
                                                                        forKeys:keys]; // Block-based enumeration. NSMutableDictionary *mutableDictionary = [NSMutableDictionary dictionary];
[randomDict enumerateKeysAndObjectsUsingBlock:^(id key, id obj, BOOL *stop) { if (testObj(obj)) {
        mutableDictionary[key] = obj;
    }
}]; // NSFastEnumeration NSMutableDictionary *mutableDictionary = [NSMutableDictionary dictionary]; for (id key in randomDict) { id obj = randomDict[key]; if (testObj(obj)) {
        mutableDictionary[key] = obj;
    }
} // NSEnumeration NSMutableDictionary *mutableDictionary = [NSMutableDictionary dictionary]; NSEnumerator *enumerator = [randomDict keyEnumerator]; id key = nil; while ((key = [enumerator nextObject]) != nil) { id obj = randomDict[key]; if (testObj(obj)) {
           mutableDictionary[key] = obj;
       }
 } // C-based array enumeration via getObjects:andKeys: NSMutableDictionary *mutableDictionary = [NSMutableDictionary dictionary]; id __unsafe_unretained objects[numberOfEntries]; id __unsafe_unretained keys[numberOfEntries];
[randomDict getObjects:objects andKeys:keys]; for (int i = 0; i < numberOfEntries; i++) { id obj = objects[i]; id key = keys[i]; if (testObj(obj)) {
       mutableDictionary[key] = obj;
    }
 }

Filtering/Enumeration Method	Time [ms], 50.000 elements	1.000.000 elements
keysOfEntriesWithOptions:, concurrent	16.65	425.24
getObjects:andKeys:	30.33	798.49*
keysOfEntriesWithOptions:	30.59	856.93
enumerateKeysAndObjectsUsingBlock:	36.33	882.93
NSFastEnumeration	41.20	1043.42
NSEnumeration	42.21	1113.08

(*) There’s a caveat when usinggetObjects:andKeys:. In the above code example, we’re using a C99 feature called variable-length arrays (as normally, the array count needs to be a fixed variable). This will allocate memory on the stack, which is a bit more convenient, but also limited. The above code example will crash for a large number of elements, so usemalloc/calloc-based allocation (andfree) to be on the safe side.

Why isNSFastEnumerationso slow here? Iterating the dictionary usually requires both key and object; fast enumeration can only help for the key, and we have to fetch the object every time ourselves. Using the block-basedenumerateKeysAndObjectsUsingBlock:is more efficient since both objects can be more efficiently prefetched.

The winner - again - is concurrent iteration viakeysOfEntriesWithOptions:passingTest:andobjectsForKeys:notFoundMarker:. This is a bit more code, but this can be nicely encapsulated in a category.

Should I Use dictionaryWithCapacity:?

By now you already now how this test works, and the short answer is NO, thecountparameter doesn’t change anything:

Adding 10000000 elements toNSDictionary. no count 10786.60[ms] with count: 10798.40[ms].

Sorting

There’s not much to say about dictionary sorting. You can only sort the key array as a new object, thus you can use any of the regularNSArraysorting methods as well:

- (NSArray *)keysSortedByValueUsingSelector:(SEL)comparator;
- (NSArray *)keysSortedByValueUsingComparator:(NSComparator)cmptr;
- (NSArray *)keysSortedByValueWithOptions:(NSSortOptions)opts 
                          usingComparator:(NSComparator)cmptr;

Shared Keys

Starting with iOS 6 and 10.8, it’s possible to have a pre-generated key set for a new dictionary, usingsharedKeySetForKeys:to create the key set from an array, anddictionaryWithSharedKeySet:to create the dictionary. UsuallyNSDictionarycopies its keys. When using a shared key set it will instead reuse those objects, which saves memory. According to the Foundation Release Notes,sharedKeySetForKeys:will calculate a minimal perfect hash that eliminates any need for probe looping during a dictionary lookup, thus making keyed access even faster.

This makes it perfect for use cases like a JSON parser, although in our limited testing we couldn’t see Apple using it inNSJSONSerialization. (Dictionaries created with shared key sets are of subclassNSSharedKeyDictionary; regular dictionaries are__NSDictionaryI/__NSDictionaryM, with I/M indicating mutability; and toll-free bridged dictionaries are of class_NSCFDictionary, both mutable and immutable variants.)

Interesting detail: Shared-key dictionaries are always mutable, even when calling ‘copy’ on them. This behavior is not documented but can be easily tested:

id sharedKeySet = [NSDictionary sharedKeySetForKeys:@[@1, @2, @3]]; // returns NSSharedKeySet NSMutableDictionary *test = [NSMutableDictionary dictionaryWithSharedKeySet:sharedKeySet];
test[@4] = @"First element (not in the shared key set, but will work as well)"; NSDictionary *immutable = [test copy];
NSParameterAssert(immutable == 1);
((NSMutableDictionary *)immutable)[@5] = @"Adding objects to an immutable collection should throw an exception.";
NSParameterAssert(immutable == 2);

NSSet

NSSetand its mutable variantNSMutableSetare an unordered collection of objects. Checking for existence is usually an O(1) operation, making this much faster for this use case thanNSArray.NSSetcan only work efficiently if the hashing method used is balanced; if all objects are in the same hash bucket, thenNSSetis not much faster in object-existence checking thanNSArray.

Variants ofNSSetare alsoNSCountedSet, and the non-toll-free counter-variantCFBag/CFMutableBag.

NSSetretains its object, but per the set contract, that object needs to be immutable. Adding objects to a set and then later changing that object will result in weird bugs and will corrupt the state of the set.

NSSethas far less methods thanNSArray. There is no sorting method but there are a few convenience enumeration methods. Some important methods areallObjectsto convert the objects into anNSArrayandanyObject, which returns either any object or nil, if the set is empty.

Set Manipulation

NSMutableSethas several powerful set methods likeintersectSet:,minusSet:, andunionSet:.

Set-Union

Should I Use setWithCapacity:?

Again, we test if there is any noticeable speed difference when we initialize a set with a given capacity:

Adding 1.000.000 elements toNSSet. no count 2928.49[ms] with count: 2947.52[ms].

This falls under measurement uncertainty - there’s no noticeable time difference. There is evidence that at least in the previous version of the runtime, this had much more of a performance impact.

Performance Characteristics of NSSet

Apple doesn’t provide any notes about the computational complexity in the CFSet headers:

Class / Time [ms]	1.000.000 elements
NSMutableSet, adding	2504.38
NSMutableArray, adding	1413.38
NSMutableSet, random access	4.40
NSMutableArray, random access	7.95

This benchmark is pretty much what we expected:NSSetcallshashandisEqual:on each added object and manages a buckets of hashes, so it takes more time on adding elements. Random access is hard to test with a set, since all there is isanyObject.

There was no need for includingcontainsObject:in the benchmark. It is magnitudes faster on a set - that’s their speciality, after all.

NSOrderedSet

NSOrderedSetwas first introduced in iOS 5 and Mac OS X 10.7, and there’s almost no API directly using it, except for CoreData. It seems like a great class with the best of bothNSArrayandNSSet: having the benefits of instant object-existence checking, uniqueness, and fast random access.

NSOrderedSethas great API methods, which makes it convenient to work with other set or ordered set objects. Union, intersection, and minus are supported just like inNSSet. It has most of the sort methods that are inNSArray, with the exception of the old function-based sort methods and binary search - after all,containsObject:is super fast, so there’s no need for that.

Thearrayandsetaccessors will respectively return anNSArrayorNSSet, but with a twist! Those objects are facade objects that act like immutable objects and will update themselves as the ordered set is updated. This is good to know when you’re planning to iterate those objects on different threads and get mutation exceptions. Internally, the classes used are are__NSOrderedSetSetProxyand__NSOrderedSetArrayProxy.

Side Note: If you’re wondering whyNSOrderedSetisn’t a subclass ofNSSet, there’s a great article on NSHipster explaining the downsides of mutable/immutable class clusters.

Performance Characteristics of NSOrderedSet

If you look at this benchmark, you see whereNSOrderedSetstarts getting expensive. All those benefits can’t come for free:

Class / Time [ms]	1.000.000 elements
NSMutableOrderedSet, adding	3190.52
NSMutableSet, adding	2511.96
NSMutableArray, adding	1423.26
NSMutableOrderedSet, random access	10.74
NSMutableSet, random access	4.47
NSMutableArray, random access	8.08

This benchmark adds custom strings to each of these collection classes, and later randomly accesses those.

NSOrderedSetwill also take up more memory than eitherNSSetorNSArray, since it needs to maintain both hashed values and indexes.

NSHashTable

NSHashTableis modeled afterNSSet, but is much more flexible when it comes to object/memory handling. While some of the features ofNSHashTablecan be achieved via custom callbacks onCFSet, hash table can hold objects weakly and will properly nil out itself when the object is deallocated - something that would be quite ugly when manually added to anNSSet. It’s also mutable by default - there is no immutable counterpart.

NSHashTablehas both an ObjC and a raw C API, where the C API can be used to store arbitrary objects. Apple introduced this class in 10.5 Leopard, but only added it quite recently in iOS 6. Interestingly enough, they only ported the ObjC API; the more powerful C API is excluded on iOS.

NSHashTableis wildly configurable via theinitWithPointerFunctions:capacity:- we’re only picking the most common use cases, which are also predefined usinghashTableWithOptions:. The most useful option has its own convenience constructor viaweakObjectsHashTable.

NSPointerFunctions

These pointer functions are valid forNSHashTable,NSMapTable, andNSPointerArray, and define the acquisition and retention behavior for the objects saved in these collections. Here are the most useful options. For the full list, seeNSPointerFunctions.h.

There are two groups of options. Memory options determine memory management, and personalities define hashing and equality.

NSPointerFunctionsStrongMemorycreates a collection that retains/releases objects, much like a regularNSSetorNSArray.

NSPointerFunctionsWeakMemoryuses an equivalent of__weakto store objects and will automatically evict deallocated objects.

NSPointerFunctionsCopyIncopies the objects before they are added to the collection.

NSPointerFunctionsObjectPersonalityuseshashandisEqual:from the object (default).

NSPointerFunctionsObjectPointerPersonalityuses direct-pointer comparison forisEqual:andhash.

Performance Characteristics of NSHashTable

Class / Time [ms]	1.000.000 elements
NSHashTable, adding	2511.96
NSMutableSet, adding	1423.26
NSHashTable, random access	3.13
NSMutableSet, random access	4.39
NSHashTable, containsObject	6.56
NSMutableSet, containsObject	6.77
NSHashTable, NSFastEnumeration	39.03
NSMutableSet, NSFastEnumeration	30.43

If you just need the features of anNSSet, then stick atNSSet.NSHashTabletakes almost twice as long to add objects, but has quite similar performance characteristics.

NSMapTable

NSMapTableis similar toNSHashTable, but modeled afterNSDictionary. Thus, we can control object acquisition/retention for both the keys and objects separately, viamapTableWithKeyOptions:valueOptions:. Since storing one part weak is again the most useful feature ofNSMapTable, there are now four convenience constructors for this use case:

strongToStrongObjectsMapTable
weakToStrongObjectsMapTable
strongToWeakObjectsMapTable
weakToWeakObjectsMapTable

Note that - unless created withNSPointerFunctionsCopyIn- any of the defaults will retain (or weakly reference) the key object, and not copy it, thus matching the behavior ofCFDictionaryand notNSDictionary. This can be quite useful if you need a dictionary whose key does not implementNSCopying, likeUIView.

If you’re wondering why Apple “forgot” adding subscripting toNSMapTable, you now know why. Subscripting requires anid<NSCopying>as key, which is not necessary forNSMapTable. There’s no way to add subscripting to it without having an invalid API contract or weakening subscripting globally with removing theNSCopyingprotocol.

You can convert the contents to an ordinaryNSDictionaryusingdictionaryRepresentation. This returns a regular dictionary and not a proxy - unlikeNSOrderedSet:

Performance of NSMapTable

Class / Time [ms]	1.000.000 elements
NSMapTable, adding	2958.48
NSMutableDictionary, adding	2522.47
NSMapTable, random access	13.25
NSMutableDictionary, random access	9.18

NSMapTableis only marginally slower thanNSDictionary. If you need a dictionary that doesn’t retain its keys, go for it, and leaveCFDictionarybehind.

NSPointerArray

TheNSPointerArrayclass is a sparse array that works similar to anNSMutableArray, but can also holdNULLvalues, and thecountmethod will reflect those empty spots. It can be configured with various options fromNSPointerFunctions, and has convenience constructors for the common use cases,strongObjectsPointerArray, andweakObjectsPointerArray.

Before you can useinsertPointer:atIndex:, we need to make space by directly setting thecountproperty, or you will get an exception. Alternatively, usingaddPointer:will automatically increase array size if needed.

You can convert anNSPointerArrayinto a regularNSArrayviaallObjects. In that case, allNULLvalues are compacted, and only existing objects are added - thus the object indexes of this array will most likely be different than in the pointer array. Careful: if you are storing anything other than objects into the pointer array, attempting to callallObjectswill crash withEXC_BAD_ACCESS, as it tries to retain the “objects” one by one.

From a debugging point of view,NSPointerArraydidn’t get much love. Thedescriptionmethod simply returns<NSConcretePointerArray: 0x17015ac50>. To get to the objects you need to, call[pointerArray allObjects], which, of course, will change the indexes if there are anyNULLsin between.

Performance of NSPointerArray

When it comes to performance,NSPointerArrayis really, really slow, so think twice if you plan to use it on a large data set. In our benchmark we’re comparingNSMutableArraywithNSNullas an empty marker andNSPointerArraywith aNSPointerFunctionsStrongMemoryconfiguration (so that objects are properly retained). Then in an array of 10,000 elements, we fill every tenth entry with a string “Entry %d”. The benchmark includes the total time it takes forNSMutableArrayto be filled withNSNull.null. ForNSPointerArray, we usesetCount:instead:

Class / Time [ms]	10.000 elements
NSMutableArray, adding	15.28
NSPointerArray, adding	3851.51
NSMutableArray, random access	0.23
NSPointerArray, random access	0.34

Notice thatNSPointerArrayrequires more than 250x (!) more time thanNSMutableArray. This is really surprising and unexpected. Tracking memory is harder and it’s likely thatNSPointerArrayis more efficient here, but since we use one shared instance forNSNullto mark empty objects, there shouldn’t be much overhead except pointers.

NSCache

NSCacheis quite an odd collection. Added in iOS 4 / Snow Leopard, it’s mutable by default, and also thread safe. This makes it perfect to cache objects that are expensive to create. It automatically reacts to memory warnings and will clean itself up based on a configurable “cost.” In contrast toNSDictionary, keys are retained and not copied.

The eviction method ofNSCacheis non-deterministic and not documented. It’s not a good idea to put in super-large objects like images that might fill up your cache faster than it can evict itself. (This was the case of many memory-related crashes in PSPDFKit, where we initially usedNSCachefor storing pre-rendered images of pages, before switching to custom caching code based on a LRU linked list.)

NSCachecan also be configured to automatically evict objects that implement theNSDiscardableContentprotocol. A popular class implementing this property isNSPurgeableData, which as been added at the same time, but was “not fully thread safe” until OS X 10.9 (there’s no information if this has affected iOS as well, or if this fix landed in iOS 7).

Performance of NSCache

So how doesNSCachehold up compared to anNSMutableDictionary? The added thread safety surely takes some overhead. Out of curiosity, I’ve also added a custom, thread-safe dictionary subclass (PSPDFThreadSafeMutableDictionary) that synchronizes access via anOSSpinLock:

Class / Time [ms]	1.000.000 elements	iOS 7x64 Simulator	iPad Mini iOS 6
NSMutableDictionary, adding	195.35	51.90	921.02
PSPDFThreadSafeMutableDictionary, adding	248.95	57.03	1043.79
NSCache, adding	557.68	395.92	1754.59
NSMutableDictionary, random access	6.82	2.31	23.70
PSPDFThreadSafeMutableDictionary, random access	9.09	2.80	32.33
NSCache, random access	9.01	29.06	53.25

NSCacheholds up pretty well, and random access is equally fast as our custom thread-safe dictionary. Adding is slower, as expected, becauseNSCachealso keeps an optional cost factor around, has to determine when to evict objects, and so on - it’s not a very fair comparison in that regard. Interestingly, it performs almost ten times worse when run in the Simulator. This is true for all variants, 32 or 64 bit. It also looks like it has been optimized in iOS 7 or simply benefits from the 64-bit runtime. When testing with an older device, the performance overhead of usingNSCacheis far more noticeable.

The difference between iOS 6 (32 bit) and iOS 7 (64 bit) is also far more noticeable since the 64-bit runtime uses tagged pointers, and thus our@(idx)boxing is much more efficient there.

NSIndexSet

There are a few use cases whereNSIndexSet(and its mutable variant,NSMutableIndexSet) really shines, and you will find various usages throughout Foundation. It can save a collection of unsigned integers in a very efficient way, especially if it’s only one or a few ranges. As the name “set” already implies, eachNSUIntegeris either in the index set or isn’t. If you need to store an arbitrary number of integers that are not unique, better use anNSArray.

This is how you would convert an array of integers to anNSIndexSet:

NSIndexSet *PSPDFIndexSetFromArray(NSArray *array) { NSMutableIndexSet *indexSet = [NSMutableIndexSet indexSet]; for (NSNumber *number in array) {
        [indexSet addIndex:[number unsignedIntegerValue]];
    } return [indexSet copy];
}

Getting all indexes out of the index set was a bit fiddly before we had blocks, withgetIndexes:maxCount:inIndexRange:being the fastest way, next to usingfirstIndexand iterating untilindexGreaterThanIndex:returnedNSNotFound. With the arrival of blocks, working withNSIndexSethas become a lot more convenient:

NSArray *PSPDFArrayFromIndexSet(NSIndexSet *indexSet) { NSMutableArray *indexesArray = [NSMutableArray arrayWithCapacity:indexSet.count];
    [indexSet enumerateIndexesUsingBlock:^(NSUInteger idx, BOOL *stop) {
       [indexesArray addObject:@(idx)];
    }]; return [indexesArray copy];
}

Performance of NSIndexSet

There’s no equivalent toNSIndexSetin Core Foundation, and Apple doesn’t make any promises to performance. A comparison betweenNSIndexSetandNSSetis also relatively unfair to begin with, since the regular set requires boxing for the numbers. To mitigate this, the benchmark will prepare pre-boxedNSUIntegers, and will callunsignedIntegerValueon both loops:

Class / Time per Entries [ms]	#1.000	#10.000	#1.000.000	#10.000.000	#1.000.000, iPad Mini iOS 6
NSIndexSet, adding	0.28	4.58	98.60	9396.72	179.27
NSSet, adding	0.30	2.60	8.03	91.93	37.43
NSIndexSet, random access	0.10	1.00	3.51	58.67	13.44
NSSet, random access	0.17	1.32	3.56	34.42	18.60

We’ll see that at around 1 million entries,NSIndexSetstarts becoming slower thanNSSet, but only because of the new runtime and tagged pointers. Running the same test on iOS 6 shows thatNSIndexSetis faster, even with this high number of entries. Realistically, in most apps, you won’t add that many integers into the index set. What’s not measured here is thatNSIndexSetcertainly has a greatly optimized memory layout compared toNSSet

Conclusion

This article provides you with some real-world benchmarks to make informed choices when using Foundation’s collection classes. Next to the discussed classes, there are some less common but still useful ones, especiallyNSCountedSet, CFBag, CFTree, CFBitVector, and CFBinaryHeap.

转载于:https://my.oschina.net/grant110/blog/186850