转贴一篇文章,喜欢STL的朋友别错过:Overdoing Templates(过度使用模板)

转载 2002年04月02日 10:29:00

这篇文章是Steve Donovan写的,他是“C++ By Examples”的作者,他开发了一个叫做UnderC的C++解释器,可以把C++当成脚本语言来解释执行。跟Bjarne Stroustrup,Andy Koenig, Scott Meyers们比起来,在C++社区,他的名气要小得多。但是,他的这篇文章公然对大师们的观点提出异议,而且我认为说的很在理。大师们为了展示STL能力的范例,不应该作为日常编程的榜样。而过度强调编码技巧,正在使C++一步步远离普通程序员。当所有“菜鸟”都抛弃一种语言的时候,就算“大虾”们的技巧再高超,也只能自娱自乐。我认为这篇文章是一声呐喊,虽然不够响亮,但是是难得的一声呐喊--

让我们已经无比强大的C++朴实一些吧!

Overdoing C++ Templates
Every decade or so, a new fashion in programming comes along and announces itself the successor to previous discredited paradigms. Again we believe that from now on, software will be more reliable, cheaper to build, and more fun to make. (No one believes it will be smaller or faster.) In the 70s, there was structured programming; in the 80s, object-oriented programming; and from the mid-90s onward, generic programming. This last one gets its name from powerful techniques for code reuse using templates (generic classes and functions).

Parameterized classes and functions (templates) are very useful beasts. A function sqr() can be written that will square anything that defines the multiplication operator—complex numbers, matrices, whatever. The standard container classes such as list<> are templates—one doesn't have to rewrite them for every new type. This was a genuine pain in plain old C++, and I think the ISO standard is a great improvement. However, some things were overdone in the process.

For instance, the standard library strings and iostreams are templates parameterized by a "character traits" type. This means that the same basic_string<> class definition can generate ASCII strings, Unicode strings, and Martian Trilobyte strings. (In principle. Many implementations have still only implemented it for ASCII characters, which seems particularly silly.) The standard mandates that these very common classes, which are used in nearly every C++ application, must be implemented as templates.

But there is a price to pay in performance and debuggability. Some experimentation (using Microsoft Visual C++ 6.0) illustrates the problem. This compiler supports the new-style iostreams library, as well as the "classic" iostreams, so we can compare the implementations. The first test program is of course "Hello, World," which compiles more than twice as fast using classic iostreams. A more serious program involved 200 lines, each writing out 10 variables to count. What is most striking is the compilation speed: It took nearly 10 sec using the standard library versus 1.5 sec for the classic implementation. Ten seconds is a lot of time; one can cram a whole highly irritating commercial into that kind of break. Executable size was 115K for standard and 70K for classic. Your mileage may vary, but the overall picture is of slower builds and larger executables when using the new iostreams library. And this isn't a peculiarity of the Microsoft compiler because GCC behaves in the same way.

It's true that executable size is not as important as it used to be, but these days the fastest growing class of programmable devices consists of information appliances (in which memory will remain at a premium for the next few years): hand-helds, cell phones, smart refrigerators, Bluetooth-enabled espresso machines, and so on. The extra size from using the standard iostreams comes from the wholesale inlining of template code, which makes it hard to fine-tune crucial operations without code bloat. For me, the build times are a more crucial problem because it means waiting longer and losing the "conversational flow" that is important in development.

Then, consider debuggability. The template implementation of the standard string class is highly ingenious, but not pretty to the applications debugger. She is faced with compiler and debugger messages using the following fully expanded name:

class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char>>
And as for the very useful map<string,string>, I leave it to your imagination. The name is so long that one gets dozens of compiler warnings about the internal name being truncated. std::string should be as transparent as possible to the beginner, who should not be penalized for treating it as a built-in feature of the language. It is technically feasible to hunt for any defined typedefs in scope when putting out compiler error messages, and I may get around to doing that with the UnderC project. Verity Stob suggests writing a postprocessor to massage C++ error messages, and I hope she's joking. It is simpler just not to use such an overcomplicated type. My secret weapon in C++ development (which I'll confess here in public for the first time) is an interface-compatible string class that replaces the <string> header in my larger projects. Occasionally, I'll rebuild with the standard headers to see if my libraries are still honest, but I generally let other people struggle with being correct at the price of performance.

Let me say that there are indeed applications that need the kind of flexibility offered by std::string programs that manipulate both ASCII and Unicode strings or that need to customize the allocation strategy, etc. These are not generally very common (usually, programs either use ASCII or Unicode), and it seems unfair to burden programmers with the general machinery. It does make the library implementer's job more interesting, but it complicates that of the application programmers. This seems to be an inversion of the general principle that a well-designed library hides the difficulty of its implementation and is straightforward to use. That is, std::string does not hide its implementation sufficiently because application programmers are constantly being made aware of it during their development process. And we cannot assume that the users of these classes are rocket scientists. The standard is insisting on a particular implementation strategy, which goes against the idea of standards ideally specifying only the public interface and expected behavior of classes. Naturally, the general templates can always be available to those who genuinely need them.

These considerations apply as well to the standard containers such as list<>, which have an extra default template argument that defines an allocator. Most people don't need this facility, although it's very useful if you do, of course. Again, these more general versions can be defined as separate templates. I appreciate that this makes the standard library less technically interesting, but libraries must be primarily designed with the end users in mind. To rephrase the C++ mantra, users should not be bothered by what they don't use.

Apart from the danger of making things templates when they don't need to be templates, there is another problem that occurs with generic programming in C++. Most people agree that the standard algorithms can be very useful. If I have a vector of integers, then sort(v.begin(),v.end()) will sort it. Because the comparison is inlined, this generic algorithm will be faster than the old-fashioned qsort(), as well as being easier to use, especially if this was a vector of a user-defined type. copy() will copy anything, anywhere, and in the most efficient way possible.

But some uses are unnecessarily opaque:

copy_if(v.begin(),v.end(),ostream_iterator<int>(cout) bind2nd(greater<int>(),7));
To be pedantic, each name should be qualified with std::, but assume that everything has been brought into the global namespace, either by separate using clauses or by some other unmentionable means) This example from Stroustrup1 can be more conventionally expressed in a form that makes it obvious that all the integers will run together on the output:

 vector<int>::iterator li;
 for (li = v.begin(); li != v.end(); ++li)
  if (*li > 7) cout << *li;
Stroustrup tells us that using explicit loops is "tedious and error-prone," but I cannot see any advantages in using the first version. Obviously, one can get used to this notation; humans are very adaptable, and as professionals, we have to learn the new idiom. But it is certainly not less tedious, and it is arguably much less readable and flexible. Also, it can constrain design decisions. For example, given that we have a list of Shape * pointers, we can tell them all to draw themselves with either

 for_each(ls.begin(),ls.end(),
     bind2nd(mem_fun(&Shape::draw),canvas));
or

 ShapeList::iterator li;
 for (li = ls.begin(); li != ls.end(); ++li)
    (*li)->draw(canvas);
Now, if I want to modify this so I only draw shapes matching some criteria (and specifically don't want to put that behavior into the shapes themselves), then I just have to add an if statement in the explicit loop. The only way I can think of doing this using the generic idiom is to actually define a function as the "payload" of the for_each() algorithm. Using the terminology of the software patterns book2, the first example is an internal iterator, and the second is an external iterator. The authors observe that C++ is not particularly good at internal iterators, and I think we should respect the limitations of the language. The problem here is the overenthusiastic application of the generic paradigm in C++—again leading to unnecessary difficulties. C++ simply does not do general anonymous functions such as LISP, SmallTalk, Ruby, etc. An anonymous function (or lambda expression) in C++ would look rather like the following3; perhaps one day somebody will implement it:

for_each(ls.begin(),ls.end(),
  void lambda(Shape *p) { p->draw(canvas); });
C++ is a marvelous language that is found in everything from cell phones to continental switching networks. It flexibly supports many programming styles, but this flexibility can be a problem. The art of good taste in programming is picking an appropriate idiom for the particular problem, just as good style in prose writing needs always to be aware of the audience. I don't wish to undermine the standard, which involved a mammoth effort and has given us a common platform. My issue with the standard is that it is too bound up with the generic programming paradigm, and hence becomes prescriptive about what constitutes good programming style (for instance, that algorithms are to be strongly preferred over explicit loops). It is also burdening programmers with implementation details (such as basic_string<>), which reinforces the perception of C++ as a hardcore engineer's language.

 

过度使用C++模板(overdoing C++ templates)

过度使用C++模板 不喜欢模板.
  • dreamcs
  • dreamcs
  • 2010年12月22日 11:18
  • 535

STL模板库algorithm常用方法

#include #include #include//c++的标准输入输出 #include using namespace std; int main() { int a,b; s...
  • qq_33506160
  • qq_33506160
  • 2016年07月19日 20:54
  • 519

转贴一篇文章,使用UML朋友别错过:怎么避免Use_Case的缺陷?

How to Avoid Use-Case Pitfalls January 2000Whether your system boundaries are blurred or youve tangl...
  • duanfeng
  • duanfeng
  • 2002年05月02日 10:18
  • 692

一篇关于STL标准模板库使用方法的经典文章!

STL 简介,标准模板库 作者:Scott Field         这篇文章是关于C++语言的一个新的扩展——标准模板库的(Standard Template Library),也叫STL。 ...
  • hygzxf
  • hygzxf
  • 2011年09月13日 16:10
  • 2794

ThinkPHP实现文章上一篇和下一篇

控制器代码: (可以根据自己的文章排序不同而修改)$id= $_GET['id']; //上一篇 $front=M('article')->where("id>".$id)->ord...
  • LIU_YANZHAO
  • LIU_YANZHAO
  • 2016年12月03日 10:24
  • 557

java统计一篇文章中单词出现的次数

import java.util.HashMap; import java.util.Iterator;   /**  * @author Administrator  ...
  • sz_bdqn
  • sz_bdqn
  • 2014年04月08日 23:23
  • 12021

如何写一篇英文文章的Summary?

摘要是对一篇文章的主题思想的简单陈述。它用最简洁的语言概括了原文的主题。写摘要主要包括三个步骤:(1)阅读;(2)写作;(3)修改成文。        第一步:阅读 A.认真阅读给定的原文...
  • Algorithmguy
  • Algorithmguy
  • 2014年09月28日 08:25
  • 3713

Overdoing C++ Templates

by Steve Donovan, the Author of C++ by Example: "UnderC" Learning EditionMAR 22, 2002Steve Donovan e...
  • domustdo
  • domustdo
  • 2002年04月22日 09:36
  • 667

有一篇文章,共有3行文字,每行有8…

有一篇文章,共有3行文字,每行有80个字符。要求分别统计出其他英文大写字母、小写字母、数字、空格以及其他字符的个数。 #include void main() { char ch[3][80]; i...
  • jingzhewangzi
  • jingzhewangzi
  • 2014年08月29日 22:25
  • 4215

WordPress-当文章属于两个或多个分类时,使其在不同的分类使用不同文章模板的方法

假设有两个分类A,B,分类目录显示页分别为category-A,category-B,两个页面均显示该分类的文章列表,要使同一篇文章在两个分类页中打开后调用不同的模板single-A,single-B...
  • u011023132
  • u011023132
  • 2015年06月01日 14:59
  • 925
内容举报
返回顶部
收藏助手
不良信息举报
您举报文章:转贴一篇文章,喜欢STL的朋友别错过:Overdoing Templates(过度使用模板)
举报原因:
原因补充:

(最多只允许输入30个字)