Here’s another Microsoft internal newsletter about some pretty cool techniques from Microsoft Research that have been integrated into Microsoft’s new Digital Image software.  IMHO, there’s just no decent imaging software.  I was a huge Jasc Software Paint Shop Pro 6 fan.  That’s version 6, mind you.  It was a fantastic editor.  It had powerful techniques ala Photoshop with a simple and elegant interface.  Later versions became far to complex with a schizophrenic move towards supporting both vectors and rasters and layers in one confusion interface littered with toolbars and toolboxes.  

Photoshop is too advanced for the average user, Paint is joke.  The original Microsoft Digital Image Suite was a train-wreck called “Picture-It” that had the unfortunate habit of crashing by disappearing.  You know these special crashes…no Doctor Watson, just -poof-, gone. 

The new Adobe PhotoAlbum suite has some pretty impressive organizational skills, most associating event and subject tags around the time the photo was taken.  But, will Microsoft get it right this time with a little help from Microsoft Research?

Side Note: It would be nice if Microsoft would decide what App or Applet should be used to view JPEGS.  I’m writing this post on a fairly fresh Tablet PC, and I’m presented with this menu when right-clicking a JPEG:

Ok, so I can open it in the “Windows Picture and Fax Viewer” – hm, I open pictures a LOT more than Faxes.  I’ve also got Paint.  Hm, then IE.  Also the Microsoft Office Picture Manager which is a GREAT tool, but: 1. it is relegated to the never used “Office Tools” program group, and 2. it doesn’t associate itself with graphic files until you run it once…but you have to discover it before you can run it!  Seems since Microsoft is so gung-ho on Digital Photography, that someone should organize a Unified Front within the org and work this out.


History is being written in a new way. It is being written by people through the big and the small events in their lives. We write history through personal Web sites, discussion boards, and the legacy of photos, taken at moments that are important to us.

Researchers at Microsoft have been working on a wide range of technologies that will help people write their personal histories through digital photography. To tell any history, it's best to start at the beginning, and move forward.

In the beginning, you buy a digital camera, take a quick look at the manual, throw it to the side, and start pushing buttons. Digital photography has made it possible to take an almost endless number of photos. Some of these photos are good; some get deleted before anyone else sees them. Many of them are almost good, with a little tweaking they'd be just fine.

Adjustable Light A common problem with digital photography is lighting. Photos turn out either too light or too dark. Since digital cameras allow us to take lots of shots without running out of film, we're willing to throw away a lot of the bad shots. But if you want to take a great picture, it might be nice to be able to control the lighting without purchasing professional lighting equipment.

One of the research projects at Microsoft Research is called Continuous Flash. This technology allows you to take the same picture with flash and without flash and later adjust the balance between light and dark. It's better than contrast filters in photo editing tools, because it considers the reflection characteristics of each object in your picture.

"You can't compensate for having one area underexposed and one area overexposed," said Hugues Hoppe, one of the project researchers. "If an area is underexposed, you can't really get the detail back by increasing brightness, because it wasn't captured in the first place. By having two different images which both have useful information, you can merge them together."

Image Stacks A similar technology that combines the best of multiple photographs is a project called Image Stacks. Image Stacks aligns multiple images of the same subject, allowing the user to pick and chose the best pieces from each photograph. Researchers Michael Cohen, Steven Drucker and Alex Colburn thought this would come in handy for special events, when you want to get a picture of the entire group that's suitable for framing.

Taking group photographs is difficult, because capturing a single image in which everyone looks good is almost impossible. What usually happens is that in one shot, someone has their eyes closed, but someone else has got the most adorable smile. Check the next shot, everyone has their eyes open, but one person is picking a poppy seed out of their teeth. The third shot, both previous people are behaving, but grandma is yawning, tired of waiting through multiple shots. Which shot do you pick? With Image Stacks, you can easily cut and paste to present everyone's best face. The images are automatically registered into a single composite image.

Print to Digital Print photos are still around. They're around in shoeboxes. They're shoved under the bed and in the back of closets. Some of us have so many print photos we don't know what to do with them. Neatnik types tediously scan all of their print photos one-by-one, converting them to digital form. But most of us don't want to go to all this trouble.

Cormac Herley, a researcher in the Communications, Collaboration and Signal Processing (CCSP) group, has developed a way to allow people to scan multiple photos at one time. You can put as many photos as will fit on your scanner, and the software will recognize each photo separately. It can 'read' the edges of the photos, even if they're crowded together or tilted. When it converts them to digital photos, it will correct for orientation and position. It's a quick way to get the family memories out of the shoebox and onto your hard drive.

"It's a harder problem than it looks like on the surface. Many scanner makers have tried, but it hasn't worked before," said Herley. "But this really works, it's not just a demo."

Take Out the Red The Media Computing group at Microsoft Research Asia has developed several cool image editing techniques to help you fix some common problems with any photograph. One of the worst that comes to mind is the glowing red eyes that result from the flash hitting the pupil in just the wrong way. Unless you're way into the vampire look, this just isn't right. The red eye fix, which currently ships in Digital Image Suite and Windows XP Media Center Edition, is one of the best photo retouch features around. All you do is move the 'target' over the red eye, click, and the red is out.

red eye fix

The same group has also developed a quick and effective way to fix the brightness and contrast in a picture. Even if you took a picture that is too dark, you can use the Levels Auto-Fix feature in Digital Image Pro to lighten up your picture.

Organize the Digital Shoebox The Media Computing group has also made it easy to organize your digital photos using image recognition algorithms. Some people do the same thing they did with the shoebox, and scatter pictures all over their hard drive. Then they can't find the one picture they want to use for their holiday card. The group's algorithms can distinguish between indoor and outdoor shots, shots with people in them and shots without, and city and non-city scenes. Combined with other technologies, such as keyword annotation, it makes it a breeze to find any photo in your collection.

John Platt, a researcher in the CCSP group has developed another way to manage your photos online. His image clustering algorithm helps users find their photos by one of the most prominent markers: events. The software is effective because it doesn't look only at timestamps, which could be misled by a faulty camera clock. Instead, the software looks at photograph order plus color to find pictures taken during one particular event.

"We only compare colors locally in time," said Platt. "So if you have a pumpkin in one shot, and a few months later you wear an orange shirt, later, when you're searching for the day you visited the pumpkin patch, it won't show photographs from when you were wearing the orange shirt."

His algorithm underlies several other technologies designed to help people find their digital photos, including the Microsoft Research Media Browser. The Media Browser, developed by researchers in the Next Media group, takes advantage of the photo recognition research from the Media Computing group, and Platt's algorithms to build a unique visual experience that helps you search for and identify your photos. And it looks darn cool as it works. The interface is an impressive, futuristic presentation of photos that rearrange themselves before your eyes, sliding into place in a typical 2D presentation or a 3D stack.

"The idea behind this is annotation of large collections of photos," said Steven Drucker, the lead researcher on the project. "We know that if you put annotations on photos, that it's much easier to retrieve them. But we also know that it's tedious and difficult to do. We use the advanced techniques that are available, such as face detection and image clustering, to make it easier for you to interact with your photos. We also use a game graphics card for higher visual quality."

Fill It In Smart Erase is a photo editing tool found in Digital Image Pro and invented by researcher Patrick Perez in Cambridge . The feature allows users to remove objects from a picture. This can come in handy in case you want to remove your ex from the family reunion picture, or you before you lost the 30 pounds.

The algorithm looks at areas of the image to see which patch of texture can be "stolen" to fill in the holes left behind when the unwanted object in the image is removed.

To fill in the hole, Smart Erase does some reasoning about texture. It views the pixels outside the object as potential replacement material. The program has some strategies for knowing exactly where to look to get this material. "The algorithm constantly reviews what pieces it's got and makes comparisons very quickly to come up with the right fit," said Andrew Blake, Senior Researcher in Cambridge .

Smart Erase

Blend It Another photo editing feature from the Cambridge lab, a tool code-named Blender, appeared in Digital Image Pro this year as the "Blending Brush." Blender is a seamless cloning tool that can take the wrinkles out of your face, insert a new object into a scene, and combine parts of one scene with another - all without the usual difficulties and distortions that most photo editing techniques present.

If an object inserted into a new background has complex outlines, standard cloning may not work because of the incompatibility of color and intensity between the background and the new object. And even the best, most careful cutting and pasting often yields poor results because the outlines are fuzzy or jagged. Blender 'blends' pieces of the inserted object and the background together to form a seamless whole.

Cut It Out Cutting out an image and putting it somewhere else has always held a lot of fascination for photo aficionados. Blake and his team are developing a new algorithm they call GrabCut, that's a 'no-brainer' way to do this important task. Instead of having to carefully trace the outlines of the object you want to cut out, all you have to do is draw a rectangle around the object. The algorithm selects the object and eliminates the old background. You can then paste the cut-out object onto a different background.

Cartoon Wizard Doesn't everyone want to star in their own anime or Disney cartoon?

Though Microsoft Asia researchers can't get you a Disney contract, they can turn your digital photograph into a cartoon. Their technology, developed in cooperation with MPD Japan, is called the Cartoon Wizard. It is currently offered in the Japanese version of Office 2003. Westerners will have to wait, as the Cartoon Wizard is only trained to work with Asian faces.

Their system is based on statistical learning techniques. The algorithm automatically generates a cartoon from an image using face detection and alignment, and training data generated by studying how a human artist renders a human image into a caricature. The resulting cartoons can be used in e-cards or personalized emoticons for chat programs.

Tell a Photostory Now that you've stepped through the process of improving and organizing your digital photos, perhaps you'd like to share them. Microsoft Research has developed several ways to do this, in small and large ways.

When researcher Dave Vronay was working on PhotoStory, he wanted to recreate the feeling of a family sitting around an old-fashioned photo album and telling the stories connected to the pictures.

"A picture is not just a description of what is there," he said. "For instance, if you have a picture of a hotel, and you showed it to a friend, you probably wouldn't just say, 'and that's the hotel we stayed at.' You might instead launch into a story about the waiter with purple hair who served you duck soup at the hotel restaurant, even though you didn't have a picture of him. The photo would be a reminder of the stories that surrounded that photo."

With Photostory, you can add images, music, and background narration to tell the stories behind the pictures and send it to the people you'd love to have sitting on the couch next to you while you share your memories.

Share Your Photos with Friends The Social Computing group is experimenting with an online blog and photo sharing application code-named Wallop, a project designed to help people to connect with those close to them — families and friends, and friends of friends.

The group considers Wallop a "social networking" application that provides a way for small, closely connected groups of people to share personal information and photographs online. The beta testers can send photos to their Wallop interface through email or instant messages to easily update their blog interface.

Wallop Screen Shots

Share Your Photos with the World The World Wide Media Exchange (WWMX) offers users from around the world the chance to upload and share their photos with millions. It provides MapPoint maps and TerraServer maps so that you can view your photos by location as well as time.

One of the advantages to this interface is the ability to communicate with people across the world. If you're planning a trip to London , for instance, maybe some nice tourist who has gone before you has posted their pictures of a trip around the city, complete with shots of their favorite tea stops. Then other tourists or locals can jump in and write annotations on the photographs, such as, "don't eat here, the crumpets aren't up to the usual standards."

Some of the contributors to the WWMX have contributed to history by adding photos of 'news' events, such as fires in Southern California and search and rescue operations.

A Visual Journey David Salesin, a senior researcher in the Document Processing and Understanding group, has inspired many digital projects at Microsoft Research. He is also on the faculty at the University of Washington . Salesin recently became actively involved in a large digital photography project. A very large project. He contributed original digital photographs from his trip to Bhutan to the world's largest published book, a visual journey across the last unspoiled Himalayan kingdom on the planet.

The project was funded by several sources, including the Bill & Melinda Gates Foundation and the iCampus program at Microsoft Research. "Mike's project seemed like an interesting, original take on how we might be able to use technology for education," said Salesin. He helped convince the iCampus funding committee to support the project.

这是另一本Microsoft内部新闻通讯,介绍了Microsoft Research的一些非常酷的技术,这些技术已集成到Microsoft的新Digital Image软件中。 恕我直言,没有像样的成像软件。 我是Jasc Software Paint Shop Pro 6的忠实粉丝。 请注意,这是版本6。 这是一位了不起的编辑。 它具有功能强大的技术(例如Photoshop)以及简单而优雅的界面。 后来的版本变得非常复杂,精神分裂性地朝着在带有工具栏和工具箱的混乱界面中同时支持矢量,栅格和图层的方向发展。

对于普通用户而言,Photoshop太先进了,Paint只是在开玩笑。 最初的Microsoft Digital Image Suite是一个名为“ Picture-It”的火车残骸,不幸的习惯是因消失而崩溃。 您知道这些特殊的崩溃...没有沃森医生,只是-poof-,不见了。

新的Adobe PhotoAlbum套件具有一些令人印象深刻的组织技能,在拍摄照片时,大多数事件和主题标签都具有关联性。 但是,Microsoft这次在Microsoft Research的帮助下能否正确解决?

旁注如果Microsoft决定应使用哪种App或Applet查看JPEGS,那就很好了。 我是在相当新的Tablet PC上写这篇文章的,右键单击JPEG时会看到以下菜单:

好的,所以我可以在“ Windows图片和传真查看器”中打开它–嗯,我打开的图片比传真还多。 我也有油漆。 嗯,然后是IE。 也是Microsoft Office Picture Manager(它是一个很棒的工具),但是:1.它被降级为从未使用过的“ Office Tools”程序组,并且2.在运行一次之前,它不会与图形文件关联……但是您必须先发现它,然后才能运行它! 似乎由于Microsoft对Digital Photography的热爱,所以有人应该在组织内部组织统一战线并加以解决。


历史正在以新的方式书写。 它是人们通过生活中的大小事件编写的。 我们通过个人网站,讨论区和照片的遗迹来记录历史, 在对我们很重要的时刻拍摄。

微软的研究人员一直在研究各种各样的技术,这些技术将帮助人们通过数字摄影来记录自己的个人历史。 要讲述任何历史,最好从头开始,然后向前发展。

首先,您要购买数码相机,快速浏览一下手册,将其放在一边,然后开始按下按钮。 数码摄影使拍摄几乎无限数量的照片成为可能。 这些照片有些不错; 有些会在其他人看到之前被删除。 他们中的许多人都差不多,只要稍加调整,就可以了。

可调光数码摄影的一个常见问题是照明。 照片结果太亮或太暗。 由于数码相机可以让我们拍摄很多照片而不会用完胶卷,因此我们愿意丢弃很多不良照片。 但是,如果您想拍摄一张精美的照片,那么无需购买专业的照明设备就能控制照明可能会很好。

Microsoft Research的一项研究项目称为Continuous Flash。 这项技术使您可以在有闪光灯和无闪光灯的情况下拍摄同一张照片,然后再调整明暗之间的平衡。 它比照片编辑工具中的对比度滤镜更好,因为它考虑了图片中每个对象的反射特性。

项目研究人员之一Hugues Hoppe表示:“您不能补偿一个区域曝光不足而一个区域曝光过度的情况。” “如果某个区域曝光不足,您将无法通过增加亮度来真正获得细节,因为它不是一开始就被捕获的。通过拥有两个都有有用信息的不同图像,您可以将它们合并在一起。”

图像堆栈一项名为“图像堆栈”的项目是一种结合了多张照片的最佳技术。 图像堆栈可对齐同一主题的多幅图像,使用户可以从每张照片中挑选最佳片段。 研究人员迈克尔·科恩( Michael Cohen)史蒂芬·德鲁克( Steven Drucker)亚历克斯·科伯恩( Alex Colburn)认为,当您想获取适合组帧的整个团队的照片时,这将在特殊事件中派上用场。

拍摄集体照很困难,因为几乎不可能拍摄一张所有人都看起来不错的图像。 通常发生的情况是,一次拍摄时,某人闭上了眼睛,但其他人却表现出最可爱的笑容。 检查下一个镜头,每个人都睁开眼睛,但是一个人正在从牙齿中摘取罂粟种子。 第三枪,以前的两个人都在表现,但是奶奶正在打哈欠,厌倦了等待多次射击。 你选哪个镜头? 使用图像堆栈,您可以轻松地剪切和粘贴以呈现每个人的最佳面Kong。 图像会自动注册为单个合成图像。

打印到数字打印照片仍然存在。 他们在鞋盒里。 他们被推到床下和壁橱里。 我们中有些人有太多的打印照片,我们不知道该如何处理。 Neatnik类型乏味地一张一张地扫描所有打印照片,然后将它们转换为数字形式。 但是我们大多数人都不想陷入所有这些麻烦。

通信,协作与信号处理(CCSP)小组的研究员Cormac Herley开发了一种方法,使人们可以一次扫描多张照片。 您可以在扫描仪上放入尽可能多的照片,并且该软件将分别识别每张照片。 即使照片拥挤或倾斜,它也可以“读取”照片的边缘。 将其转换为数码照片时,将校正方向和位置。 这是一种将家庭记忆带出鞋盒并放入硬盘驱动器的快速方法。

赫利说:“这比表面上看起来要困难得多。许多扫描仪制造商都曾尝试过,但以前从未奏效。” “但这确实有效,不只是演示。”

冲出重围Microsoft Research Asia的媒体计算小组开发了几种出色的图像编辑技术,可帮助您解决任何照片的一些常见问题。 想到的最糟糕的情况之一是,闪光灯以错误的方式撞击瞳Kong而产生的发红光的眼睛。 除非您进入吸血鬼外观,否则这是不对的。 红眼修复程序,当前在Digital Image Suite和 Windows XP Media Center Edition是周围最好的照片修饰功能之一。 您要做的就是将“目标”移到红眼上方,单击,然后红色消失。


同一小组还开发了一种快速有效的方法来固定图片中的亮度和对比度。 即使您拍摄的照片太暗,也可以使用Digital Image Pro中的“色阶自动修复”功能使照片变亮。

组织数字鞋盒媒体计算小组也使使用图像识别算法组织数字照片变得容易。 有些人在鞋盒上做同样的事情,并在硬盘上散布图片。 这样一来,他们就找不到想要用作节日贺卡的图片。 该小组的算法可以区分室内和室外镜头,其中有人的镜头和没有人的镜头以及城市和非城市场景。 结合关键字注释等其他技术,轻松查找收藏集中的任何照片。

CCSP小组的研究员John Platt开发了另一种在线管理照片的方法。 他的图像聚类算法可帮助用户通过最突出的标记之一:事件来查找照片。 该软件之所以有效,是因为它不仅查看时间戳,而且可能由于错误的相机时钟而误导了时间戳。 取而代之的是,该软件查看照片顺序和颜色,以查找在特定事件中拍摄的照片。

普拉特说:“我们只会及时比较本地的颜色。” “因此,如果您一次拍摄了一个南瓜,几个月后您穿了橙色衬衫,稍后,当您搜索拜访南瓜补丁的那一天时,它不会显示您穿着南瓜时的照片。橙色衬衫。”

他的算法基于旨在帮助人们查找数码照片的其他几种技术,其中包括Microsoft Research Media Browser。 Next Media小组的研究人员开发的Media Browser利用Media Computing小组的照片识别研究成果和Platt的算法来构建独特的视觉体验,以帮助您搜索和识别照片。 它的工作原理看起来真酷。 该界面是令人印象深刻的未来派照片,它们可以在您的眼前重新排列,并以典型的2D演示或3D堆栈形式滑动到适当的位置。

该项目的首席研究员史蒂文·德鲁克(Steven Drucker)表示:“这背后的想法是对大量照片进行注释。” “我们知道,如果在照片上加上注释,检索起来会容易得多。但是我们也知道,这样做很繁琐且困难。我们使用可用的高级技术(例如人脸检测和图像聚类)来制作照片。您可以更轻松地与照片互动。我们还使用游戏显卡来提高视觉质量。”

“将其填充为智能擦除”是Digital Image Pro中的一种照片编辑工具,由研究员Patrick Perez发明于 剑桥 。 该功能允许用户从图片中删除对象。 如果您想从家庭聚会的照片中删除前任妻子,或者在您减掉30磅之前,这可以派上用场。


为了填补漏洞,Smart Erase对纹理进行了一些推理。 它将对象外部的像素视为潜在的替换材料。 该程序具有一些策略,可以准确地知道从何处获取该材料。 “该算法不断地回顾了个它有,进行比较非常Swift拿出合适人选,说:”安德鲁·布雷克,在高级研究员 剑桥


混合它的另一种照片编辑功能 剑桥 实验室,一个代号为Blender的工具,今年在Digital Image Pro中以“混合画笔”出现。 Blender是一种无缝的克隆工具,可以消除脸上的皱纹,将新对象插入场景中,并将一个场景的一部分与另一个场景结合在一起-所有这些都没有大多数照片编辑技术所存在的通常的困难和变形。

如果插入到新背景中的对象具有复杂的轮廓,则由于背景和新对象之间的颜色和强度不兼容,因此无法进行标准克隆。 甚至最好,最仔细的剪切和粘贴也常常导致效果不佳,因为轮廓模糊或参差不齐。 Blender将插入的对象和背景片段“融合”在一起,形成一个无缝的整体。

出图像切出图像并将其放置在其他地方对摄影爱好者一直很着迷。 Blake和他的团队正在开发一种称为GrabCut的新算法,这是完成这项重要任务的“明智之举”。 不必仔细跟踪要切出的对象的轮廓,只需在对象周围绘制一个矩形即可。 该算法选择对象并消除旧背景。 然后,您可以将剪切的对象粘贴到其他背景上。


尽管Microsoft亚洲研究人员无法与您取得迪士尼合同,但他们可以将您的数码照片变成卡通。 他们与日本MPD合作开发的技术被称为卡通向导。 它目前在日语版的Office 2003中提供。西方人将不得不等待,因为“卡通向导”仅受过训练可以与亚洲面Kong配合使用。

他们的系统基于统计学习技术。 该算法使用脸部检测和对齐功能,以及通过研究人类艺术家如何将人类图像渲染为漫画而生成的训练数据,从图像自动生成漫画。 生成的卡通可以用于聊天程序的电子贺卡或个性化表情符号中。

讲述一个照片故事现在,您已经完成了改善和整理数字照片的过程,也许您想共享它们。 Microsoft Research开发了几种方法来实现此目的,无论大小。

当研究员Dave Vronay从事PhotoStory创作时,他想重现一个家庭坐在老式相册中并讲述与图片相关的故事的感觉。

他说:“一张图片不只是描述那里的东西。” “例如,如果您有一家酒店的照片,并且将其显示给朋友,您可能不会只说'那是我们住过的酒店。' 相反,您可能会讲一个关于紫色头发的服务员的故事,即使您没有他的照片,他也会在旅馆的餐厅为您提供鸭汤。照片会让人想起那张照片周围的故事。”



该小组认为Wallop是一种“社交网络”应用程序,它为联系紧密的小型人群提供了一种在线共享个人信息和照片的方式。 Beta测试人员可以通过电子邮件或即时消息将照片发送到其Wallop界面,以轻松更新其博客界面。


与全世界共享您的照片全球媒体交流(WWMX)为来自世界各地的用户提供了上传和共享数百万照片的机会。 它提供MapPoint地图和TerraServer地图,以便您可以按位置和时间查看照片。

该界面的优点之一是能够与世界各地的人们进行交流。 如果您打算去 伦敦 ,例如,也许某个不错的游客在您发布他们在城市周围旅行的照片之前走了,并附上了他们最喜欢的茶点的镜头。 然后其他游客或当地人可以跳进去,并在照片上写上注释,例如:“不要在这里吃饭,烤饼不符合通常的标准。”

WWMX的一些贡献者通过添加“新闻”事件的照片为历史做出了贡献,例如“大火”中的火灾。 南加州 以及搜救行动。

视觉之旅David Salesin是“文档处理和理解”小组的资深研究员,他启发了Microsoft Research的许多数字项目。 他也在大学教职 大学 的 华盛顿州 。 Salesin最近积极参与了一个大型数码摄影项目。 一个非常大的项目。 他从旅行中贡献了原始数码照片 不丹 到世界上最大的出版书籍,穿越地球上最后一个未受破坏的喜马拉雅王国的视觉之旅。

该项目由多个来源资助,包括比尔和梅琳达·盖茨基金会和微软研究院的iCampus计划 Salesin说:“迈克的项目似乎是关于我们如何利用技术进行教育的有趣的,原始的尝试。” 他帮助说服iCampus资助委员会支持该项目。

翻译自: https://www.hanselman.com/blog/new-digital-photo-techniques-from-microsoft-research-integrated-into-microsoft-digital-image-suite-9






