Boost.Regex

Contents

Introduction Types Regular Expression Creation Overloaded Algorithms
regex_match regex_search regex_replace
Iterators
regex_iterator creation helper regex_token_iterator creation helpers

Introduction

The header <boost/regex/mfc.hpp> provides Boost.Regex support for MFC string types: note that this support requires Visual Studio .NET (Visual C++ 7) or later, where all of the MFC and ATL string types are based around the CSimpleStringT class template. 

In the following documentation, whenever you see CSimpleStringT<charT>, then you can substitute any of the following MFC/ATL types (all of which inherit from CSimpleStringT):

CString
CStringA
CStringW
CAtlString
CAtlStringA
CAtlStringW
CStringT<charT,traits>
CFixedStringT<charT,N>
CSimpleStringT<charT>

Types

The following typedefs are provided for the convenience of those working with TCHAR's:

typedef basic_regex<TCHAR> tregex; 
typedef match_results<TCHAR const*> tmatch; 
typedef regex_iterator<TCHAR const*> tregex_iterator; 
typedef regex_token_iterator<TCHAR const*> tregex_token_iterator; 
 

If you are working with explicitly narrow or wide characters rather than TCHAR, then use the regular Boost.Regex types instead.

Regular Expression Creation

The following helper function is available to assist in the creation of a regular expression from an MFC/ATL string type:

template <class charT>
basic_regex<charT> 
   make_regex(const ATL::CSimpleStringT<charT>& s, 
              ::boost::regex_constants::syntax_option_type f = boost::regex_constants::normal);
 

Effects: returns basic_regex<charT>(s.GetString(), s.GetString() + s.GetLength(), f);

 

Overloaded Algorithms

For each regular expression algorithm that's overloaded for a std::basic_string argument, there is also one overloaded for the MFC/ATL string types.  These algorithm signatures all look a lot more complex than they actually are, but for completeness here they are anyway:

regex_match

There are two overloads, the first reports what matched in a match_results structure, the second does not. 

All the usual caveats for regex_match apply, in particular the algorithm will only report a successful match if all of the input text matches the expression, if this isn't what you want then use regex_search instead.

template <class charT, class T, class A>
bool regex_match(
   const ATL::CSimpleStringT<charT>& s, 
   match_results<const B*, A>& what, 
   const basic_regex<charT, T>& e, 
   boost::regex_constants::match_flag_type f = boost::regex_constants::match_default);
 

 

Effects: returns ::boost::regex_match(s.GetString(), s.GetString() + s.GetLength(), what, e, f);

Example:

//
// Extract filename part of a path from a CString and return the result
// as another CString:
//
CString get_filename(const CString& path)
{
   boost::tregex r(__T("(?://A|.*)([^]+)"));
   boost::tmatch what;
   if(boost::regex_match(path, what, r))
   {
      // extract $1 as a CString:
      return CString(what[1].first, what.length(1));
   }
   else
   {
      throw std::runtime_error("Invalid pathname");
 
 
}
}
      

template <class charT, class T>
bool regex_match(
   const ATL::CSimpleStringT<charT>& s,
   const basic_regex<B, T>& e,
   boost::regex_constants::match_flag_type f = boost::regex_constants::match_default)

 

Effects: returns ::boost::regex_match(s.GetString(), s.GetString() + s.GetLength(), e, f);

Example:

//
// Find out if *password* meets our password requirements,
// as defined by the regular expression *requirements*.
//
bool is_valid_password(const CString& password, const CString& requirements)
{
   return boost::regex_match(password, boost::make_regex(requirements));
}      

regex_search

There are two additional overloads for regex_search, the first reports what matched the second does not:

template <class charT, class A, class T>
bool regex_search(const ATL::CSimpleStringT<charT>& s,
                  match_results<const charT*, A>& what,
                  const basic_regex<charT, T>& e,
                  boost::regex_constants::match_flag_type f = boost::regex_constants::match_default)

Effects: returns ::boost::regex_search(s.GetString(), s.GetString() + s.GetLength(), what, e, f);

Example:: Postcode extraction from an address string.

CString extract_postcode(const CString& address)
{
   // searches throw address for a UK postcode and returns the result,
   // the expression used is by Phil A. on www.regxlib.com:
   boost::tregex r(__T("^(([A-Z]{1,2}[0-9]{1,2})|([A-Z]{1,2}[0-9][A-Z]))//s?([0-9][A-Z]{2})___FCKpd___7quot;));
   boost::tmatch what;
   if(boost::regex_search(address, what, r))
   {
      // extract $0 as a CString:
	return CString(what[0].first, what.length());
   }
   else
   {
      throw std::runtime_error("No postcode found");
   }
}      

template <class charT, class T>
inline bool regex_search(const ATL::CSimpleStringT<charT>& s,
                 const basic_regex<charT, T>& e,
                 boost::regex_constants::match_flag_type f = boost::regex_constants::match_default)

Effects: returns ::boost::regex_search(s.GetString(), s.GetString() + s.GetLength(), e, f);


regex_replace

There are two additional overloads for regex_replace, the first sends output to an output iterator, while the second creates a new string

template <class OutputIterator, class BidirectionalIterator, class traits, class
          charT>
OutputIterator regex_replace(OutputIterator out,
                           BidirectionalIterator first,
                           BidirectionalIterator last,
                           const basic_regex<charT, traits>& e,
                           const ATL::CSimpleStringT<charT>& fmt,
                           match_flag_type flags = match_default)
 

Effects: returns ::boost::regex_replace(out, first, last, e, fmt.GetString(), flags);

template <class traits, charT>
ATL::CSimpleStringT<charT> regex_replace(const ATL::CSimpleStringT<charT>& s,
                            const basic_regex<charT, traits>& e,
                            const ATL::CSimpleStringT<charT>& fmt,
 
			match_flag_type flags = match_default)

Effects: returns a new string created using regex_replace, and the same memory manager as string s.

Example:

//
// Take a credit card number as a string of digits, 
// and reformat it as a human readable string with "-"
// separating each group of four digits:
//
const boost::tregex e(__T("//A(//d{3,4})[- ]?(//d{4})[- ]?(//d{4})[- ]?(//d{4})//z"));
const CString human_format = __T("$1-$2-$3-$4");

CString human_readable_card_number(const CString& s)
 
{
   return boost::regex_replace(s, e, human_format);
}
      

Iterators

The following helper functions are provided to ease the conversion from an MFC/ATL string to a regex_iterator or regex_token_iterator:

regex_iterator creation helper
template <class charT>
regex_iterator<charT const*> 
   make_regex_iterator(
      const ATL::CSimpleStringT<charT>& s, 
      const basic_regex<charT>& e, 
      ::boost::regex_constants::match_flag_type f = boost::regex_constants::match_default);

Effects:returns regex_iterator(s.GetString(), s.GetString() + s.GetLength(), e, f);

Example:

void enumerate_links(const CString& html)
{
   // enumerate and print all the  links in some HTML text,
   // the expression used is by Andew Lee on www.regxlib.com:
   boost::tregex r(__T("href=[/"/']((http://|//.///|///)?//w+(//.//w+)*(/w+(//.//w+)?)*(///|//?//w*=//w*(&//w*=//w*)*)?)[/"/']"));
   boost::tregex_iterator i(boost::make_regex_iterator(html, r)), j;
   while(i != j)
   {
      	std::cout << (*i)[1] << std::endl;
 
	++i;
   }
}
      

regex_token_iterator creation helpers
template <class charT> 
regex_token_iterator<charT const*> 
   make_regex_token_iterator(
      const ATL::CSimpleStringT<charT>& s, 
      const basic_regex<charT>& e, 
      int sub = 0, 
      ::boost::regex_constants::match_flag_type f = boost::regex_constants::match_default);

Effects:returns regex_token_iterator(s.GetString(), s.GetString() + s.GetLength(), e, sub, f);

template <class charT> 
regex_token_iterator<charT const*> 
   make_regex_token_iterator(
      const ATL::CSimpleStringT<charT>& s, 
      const basic_regex<charT>& e, 
      const std::vector<int>& subs, 
      ::boost::regex_constants::match_flag_type f = boost::regex_constants::match_default);

Effects:returns regex_token_iterator(s.GetString(), s.GetString() + s.GetLength(), e, subs, f);

template <class charT, std::size_t N> 
regex_token_iterator<charT const*> 
   make_regex_token_iterator(
      const ATL::CSimpleStringT<charT>& s, 
      const basic_regex<charT>& e, 
      const int (& subs)[N], 
      ::boost::regex_constants::match_flag_type f = boost::regex_constants::match_default);

Effects: returns regex_token_iterator(s.GetString(), s.GetString() + s.GetLength(), e, subs, f);

Example:

void enumerate_links2(const CString& html)
{
   // enumerate and print all the  links in some HTML text,
   // the expression used is by Andew Lee on www.regxlib.com:
   boost::tregex r(__T("href=[/"/']((http://|//.///|///)?//w+(//.//w+)*(/w+(//.//w+)?)*(///|//?//w*=//w*(&//w*=//w*)*)?)[/"/']"));
   boost::tregex_token_iterator i(boost::make_regex_token_iterator(html, r, 1)), j;
   while(i != j)
   {
      std::cout << *i << std::endl;
      ++i;
   }
}      

 

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
提供的源码资源涵盖了Java应用等多个领域,每个领域都包含了丰富的实例和项目。这些源码都是基于各自平台的最新技术和标准编写,确保了在对应环境下能够无缝运行。同时,源码中配备了详细的注释和文档,帮助用户快速理解代码结构和实现逻辑。 适用人群: 适合毕业设计、课程设计作业。这些源码资源特别适合大学生群体。无论你是计算机相关专业的学生,还是对其他领域编程感兴趣的学生,这些资源都能为你提供宝贵的学习和实践机会。通过学习和运行这些源码,你可以掌握各平台开发的基础知识,提升编程能力和项目实战经验。 使用场景及目标: 在学习阶段,你可以利用这些源码资源进行课程实践、课外项目或毕业设计。通过分析和运行源码,你将深入了解各平台开发的技术细节和最佳实践,逐步培养起自己的项目开发和问题解决能力。此外,在求职或创业过程中,具备跨平台开发能力的大学生将更具竞争力。 其他说明: 为了确保源码资源的可运行性和易用性,特别注意了以下几点:首先,每份源码都提供了详细的运行环境和依赖说明,确保用户能够轻松搭建起开发环境;其次,源码中的注释和文档都非常完善,方便用户快速上手和理解代码;最后,我会定期更新这些源码资源,以适应各平台技术的最新发展和市场需求。 所有源码均经过严格测试,可以直接运行,可以放心下载使用。有任何使用问题欢迎随时与博主沟通,第一时间进行解答!

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值