最近有人跟我说有没有可以批量把文件名翻译成中文的,我就尝试写了一下。本想调用读取文件名后调用百度翻译进行翻译,但是我用fiddler 修改翻译的字符串重新请求都不行,里面生成的各种参数较为复杂,所以懒人的我选择了百度翻译中间的过渡翻译(就是按一个字母就能立即触发事件给你跳出一系列你可能需要翻译的词汇,算是即时翻译吧)。我本意是用百度翻译把其他语言都转成汉语,但是调用这个过渡的翻译,它只支持英文转汉语,但是怎么说也是写了一点,故留点代码供以后参考。
其中用到了php 的 curl ,以及php对于文件的操作 和 正则匹配
具体代码如下:
<?php
//修改test目录下的文件名
//目录是相对该文件的目录
change('test');
/**
* 转换文件名成中文(暂时只支持英文转中文)
* @param $dir string 相对于当前目录的文件夹(必选参数)
*/
function change($dir = '')
{
if(is_dir($dir)) //判断是否为一个目录
{
if($handler = opendir($dir)) //读取所有文件夹下的文件名
{
while( ($filename = readdir($handler)) !== false )
{
//过滤 . 和 .. 两个文件夹
if($filename == "." || $filename == "..") continue;
$name = substr($filename, 0, strrpos($filename,'.')); //获取文件名
$suffix = substr($filename, strrpos($filename,'.')); //获取文件的后缀
$new_name = bdTranslation($name); //获取翻译后的新名字
if(empty($new_name)) continue;
$new_filename = $dir.'/'.$new_name.$suffix; //新文件名的路径
$old_filename = $dir.'/'.$filename; //旧文件名的路径
if(file_exists($new_filename) || !file_exists($old_filename)){
//判断新文件是否存在或者旧文件已经不存在导致的无法修改
echo "{$new_filename} already exists or {$old_filename} does not exist";
echo "</br>";
}else{
//重命名文件
@rename(iconv('UTF-8','GBK',$old_filename), iconv('UTF-8','GBK',$new_filename));
}
}
//释放资源
closedir($handler);
echo "success";
}else{
echo "{$dir} is open failed";
}
}else{
echo "{$dir} is not a dir";
}
}
/**
* 调用百度翻译(抓取的地址只是中间过度的地址,翻译的并非完全正确)
* @param $name
* @return mixed
*/
function bdTranslation($name)
{
$url = "https://fanyi.baidu.com/sug";
$cfg['ssl'] = true;
$cfg['post'] = [
'kw' => $name
];
$result = curlOpen($url,$cfg);
$result = json_decode($result,true);
if(empty($result['data'][0]['v'])) return '';
//优先匹配名名词
preg_match("/n\. ([\x{4e00}-\x{9fa5}]+)/u",$result['data'][0]['v'],$match);
if($match){
return $match[1];
}
else{
//名词不存在则匹配第一个解释
preg_match("/([\x{4e00}-\x{9fa5}]+)/u",$result['data'][0]['v'],$match);
if($match) return $match[0];
else return '';
}
}
/**
* 封装好的curl
* @param $url
* @param array $config
* @return mixed
*/
function curlOpen($url, $config = array())
{
$arr = array('post' => false,'referer' => $url,'cookie' => '', 'useragent' => 'Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.0; Trident/4.0; SLCC1; .NET CLR 2.0.50727; .NET CLR 3.0.04506; customie8)', 'connectout'=>3, 'timeout' => 10, 'return' => true, 'proxy' => '', 'userpwd' => '', 'nobody' => false,'header'=>array(),'gzip'=>true,'ssl'=>false,'isupfile'=>false,'returnheader'=>false);
$arr = array_merge($arr, $config);
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, $arr['return']);
curl_setopt($ch, CURLOPT_NOBODY, $arr['nobody']);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 1);
curl_setopt($ch, CURLOPT_USERAGENT, $arr['useragent']);
curl_setopt($ch, CURLOPT_REFERER, $arr['referer']);
curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, $arr['connectout']);
curl_setopt($ch, CURLOPT_TIMEOUT, $arr['timeout']);
curl_setopt($ch, CURLOPT_HEADER, $arr['returnheader']);//��ȡheader
if($arr['gzip']) curl_setopt($ch, CURLOPT_ENCODING, 'gzip,deflate');
if($arr['ssl'])
{
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);
curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, false);
}
if(!empty($arr['cookie']))
{
if(substr($arr['cookie'], -4) == '.txt'){
curl_setopt($ch, CURLOPT_COOKIEJAR, $arr['cookie']);
curl_setopt($ch, CURLOPT_COOKIEFILE, $arr['cookie']);
}else{
curl_setopt($ch, CURLOPT_COOKIE, $arr['cookie']);
}
}
if(!empty($arr['proxy']))
{
//curl_setopt($ch, CURLOPT_PROXYTYPE, CURLPROXY_HTTP);
curl_setopt ($ch, CURLOPT_PROXY, $arr['proxy']);
if(!empty($arr['userpwd']))
{
curl_setopt($ch,CURLOPT_PROXYUSERPWD,$arr['userpwd']);
}
}
if(!empty($arr['header']['ip']))
{
array_push($arr['header'],'X-FORWARDED-FOR:'.$arr['header']['ip'],'CLIENT-IP:'.$arr['header']['ip']);
unset($arr['header']['ip']);
}
$arr['header'] = array_filter($arr['header']);
if(!empty($arr['header']))
{
curl_setopt($ch, CURLOPT_HTTPHEADER, $arr['header']);
}
if ($arr['post'] != false)
{
curl_setopt($ch, CURLOPT_POST, true);
if(is_array($arr['post']) && $arr['isupfile'] === false)
{
$post = http_build_query($arr['post']);
}
else
{
$post = $arr['post'];
}
curl_setopt($ch, CURLOPT_POSTFIELDS, $post);
}
$result = curl_exec($ch);
//var_dump(curl_getinfo($ch));
curl_close($ch);
return $result;
}
若有什么错误的地方或者需要改进的地方,欢迎大家提出