最近弄了一個工具,希望能獲取自己百度網盤里面的數據但又不想公開數據,于是想到了模擬登陸百度,用常規的模擬登陸測試了下發現不行,抓取登陸時的數據才發現,其實百度登陸過程中跳轉了幾次頁面,如果僅僅對http://passport.baidu.com/v2/api/?login一個頁面獲取cookie是不完整的那樣就只有BAIDUID的值,而僅僅這個cookie值是沒有多少作用的.
通過對抓包數據的分析,實際登陸過程中是先請求了一次http://passport.baidu.com/v2/api/?getapi&class=login&tpl=mn&tangram=true這個頁面,服務器同時給瀏覽器設置兩個cookie,一個BAIDUID的cookie值,這個應該是與seesion id相關的;另一個是
Set-Cookie:
HOSUPPORT=1; expires=Thu,19-Aug-2021 15:41:37 GMT; path=/; domain=passport.baidu.com; httponly
推測這個應該是百度檢測瀏覽器是否支持cookie;
再次請求該頁面,獲取網頁數據會得到一個token值用于登陸;
然后登陸成功會得到BDUSS等相關的cookie值,以上才是登陸成功,記錄下上面的cookie即可!
下面是簡單的請求及登陸函數集合,作為基礎類吧,可能簡單了點,以后再完善吧!
代碼如下:
- <?php
- /**
- * 百度基礎類
- * @author qaulau@hotmail.com
- * @file baidu.php
- * @date 2013-6-2 www.49028c.com
- */
- class baidu{
- private $cookie = '';
- private $username = '';
- private $password = '';
- const COOKIE_DIR = 'temp'; //cookie存放目錄
- const COOKIE_VALIDATE = 604800; //cookie有效期,默認為7天
- const SECRET_KEY = 'hAFS6as8askNBVSuiealkkw'; //密鑰用于加密cookie文件名,防止保存的cookie路徑被猜測
- private function http_request($url, $post_data, $referef,$header = true){
- $ch = curl_init();
- curl_setopt($ch, CURLOPT_URL, $url);
- if ($post_data != ""){
- curl_setopt($ch, CURLOPT_POST, 1);
- curl_setopt($ch, CURLOPT_POSTFIELDS, $post_data);
- }
- if ($referef != ""){
- curl_setopt($ch, CURLOPT_REFERER, $referef);
- }
- curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
- curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, 0);
- curl_setopt($ch, CURLOPT_HEADER, $header);
- curl_setopt($ch, CURLOPT_USERAGENT, "Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.31 (KHTML, like Gecko) Chrome/26.0.1410.64 Safari/537.31");
- if ($this->cookie != ""){
- curl_setopt($ch, CURLOPT_COOKIE, $this->cookie);
- }
- $data = curl_exec($ch);
- curl_close($ch);
- if ($header){
- preg_match_all('/Set-Cookie:((.+)=(.+))$/m ', $data, $cookies);
- if(is_array($cookies) && count($cookies) > 1 && count($cookies[1]) > 0){
- foreach($cookies[1] as $i => $k){
- $cookieinfos = explode(";", $k);
- if(is_array($cookieinfos) && count($cookieinfos) > 1){
- $this->cookie .= $cookieinfos[0];
- $this->cookie .= "; ";
- }
- }
- }
- }
- return $data;
- }
- private function login(){
- //生成一個cookie
- $ret = $this->http_request("https://passport.baidu.com/v2/api/?getapi&class=login&tpl=mn&tangram=true", "", "");
- //獲取token并保存cookie
- $ret = $this->http_request("https://passport.baidu.com/v2/api/?getapi&class=login&tpl=mn&tangram=true", "", "");
- preg_match_all('/login_token='(.+)'/', $ret, $tokens);
- $login_token = $tokens[1][0];
- //登陸并保存cookie
- $post_data = array();
- $post_data['username'] = $this->username;
- $post_data['password'] = $this->password;
- $post_data['token'] = $login_token;
- $post_data['charset'] = "UTF-8";
- $post_data['callback'] = "parent.bd12Pass.api.login._postCallback";
- $post_data['index'] = "0";
- $post_data['isPhone'] = "false";
- $post_data['mem_pass'] = "on";
- $post_data['loginType'] = "1";
- $post_data['safeflg'] = "0";
- $post_data['staticpage'] = "https://passport.baidu.com/v2Jump.html";
- $post_data['tpl'] = "mn";
- $post_data['u'] = "http://www.baidu.com/";
- $post_data['verifycode'] = "";
- $ret = $this->http_request("http://passport.baidu.com/v2/api/?login", $post_data, "https://passport.baidu.com/v2/?login&tpl=mn&u=http%3A%2F%2Fwww.baidu.com%2F");
- //記錄下所有cookie
- $this->writeCookie();
- }
- private function writeCookie(){
- if(!file_exists(self::COOKIE_DIR)){
- @mkdir(self::COOKIE_DIR) && touch(self::COOKIE_DIR.'/index.html');
- }
- $filename = self::COOKIE_DIR.'/'.md5($this->username.self::SECRET_KEY);
- file_put_contents($filename, $this->cookie);
- }
- public function baidu($username,$password){
- $this->username = $username;
- $this->password = $password;
- $filename = self::COOKIE_DIR.'/'.md5($this->username.self::SECRET_KEY);
- if ((@filemtime($filename)+ self::COOKIE_VALIDATE > time()) && ($cookie = file_get_contents($filename))!= ''){
- //如果cookie在有效期內且不為空
- $this->cookie = $cookie;
- }else {
- $this->login();
- }
- }
- /** www.49028c.com
- * 請求頁面
- * @param string $url :頁面地址
- * @param string $referef :引用頁面
- * @param string $post_data :post數據,如果填寫則為post方式否則為get方式
- * 返回頁面數據
- */
- public function request($url,$referef = '',$post_data = ''){
- return $this->http_request($url,$referef,$post_data,false);
- }
- }
這個只是基本的類,只涉及登陸及請求與提交數據,可在此基礎上使用,例如請求百度云網盤,代碼如下:
- $baidu = new baidu('用戶名','密碼');
- $data = $baidu->request('http://pan.baidu.com/api/list?channel=chunlei&clienttype=0&web=1&num=100&page=1&dir=%2F','http://pan.baidu.com');
- echo $data;
還可以用來作為貼吧的發帖或百度空間更新工具.
新聞熱點
疑難解答