I want to mirror a site with wget command, and I ran:
wget --debug -Sv --limit-rate=100k --user-agent='Ineternet-Explorer' --exclude-domains="bbshema.cga.com.cn,game.cga.com.cn" --exclude-directories=/Soft/,/Photo/,/GuestBook/,/guestbook/,/User/,/user/ -R "SendMail.asp*,Print.asp\?ArticleID*,GetHits.asp\? ArticleID*,Comment.asp\?Action*,User*.asp*,Vote.as p*,Show*.asp\?page=*," -m -l0
http://ro.cga.com.cn
Quote:
Deciding whether to enqueue "http://ro.cga.com.cn/images/xj_49.gif".
http://ro.cga.com.cn/images/xj_49.gif (xj_49.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/t1.gif".
http://ro.cga.com.cn/images/t1.gif (t1.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/t2.gif".
http://ro.cga.com.cn/images/t2.gif (t2.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/Article/Class14/Class21/Index.html".
http://ro.cga.com.cn/Article/Class14/Class21/Index.html (Index.html) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/ziliaologo.jpg".
http://ro.cga.com.cn/images/ziliaologo.jpg (ziliaologo.jpg) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/t3.gif".
http://ro.cga.com.cn/images/t3.gif (t3.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/w1.gif".
http://ro.cga.com.cn/images/w1.gif (w1.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/more.gif".
http://ro.cga.com.cn/images/more.gif (more.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/t1.gif".
http://ro.cga.com.cn/images/t1.gif (t1.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/images/t2.gif".
http://ro.cga.com.cn/images/t2.gif (t2.gif) does not match acc/rej rules.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/".
Already on the black list.
Decided NOT to load it.
Deciding whether to enqueue "http://ro.cga.com.cn/".
Already on the black list.
|
whether I had been added black list???
when , I ran
wget -m www.cga.com.cn
it worked properly, but I got too many pages that I didn't want to download.
what's mean content above???
who can help me??
thx for any reply.