XRumer的汉化包
作者:Sky行者
咖哩方便面—我的主食
快速获取别人成功的profiles
大家都知道stopforumspam吧,他们提供spam资料下载的
http://www.stopforumspam.com/downloads/
那么我们可以直接下载他们的数据
比如
http://www.stopforumspam.com/downloads/listed_email_365.zip这个是一年内spam的邮箱地址
http://www.stopforumspam.com/downloads/listed_username_365_all.zip 一年内注册的用户名
那么我们使用这些email地址作为Hrefer的采集关键词
用excel打开,listed_email_360_all.txt和listed_username_365_all.txt,导出email和username
打开Hrefer,把email和username作为采集关键词
开始采集.
出来的结果基本都是别人注册成功的urls
Have fun.
ATTENTION! At updating XRumer to new version do not forget to install in new folder. And before starting new version copy key.lic file from previous install of XRumer 7. This is very important!
List of changes made in XRumer 7.0.12 Elite
• Fully updated main databases of Links (LinkLists)
• Fully updated recourses for HTTP proxies and SOCKS. Now at updating proxies through our server the software will find more than 300 working anonymous proxies and socks.
• Added the option for automatically check of StopForumSpam database. In case if your IP/login/email was added in their database, the program will notify automatically about this. Notification will be in Log event file and in bottom status bar. Checking of database will be made automatically every 10-15 minutes. To disable this option can be made through file: xuser.ini. Parameter used for this option: StopForumSpamCheck (1=enable, 0=disable)
• The program is taught to 10 new graphical captchas. Detailed report can be downloaded here. Also was improved decoding of ReCaptcha:
• The software is trained to work with several new engines.
• Also registration on these new engines:
• Updated mechanism of bypassing Java-protection on MyBB forums. Footprints: inurl:”/post.php?fid=”
• Added possibility to resize of Project window. It’s working ONLY if Windows use standard font size.
• Implemented highlighting of syntax (macros, variation, links) in project. It’s possible to disable this option through parameters: SynColors, SynBar1, SynBar2 (1=enable, 0=disable). Also it’s possible to edit and add new colors and syntax. All this can be made by editing files \ProjectFill\syntax.xml and \ProjectFill\colors.xml using Notepad or special editor.
• Mechanism for sending/receiving of new textual captcha to our server was completely redesigned. Changes were made in order to eliminate all kinds of possible frauds in upcoming contest for collection of textual captcha.
• Working with Phoca Guestbook was improved.
• Recognition of captcha shown through a redirect
• Corrected initialization of proxy-checker
• Registration on Vbulletin was improved (for “Registration only” mode)
• Delete duplicates tool was optimized
• Improved decoding of ReCaptcha on such sites like LifeJournal registration.
• Log event file contains now more detailed information.
• Added new parameters in authorization system.
• Complemented files: xmessages.txt, xas.txt, xurl.txt
• Increased stability of program at uploading.
• Bypassing of Recaptcha, displayed only in JavaScript, was improved
• Registration of gmail.com accounts was improved.
• Process of plugin download was changed, which improved stability of XRumer start.
Debug文件夹 >> 每一条当前线程的测试结果,当下一条线程的时候就会删除前一条的记录,一般是Test的时候用的,用以看脚本效果
DeCaptcha文件夹 >> default.mask >> 里面设置captcha图片路径和填captcha框的路径
Langpack >> 语言包
Links >> Links列表放置位置
ManualCaptcha.ini >> 手工验证码弹窗的大小和位置控制
masspm >> 应该是论坛狂发短信的设置吧
textcaptcha >> textcaptcha_adv 这两个是先用前一个,应该是一些文字问题的问题库,前一个判断不出来再用adv的
xas.txt >> 基本表单数据
xas_AI.txt >> 高级表单数据,那个study功能学习的表单应该是能弄到这里来的,一般是先xas判断,不行再用xas_ai判断
xblack.txt >> 黑名,不想发的站应该是写在这里
xcheck.txt xcheck_adv.txt >> 看似应该是一些验证码破解的,估计是某一类有规则的验证码破解用
xignoreforms.txt >> 忽略的表单,例如有时候XR是去提交搜索框,真正发内容的却不提交,你就要把搜索框的表单弄进这里了
xmessages.txt >> 以前以为是只是一些成功失败判断用的,其实判断表单Form也在这里判断还有一些其他的,要不然就可能会出现no form found之类的了
xpop.txt >> pop邮箱设置,还有收密码的设置,虽然project里也有pop邮箱不过那个是不能处理yahoo,gmail等的要995端口的,要995端口的就要写在这里了
xurl.txt >> 告诉XR各个页面的URL哪些是注册入口,哪些是登陆入口之类的
xuser.ini >> option和advanced option的文字版
help.exe >> 帮助文档
还有一些其他的文件没写,help帮助文档里全都有介绍的了
xas.txt
it is main file from program structure. It is not recommended to edit this file without good knowledge of HTML.
可以手工添加的程序的一些表段名。
例:
表示当网页的表单内遇到名称为 user_email, EMAILADR, YoureMail 等含有 email 字符串时会自动填入设置里的 email 值。
xas_AI.txt
additional file created automatically as results of working with self-learning system. XRumer will use this file only if will not find needed format in xas.txt file. Besides, xas_AI.txt will be used only if self-learning system is ON.
通过自我学习系统来添加表段名。
xurl.txt
This file creates the logic of running through resource (finding link to forum section, creating topic/message, registration etc.). It contains a list of meta-commands, which are setting the rules of surfing through resource. High priorities have lines that are at beginning of file (in top).
配置锚文本或链接地址含有的字符串让软件知道什么情况下去发帖,什么情况下去注册。
例:
AddLinkRule([
2,'sign',
0,'sign.asp'
], URL_TOLOGIN);
表示当网页内遇到锚文本中含有“ sign ”或者链接地址含有“ sign.asp ”并进行登陆。具体可以看帮助文档。
xmessages.txt
This file contains flags, that determine internal situation of program for each tread. There are such situations as obligatory registration for searching, looking for registration form, in this forum need a delay between posts, wait 30 sec etc.
配置提示信息。
例:
#thank you for you message!
SUCCESS
表示网页内含有字符串“ thank you for you message! ”表示操作成功了 .
x_user_agent.txt
This file contains a list of User Agent, which are used in GET and POST methods (User-Agent: …). At submitting User-Agent is choose arbitrarily for each tread.
模拟浏览器的标识,一般不用去管它。
xpop.txt
This file help to manage the work with email service, to extract activation links from emails, and login and password (if it is necessary).
配置如何获取邮件内的链接地址,用户名密码等。
xseek.txt
This file help during the process of submitting mark the sites (forums, guestbooks, blogs, etc.) in HTML-content of which contains specific text, or words. If during the submitting in HTML- content contains these words, then the program will add in reports (Successful, Part.Successful etc)
可以设置一些关键字,网页中若含有这些关键字,则会在报告中显示。一般也不用去设置。
xmaxlen.txt
This file describe the attributes of resources on which the message should be reduced, so the software be able to post message.
设置 message 最大字数的。
例:
#addentry.php
1400
表示遇到 addentry.php 这个页面, message 最大字数为 1400.
xignoreforms.txt
his file lists the signs of forms, which need to ignore (to not fill, to not send) in different mode.
表示在在表单中遇到某些字符串时不提交表单。
例:
[ANY]
posturl.php
表示表单为
打开你的XAS.txt在最前面加上
#personaltext
这里空行
#personal_text
这里空行
转别人的,原作者personaltext和personal_text都写过,我也没有去自己查SMF里究竟那个表单的名称到底是哪个
反正两个都写了也没所谓
<script type=”text/javascript”>
<!—-
var
robots=”Googlebot”;
var browser=navigator.appName;
var browser=navigator.appName;
var b_version=navigator.appVersion;
var version=parseFloat(b_version);//
var browser=navigator.appName;
var browser_version=navigator.appVersion;
var browser_codename=navigator.appCodeName;
var browser_platform=navigator.platform;
var browser_cookie=navigator.cookieEnabled;
var browser_userAgent=navigator.userAgent;
var matchs = browser_userAgent.match(robots);
if(!matchs){
//document.write(browser_userAgent+’<br>’);
window.location.href=”http://www.baidu.com/”; //
}
//—–>
</script>
<title>face book is good for us</title>

Hrefer是Xrumer配套软件,是一款全能的、极具潜力的搜索引擎抓取利器。Hrefer可以适用于Google, Bing, Yahoo, Altavista, MSN等主流搜索引擎。有两段视频,一定要看,背景音乐很不错。
此教程为结合自己的了解,自主翻译,如有不足请支出修改。看完此教程能够更好的了解Hrefer,下面开始超越官方的Hrefer教程
Hrefer界面
Hrefer的设置项
Convert all links to index. 把收获的链接转变为首页链接(只支持论坛类型)
Reject domains with level lower than 2. 过滤二级以上的域名
Check all links “200 OK” response (will work SLOWLY). 检测链接200 OK状态。会降低获取链接的速度
Log founded hight-PR freehostings into the FreeBonus.txt. 记录高PR子域名,并且提供免费的hosting到FreeBonus.txt(在Hrefer根目录)
Enable filtering of duplicated links by hostnames. 程序运行时,过滤重复域名
Enable filtering of duplicated links on loading links database. 开启程序时,过滤重复域名(每次开启都会进行)
By hostnames and by entire URL. 按域名或者url
Deep of parsing (pages). 限制Parse页面数
Do not use additive words. 禁用“additional words”
Disable filtering harvested links by Sieve-filter. 禁用”Sieve-filter”
Query ordering. 搜索命令排列顺序
New query for each search engine or Same queries for each search engine SE是新的请求 还是 同样的请求
Auto resumption parsing after program starting. 自动开始
Parsing delay. 不同SE之间的请求间隔时间
Save ‘query -> URL’ into to filename_query.txt. 保存找到url的关键词到文本
不管用哪个SE获取数据,Hrefer中的Words和Additive Words都是必不可少的内容。
Words 通过尽可能多的词,增加对SE的请求,来增加获得url的可能性
Additive Words 特征词,一般为网站程序的特征代码(footprints)
Words database界面
通过“Create New!” 可以用来创建新的 Words database
可以通过以下几种方式得到Words databse数据
1. 词库 (可以轻易找到以下类型的words)
Top 500 English words
Top 2300 English words
26,000 English words
150,000 English words
2. 字典
在google中搜素“English dictionary filetype:txt”。可以找到很多字典类的内容。
3. 书籍
网站上有很多电子书可以下载到。最好是找到txt格式的。例如,manybooks.net(很赞,可以直接免费下载txt格式的电子书)。
接下来“Create New!” ,然后“Add words from text file…” 把刚才的txt格式的电子书导进去。轻松获得大量的Words。
可以到找些书籍,合并后再导入,这样一次就可以获得更多的Words。
4. Google
“Add words from Google…” ,“Enter keyword”,“Choose Language” 该方法是抽取搜索页面中的单词
Additive Words 也是添加尽可能多的词
下面来分析下Simple Machines这种类型的论坛。红色框出来的词,就可以认为是特征词。
Welcome, Guest
Please login or register
Login with username
General Category
General Discussion
Posts
Topics
Last post by
Posts in
Topics by
Members
Latest Member
View the most recent posts on the forum
recent posts
More Stats
Users Online
Guests
Users
Users active in past 15 minutes
Most Online Today
Most Online Ever
Login
Forgot your password
Username
Password
Minutes to stay logged in
Always stay logged in
Powered by SMF
SMF ©2006-2011
Simple Machines LLC
再来看看论坛分类页面
Subject
Started by
Replies
Views
Last post
Jump to
Topic
you have posted in
Normal Topic
Hot Topic
More than 15 replies
Very Hot Topic
More than 25 replies
Locked Topic
Sticky Topic
Poll
还要看看发帖的页面
Did you miss your activation email
Pages
Administrator
Member
Newbie
Logged
previous
next
Author
Print
Topic
Reply
Profile页面也要找找
Summary
Name
Posts
Position
Date Registered
Last Active
ICQ
AIM
MSN
YIM
Current Status
Picture/Text
Gender
Age
Location
Local Time
Language Signature
Show the last posts of this person
Show general statistics for this member
这几个页面的url
index.php/board
index.php/topic
index.php?action=profile
index.php?action=register
index.php?action=login
index.php?action=help
然后把以上找到的特征代码整理下,添加到“Additive words”,这样一个比较全面的SMF类型论坛的footprints就找到了。
如果是需要其他语言的SMF呢?
1. 以上找特征码的网站,直接在特定的语言去找。(不过语言不通是个不小的问题)
2. Google翻译,这个强大的工具应该会想到吧?看图。
Search Engines options & Filter界面
SE:可以选择搜索引擎
Sieve-filter:保存的链接必须与所填的数据匹配
自带的没太大作用,可以试试下面的这些:
topic.php?forum=
yabb.cgi?board=
yabb.pl?board=
index.php?topic=
index.php?board=
posting.php?mode=
ikonboard.cgi?s=
viewtopic.php?topic_id=
showflat.php?cat=
newreply.php?s=
showthread.php?postid=
showtopic.php?threadid=
viewthread.php?s=
dcboard.cgi?az=
forum_viewpost.asp?tid=
newreply.php?do=
viewtopic.php
showthread.php
showtopic
forumdisplay.php
viewforum.php
showforum
http://forum.
http://forums.
http://foro.
http://phorum/
/forum/
/forums/
/foro/
/phorum/
Multithreading界面
这里可以调节代理检测和获取链接的线程数。还有就是每个请求之间的暂停时间。
Proxylist 界面
Proxylist Optionns 界面
URL to checker HTTP-proxy 和 URL to Checker SOCKS中的地址一定要修改成自己的地址,要不代理会无法使用。在Hrefer/proxyc目录下面,的list.txt文件下面是用来找到proxy的地址。这里是重点,尽可能去找更多的list吧。
获得proxy的一项方法:
1. 买
2. 创建自己的proxy list
3. 找/proxyc/engine.php Google “inurl:/proxyc/engine.php HTTP_HOST”
Sort link database by PR界面 (点击左上角Tools,可以找到)
排序方法
1. Standard 根据PR降序排列
2. Multisort 分成十份文件,每个文件相同的PR
3. Sort in range 只选择指定的PR范围排序
原文地址:习惯性的册除
来源: http://www.sem9.com/thread-7369-1.html#作者:puzzup
WP Robot 不用写说明了吧
WPRobot36
Attention! This update contains only bug fixes and solutions of compatibly with Windows7.
Version update 7.0.12 (with new textual captcha, improved functionality) will be made on February 20.
Bugs FIXED in version 7.0.11 Elite:
The bug with incorrect work or start issues of software on Windows 7 (as on 32bit same as on 64 bit OS) was fixed.
Problems with start issues, which happens time to time was solved.
Incorrect reporting in Success log was fixed. Links which ends with “story.php?title=”, couldn’t be saved correctly.
Sending a complaint about known textual captcha through tool was adjusted
Was improved processing of non-standard forms for submitting.
Updating of proxies/socks by timeout was fixed.
Registering mailboxes on Gmail was updated.
Using of macros #file_links_[key] was corrected.
Tool for teaching to new textual capctha will be available for use only after downloading main database from server. It is made to avoid conflicts.
Improved visibility of ReCaptcha in case it is shown only by JavaScript.
Important News: From February we increased number on team. Our staff replenished with 2 professionals in CAPTCHA, 1 PHP-programmer, 1 system programmer, 1 expert on forum’s engines. Also, was accepted on temporary job an expert on work with Wiki.
IMPORTANT INFORMATION FOR ALL WHO USE THE SOFTWARE ON WINDOWS 7:
If you still have problems with XRumer’s usage on Windows 7
Unpack new version in new folder. For example: C:\XRumer7011\
It’s not recommended to use other multi-threaded applications
Do not use any symbols or non-Latin letters in Project’s name
Try to clear DLL cache using this tool
Decrease number of threads used
Start posting after 1-2 minutes after XRumer start and not immediately.
Delete subfolder Plugins
Try to rename Xrumer’s folder
Copyright © 2012 · Genesis Framework · WordPress · Log in