Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
W
webmagic
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
沈俊林
webmagic
Commits
0a2b9137
Commit
0a2b9137
authored
Jul 24, 2013
by
yihua.huang
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
release
parent
2b3554c1
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
9 deletions
+3
-9
README.md
README.md
+0
-4
release-note.md
release-note.md
+3
-5
No files found.
README.md
View file @
0a2b9137
...
...
@@ -9,10 +9,6 @@ webmagic的发起源于工作中的需要,其定位是帮助开发者更便捷
webmagic的功能覆盖整个爬虫的生命周期(链接提取、页面下载、内容抽取、持久化),开发者可以便捷的使用xpath和正则表达式进行链接和内容的提取,只需编写少量代码即可完成一个定制爬虫。
#### 请注意
webmagic正处于开发阶段,目前还没有稳定版本。欢迎开发者参与到webmagic的试用和修改中来。
**如果只是想以外部jar包的方式,引用webmagic并进行自己的业务开发,建议你等待webmagic的第一个稳定版本。**
###特色###
*
####垂直爬虫####
...
...
release-note.md
View file @
0a2b9137
...
...
@@ -8,10 +8,8 @@ Release Notes
增加下载的重试机制,支持gzip,支持自定义UA/cookie。
增加多线程抓取功能,只需在初始化的时候指定线程数即可。
增加jquery形式的CSS Selector API,可以通过
`page.getHtml().$("div.body")`
形式抽取元素。
完善了文档,架构说明:
[
webmagic的设计机制及原理-如何开发一个Java爬虫
](
http://my.oschina.net/flashsword/blog/145796
)
,Javadoc:
[
http://code4craft.github.io/webmagic/docs
](
http://code4craft.github.io/webmagic/docs
)
。
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment