Skip to content
This repository has been archived by the owner on Sep 11, 2020. It is now read-only.

上下册的书下册爬不了 #60

Open
darkness112 opened this issue Feb 6, 2020 · 7 comments
Open

上下册的书下册爬不了 #60

darkness112 opened this issue Feb 6, 2020 · 7 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@darkness112
Copy link

https://lib-nuanxin.wqxuetang.com/#/Book/2115

下册下不了

@kajweb
Copy link
Owner

kajweb commented Feb 6, 2020

2020-02-07 00:58:15,169 [INFO] 成功创建文件夹 books/IMG/2115
2020-02-07 00:58:15,564 [INFO] 2115开始下载天坑(上、下册),共 313 页
2020-02-07 00:58:16,674 [INFO] 2115下载成功 第1页(1/313) 随机10.2s

测试正常。

@kajweb kajweb closed this as completed Feb 6, 2020
@BillXuce BillXuce added enhancement New feature or request help wanted Extra attention is needed labels Feb 7, 2020
@BillXuce
Copy link
Collaborator

BillXuce commented Feb 7, 2020

发现问题,下册书阅读地址多一个v=2参数

@BillXuce BillXuce reopened this Feb 7, 2020
@kajweb
Copy link
Owner

kajweb commented Feb 7, 2020

果然是天坑……

@BillXuce
Copy link
Collaborator

BillXuce commented Feb 7, 2020

果然是天坑……

已经在搞了,然而我想弃了,这全要加一层循环

@kajweb
Copy link
Owner

kajweb commented Feb 7, 2020

没啥必要,很少要下载上下册的书的。在wiki上面教他们怎么改代码就行了

@kajweb
Copy link
Owner

kajweb commented Feb 7, 2020

应该是修改def getBaseUrl( self, page ):这里吧

@BillXuce
Copy link
Collaborator

BillXuce commented Feb 7, 2020

并不可以,相应的页码、目录也要重新获取。在获得BookInfo的过程中,目前的代码只可以获得第一本书的info,而在"https://lib-nuanxin.wqxuetang.com/v1/book/initbook?bid={}"中才可以同时获取到上下册的相应信息,而且代码中的self.name,self.catatree等全部要变为列表的形式,代码重构量有些大。

你可以试试看其他人的脚本可不可以下载上下册。可以分享给我们看一下

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants