采集的内容和网页内容不同怎么办?
例如原网址是这个:
https://detail.1688.com/offer/527173741316.html?spm=b26110380.sw1688.mof001.52.LVkzmq&tracelog=p4p
爬虫爬到链接访问后变成这样:
https://dj.1688.com/ci_bb?a=2000671507&e=sZm3tJ8i-7-QWUEkBm9Cm1LNOjNmb6NyUi7fH7USgJdVFqGUifkH0DJ.1fCQuiCmMPxJzcK2HCHOeUpm6Y.tv9KzKymIGgM4lbUP.4Ep9j5qbQFcj3aUK3CBjdjzeMemG-iU4zmZKJ-yjYXpLyymSEATBbBXackC.ozbDfh-8bwlKFMwBtw3if.nrl5Kcjk.MS53QRob2q42DrTtfItdGfuC4KUGQ.wCbRyk8MhxTkdBphtmjcAxNr9pI8FvPA8woGEL1mhC-eg8mur7F87N1XDbjr9ycYhHmRvJIQU.Jo5rs9Yp62zb216IIzIZ.Xe-W1C7fzGmUi99rKOlt-CHg8YHpqxHRNBaQ-ByUM0EnRFgY098CL6aoGfTewGtjwIBZK7IL.EuAANXYP9UFPYJlpW1D.-BKfY-SLzgWmecFMxiaI5xqTirrM4DgNrY.qQupBNNAlxRLaF6B2vISaYfCTY-7ZXT01fFN6g.xeS2xpjVM566HrVLUEEU.9wNxGtxmsKoBO71-H8ZU-GLytI7pNrLYzgm5Ks19P7p4z6ygGQpcdUqonc19w__&v=4&ap=1&rp=1
请问各位大大,管理们,这个要怎么办?
求告知。感激不尽
|
|
|
|
|
共 3 个关于本帖的回复 最后回复于 2017-12-6 09:16