Skip to content
This repository has been archived by the owner on May 11, 2020. It is now read-only.

Mis-extract image's source in CNET #20

Open
XeonKHJ opened this issue Apr 7, 2019 · 0 comments
Open

Mis-extract image's source in CNET #20

XeonKHJ opened this issue Apr 7, 2019 · 0 comments

Comments

@XeonKHJ
Copy link

XeonKHJ commented Apr 7, 2019

the link is below:
How to hide cables, clean your screen and organize your TV system
It seems like ReadSharp will mis-extract the image's source to the page's url;
Here's the HTML code comparison:
Original
<figure class="image image-large pull-none shortcode" section="shortcodeImage"><span class="imageContainer"><span><img style="display: block;" alt="how-to-tidy-your-home-theatre-01.jpg" src="https://cnet1.cbsistatic.com/img/4XEO9v8rVP3h1D1dpvT6pqLYT7g=/2017/03/10/b3a7ca7d-e81b-493e-b593-a822e4f4e357/how-to-tidy-your-home-theatre-01.jpg" data-original="https://cnet1.cbsistatic.com/img/4XEO9v8rVP3h1D1dpvT6pqLYT7g=/2017/03/10/b3a7ca7d-e81b-493e-b593-a822e4f4e357/how-to-tidy-your-home-theatre-01.jpg"></span><noscript>&lt;span&gt;&lt;img src="https://cnet1.cbsistatic.com/img/4XEO9v8rVP3h1D1dpvT6pqLYT7g=/2017/03/10/b3a7ca7d-e81b-493e-b593-a822e4f4e357/how-to-tidy-your-home-theatre-01.jpg" class="" alt="how-to-tidy-your-home-theatre-01.jpg"&gt;&lt;/span&gt;</noscript></span><figcaption><span class="credit"> Sarah Tew/CNET </span></figcaption></figure>
Extracted
<figure class="image image-large pull-none shortcode" section="shortcodeImage"><span class="imageContainer"><span><img src="https://www.cnet.com/how-to/how-to-hide-cables-clean-your-screen-and-organize-your-tv-system/" class="lazy " alt="how-to-tidy-your-home-theatre-01.jpg" /></span></span><figcaption><span class="credit"> Sarah Tew/CNET </span></figcaption></figure>

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant