Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

not working for gnews.org #576

Closed
AndyTheFactory opened this issue Oct 24, 2023 · 1 comment
Closed

not working for gnews.org #576

AndyTheFactory opened this issue Oct 24, 2023 · 1 comment
Labels
documentation Improvements or additions to documentation site has protection sites not working

Comments

@AndyTheFactory
Copy link
Owner

Issue by Jooey233
Mon Apr 3 16:20:52 2023
Originally opened as codelucas/newspaper#968


https://gnews.org/articles/1068907
used article.text for this page, and no text got.
and build for gnews is not working too.

import newspaper

gnews = newspaper.build('https://gnews.org/', language='zh')

article = gnews.articles[0]
article.download()
article.parse()
print(article.text)

it will give u a 'index out of range'

and this:

import newspaper

page = newspaper.Article('https://gnews.org/articles/1065912')
page.download()
page.parse()
print(page.title)
print(page.text)

Only part of the title is caught, the text is not working at all

@AndyTheFactory AndyTheFactory added the bug Something isn't working label Oct 25, 2023
@AndyTheFactory AndyTheFactory added this to the First release milestone Oct 25, 2023
@AndyTheFactory AndyTheFactory added sites not working site has protection documentation Improvements or additions to documentation and removed bug Something isn't working labels Oct 28, 2023
@AndyTheFactory AndyTheFactory modified the milestone: Release 0.9.2 Nov 12, 2023
@AndyTheFactory
Copy link
Owner Author

Site has Cloudflair protection. it returns status_code 200, so it looks "ok" for the downloader...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation site has protection sites not working
Projects
None yet
Development

No branches or pull requests

1 participant