Ya corri el codigo que hicimos en la clase, ha estado encontrando los links pero despues de un tiempo me sale un error. <code>INFO:__main...

Mario Emiliano Gordon Pico

Mario Emiliano Gordon Pico

Pregunta
student
hace 7 años

Ya corri el codigo que hicimos en la clase, ha estado encontrando los links pero despues de un tiempo me sale un error.

INFO:__main__:Star fetching article at /metropoli/cdmx/ella-es-maricela-taquillera-en-el-metro-desde-1970 Traceback (most recent call last): File "/usr/lib/python3/dist-packages/urllib3/connection.py", line 141, in _new_conn (self.host, self.port), self.timeout, **extra_kw) File "/usr/lib/python3/dist-packages/urllib3/util/connection.py", line 60, in create_connection for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM): File "/usr/lib/python3.6/socket.py", line 745, in getaddrinfo for res in _socket.getaddrinfo(host, port, family, type, proto, flags): socket.gaierror: [Errno -2] Name or service not known During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 601, in urlopen chunked=chunked) File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 357, in _make_request conn.request(method, url, **httplib_request_kw) File "/usr/lib/python3.6/http/client.py", line 1239, in request self._send_request(method, url, body, headers, encode_chunked) File "/usr/lib/python3.6/http/client.py", line 1285, in _send_request self.endheaders(body, encode_chunked=encode_chunked) File "/usr/lib/python3.6/http/client.py", line 1234, in endheaders self._send_output(message_body, encode_chunked=encode_chunked) File "/usr/lib/python3.6/http/client.py", line 1026, in _send_output self.send(msg) File "/usr/lib/python3.6/http/client.py", line 964, in send self.connect() File "/usr/lib/python3/dist-packages/urllib3/connection.py", line 166, in connect conn = self._new_conn() File "/usr/lib/python3/dist-packages/urllib3/connection.py", line 150, in _new_conn self, "Failed to establish a new connection: %s" % e) urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f5d425fb358>: Failed to establish a new connection: [Errno -2] Name or service not known During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/lib/python3/dist-packages/requests/adapters.py", line 440, in send timeout=timeout File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 639, in urlopen _stacktrace=sys.exc_info()[2]) File "/usr/lib/python3/dist-packages/urllib3/util/retry.py", line 398, in increment raise MaxRetryError(_pool, url, error or ResponseError(cause)) urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='www.eluniversal.com.mx', port=80): Max retries exceeded with url: /metropoli/cdmx/ella-es-maricela-taquillera-en-el-metro-desde-1970 (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f5d425fb358>: Failed to establish a new connection: [Errno -2] Name or service not known',)) During handling of the above exception, another exception occurred: Traceback (most recent call last): File "main.py", line 64, in <module> _news_scrapper(args.news_site) File "main.py", line 23, in _news_scrapper article = _fetch_article(news_site_uid, host, link) File "main.py", line 37, in _fetch_article article = news.ArticlePage(news_site_uid, _build_link(host, link)) File "/home/maego/webscraper/news_page_objects.py", line 39, in __init__ super().__init__(news_site_uid, url) File "/home/maego/webscraper/news_page_objects.py", line 12, in __init__ self._visit(url) File "/home/maego/webscraper/news_page_objects.py", line 18, in _visit response = requests.get(url) File "/usr/lib/python3/dist-packages/requests/api.py", line 72, in get return request('get', url, params=params, **kwargs) File "/usr/lib/python3/dist-packages/requests/api.py", line 58, in request return session.request(method=method, url=url, **kwargs) File "/usr/lib/python3/dist-packages/requests/sessions.py", line 520, in request resp = self.send(prep, **send_kwargs) File "/usr/lib/python3/dist-packages/requests/sessions.py", line 630, in send r = adapter.send(request, **kwargs) File "/usr/lib/python3/dist-packages/requests/adapters.py", line 508, in send raise ConnectionError(e, request=request) requests.exceptions.ConnectionError: HTTPConnectionPool(host='www.eluniversal.com.mx', port=80): Max retries exceeded with url: /metropoli/cdmx/ella-es-maricela-taquillera-en-el-metro-desde-1970 (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f5d425fb358>: Failed to establish a new connection: [Errno -2] Name or service not known',))
2 respuestas
    Osmandi Gomez

    Osmandi Gomez

    student
    hace 7 años

    Posiblemente haya un error en la conexión con la URL, esto puede ser por un corte abrupto del internet o bien que la página haya cortado la conexión. Puedes probar esto utilizando otra página.

    Mario Emiliano Gordon Pico

    Mario Emiliano Gordon Pico

    student
    hace 7 años

    gracias, voy a intentarlo

Curso de Ingeniería de Datos con Python

Curso de Ingeniería de Datos con Python

Genera un set de datos capaz de generar predicciones a partir de algoritmos de machine learning. Encuentra fuentes ideales de datos. Trata, limpia y alista información para crear tu dataset y da el primer paso para formarte en la Carrera de Ciencia de Datos.

Curso de Ingeniería de Datos con Python
Curso de Ingeniería de Datos con Python

Curso de Ingeniería de Datos con Python

Genera un set de datos capaz de generar predicciones a partir de algoritmos de machine learning. Encuentra fuentes ideales de datos. Trata, limpia y alista información para crear tu dataset y da el primer paso para formarte en la Carrera de Ciencia de Datos.