Fundamentos de los Agentes Inteligentes y LangChain
Introducci贸n a LangChain
Agiliza procesos usando Agentes AI
Agentes inteligentes de LangChain
Instalaci贸n y configuraci贸n de LangChain
Quiz: Fundamentos de los Agentes Inteligentes y LangChain
Chat Models y Prompt templates
Chat Messages con OpenAI
Introducci贸n a los modelos de chat
Output parsers
Prompt templates en LangChain
Tipos de ChatTemplates: Few-Shot Prompting
Quiz: Chat Models y Prompt templates
Cadenas en LangChain
Introducci贸n a Chains y LCEL
Chat con historial
Integraci贸n de cadena: Runnable y OutputParser
Chat Memory
Implementaci贸n de memoria en cadenas
Quiz: Cadenas en LangChain
Carga de documentos en LangChain
Cargar HTML y Directorio con LangChain
Carga de PDF y CSV con LangChain
Text Splitters
Quiz: Carga de documentos en LangChain
Retrieval-augmented generation (RAG)
VectorStore: Chroma
Introducci贸n a Embeddings
Vectorstore: Pinecone
Chatbot RAG: carga de documentos a Vectorstore
Chatbot RAG: prompt templates, cadenas y memoria
Quiz: Retrieval-augmented generation (RAG)
Agentes en LangChain
Construcci贸n de agentes en LangChain
LangChain Tools
Construcci贸n de agentes con memoria
Quiz: Agentes en LangChain
Ecosistema de LangChain
Ecosistema de LangChain
You don't have access to this class
Keep learning! Join and start boosting your career
In this class, we explore how LangChain allows us to load and process HTML files using specialized tools called document loaders. Below, we break down the key points for taking advantage of this functionality, as well as some concepts and best practices for managing multiple files and directories.
HTML files are essential for structuring and displaying content on the web. LangChain allows you to load these files and treat them as actionable documents, which means you can extract text, metadata and do analysis on the content using document loaders.
LangChain has specific loaders for different file types, including HTML. One of the most common is Beautiful Soup, a Python library used to work with HTML files, facilitating the extraction of structured information.
HTML loading process:
When loading an HTML file, LangChain not only allows you to access the text of the document, but also the associated metadata, such as the file name, path, title, etc. This metadata can be useful in projects where you need to reference the source of the information or in cases of analysis of large quantities of documents.
One of the most powerful features of LangChain is the ability to load entire directories containing multiple files, such as PDFs, CSVs or HTMLs. This is especially useful when working with large volumes of data distributed across different files.
You can load the entire contents of a directory and specify which type of files you want to process (for example, HTML only or CSV only). This is done through a directory loader, which automates the loading of multiple files and allows you to process them in sequence.
When working with many files, LangChain offers an option to display a progress bar, which allows you to view the loading status of each document in real time. This is useful in large projects where the loading time may vary depending on the number of files and their size.
If you work in Google Colab, a good practice is to use Google Drive to store and access files. LangChain allows you to easily connect your work environment with Google Drive, making it easy to upload and process files.
Contributions 6
Questions 1
Want to see more contributions, questions and answers from the community?