cheerio load from file
Cheerio Load from File
As a web developer, I have worked with many HTML parsing libraries and one of my favorites is Cheerio. Cheerio is a fast, flexible, and lean implementation of core jQuery designed specifically for the server. It allows you to manipulate and traverse HTML documents using the same syntax as jQuery, which makes it a great choice for web scraping, testing, and data mining.
How to Load HTML from a File using Cheerio
One of the most common tasks when working with Cheerio is to load an HTML file and parse it. Cheerio provides a simple API for loading HTML from a file:
const fs = require('fs');
const cheerio = require('cheerio');
const html = fs.readFileSync('path/to/file.html', 'utf8');
const $ = cheerio.load(html);
- The
fs.readFileSync()
method reads the contents of the file synchronously and returns a string. - The
cheerio.load()
method loads the HTML string into a Cheerio object.
Once you have loaded the HTML into a Cheerio object, you can use the same syntax as jQuery to manipulate and traverse the HTML:
$('h1').text('Hello Cheerio!');
This code selects all <h1>
elements in the HTML document and sets their text content to Hello Cheerio!
.
Other Ways to Load HTML using Cheerio
Aside from loading HTML from a file, Cheerio provides several other ways to load HTML:
- Load from a URL: You can use the
cheerio.load()
method with therequest()
module to load HTML from a URL:
const request = require('request');
const cheerio = require('cheerio');
request('http://example.com', (error, response, html) => {
const $ = cheerio.load(html);
});
- Load from a String: You can use the
cheerio.load()
method to load HTML from a string:
const cheerio = require('cheerio');
const html = '<html><body><p>Hello Cheerio!</p></body></html>';
const $ = cheerio.load(html);
These are just a few examples of how you can load HTML using Cheerio. With its simple and powerful API, Cheerio is an excellent tool for working with HTML on the server.