cheerio load from file

Cheerio Load from File

As a web developer, I have worked with many HTML parsing libraries and one of my favorites is Cheerio. Cheerio is a fast, flexible, and lean implementation of core jQuery designed specifically for the server. It allows you to manipulate and traverse HTML documents using the same syntax as jQuery, which makes it a great choice for web scraping, testing, and data mining.

How to Load HTML from a File using Cheerio

One of the most common tasks when working with Cheerio is to load an HTML file and parse it. Cheerio provides a simple API for loading HTML from a file:

const fs = require('fs');
const cheerio = require('cheerio');

const html = fs.readFileSync('path/to/file.html', 'utf8');
const $ = cheerio.load(html);
  • The fs.readFileSync() method reads the contents of the file synchronously and returns a string.
  • The cheerio.load() method loads the HTML string into a Cheerio object.

Once you have loaded the HTML into a Cheerio object, you can use the same syntax as jQuery to manipulate and traverse the HTML:

$('h1').text('Hello Cheerio!');

This code selects all <h1> elements in the HTML document and sets their text content to Hello Cheerio!.

Other Ways to Load HTML using Cheerio

Aside from loading HTML from a file, Cheerio provides several other ways to load HTML:

  • Load from a URL: You can use the cheerio.load() method with the request() module to load HTML from a URL:

const request = require('request');
const cheerio = require('cheerio');

request('', (error, response, html) => {
  const $ = cheerio.load(html);
  • Load from a String: You can use the cheerio.load() method to load HTML from a string:

const cheerio = require('cheerio');

const html = '<html><body><p>Hello Cheerio!</p></body></html>';
const $ = cheerio.load(html);

These are just a few examples of how you can load HTML using Cheerio. With its simple and powerful API, Cheerio is an excellent tool for working with HTML on the server.

Subscribe to The Poor Coder | Algorithm Solutions

Don’t miss out on the latest issues. Sign up now to get access to the library of members-only issues.
[email protected]