GitHub - juyal-ahmed/web-scraper: A web scraper php class using PHP cURL to scrap web page. By which you can scrap web page by cURL get, post methods also by which you can scrap web page content from a asp.net based websites with form post.

PHP Web Scraping Class

A very simple single page PHP web scraper class that utilizes the cURL library to scrape web page content. Scrape web pages using GET or POST methods. Also scrape web page content from asp.net based websites using form POST methods.
Support for:
1. GET Method
2. POST Method
3. ASP Calls
4. Retrieve Page Contents by Markup Tag Names
5. Retrieve Values from Form Fields

Installation

composer require juyal-ahmed/web-scraper

Getting a full webpage content:

<?php
require 'vendor/autoload.php';

// Create a Scraper instance with only the URL specified
$scraper = new \PhpFarmer\WebScraper\Scraper('https://example.com');
$pageHtmlContent = $scraper->getPageContent('https://example.com/page.html');
?>

Getting a full webpage content:

<?php
require 'vendor/autoload.php';

// Create a Scraper instance with custom cache settings
$scraperWithCache = new Scraper('https://example.com', true, './custom_cache/', 600);
$pageHtmlContent = $scraper->getPageContent('https://example.com/page.html');
?>

Getting a full webpage content with Using Proxy IP:

<?php
require 'vendor/autoload.php';

// Create a Scraper instance with only the URL specified
$scraper = new \PhpFarmer\WebScraper\Scraper('https://example.com');
$pageHtmlContent = $scraper->curl('https://example.com/page.html', "93.118.xx.141:8800", "6USERR:8PASS1");
?>

Parsing a page html content:

<?php
$subHtmlContent =  $scraper->getHtmlContentBetweenTags($pageHtmlContent, '', '');
?>

How It Works:

Include The Class scraper.php in your Working page header.
Set some default settings.
Get the page content by its existing methods.
Split your content by getHtmlContentBetweenTags methods if single content you are searching for.
If grid data needed, split the content with a needle Ex: explode()
Then loop it whole and get the content by getHtmlContentBetweenTags again to make the final array of grid data.
That's' all

Thanks

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
examples		examples
src		src
.gitignore		.gitignore
README.md		README.md
composer.json		composer.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PHP Web Scraping Class

Installation

Getting a full webpage content:

Getting a full webpage content:

Getting a full webpage content with Using Proxy IP:

Parsing a page html content:

How It Works:

About

Releases

Packages

Contributors 2

Languages

juyal-ahmed/web-scraper

Folders and files

Latest commit

History

Repository files navigation

PHP Web Scraping Class

Installation

Getting a full webpage content:

Getting a full webpage content:

Getting a full webpage content with Using Proxy IP:

Parsing a page html content:

How It Works:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages