Web scraping is a computer science technique for extracting information and data from web sites. In data mining research scraping and analysing of information is discussed. Practically web scraping is necessary if you want to develop a web application where you want to show customised information from various websites. For this you’ve to first scrap data from the sites and then apply some logic to filter the information.
Practically you can use different languages to write the program that will automatically search and collect the information. But if you’re PHP experts and want to use PHP for this kind of stuff here I would like to recommend a book web-bots/spider using PHP/Curl
What is web scraping?
I am going to expalin this with the help of an example .Suppose you want to develop an application which collects contents from various websites and thus shows a customised user
specific contents ,for that we use web scrapers .
Say for example ,Suppose you want to develop a web application where you want to show classified information about current news from different newspaper. So that people who are interested on a particular news can get all the newspaper link in your classified category.
How it works ?
The application will retrieve all the news from different websites, then classify and categorised information based on interest, like politics, sports, entertainment etc. The above book and the library(Library ) will help you to make this kind of application.
We can use this for any kind of application like Android and mobile applications.On that case your scraping script should be installed in a server where it automatically will collect and classify information and in the smart phone application you just retrieved the data from your server.