Dec
18
Ever wanted to get a list of information such as URLs, Articles, tabular data, or whatever else that you know is on one website or across multiple websites, then manipulate it to reuse elsewhere? Stop wondering, because we are about to get down to business! There’re many ways to scrape / mine data, but I’ve found that the easiest and most efficient way is to use a combination of cURL and XPATH. cURL is neat because it will easily let you use proxies, manipulate browser information, catch errors, etc. XPATH is great because you don’t need to write a bunch of regular expressions or other functions to manage the data [...]

