Four Programming Languages Creating a Complete Website Scraper Application Four Programming Languages Creating a Complete Website Scraper Application

Four Programming Languages Creating a Complete Website Scraper Application

    • €11.99
    • €11.99

Publisher Description

Four Programming Languages Creating a Complete Website Scraper Application

After finishing these pages you will have a complete application which will work for either console or desktop platform. You will be utilizing three languages - C#,VB.Net and Java for creating this application. Each chapter covers a single language and either the desktop or console application coded in that language (Java does not natively allow a console application, so it includes only Desktop). For console program automation purposes, we will be using an Excel sheet and VBA coding. Using the desktop application allows for more flexibility in web page processing, with entry fields for beginning and ending text along with DIVs and other processing options. Enjoy this learning experience.
This list includes some of the types/commands and the languages that use them

WebResponse, WebRequest, HttpWebRequest, StreamReader (C#/VB)
GetResponse, Regex.Replace, String.Replace, IndexOf (C#/VB)
Substring, ReadLine, Trim, WriteLine (C#/VB)
EndsWith, AddRange, ReadToEnd, Count (C#/VB)
GetCommandLineArgs, GetResponseStream (VB)
getText, endsWith, split, length, openConnection (Java)
toString, BufferedReader, getSelectedIndex, replaceAll (Java)
isEmpty, substring,indexOf, readLine, PrintWriter, write (Java)
ActiveCell,Value,ChDir,Shell,Activate (VBA)

Why would you want to work with the same program in multiple languages? A simple answer to this is "versatility." You may come across a need for Java where a .Net-based language just won't work. A perfect example of this is Windows versus Linux web hosting. If you have designed a .Net program and placed it on your site based on Windows, it will work beautifully. If you then change the hosting plan to Linux, the .Net program will not work without some tweaking or an interpreter. If that were written in Java, however, it would have moved over fine.
Why would you want a web site text extraction program? Well, if you had a need to capture the main text from a few web pages, this would be too much trouble. If you are migrating a web site designed in ASP.NET into another format, maybe a CMS, this approach can be quite useful. If you have 1,000 pages in the site and all are similarly structured, it may take a week for a single person to manually copy and paste the body text from these pages. Using the automated approach, with a pause between each page for accuracy purposes, approximately 700 pages per hour can be processed. That equates to a tremendous labor savings.

GENRE
Computing & Internet
RELEASED
2014
6 September
LANGUAGE
EN
English
LENGTH
116
Pages
PUBLISHER
Stephen J Link
SIZE
327.1
KB

More Books Like This

How To Write A Zotero Translator How To Write A Zotero Translator
2011
Programming A Beginner's Guide Programming A Beginner's Guide
2009
Javascript Javascript
2017
Simply Programming C# and Visual Basic … Simply Programming C# and Visual Basic …
2013
Computer Programming JavaScript, Python, HTML, SQL, CSS Computer Programming JavaScript, Python, HTML, SQL, CSS
2019
Beginning C# 7 Hands-On – The Core Language Beginning C# 7 Hands-On – The Core Language
2017

More Books by Stephen J Link

HTML5,CSS3,Javascript and JQuery Mobile Programming: Beginning to End Cross-Platform App Design HTML5,CSS3,Javascript and JQuery Mobile Programming: Beginning to End Cross-Platform App Design
2014
The Journey Along God's Road to Revelation The Journey Along God's Road to Revelation
2015
Wisdom of Proverbs: Take the 31 Day Journey Wisdom of Proverbs: Take the 31 Day Journey
2020
WordPress 4 Business Website Redesign: With Custom Coding Of Imported Database WordPress 4 Business Website Redesign: With Custom Coding Of Imported Database
2018
Excel Programming through VBA: A Complete Macro Driven Excel 2010 Application Excel Programming through VBA: A Complete Macro Driven Excel 2010 Application
2014
Power Outlook Power Outlook
2004