About ComputerScienceExpert

Levels Tought:
Elementary,Middle School,High School,College,University,PHD

Expertise:

Applied Sciences,Calculus See all

Applied Sciences,Calculus,Chemistry,Computer Science,Environmental science,Information Systems,Science Hide all

Teaching Since:	Apr 2017
Last Sign in:	122 Weeks Ago, 4 Days Ago
Questions Answered:	4870
Tutorials Posted:	4863

Education

MBA IT, Mater in Science and Technology
Devry
Jul-1996 - Jul-2000

Experience

Professor
Devry University
Mar-2010 - Oct-2016

Category > Programming Posted 26 May 2017 My Price 8.00

A robot (also known as a bot or spider or crawler

1.34 A robot (also known as a bot or spider or crawler ) is a program that accesses web documents automatically rather than in direct response to a user input. For example, the Google search engine uses a program called googlebot to automatically crawl the World Wide Web and build its searchable index of Web pages. An indexing robot such as googlebot begins by reading some Web document, then reading documents linked to by the initial document, and recursively continuing this process on previously unread documents. Some informal standards have been developed to allow Web site administrators and document authors to request robots not to read certain documents. (a) Read the first part of Section 4.1 of Appendix B of the HTML 4.01 Recommendation [W3C-HTML-4.01], and explain what you would do in order to request that robots not crawl the documents accessible from your Tomcat web server. (See http://www.robotstxt.org/wc/norobots.html for more information on the Robot Exclusion Standard.) (b) For one or more Web sites as directed by your instructor, list for each the robots (if any) that are explicitly excluded from crawling one or more of the files at that site.

Answers

ComputerScienceExpert

(11)

Status NEW Posted 26 May 2017 04:05 AM My Price 8.00

-----------

Not Rated(0)

Buy Answer

Hire Dedicated Virtual Team / Business Solution for SMEs.