A parser that scans a local copy of the entire website


Bulk Spell Checker (Java)

This is a relatively simple program for bulk spell-checking and correcting an entire website. It has several pieces:

1. A parser that scans a local copy of the entire website, ignoring HTML (Hypertext Markup Language)markup, creating a HashMap off all the words used anywhere on the entire website, with a count of how many times each word was used. The list is case-sensitive.

2. An interface to a traditional spell check program to mark the words in the list as good or bad and provide a list of likely alternates for bad words.

3. You then remove the good words from the list.

4. Then you display the words sorted in order by frequency, most infrequently used first.

5. The user can peruse the list and select a replacement word, or type one in.

6. The user runs a batch replace program that replaces the uses of the bad words with replacements.

7. That deals with the errors that do not depend on context.

8. Then the user prepares a list of bad words remaining, in context.

9. The user can peruse the list and select a replacement word, or type one in.

10. The user runs a batch replace program that replaced the uses of the bad words with replacement, but only in the one context.

A large website might overflow RAM (Random Access Memory). So you might implement this with a 64-bit JVM (Java Virtual Machine) and a ton of RAM, and offer it as a service on the web.

Solution Preview :

Prepared by a verified Expert
Business Economics: A parser that scans a local copy of the entire website
Reference No:- TGS02235471

Now Priced at $20 (50% Discount)

Recommended (99%)

Rated (4.3/5)