/----------------------------------------------------------------------\
| Title        : HTML to text converter and markup remover             |
|                                                                      |
| File name    : example.txt                                           |
| File size    : 8,263 bytes (approx)                                  |
| Create date  : 6-Jan-2003                                            |
\----------------------------------------------------------------------/

HTML to text converter and markup removal
=========================================

------------------------------------------------------------------------

 [1]  [2]              30-day no-risk                                  
                       money-back guarantee [3] !                      
------------------------------------------------------------------------

Detagger  is a utility that removes some  or all of the tags form a HTML
file. Detagger makes it easy to extract text from your web pages for use
elsewhere,  or to tidy  up your HTML code  to make clean, faster-loading
web pages.

Detagger can act as a full HTML  to text converter, and has a number  of
options  for producing good-looking  text file. For  example here is the
result of converting this page.

Detagger can also can act as a markup remover, selectively removing  and
editing the tags that make up the HTML code in your file.

The  utility  supports  wildcards and  drag  and drop  operation,  and a
console version is available for batch operations, making Detagger  well
suited  to whatever  mode of  operation you  prefer. An  API version may
interest software developers.

Whether you're trying to collate text from multiple sources on the  web,
or  simply looking for some way  to remove all the JavaScript, FONT tags
and comments from  your HTML  archives, Detagger  is the  tool for  you.

There are a number of evaluation downloads available.

Detagger  is  produced  by  JafSoft  [4]  who  are  the  authors  of the
highly-praised  AscToHTM  [5]  text-to-HTML  converter  and  other  text
conversion products.

Detagger as a HTML-to-text converter
====================================

As  an  HTML-to-Text  converter,  Detagger allows  you  to  convert HTML
newsletters into  a  more  compact and  email-friendly  format,  helping
authors  easily maintain HTML and text versions. The program will output
the document as text, preserving  the marked up headings, lists,  tables
of  the original document  and turning them  into suitable text formats.
Text will  be  laid  out  as faithfully  as  possible  to  the  original
document, within the constraints of your chosen page width.

There  are many formatting options which  can be saved in "policy" files
so that they may be easily reloaded in later sessions.

Detagger allows you to:-

    - Remove all  the  HTML  tags,  using  the  heading,  paragraph  and
    list tags etc. to decide how the text should be formatted

    - Parse  tables  and  layout  the  text  accordingly.  Simple tables
    can also be  converted into comma-delimited  (CSV) or  tab-delimited
    data, ready for import into spreadsheets.

    - Replace  hyperlinks  by  the  display  text.  URLs  may  either be
    placed in the main text, or added  as an entry in a reference  table
    added at the end of the text. (See the example for this page).

    - Format  the  output  to  your  desired page  width  (may  not work
    when parsing complex tables)

    - Format   any   "dialogue"  intelligently.   This  is  particularly
    useful when converting short stories

    - Replace  Image  tags  by an  Image  marker. This  can  be labelled
    with the Image URL or the ALT attribute text.

    - Add custom  header  and footers  to  the output.  These  can  have
    merged   in  data  fields  such  as  convert  date,  title  etc. The
    evaluation version,  adds  a  standard  header,  in  the  registered
    version  this is omitted and you can choose to add your own headers.

    - Convert  all  HTML  entities  into  the  correct  characters.  You
    can  choose to have 8-bit  characters replaced by 7-bit alternatives
    where available to give greatest compatibility of the output

    - Support   the  creation  of   Unicode  text  files  from  advanced
    HTML character sets.


Detagger as a markup remover and tag manipulator
================================================

As  a markup remover, Detagger allows you to "tidy up" your HTML code in
a number of ways. You  simply select classes of  tags you want  removed,
sections  of code you  want stripped out, or  tag manipulations you want
performed.

Options include:-

    - remove all non-HTML tags (e.g. MS Office tags)
    - remove all non-standard tags
    - remove the <HEAD>...</HEAD> section
    - remove all <STYLE> tags, style sheets and CSS attributes
    - remove all <SCRIPT> and JavaScript from the document
    - remove all <FORM>,<INPUT>,<SELECT> etc tags

    - remove all <FONT> tags
    - remove all comment tags
    - remove   all  hyperlinks  (replacing  them  by  the  display  text
    only)


and

    - convert all tags to UPPER or lower case.
    - replace    character   entities   such   as   "&nbsp;"   by  ASCII
    near-equivalents


Downloads
=========

Windows version

Download a 30-day trial of the Windows program from here :-

                - Site 1 (965Kb) [6]
                - Site 2 (965Kb) [7]


Console version

Registered users will  get access to  a console version  that is  better
suited for use in batch processing and automated conversions procedures.
You can also download evaluation of the console version

API demonstration package

Developers who  want to  integrate the  functionality of  Detagger  into
their own software may be interested in the API version.

                - download an API demonstration package


The  API can be called from C/C++, Visual Basic etc. Sample projects are
included in the demonstration.

Please note  The API  version  is sold  under  separate license  to  the
windows utility. Contact info@jafsoft.com for details

Awards and reviews
==================

It's  early days  yet. But here  are the awards  Detagger has received:-

   [8]                    [9]                    [10]    [11]     [12]  
 ShareUp     4 stars at SoftList (Russian)      ShareUp SofoTex ListSoft
------------------------------------------------------------------------

 [13]          home - contact us [14] - news [15] - products [16] -    
                      ordering [17] - search this site [18]            
                  For more information contact info@jafsoft.com.       

[1] http://www.jafsoft.co.uk/download/windows/detagger.zip?from=top_icon
[2] http://www.swreg.org/soft_shop/296/shopscr5.shtml
[3] http://www.jafsoft.com/asctohtm/guarantee.html
[4] http://www.jafsoft.com/
[5] http://www.jafsoft.com/asctohtm/
[6] http://www.jafsoft.co.uk/download/windows/detagger.zip
[7] http://www.jafsoft.com/download/windows/detagger.zip
[8] http://www.topshareware.com/detagger-download-1878.htm
[9] http://www.softlist.net/cgi-bin/program.cgi?id=11592
[10] http://www.shareup.com/showapp.php?id=166
[11] http://www.sofotex.com/download/software/7100.html
[12] http://www.listsoft.ru/?id=12403
[13] http://t.extreme-dm.com/?login=_Detagger_
[14] http://www.jafsoft.com/getintouch.html
[15] http://www.jafsoft.com/news/news.html
[16] http://www.jafsoft.com/products/
[17] http://www.jafsoft.com/products/pricing.html
[18] http://www.jafsoft.com/search.html

========================================================================
Converted by an unregistered version of Detagger 2.0
Visit http://www.jafsoft.com/detagger/
(this message is omitted in registered version)
========================================================================