SPONSORS:






User Tag List

Thanks Thanks:  0
Likes Likes:  0
Dislikes Dislikes:  0
Results 1 to 9 of 9
  1. #1
    Junior Member
    Join Date
    Feb 2009
    Posts
    4
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Content testing for HTML Pages

    Hi,

    I am working in an eLearning organization. Here we get the print version of the books that is PDF's for the book and we develop the eBook for the same that is we change these PDF's to HTML pages and then make a build for the eBook that can run on the LMS that the client has provided.
    So far we had been using the manual approach to verify the correctness and completeness of content in the HTML pages.
    Would like to know that is there any tool available that would take the PDF page as an input and then map the entire content to that on the HTML page which the developer has created?

    Kapil

  2. #2
    Member
    Join Date
    May 2006
    Location
    mumbai
    Posts
    81
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    There are many tools that help you convert from pdf to html. COnvert Doc is a pretty good tool which you get in the trial version.I havent found any good one in open source. Google for many more better tools which suits your budget the best.
    Regards
    Saju Thomas

  3. #3
    Junior Member
    Join Date
    Feb 2009
    Posts
    4
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    Hi Saju,

    Thank you for your reply. however could you please let me know as to how would this tool help me in testing? This might help the developer in changing the PDF to HTML.
    But the testing team here would like to know only that they have copied all the content from the provided PDF in to the HTML page or not?

    Regards,
    Kapil

  4. #4
    Member
    Join Date
    Jul 2007
    Posts
    96
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    look for some file comparison tools.
    google is the answere
    I have nothing to declare except my genius. -Oscar Wilde

  5. #5
    Member
    Join Date
    Sep 2008
    Location
    India
    Posts
    394
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    Hi Sanju,

    One of the alternatives is to convert pdf as well as html to txt files using some of free converters available by using google.

    Note:- images & hyperlinks won't be displayed in txt file. However that can be verified simply by looking at html. Also You need to remove if there are <img> tags in text file.

    Now you can compare both files (word by word or line by line) by creating a simple program in any programming language you are comfortable with or using file compare programs as Mr_Perfect mentioned.
    Best Regards,
    Sanket Vaidya

    Om - Effortless Text Generation http://sourceforge.net/p/omfortesting/home/description/

  6. #6
    Junior Member
    Join Date
    Feb 2009
    Posts
    4
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    Hi Sanket,

    Thank you so much for this suggestion. Let me check the same and I would give you all an update for this.

    Regards,
    Kapil

  7. #7
    Member
    Join Date
    Sep 2008
    Location
    India
    Posts
    394
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    [ QUOTE ]
    Also You need to remove if there are <img> tags in text file.

    [/ QUOTE ]

    Hi Kapil,

    If you see lot of such tags & formatting related tags in any of the .txt file & its quite troublesome to remove them before starting comparison then you can try converting pdf & html to .doc & try some .doc file comparison program.
    Best Regards,
    Sanket Vaidya

    Om - Effortless Text Generation http://sourceforge.net/p/omfortesting/home/description/

  8. #8
    Senior Member
    Join Date
    Jun 2008
    Location
    The Land of Snake Charmers
    Posts
    212
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    Hi

    Instead of "comparing" both the HTML and pdf in totallity, you may consider breaking down the convertor in terms of functionality

    1. How does the convertor handle paragraphs
    2. How does the convertor handle single space/double spaces
    3. How does the convertor handle text like "bush hid the facts" (You might want to google this one up - incase you are not aware on this notepad defect)
    4. How does the convertor handle formatting
    5. how does the convertor handle page breaks
    6. How does the converot handle font size, color

    Look at in this way

    INPUT -> Convertor -> OUTPUT

    You have to change the input parameters and observe the behaviour in Output.

    The inputs would be the possible attributes of a pdf document - A mapping table might assist

  9. #9
    Senior Member
    Join Date
    Jun 2008
    Location
    The Land of Snake Charmers
    Posts
    212
    Post Thanks / Like
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Total Downloaded
    0

    Re: Content testing for HTML Pages

    [ QUOTE ]

    Would like to know that is there any tool available that would take the PDF page as an input and then map the entire content to that on the HTML page which the developer has created?


    [/ QUOTE ]

    Forgive me!!
    I did not read the question completely

    Sorry - I am not aware of an off the shelf tool which has been specifically designed for this kind of comparision.

    QTP might be able to perform this, however some effort for scripting would have to performed!

 

 

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Search Engine Optimisation provided by DragonByte SEO v2.0.36 (Pro) - vBulletin Mods & Addons Copyright © 2016 DragonByte Technologies Ltd.
Resources saved on this page: MySQL 9.09%
vBulletin Optimisation provided by vB Optimise v2.6.4 (Pro) - vBulletin Mods & Addons Copyright © 2016 DragonByte Technologies Ltd.
User Alert System provided by Advanced User Tagging v3.2.8 (Pro) - vBulletin Mods & Addons Copyright © 2016 DragonByte Technologies Ltd.
vBNominate (Lite) - vBulletin Mods & Addons Copyright © 2016 DragonByte Technologies Ltd.
Feedback Buttons provided by Advanced Post Thanks / Like (Pro) - vBulletin Mods & Addons Copyright © 2016 DragonByte Technologies Ltd.
Username Changing provided by Username Change (Free) - vBulletin Mods & Addons Copyright © 2016 DragonByte Technologies Ltd.
BetaSoft Inc.
Digital Point modules: Sphinx-based search
All times are GMT -8. The time now is 08:03 PM.

Copyright BetaSoft Inc.