Difference between revisions of "Previews Parsing"

From Komic Box Docs
Jump to: navigation, search
(updating info on URL format)
(url)
Line 19: Line 19:
 
I appears the the URL for each COF in the archive follow the format below:
 
I appears the the URL for each COF in the archive follow the format below:
  
https://www.previewsworld.com/Catalog/CustomerOrderForm/<format>/<MONYY>
+
    https://www.previewsworld.com/Catalog/CustomerOrderForm/<format>/<MONYY>
  
 
where <format> is either PDF or TXT
 
where <format> is either PDF or TXT

Revision as of 18:14, 19 October 2018

Previews Parsing

Purpose

To take the previews order form and parse its contents into database tables

The Previews Web Site

The home page is located at previewsworld.com

They now have a digital version of previews which you can view on the web site or on a mobile app. Each issue is $3.99

The Customer Order Form (COF) can be downloaded in Text or PDF format from the Archive page. The have issues as far back as Jan 2012.

I have down loaded some of the text format COFs and have them located here JAN 2009 thru APR 2013

I also downloaded some of the PDF format COFs which are located here: JAN 2009 thru DEC 2010, SEP 2011, JAN 2012 thru OCT 2012 and JAN 2013 thru APR 2013

I appears the the URL for each COF in the archive follow the format below:

   https://www.previewsworld.com/Catalog/CustomerOrderForm/<format>/<MONYY>

where <format> is either PDF or TXT

and <MONYY> is the 3 letter month abbreviation and the 2 digit year.

Apparently, it may be posible to go back as far as JAN 2010 using this process

High Level Functions

File Loader

Parsing of Loaded Data