Difference between revisions of "Previews Parsing"
Gskluzacek (Talk | contribs) (adding content about previews web site) |
Gskluzacek (Talk | contribs) (updating info on URL format) |
||
| Line 16: | Line 16: | ||
I also downloaded some of the PDF format COFs which are located here: | I also downloaded some of the PDF format COFs which are located here: | ||
[https://github.com/gskluzacek/previews/tree/master/old_code/Preview%20Parsing/previews%20parsing/pdf JAN 2009 thru DEC 2010, SEP 2011, JAN 2012 thru OCT 2012 and JAN 2013 thru APR 2013] | [https://github.com/gskluzacek/previews/tree/master/old_code/Preview%20Parsing/previews%20parsing/pdf JAN 2009 thru DEC 2010, SEP 2011, JAN 2012 thru OCT 2012 and JAN 2013 thru APR 2013] | ||
| + | |||
| + | I appears the the URL for each COF in the archive follow the format below: | ||
| + | |||
| + | https://www.previewsworld.com/Catalog/CustomerOrderForm/<format>/<MONYY> | ||
| + | |||
| + | where <format> is either PDF or TXT | ||
| + | |||
| + | and <MONYY> is the 3 letter month abbreviation and the 2 digit year. | ||
| + | |||
| + | Apparently, it may be posible to go back as far as JAN 2010 using this process | ||
== High Level Functions == | == High Level Functions == | ||
=== File Loader === | === File Loader === | ||
=== Parsing of Loaded Data === | === Parsing of Loaded Data === | ||
Revision as of 18:12, 19 October 2018
Contents
Previews Parsing
Purpose
To take the previews order form and parse its contents into database tables
The Previews Web Site
The home page is located at previewsworld.com
They now have a digital version of previews which you can view on the web site or on a mobile app. Each issue is $3.99
The Customer Order Form (COF) can be downloaded in Text or PDF format from the Archive page. The have issues as far back as Jan 2012.
I have down loaded some of the text format COFs and have them located here JAN 2009 thru APR 2013
I also downloaded some of the PDF format COFs which are located here: JAN 2009 thru DEC 2010, SEP 2011, JAN 2012 thru OCT 2012 and JAN 2013 thru APR 2013
I appears the the URL for each COF in the archive follow the format below:
https://www.previewsworld.com/Catalog/CustomerOrderForm/<format>/<MONYY>
where <format> is either PDF or TXT
and <MONYY> is the 3 letter month abbreviation and the 2 digit year.
Apparently, it may be posible to go back as far as JAN 2010 using this process