logo Sign In

Post #496209

Author
none
Parent topic
Starwars.com closes its forums
Link to post in topic
https://originaltrilogy.com/post/id/496209/action/topic#496209
Date created
4-May-2011, 10:02 AM
Main forums.starwars.com Page Types and Preliminary Counts
Page Type File Name Preliminary Count Estimate
Announcements : ann.jspa?annID=# Uncertain (probably under 10)
Category : category.jspa?categoryID=#
category.jspa?categoryID=#&start=##
(## = 0, 15, 30, 45, etc.)
1 thru 20 each with multiple start values
Forum : forum.jspa?forumID=#
forum.jspa?forumID=#&start=##
1 thru ~193 (don't seem to be sequential)
Highly used forumID=61 has at least 1102 pages start=16515
Messages : message.jspa?messageID=# ~2 million : according to stats on main forum page
quick scrap found a high value of 17965717
Profiles : profile.jspa?userID=#  ???? : quick scrape found a high value of 9782310
Seem to be sequential [earlier # have earlier creation date]
RSS : rss.jspa?feed=rss%2Frssmessages.jspa?forumID=# Please confirm if these are scrape worthy
Tag : tag.jspa?tagName=__NAME__ Main Star Wars terms each have their own __NAME__ tag
Thread Message : thread.jspa?messageID=#  ???? : quick scrape found a high value of 17966647
Thread Thread : thread.jspa?threadID=# 50,574 according to stats of main forum page
???? : quick scrape found a high value of 275287
Other : Folder 'dwf', 'resources' & 'scripts' have JavaScript (.js)
Folders 'images' & 'share' have .gifs
File types 'index' and a few other misc. types
Note: these are the main categories found from a quick scrape.
Possible repetition : File type 'messages' might be the same as 'thread message'