CAD CAM EDM DRO - Yahoo Group Archive

Re: Searching Archives

Posted by echnidna
on 2002-11-11 15:01:52 UTC
I liked Jason's suggestion of the webscraper as it would probably be great for offline research but Daves comments about the 7x10mini-lathe site really seem the way to go. I wonder if he would show us how to do a similar setup for this site.

Regards
Bob Thomas

--- In CAD_CAM_EDM_DRO@y..., "turbulatordude" <davemucha@j...> wrote:
> It's all text, and most are less than a page, so you are talking
> about 51k or more likely 20k text pages.
>
> I would think less than 10 meg.
>
> There is a guy who did something sililar on the 7x10mini-lathe site
> he made an index that links to the yahoo posts. he has a database
> that allows you to search on your words, but instead of posting his
> copy of the list, it links you to the yahoo board. no copyright
> infringment, no complaints, and you can go up and down the thread as
> you choose.
>
> Dave
>
>
>
> --- In CAD_CAM_EDM_DRO@y..., "Askew, Jason" <jaskew@u...> wrote:
> > Bonus question: how large do you think all 51000 messages would
> be? I
> > could make a screen (html) scraper to go through yahoo and get them
> all...
> > but what to do with them then? I might be able to provide a
> location for
> > them, but it would definately be based on the size.
> >
> > My 2 cents.
> >
> > -----Original Message-----
> > From: echnidna [mailto:echnidna@y...]
> > Sent: Sunday, November 10, 2002 5:00 PM
> > To: CAD_CAM_EDM_DRO@y...
> > Subject: [CAD_CAM_EDM_DRO] Searching Archives
> >
> >
> > Hi Group
> > When we do a keyword search of the archives only the last 1000
> posts are
> > searched. Do a simple trial yourself by searching for "welcome"
> which was
> > the very first message posted. You won't find it!
> >
> > So finding info in the previous 51000 odd messages is impossible
> without
> > opening each message individually.
> >
> > Because of the huge amount of info posted by this group it seems
> that some
> > way of searching all messages would benefit all of us.
> >
> > So how can we organise this info so any of us can find things
> easily?
> >
> > Some Possibilities;
> > put it all into a word document so we can use the find facility.
> > put into htm format on a freebie website so we can use search
> facility.
> >
> > As there are over 52000 messages, whatever is done will probably
> mean that a
> > team of us will need to set it all up.
> >
> > What are your suggestions?
> >
> > Regards
> > Bob Thomas

Discussion Thread

echnidna 2002-11-10 14:59:50 UTC Searching Archives Tim Goldstein 2002-11-10 15:06:18 UTC RE: [CAD_CAM_EDM_DRO] Searching Archives turbulatordude 2002-11-10 18:26:09 UTC Re: Searching Archives Marv Frankel 2002-11-10 19:38:09 UTC Re: [CAD_CAM_EDM_DRO] Searching Archives Askew, Jason 2002-11-10 19:38:11 UTC RE: [CAD_CAM_EDM_DRO] Searching Archives Jon Elson 2002-11-10 22:29:29 UTC Re: [CAD_CAM_EDM_DRO] Searching Archives echnidna 2002-11-11 01:17:53 UTC Re: Searching Archives alenz2002 2002-11-11 03:03:44 UTC Re: Searching Archives bkpryor 2002-11-11 03:40:45 UTC Re: Searching Archives turbulatordude 2002-11-11 04:30:44 UTC Re: Searching Archives turbulatordude 2002-11-11 05:41:28 UTC Re: Searching Archives echnidna 2002-11-11 15:01:52 UTC Re: Searching Archives