Re: Lazy man's google search of the group...
Posted by
turbulatordude
on 2002-11-14 05:06:46 UTC
<snip>
There was some question of the percentage of use of the list.
we are currently 149.7 meg. of 512 meg. not sure how that works.
or what it means. but assuming yahoo is reporting text file size,
the is huge. The limit in the files section is only about 30 meg.
Don't forget that you can also read a message index. I just started
looking at that.
go on-line to any post and the link 'message index' is just after 'p
thread' this only works when you are looking at a post, not the
general messages. What it looks like, might be the last 30 messages,
all in text format, all without ad's. That would cut one's slurping
down to 2,000 slurps instead of 60,000.
Pumping this into excel it would be rather simple to put it into a
database, strip off the much larger header, but only one per 30
posts. Also, each post is preceeded by the post number so a lengthy
ripping of data is not needed.
With some clever database work, it seems one can organize and relate
posts with the exact same seuject line. Means our changing subject
lines is more important when the topic changes.
Dave
There was some question of the percentage of use of the list.
we are currently 149.7 meg. of 512 meg. not sure how that works.
or what it means. but assuming yahoo is reporting text file size,
the is huge. The limit in the files section is only about 30 meg.
Don't forget that you can also read a message index. I just started
looking at that.
go on-line to any post and the link 'message index' is just after 'p
thread' this only works when you are looking at a post, not the
general messages. What it looks like, might be the last 30 messages,
all in text format, all without ad's. That would cut one's slurping
down to 2,000 slurps instead of 60,000.
Pumping this into excel it would be rather simple to put it into a
database, strip off the much larger header, but only one per 30
posts. Also, each post is preceeded by the post number so a lengthy
ripping of data is not needed.
With some clever database work, it seems one can organize and relate
posts with the exact same seuject line. Means our changing subject
lines is more important when the topic changes.
Dave
> IANAL, so YMMV and all that. You're staring down about 200M of
> data if you want all the posts for CCED as uncompressed plaintext.
>
> Dave Kowalczyk
> Everett WA
> TurboCNC software --> http://www.dakeng.com
>
> >
> > Thanks for the feedback... there are shortcomings, of that there
> is
> > no doubt. But I am trying! ;)
> >
> > I'm to the point now where it seems the only real way to have a
> > comprehensive handle on things is to have all the messages stored
> > somewhere I can get my hands on them.
> >
> > I might just screen scrape the whole darn thing. Someone said
> there
> > may be copyright issues... someone care to explain them?
> >
> > How about if I make a search but never expose my copy of the
> messages
> > and just refer the person searching to the yahoo post?
> >
> > Jason
Discussion Thread
Askew, Jason
2002-11-13 07:48:28 UTC
Lazy man's google search of the group...
Marv Frankel
2002-11-13 08:15:01 UTC
Re: [CAD_CAM_EDM_DRO] Lazy man's google search of the group...
turbulatordude
2002-11-13 08:32:32 UTC
Re: Lazy man's google search of the group...
echnidna
2002-11-13 18:57:13 UTC
Re: Lazy man's google search of the group...
killthiskid
2002-11-13 19:13:24 UTC
Re: Lazy man's google search of the group...
echnidna
2002-11-13 22:06:01 UTC
Re: Lazy man's google search of the group...
Dave Kowalczyk
2002-11-13 22:23:44 UTC
Re: Lazy man's google search of the group...
echnidna
2002-11-13 23:35:26 UTC
Re: Lazy man's google search of the group...
turbulatordude
2002-11-14 05:06:46 UTC
Re: Lazy man's google search of the group...
jmkasunich
2002-11-14 05:56:56 UTC
Re: Lazy man's google search of the group...
JJ
2002-11-14 09:21:23 UTC
RE: [CAD_CAM_EDM_DRO] Re: Lazy man's google search of the group...
Raymond Heckert
2002-11-14 19:48:08 UTC
Re: [CAD_CAM_EDM_DRO] Re: Lazy man's google search of the group...
echnidna
2002-11-14 20:19:31 UTC
Re: Lazy man's google search of the group...
Fred Smith
2002-11-14 21:31:23 UTC
Re: Lazy man's google search of the group...
Chris L
2002-11-15 21:30:09 UTC
Re: [CAD_CAM_EDM_DRO] Re: Lazy man's google search of the group...