Create an Account
username: password:
 
  MemeStreams Logo

Scraping and ad-stripping Google's results

search

Decius
Picture of Decius
Decius's Pics
My Blog
My Profile
My Audience
My Sources
Send Me a Message

sponsored links

Decius's topics
Arts
  Literature
   Sci-Fi/Fantasy Literature
  Movies
   Sci-Fi/Fantasy Films
  Music
   Electronic Music
Business
  Finance & Accounting
  Tech Industry
  Telecom Industry
  Management
  Markets & Investing
Games
Health and Wellness
Home and Garden
  Parenting
Miscellaneous
  Humor
  MemeStreams
Current Events
  War on Terrorism
Recreation
  Cars and Trucks
  Travel
Local Information
  United States
   SF Bay Area
    SF Bay Area News
Science
  Biology
  History
  Math
  Nano Tech
  Physics
Society
  Economics
  Politics and Law
   Civil Liberties
    Internet Civil Liberties
    Surveillance
   Intellectual Property
  Media
   Blogging
Sports
Technology
  Computer Security
  Macintosh
  Spam
  High Tech Developments

support us

Get MemeStreams Stuff!


 
Scraping and ad-stripping Google's results
Topic: Technology 1:14 pm EST, Jan 11, 2005

] This step that we have taken has implications for all
] search engines. These engines crawl the public web
] without asking permission, and cache and reproduce the
] content without asking permission, and then use this
] information as a carrier for ads that generate private
] profit. We are convinced that if citizens scrape Google
] and strip the ads, and make the scraped results available
] as a nonprofit public service, that this is legal. This
] is especially the case if there are public policy
] concerns behind the scraping.
]
] Google Watch has been the most prominent critic of
] Google's outrageous privacy policies for more than two
] years. This is why we started the proxy, and it's why we
] continue the proxy. We invite Google to serve us with a
] cease and desist letter as a first step toward resolving
] this issue. So far, we have yet to hear from Google's
] lawyers. By releasing the source code for our proxy,
] we're trying to escalate the issue.

Google should not save all 4 octets of your IP address. There is no "good" use of that data. Of course, MemeStreams does this too, so yes I'm being hyprocritical, but I didn't just make a fortune in a public offering either. They should store a SHA1 hash of the last 2 octets so they can identify unique visitors without storing identifying data. And the cookie could use some end user control.

Scraping and ad-stripping Google's results



 
 
Powered By Industrial Memetics
RSS2.0