[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: More dumb ad sites for your killfile




[email protected] (Anonymous) writes:
> ads.lycos.com
> ad.doubleclick.net
> ads.altavista.digital.com
> ad.preferences.com
> ph-ad*.focalink.com
> www.news.com/Banners
> ads.lycos.com
> static.wired.com/advertising

A desirable feature for a proxy would be to filter portions of what it gets.
If we're getting a URL that matches a pattern (like www.stocks.hotshit.com),
then once we see a certain pattern in the HTML (start of embedded ad),
excise the HTML until we see another pattern.  A set of triples.

My block file so far is much bigger than yours.  Anyone cares to keep the
official big blocklist? :-)

**************************************************************************
victory.cnn.com/(image|click).ng/
/fox/graphics/yuppie.gif
ads.web.aol.com
/*/rsac.jpg
/gifs/ads/
webtrack.com
# look for guesttrack
/.*/rsacirated.gif
/shell-cgi/adserv/ads
/adserver/
/ad-graphics/
/cgi-win/tracker
/cgi-bin/Count
/cgi-bin/guesttrack
webcrawler.com/icons/(tenants.*|bottom_logo).gif
netads.*.com/
mckinley.com/img/magellan/butnbar.gif
#hotbot.com
hotbot.com/images/list.*.gif
#
banner-net.com/
/cgi-bin/nph-count
# yahoo
/us.yimg.com/images/compliance/
# Internet explorer logos
/.*/ie.*_(animated|static|sm).gif
/.*/netnow.*.gif
pagecount.com
gm.preferences.com
# da Silva's stupid list of mailing lists
/internet/paml/sponsors
/gifs/mlogo3.gif
#dejanews
/gifs/tripod.gif
/gifs/browsers.gif
/gifs/dnlogo_r.*.gif
# domains
smartclicks.com/
resource-marketing.com/
valueclick.com/
bannermall.com/
iname.com/
bannerweb.com/
eads.com/
/interdex/reciprocal/
/cgi-bin/spim/sp/
adforce..*.com/
/cgi-bin/adclick
imageserv.imgis.com/images/
/g/ads/
#
/graphics/ads/
/OAS/ugo/adstream.cgi/
/content/cgi-bin/clickad/
/content/advertising/
/.*/(S|s)ponsors/.*.gif
/cgi-bin/pn/show_ad
/gif/ads/
.*banner.*.gif
/ad/
/adgenius/
/adproof/
/(A|a)ds/
/adv/
/advertising/
/adverts/
/avimages/
/banner_ds/
/banners/
/banners?/
/CategoryID=0
/cgi-bin/ad-bin/
/cgi-bin/adroll/
/cgi-bin/counter*
/cgi-bin/nph-adlick
/event.ng/
/gfx/spon/
/gifs/netfinity.gif
/gifs/tripod2.gif
/graphics/pcast.gif
/graphicsadvert
/image/ads/
/images/ABCnewsa.gif
/images/ads/
/images/deckad1.gif
/images/getpoint1.gif
/images/nyyahoo.gif
/images/partners/
/images/promo/
/img/ads/
/img/art4/home/promo/
/inserts/images/
/ml/gfx/spon/
/pictures/sponsors/
/promobar
/promos/
/promotions/
/RealMedia/ads/
/shared/images/marketing/
/sponsor.*/.*.gif
209.25.19.47/
:23
ad.*.com/
adserve.*.com/
ad.*.net/
adcount.hollywood.com/
ads*.focalink.com/
ads.*.com/
bannersolutions.com/
bannerswap.com/
counter.digits.com/
digits.com/
doubleclick.com/
flycast.com/
freestats.com/
globaltrack.com/
globaltrack.net/
gp.dejanews.com/
guide.infoseek.com/
infoseek.com/images/channel/
hitbox.com/
hollynxxx.com/
icount.com/
jcount.com/
linkexchange.com/
register-it.com
riddler.com/
sexhound.com/
sexlist.com/
stattrax.com/
style.rahul.net/altavista/adverts/
# Geocities
www.geocities.com/cgi-bin/homestead/GeoGuideLite_image*
geocities.com/MemberBanners
xpagecount.com/
xxxcounter.com/
**************************************************************************
More stuff that I need to sort out:

adsmart.net
doubleclick.net
SmartBanner
imageserv.imgis.com/images
images.yahoo.com/adv
/ad_client.cgi

# ms sucks !
/*.*/(ms)?backoff(ice)?.*.(gif|jpe?g)
/*.*/(msie|sqlbans|powrbybo|activex|backoffice|explorer|netnow|getpoint|ntbutton|hmlink).*.(gif|jpe?g)
/*.*/activex.*(gif|jpe?g)
/*.*/explorer?.(gif|jpe?g)
/*.*/freeie.(gif|jpe?g)
/*.*/ie_?(buttonlogo|static?|anim.*)?.(gif|jpe?g)
/*.*/ie_sm.(gif|jpe?g)
/*.*/msie(30)?.(gif|jpe?g)
/*.*/msnlogo.(gif|jpe?g)
/*.*/office97_ad1.(gif|jpe?g)
/*.*/pbbobansm.(gif|jpe?g)
/*.*/powrbybo.(gif|jpe?g)
/*.*/sqlbans.(gif|jpe?g)

# generally useless information and promo stuff (commented out)
#/*.*/(counter|getpcbutton|BuiltByNOF|netscape|hotmail|vcr(rated)?|rsaci(rated)?|freeloader|cache_now(_anim)?|apache_pb|now_(anim_)?button|ie_?(buttonlogo|static?|.*ani.*)?).(gif|jpe?g)

#------------------------
#
# specific servers
#
#------------------------
193.158.37.3/cgi-bin/impact
193.210.156.114
194.231.79.38
199.78.52.10
204.253.46.71:1977
204.94.67.40/wc/
205.216.163.62
205.217.103.58:1977
205.217.103.58:1977
206.50.219.33
207.159.135.72
207.82.250.9
209.1.135.144:1971
209.1.135.142:1971
ad-up.com
ads?.*\.(com|net)
ad.adsmart.net
ad.doubleclick.net
ad.infoseek.com
ad.linkexchange.com
ad.preferences.com
adbot.com
adbot.theonion.com
adcount.hollywood.com
adforce.imgis.com
adlink.deh.de
adone.com
ads*.focalink.com
ads*.zdnet.com
ads.csi.emcweb.com
ads.imagine-inc.com
ads.imdb.com
ads.infospace.com
ads.lycos.com
ads.narrowline.com
ads.realmedia.com
ads.softbank.net/bin/wadredir
ads.usatoday.com
ads.washingtonpost.com
ads.web.aol.com
ads.web21.com
adservant.mediapoint.de
banners.internetextra.com
bannerswap.com
bs.gsanet.com/gsa_bs/
ciec.org/images/countdown.gif
click1.wisewire.com
click2.wisewire.com
clickii.imagine-inc.com:1964
commonwealth.riddler.com
customad.cnn.com
cyberfirst1.web.cerf.net/image.ng/
digits.com/wc/
dino.mainz.ibm.de
flycast.com/
globaltrack.com
globaltrak.net
gm.preferences.com/image.ng
gtp.dejanews.com/gtplacer
hardware.pagecount.com/
hitbox.com/wc/
hyperbanner.net
icount.com/.*.count
images.yahoo.com/promotions/
imageserv.imgis.com
impartner.de/cgi-bin
linktrader.com/cgi-bin/
logiclink.nl/cgi-bin/
movielink.com/media/imagelinks/MF.(ad|sponsor)
nrsite.com
nt1.imagine-inc.com
nt2.imagine-inc.com
nytsyn.com/gifs
pagecount.com/aa-cgi-bin
pagecount.com/aa-cgi-bin
ph-ad*.focalink.com
promo.ads.softbank.net
resource-marketing.com/tb/
smartclicks.com/.*/smartimg
smh.com.au/adproof/
sysdoc.pair.com/cgi-sys/cgiwrap/sysdoc/sponsor.gif
victory.cnn.com/image.ng/spacedesc
videoserver.kpix.com
w20.hitbox.com
www..bigyellow.com/......mat.*
www.ads.warnerbros.com
www.fxweb.holowww.com/.*.cgi
www.iadventure.com/adserver/
www.infoworld.com/pageone/gif
www.isys.net/customer/images
www.javaworld.com/javaworld/jw-ad
www.link4link.com/cgi-bin
www.mediashower.com/ad-bin/
www.nedstat.nl/cgi-bin/
www.nj.com/adverts
www.nrsite.com
www.pagecount.com/aa-cgi-bin
www.search.com/Banners
www.smartclicks.com:81
www.swwwap.com/cgi-bin/
www.valueclick.com/cgi-bin/
www.websitepromote.com/partner/img/
www.wishing.com/webaudit
yahoo.com/CategoryID=0

#------------------------
#
# some images on servers that I frequently visit
#
#------------------------

# some images on cnn's website just suck!
/*.*/book.search.gif
/*.*/cnnpostopinionhome..gif
/*.*/custom_feature.gif
/*.*/explore.anim.*gif
/*.*/infoseek.gif
/*.*/pathnet.warner.gif
/BarnesandNoble/images/bn.recommend.box.*
/digitaljam/images/digital_ban.gif
/hotstories/companies/images/companies_banner.gif
/markets/images/markets_banner.gif
/ows-img/bnoble.gif
/ows-img/nb_Infoseek.gif
cnnfn.com/images/left_banner.gif

# die sueddeutsche
/*.*/images/artszonnet.jpg

# yahoo.de
/promotions/bankgiro/

#
/gif/buttons/banner_.*
/gif/buttons/cd_shop_.*
/gif/cd_shop/cd_shop_ani_.*

#altavista
/av/gifs/av_map.gif
/av/gifs/av_logo.gif

/*.*/banner_ads/
/*.*/banners?/
/*.*/images/addver.gif
/*.*/place-ads
/*.*/promobar.*
/*.*/publicite/
/*.*/reklame/
/*.*/sponsor.gif
/*.*/sponsors?[0-9]?/
/*.*/werb\..*
/ad_images/
/bin/nph-oma.count/ct/default.shtml
/bin/nph-oma.count/ix/default.html
/cgi-bin/nph-load
/netscapeworld/nw-ad/
/worldnet/ad.cgi
/rotads/
/rotateads/
/rotations/
/promotions/houseads/
**************************************************************************

---

<a href="mailto:[email protected]">Dr.Dimitri Vulis KOTM</a>
Brighton Beach Boardwalk BBS, Forest Hills, N.Y.: +1-718-261-2013, 14.4Kbps