Skip to content

Commit

Permalink
Merge remote-tracking branch 'upstream/master'
Browse files Browse the repository at this point in the history
  • Loading branch information
j0k3r committed Oct 25, 2016
2 parents 80b4e5f + 43e47ee commit 2cf4cb6
Show file tree
Hide file tree
Showing 9 changed files with 40 additions and 7 deletions.
4 changes: 4 additions & 0 deletions abc-luxe.com.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
title: //div[contains(concat(' ',normalize-space(@class),' '),' brandMarginT ')]//h1
body: //div[contains(concat(' ',normalize-space(@class),' '),' article ')]

test_url: http://www.abc-luxe.com/actus/produits/article/kenzo-world-une-campagne-dejantee-pour-le-premier-parfum-signe-carol-lim-et-humberto-leon
4 changes: 4 additions & 0 deletions heise.de.txt
Original file line number Diff line number Diff line change
Expand Up @@ -47,11 +47,15 @@ replace_string(<span class="bild_rechts" style="width:): <p "
replace_string(<div class="heisebox">): <blockquote>

single_page_link: //a[contains(@href, '?view=print')]
single_page_link: //a[contains(@title, 'Druck')]

next_page_link: //a[@class='next']
next_page_link: //a[@title='vor']
next_page_link: //a[@rel='next']

test_url: http://www.heise.de/open/artikel/Die-Neuerungen-von-Linux-3-15-2196231.html
test_url: http://m.heise.de/open/artikel/Die-Neuerungen-von-Linux-3-15-2196231.html
test_url: http://www.heise.de/newsticker/meldung/Ueberwachungstechnik-Die-globale-Handy-Standortueberwachung-2301494.html
test_url: http://www.heise.de/newsticker/meldung/Bodenradar-fuer-selbstfahrende-Autos-horcht-unter-die-Strasse-3273941.html
test_url: http://www.heise.de/tp/artikel/49/49473/1.html
test_url: http://www.heise.de/ct/artikel/Die-Neuerungen-von-Linux-3-15-2196231.html
3 changes: 3 additions & 0 deletions imgcert.com.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
body: //div[@id='image-viewer-container']//img

test_url: https://imgcert.com/image/fgX1
6 changes: 4 additions & 2 deletions lemonde.fr.txt
Original file line number Diff line number Diff line change
Expand Up @@ -10,8 +10,10 @@ date: //time[@itemprop='datePublished']/@datetime


body: //div[@id='articleBody']
#Shoot the insane "conjugaison.lemonde.fr" links :
#strip: //a[contains(@class, 'conjug')]

# Remove the insane "conjugaison.lemonde.fr" links:
find_string: <a target='_blank' onclick='return false;' class='lien_interne conjug'
replace_string: <input type='hidden' style='display:none;'

prune: no

Expand Down
8 changes: 3 additions & 5 deletions lowtechmagazine.com.txt
Original file line number Diff line number Diff line change
@@ -1,8 +1,6 @@
# Generated by FiveFilters.org's web-based selection tool
# Place this file inside your site_config/custom/ folder
# Source: http://siteconfig.fivefilters.org/grab.php?url=http%3A%2F%2Fwww.lowtechmagazine.com%2F2015%2F10%2Fcan-the-internet-run-on-renewable-energy.html
# Source: http://siteconfig.fivefilters.org/grab.php?url=http%3A%2F%2Fwww.lowtechmagazine.com%2F2015%2F12%2Freinventing-the-greenhouse.html

body: //div[contains(concat(' ',normalize-space(@class),' '),' entry-content ')]
strip: //hr

test_url: http://www.lowtechmagazine.com/2015/10/can-the-internet-run-on-renewable-energy.html
body: //div[contains(concat(' ',normalize-space(@class),' '),' entry-inner ')]
test_url: http://www.lowtechmagazine.com/2015/12/reinventing-the-greenhouse.html
5 changes: 5 additions & 0 deletions nrc.nl.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
body: //div[contains(concat(' ',normalize-space(@class),' '),' nmt-layout--sidebar-align-right ')]
strip_id_or_class: article__footer
strip_id_or_class: nmt-layout__sidebar

test_url: http://www.nrc.nl/nieuws/2016/10/02/de-nederlandse-school-wanorde-onrust-en-lawaai-4566603-a1524441
6 changes: 6 additions & 0 deletions servethehome.com.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
# Generated by FiveFilters.org's web-based selection tool
# Place this file inside your site_config/custom/ folder
# Source: http://siteconfig.fivefilters.org/grab.php?url=https%3A%2F%2Fwww.servethehome.com%2Ffirefox-is-eating-your-ssd-here-is-how-to-fix-it%2F

body: //div[contains(concat(' ',normalize-space(@class),' '),' the-content ')]
test_url: https://www.servethehome.com/firefox-is-eating-your-ssd-here-is-how-to-fix-it/
2 changes: 2 additions & 0 deletions theguardian.com.txt
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
title: //h1[@itemprop='headline']

body: //article
body: //div[contains(concat(' ',normalize-space(@class),' '),' content__main ')]//div[contains(concat(' ',normalize-space(@class),' '),' gs-container ')]
strip: //article/header/div[contains(@class, 'content__header')]
strip: //article/header/div[contains(@class, 'content__logo-container')]
strip: //article//div[contains(@class, 'content__secondary-column')]
Expand Down Expand Up @@ -36,6 +37,7 @@ test_contains: As the second most senior judge in the country, Lord Hoffmann, sa
test_url: http://www.theguardian.com/commentisfree/2014/jun/15/britishness-search-identity-my-part-in-camerons-odyssey
test_url: http://www.theguardian.com/world/2016/feb/17/ankara-explosion-turkey-injures-large-number-of-people-reports-say
test_url: http://www.theguardian.com/uk-news/2016/feb/11/trident-the-british-question
test_url: https://www.theguardian.com/books/live/2016/oct/13/nobel-prize-in-literature-2016-liveblog

# Native ad
test_url: http://www.theguardian.com/sustainable-business/fairtrade-partner-zone/chocolate-cocoa-production-risk
9 changes: 9 additions & 0 deletions webmasters.googleblog.com.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
body://div[@id='main']
date://div[@class='publishdate']
strip://div[@class='share']
strip://div[@class='post-footer']
strip://div[@class='cmt_iframe_holder']
strip://div[@class='blog-pager']
strip://div[@class='clear']
replace_string(noscript>): div>
test_url: https://webmasters.googleblog.com/2016/08/helping-users-easily-access-content-on.html

0 comments on commit 2cf4cb6

Please sign in to comment.