Skip to content

Commit

Permalink
Merge remote-tracking branch 'upstream/master'
Browse files Browse the repository at this point in the history
  • Loading branch information
j0k3r committed Apr 2, 2017
2 parents 213461b + eb40898 commit c09a689
Show file tree
Hide file tree
Showing 8 changed files with 61 additions and 10 deletions.
13 changes: 13 additions & 0 deletions blog.trendmicro.com.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Generated by FiveFilters.org's web-based selection tool
# Place this file inside your site_config/custom/ folder
# Source: http://siteconfig.fivefilters.org/grab.php?url=http%3A%2F%2Fblog.trendmicro.com%2Ftrendlabs-security-intelligence%2Fwinnti-abuses-github%2F

title: //div[@id='post-title']//h1

date: //li[@class='post-date']//div[@class='meta-info']//a

author: //a[@rel='author']

body: //div[@id='pageContent']

test_url: http://blog.trendmicro.com/trendlabs-security-intelligence/winnti-abuses-github/
6 changes: 6 additions & 0 deletions gurumed.org.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
prune: no
body: //div[@class='entry']
strip: //div[@class='addthis_toolbox']
strip: //div[@class='yarpp-related']

test_url: http://www.gurumed.org/2015/06/22/nous-entrons-dsormais-dans-la-sixime-extinction-massive/
10 changes: 10 additions & 0 deletions jeuxvideo.com.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
prune: no
body: //div[@class='corps-news text-enrichi-default']
body: //div[@class='corps-article text-enrichi-default']
body: //div[@class='corps-video text-enrichi-default']
strip: //div[@class='bloc-contact-auteur']
strip: //div[@class='liens-avis-lecteur']

test_url: http://www.jeuxvideo.com/news/431383/lancement-cosmique-pour-devouring-stars.htm
test_url: http://www.jeuxvideo.com/test/431612/massive-chalice-du-tour-par-tour-medieval-independant.htm
test_url: http://www.jeuxvideo.com/videos/431381/devouring-stars-moisson-d-etoiles.htm
4 changes: 3 additions & 1 deletion linkedin.com.txt
Original file line number Diff line number Diff line change
@@ -1,2 +1,4 @@
http_header(user-agent): Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:50.0) Gecko/20100101 Firefox/50.0
single_page_link: //ul[@class='util-nav']//a[@class='close']
test_url: http://www.linkedin.com/news?actionBar=&articleID=894735221&ids=0Rdj4Qe3wQejwIczAOc3sRdzwUb3wScPoPdzkVe2MNcz8RcPsQejwIcPASdjwTcjwU&aag=true&freq=weekly
test_url: http://www.linkedin.com/news?actionBar=&articleID=894735221&ids=0Rdj4Qe3wQejwIczAOc3sRdzwUb3wScPoPdzkVe2MNcz8RcPsQejwIcPASdjwTcjwU&aag=true&freq=weekly
test_url: https://www.linkedin.com/pulse/google-facebook-ad-traffic-90-useless-omid-sadeghpour
12 changes: 10 additions & 2 deletions numerama.com.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,11 @@
strip://section[@class='related-article']
# Need html5lib or replace_string to handle correctly badly included in-content-ad inclusion script. html5lib is a lot slower.
# parser: html5lib
replace_string("</div>"): "&lt;/div&gt;"
body: //article[@class='post-content']
strip: //span[@class='summary-entry']
strip: //footer

test_url: http://www.numerama.com/sciences/231009-les-radiochats-et-lepineuse-question-de-la-memoire-des-sites-nucleaires.html
test_url: http://www.numerama.com/sciences/243352-hubble-detecte-un-trou-noir-supermassif-propulse-hors-de-sa-galaxie.html
test_url: http://www.numerama.com/tech/242703-free-mobile-et-la-4g-en-illimite-ce-quil-faut-savoir.html
# Don't know why this one get everything in bold:
test_url: http://www.numerama.com/business/243686-comme-convenu-quand-lenfer-dune-startup-se-transforme-en-succes-dauto-edition.html
9 changes: 9 additions & 0 deletions secouchermoinsbete.fr.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
prune: no
body: //article[@class='anecdote']
strip: //article[@class='anecdote']/aside/div[@class='column-wrapper']
strip: //article[@class='anecdote']/aside/div[@id='related-sources-wrapper']/div[@id='related']


test_url: http://secouchermoinsbete.fr/62836-audi-a-ete-cree-par-un-ancien-chef-de-chez-mercedes
test_url: http://secouchermoinsbete.fr/62795-sous-l-eau-a-plus-de-10-metres-votre-sang-est-vert
test_url: http://secouchermoinsbete.fr/62663-l-invention-qui-pourrait-nettoyer-les-oceans-en-quelques-annees
15 changes: 9 additions & 6 deletions thedailybeast.com.txt
Original file line number Diff line number Diff line change
@@ -1,7 +1,10 @@
title: //h1
body: //article/div[contains(@class, 'article-body')]
#strip: //header/hgroup/h1
strip: //footer[@class='storyFooter']
single_page_link: //li[@class='print']/a
body: //div[contains(@class, 'ArticleBody')]
strip_id_or_class: share
strip_id_or_class: Share
strip_id_or_class: footer
strip_id_or_class: Footer
strip_id_or_class: Newsletter
prune: no
test_url: http://www.thedailybeast.com/articles/2010/04/06/how-mastercard-predicts-divorce.html
test_url: http://www.thedailybeast.com/articles/2017/04/01/michael-flynn-failed-to-disclose-payments-from-russian-propaganda-network.html
test_url: http://www.thedailybeast.com/articles/2010/04/06/how-mastercard-predicts-divorce.html
test_contains: people who are going through a divorce are more likely to miss payments
2 changes: 1 addition & 1 deletion theverge.com.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@ author: //meta[@name="author"]/@content
title: //meta[@property="og:title"]/@content
date: //meta[@property="article:published_time"]/@content

body: //div[contains(@class, 'c-entry-content') or contains(@class, 'c-entry-hero__image')]
body: //picture[contains(@class, 'c-picture')] | //div[contains(@class, 'c-entry-content') or contains(@class, 'c-entry-hero__image')]
# for vergecasts, e.g. http://www.theverge.com/2013/8/22/4648566/the-vergecast-090-august-22th-2013-video
body: //article
body: //div[contains(concat(' ',normalize-space(@class),' '),' l-col__main ')]
Expand Down

0 comments on commit c09a689

Please sign in to comment.