No Description

rafapolo 4e7ca6c5d2 Update 'README.md' 1 month ago
results 0f1862b0c2 folha results 1 month ago
README.md 4e7ca6c5d2 Update 'README.md' 1 month ago
pega.rb a93e096e82 remove old results 1 month ago

README.md

pegartigo

This basic methods in Ruby gets and parses articles data from Portuguese agency publico.pt and Brazilian folha.uol.com.br for post-analyses based on input query searches.

  • CSV result files uses the greek letter μ as a field separator.

Sample:

# set query on pega.rb
get_publico "neutralidade da internet"
# run on terminal
ruby pega.rb >> results/neutralidade-da-internet-publico.csv
# get all generated csv, join & remove duplicates
cd results;
cat *.csv > all-neutralidade-publico.csv;
uniq -u all-neutralidade-publico.csv | tr -s '\n' > clean-all-neutralidade-publico.csv;