Skip to content

Latest commit

 

History

History
42 lines (24 loc) · 1.45 KB

README.md

File metadata and controls

42 lines (24 loc) · 1.45 KB

MediaMonitor

Monitoring tool for Polish news sites articles popularity in social media.

Prerequisites:

  • django

  • scrapy

  • celery

  • pytz

    Roadmap:

        MediaMonitor/ : django project folder with all core settings for django
        
        linkspider/ : tools for scrapping links from sites
        
          linkspider/ : scrapy application for crawling Agora and Onet
          
            items.py : defines what information from each link is to be processed further
            
            pipelines.py : defines functions to process those items
            
            settings.py : scrapy settings: set log level, user agent and pipelines (see above)
            
            spiders/
            
              agora_spider.py : crawler defintion
              
              onet_spider.py : crawler defintion
              
        monitor/ : django app: contains :
        
          views.py -  how to display the data
          
          models.py - Database models of links and stats: writing, reading, updating links and stats
          
          lib/share_getter.py - a core library for getting sharing statistics from FB and TW
          
          templates/ - templates for displaying the data
          
        statspider/ : spider for scrapping popularity stats from social media sites
        
          spider.py: core script for getting the data (using monitor/lib/share_getter.py)