Skip to content

Top 1 Millon ranked websites and top level domains (TLD)

License

Notifications You must be signed in to change notification settings

cjbarker/top-domains

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TOP DOMAINS

GitLab license

About

The repo caontains the top ranked top level domains (TLD) and websites tracked via Cisco's Umbrella Popularity List. Potential future enhancements may include additional source of records for merging (ex: Alexa 1Million).

The repo's goal is to provide a simple, static comma separate files available for easy ingestion and use.

File Downloads

The files can be downloaded in several ways:

  1. Download archive release in 7zip format that includes all the files
wget https://gitlab.com/cjbarker/top-domains/uploads/top-recs-20200521.zip
  1. All files downloaded via clone of the repository
git clone [email protected]:cjbarker/top-domains.git
cd top-domains/top-recs
  1. Individual file download via raw file from top-recs directory in the repository
wget https://gitlab.com/cjbarker/top-domains/raw/master/top-recs/top-sites-1000000.csv
  1. Run the program directly via wget piped to sh (see usage below)
wget -qO- https://gitlab.com/cjbarker/top-domains/raw/master/create-lists.sh | sh

Usage

The files can be downloaded directly via the directory top-recs in the repo, or can generated locally via running of the script.

If you choose to run the script yourself, locally, the following commands will execute it:

# Downloads and splits records accordingly
wget -qO- https://gitlab.com/cjbarker/top-domains/raw/master/create-lists.sh | sh

# Available files separated by TLD and websites
# Format <rank>,<value>
ls top-recs/
top-TLD-100.csv       top-TLD-4121.csv      top-sites-1000.csv    top-sites-100000.csv
top-TLD-1000.csv      top-sites-100.csv     top-sites-10000.csv   top-sites-1000000.csv

# Example Output of ranked Top Level Domains (TLD)
head top-recs/top-TLD-100.csv
1,com
2,net
3,googleapis.com
4,org
5,io
6,cn
7,goog
8,co
9,vn
10,tv

# Example Output of ranked Top Websites
head top-recs/top-sites-100.csv
1,google.com
2,microsoft.com
3,www.google.com
4,windowsupdate.com
5,ctldl.windowsupdate.com
6,data.microsoft.com
7,facebook.com
8,netflix.com
9,safebrowsing.googleapis.com
10,live.com

About

Top 1 Millon ranked websites and top level domains (TLD)

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages