2020-10-072021_02UniProt Knowledgebase and related datasetsUniProt is updated every eight weeks (see FAQ on
how to be notified automatically of updates).
You can download small data sets and subsets directly from
this website by following the download link on any search result
page. For downloading complete data sets we recommend using
ftp.uniprot.org.
If you are located in Europe, the Middle East or Africa, you
may want to download data from our mirror site in the
United Kingdom
or in Switzerland instead.
See also: Downloaded data seems incomplete or corrupted - how can I get help with download problems?
Here are the main sections of our FTP site, with links to README files and help pages and some frequently downloaded files:
UniProtKB
Parent directoryUniRef
The UniProt Reference Clusters (UniRef) provide clustered sets of sequences from the UniProt Knowledgebase (including isoforms) and selected UniParc records.
i2020-10-072021_02
Parent directoryUniParc
The UniProt Archive (UniParc) is a comprehensive and non-redundant database that contains most of the publicly available protein sequences in the world. Proteins may exist in different source databases and in multiple copies in the same database. UniParc avoided such redundancy by storing each unique sequence only once and giving it a stable and unique identifier.
i2020-10-072021_02
Parent directory /
READMEUniProt RDF distribution
All data provided by the UniProt consortium in RDF format, including supporting datasets.
i
2020-10-072021_02RDFParent directory /
README