[3dem] Removal of ls-lR index file from the PDB archive

Justin Flatt justin at rcsb.rutgers.edu
Tue Apr 4 10:06:28 PDT 2023


With continuing growth of the PDB archive, the size of the file that 
lists all directory contents (currently 
https://urldefense.com/v3/__https://files.wwpdb.org/pub/pdb/ls-lR__;!!Mih3wA!BGWSIyRbxWfOMaTcBMURR_HO4hfOYhRyVNHTksDG78infvOOUD7CH2x2GzJ2ORQGMFP7IYdtWGPtb-SNT3FQe90S$ ) will become a challenge for long 
term maintenance. wwPDB plans to remove this file from the PDB archive 
at 00:00 UTC on July 12, 2023. We strongly encourage users to utilize 
files previously announced that contain the same data 
(https://urldefense.com/v3/__https://files.wwpdb.org/pub/pdb/holdings/__;!!Mih3wA!BGWSIyRbxWfOMaTcBMURR_HO4hfOYhRyVNHTksDG78infvOOUD7CH2x2GzJ2ORQGMFP7IYdtWGPtb-SNT0GeqH8X$ ).

These inventory data files offer a quick overview of data in the 
archive. These files are in the extensible JSON format, and can be found 
under the new /pdb/holdings/ archive tree.

The inventory lists provided include:

  	* all_removed_entries.json.gz: a list of obsoleted PDB entries 
including information for entry authors, entry title, release date, 
obsolete date, and superseding PDB ID, if any.
  	* current_file_holdings.json.gz: a list of released PDB entries and 
the file types present for each in the PDB Core Archive (e.g. coordinate 
data, experimental data, validation report).
  	* obsolete_structures_last_modified_dates.json.gz: a list of obsoleted 
PDB entries with information about the most recent modification date of 
the PDBx/mmCIF file.
  	* refdata_id_list.json.gz: a list of released chemical reference 
entries, their content types (e.g., Chemical Component, BIRD), and the 
most recent modification date of the reference file.
  	* released_structures_last_modified_dates.json.gz: a list of released 
PDB entries with the most recent modification date of the PDBx/mmCIF 
file.
  	* unreleased_entries.json.gz: a list of on-hold PDB entries, their 
entry status, deposition date, and pre-release sequence information, 
where available.

Users are encouraged to utilize these inventory files. For example, 
checking for the update of the PDB archive can be performed using 
current_file_holdings.json.gz [1] or 
released_structures_last_modified_dates.json.gz [2] in 
/pub/pdb/holdings/.

Please contact info at wwpdb.org with any questions.

Links:
------
[1] https://urldefense.com/v3/__https://s3.rcsb.org/pub/pdb/holdings/current_file_holdings.json.gz__;!!Mih3wA!BGWSIyRbxWfOMaTcBMURR_HO4hfOYhRyVNHTksDG78infvOOUD7CH2x2GzJ2ORQGMFP7IYdtWGPtb-SNT9XyJYLH$ 
[2] 
https://urldefense.com/v3/__https://s3.rcsb.org/pub/pdb/holdings/released_structures_last_modified_dates.json.gz__;!!Mih3wA!BGWSIyRbxWfOMaTcBMURR_HO4hfOYhRyVNHTksDG78infvOOUD7CH2x2GzJ2ORQGMFP7IYdtWGPtb-SNT-hwQyEI$ 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ncmir.ucsd.edu/pipermail/3dem/attachments/20230404/248af708/attachment.html>


More information about the 3dem mailing list