submit_to_dcc.py

Submits records from Pulsar to the ENCODE Portal.

usage: submit_to_dcc.py [-h] [-m DCC_MODE] -p PROFILE_ID
                        (-i INFILE | --ids IDS [IDS ...]) [--no-extend-arrays]
                        [--patch]

Named Arguments

-m, --dcc-mode
The ENCODE Portal site (‘prod’ or ‘dev’, or an explicit host name, i.e. ‘demo.encodedcc.org’) to connect to.
-p, --profile-id
 
The ID of the ENCODE profile to submit to, i.e. use ‘genetic_modification’ for https://www.encodeproject.org/profiles/genetic_modification.json. Must be a value from the set [‘biosample’, ‘library’]
-i, --infile
The input file containing Pulsar record identifiers, one per row. The record identifiers should belong to a Pulsar model that is the equivalent of the DCC profile specified by –profile-id. For example, if submitting Pulsar records to the ENCODE ‘biosample’ profile, the equivalent model in Pulsar is Biosample, thus your record identifiers in this file should thus be Biosample identifiers. Note that the record identifier to use must be either the record ID or record name if POSTING, or can additionally be the value of the Pulsar record’s upstream_identifier attribute if PATCHING.
--ids One or more Pulsar record identifiers.
--no-extend-arrays
 
Only affects updating objects on the ENCODE Portal. By default, when updating an array attribute, the array will be extended with the provided values from the input file. However, including this command-line option means to first empty the array contents.

Default: False

--patch
Presence of this option indicates to PATCH an existing DCC record rather than register a new one.

Default: False