How to upload BED files using PEP
1. Create PEP with BED files metadata
To upload BED files from PEP, first we should create a PEP project locally or on PEPhub.
PEP must include this fields in sample table:
- â "sample_name"
- â "input_file"
- â "input_type"
- â "genome"
âšī¸ All other fields are optional. Schema for PEP can be found here
âšī¸ All peps before their upload are validated against the schema using eido
tool.
Example
https://pephub.databio.org/khoroshevskyi/encode_batch_1
2. Make sure all paths or urls to input bed files are correct.
To complete this step, you should manually check that all paths or urls to input bed files are correct.
3. Run bedboss upload command:
bedboss run-pep --pep <path_to_pep> --outfolder <path_to_output_folder> --bedbase-config <path_to_bedbase_config>
Full bedboss run-pep help
bedboss run-pep --help
Usage: bedboss run-pep [OPTIONS]
Run the all bedboss pipeline for a bed files in a PEP
ââ Options âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââŽ
â * --pep TEXT PEP file. Local or remote path [default: None] [required] â
â * --outfolder TEXT Path to the output folder [default: None] [required] â
â * --bedbase-config TEXT Path to the bedbase config file [default: None] [required] â
â --create-bedset --no-create-bedset Create a new bedset [default: create-bedset] â
â --bedset-heavy --no-bedset-heavy Run the heavy version of the bedbuncher pipeline [default: no-bedset-heavy] â
â --bedset-id TEXT Bedset ID [default: None] â
â --rfg-config TEXT Path to the rfg config file [default: None] â
â --check-qc --no-check-qc Check the quality of the input file? [default: check-qc] â
â --ensdb TEXT Path to the EnsDb database file [default: None] â
â --just-db-commit --no-just-db-commit Just commit to the database? [default: no-just-db-commit] â
â --force-overwrite --no-force-overwrite Force overwrite the output files [default: no-force-overwrite] â
â --update --no-update Update the bedbase database with the new record if it exists. This overwrites 'force_overwrite' option [default: no-update] â
â --upload-qdrant --no-upload-qdrant Upload to Qdrant [default: upload-qdrant] â
â --upload-s3 --no-upload-s3 Upload to S3 [default: upload-s3] â
â --upload-pephub --no-upload-pephub Upload to PEPHub [default: upload-pephub] â
â --no-fail --no-no-fail Do not fail on error [default: no-no-fail] â
â --license-id TEXT License ID [default: DUO:0000042] â
â --standardize-pep --no-standardize-pep Standardize the PEP using bedMS [default: no-standardize-pep] â
â --lite --no-lite Run the pipeline in lite mode. [Default: False] [default: no-lite] â
â --rerun --no-rerun Rerun already processed samples [default: no-rerun] â
â --multi --no-multi Run multiple samples [default: no-multi] â
â --recover --no-recover Recover from previous run [default: recover] â
â --dirty --no-dirty Run without removing existing files [default: no-dirty] â
â --help Show this message and exit. â
â°âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¯