You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Fixes env gtex issue #290 (#294)
* Change env() to stdout to save sample_name in gen3_drs
* Fix No such property: baseName for class: String
* Gen3-DRS prints md5 "file is good" to log not stdout
* Improves gen3-drs md5 error message
* Changes gtex input to support new manifest file format [#289] (#296)
* Updates ch_gtex_gen3_ids items #289
* Remove duplicate val(obj_id) in input of gen3-drs
Co-authored-by: cgpu <[email protected]>
* Comments our fasta requirement for gen3-drs input (#297)
* Comments our fasta requirement for gen3-drs input
* Update usage.md that genome_fasta is only for CRAM
* Update usage.md typo
* Fix missing file from path issue
* change GLS executor from parameter to scope (#305)
* Remove gtex (#299)
* Remove mentions of old GTEX download option from main.nf
* Remove mentions of old GTEX download option from help
* Remove mentions of old GTEX download option from usage.md
* Renames Gen3-DRS into new GTEX download option
* Renames Gen3-DRS into new GTEX download opt in usage.md
* Dev v2.1 #287 - Simplify the Gen3-DRS download option (#304)
* Update usage.md
* Update run_on_sumner.md
* add dockerfile for csvtoolkit
* add process to convert manifest json to csv
* add process to filter manifest by file passed through --reads
* update help message
* fix bug on variable declaration
* Update nextflow.config - fix typo
* Revert "Merge branch 'master' into dev-v2.1-#287"
This reverts commit be2c2ab, reversing
changes made to 04285ef.
* Update main.nf
* patch projectDir error
* Fix oublishDir path for manifest
* Fix oublishDir path for manifest
* Fix typo
* Update filter_manifest.py
* Update filter_manifest.py
* fix bug on saving filenames that were not in manifest file
* Update filter_manifest.py
* remove logging of samples not found in manifest
* Update filter_manifest.py
* Makes filter_manifest txt output optional
Co-authored-by: angarb <[email protected]>
Co-authored-by: Vlad-Dembrovskyi <[email protected]>
Co-authored-by: Vlad-Dembrovskyi <[email protected]>
* Rename examples/gen3/README.md to examples/GTEX/README.md
Editing folder name to match new "download_from" name.
* Update and rename GEN3_DRS_config.md to GTEX_config.md
Updating parameters
* Delete examples/gen3 directory
* Update usage.md
Moving this information
* Update README.md
* Update README.md
* Delete PRJNA453538.SraRunTable.txt
Not needed
* Delete MCF10_MYCER.datafiles.csv
Not needed
* Create reads.csv
Adding reads.csv example
* Update README.md
* Create manifest.json
Adding example manifest.json
* Update README.md
* Update run_on_cloudos.md
* Update Copying_Files_From_Sumner_to_Cloud.md
Made neater
* Create Star_Index_Generation.md
Co-authored-by: cgpu <[email protected]>
Co-authored-by: imendes93 <[email protected]>
Co-authored-by: angarb <[email protected]>
# Moving files from HPC to Cloud (particular to JAX Sumner)
2
+
3
+
#### Add singularity to $PATH:
2
4
module load singularity
3
5
4
-
//make some convenience commands to reduce typing (note we changed container name so we can accommodate other cloud providers):
6
+
#### Make some convenient commands to reduce typing:
5
7
alias gcloud="singularity exec /projects/researchit/crf/containers/gcp_sdk.sif gcloud"
6
8
alias gsutil="singularity exec /projects/researchit/crf/containers/gcp_sdk.sif gsutil"
7
9
8
-
//login to gcloud; this will return a url that you need to paste into a browser, which
9
-
//will take you through the google authentication process; you can use your jax
10
-
//email as userid and jax password to get in. Once you authenticate, it will display
11
-
//a code that you need to paste into the prompt provided in your ssh session on Sumner:
12
-
10
+
#### Login to gcloud; this will return a url that you need to paste into a browser, which will take you through the google authentication process; you can use your jax email as userid and jax password to get in. Once you authenticate, it will display a code that you need to paste into the prompt provided in your ssh session on Sumner:
13
11
gcloud auth login --no-launch-browser
14
12
15
-
//see which projects you have access to:
13
+
#### See which projects you have access to:
16
14
gcloud projects list
17
15
18
-
//what is the project you are currently associated with:
16
+
#### What is the project you are currently associated with:
19
17
gcloud config list project
20
18
21
-
//change project association:
19
+
#### Change project association:
22
20
gcloud config set project my-project
23
21
24
-
//see what buckets are associated with my-project:
22
+
#### See what buckets are associated with my-project:
25
23
gsutil ls
26
24
27
-
//see contents of a particular bucket:
25
+
#### See contents of a particular bucket:
28
26
gsutil ls -l gs://my-bucket
29
27
30
-
//recursively copy large directory from filesystem accessible on Sumner to your bucket:
28
+
#### Recursively copy large directory from file system accessible on Sumner to your bucket:
Copy file name to clipboardexpand all lines: docs/usage.md
+4-45
Original file line number
Diff line number
Diff line change
@@ -103,13 +103,13 @@ Input files:
103
103
(default: no rmats_pairs specified)
104
104
--run_name User specified name used as prefix for output files
105
105
(defaut: no prefix, only date and time)
106
-
--download_from Database to download FASTQ/BAMs from (available = 'TCGA', 'GTEX' or 'GEN3-DRS',
107
-
'SRA', 'FTP') (string)
106
+
--download_from Database to download FASTQ/BAMs from (available = 'TCGA', 'GTEX', 'SRA', 'FTP')
107
+
(string)
108
108
false should be used to run local files on the HPC (Sumner).
109
109
'TCGA' can also be used to download GDC data including HCMI data.
110
110
(default: false)
111
-
--key_file For downloading reads, use TCGA authentication token (TCGA) or dbGAP repository
112
-
key (GTEx, path) or credentials.json file in case of 'GEN3-DRS'
111
+
--key_file For downloading reads, use TCGA authentication token (TCGA) or
112
+
credentials.json file in case of 'GTEX'.
113
113
(default: false)
114
114
115
115
Main arguments:
@@ -246,44 +246,3 @@ Some useful ones include (specified in main.pbs):
246
246
-`-with-trace` eg `-with-trace trace.txt` which gives a [trace report](https://www.nextflow.io/docs/latest/tracing.html?highlight=dag#trace-report) for resource consumption by the pipeline
247
247
-`-with-dag` eg `-with-dag flowchart.png` which produces the [DAG visualisation](https://www.nextflow.io/docs/latest/tracing.html?highlight=dag#dag-visualisation) graph showing each of the different processes and the connections between them (the channels)
248
248
249
-
## Run with data from AnviL Gen3-DRS
250
-
251
-
You will be needing two things from - https://gen3.theanvil.io/
252
-
253
-
1. manifest file
254
-
2. credentials file
255
-
256
-
Original downloaded `manifest.json` file need to be converted into `manifest.csv` in order to be accepted in `--reads`, for doing that you can do this -
257
-
258
-
```bash
259
-
pip install csvkit
260
-
in2csv manifest.json > manifest.csv
261
-
```
262
-
263
-
NOTE: Make sure the `manifest.csv` file have five columns, Check from [examples](../examples/gen3/)
264
-
265
-
Downloaded `credentials.json` file can be provided in `--key` param.
266
-
267
-
NOTE: Make sure `credentials.json` is a latest one. They have expiry dates when you download.
268
-
269
-
If you running with AnviL Gen3-DRS files you also need to provide a Genome fasta file with `--genome_fasta`, which will be used to convert CRAM files to BAM format.
270
-
271
-
For a minimal params list check [gen3_drs.config](../conf/examples/GEN3_DRS_config.md)
272
-
273
-
### Extract based on a bam query list
274
-
275
-
If you have a list of bam file names of interest, extract the manifest file -
Original downloaded `manifest.json` will be converted into `manifest.csv` with pipeline using: https://csvkit.readthedocs.io/en/latest/
13
+
14
+
The manifest.csv will be subset using the `reads.csv` file provided in `--reads` param. (This allows you to download a complete manifest and later select the samples of interest.) For example: [gtex.reads](https://github.com/TheJacksonLaboratory/splicing-pipelines-nf/blob/dev-v2.1/examples/GTEX/reads.csv)
15
+
16
+
Downloaded `credentials.json` file can be provided in `--key_file` param.
17
+
NOTE: Make sure `credentials.json` is a latest one. They have expiration dates when you download.
18
+
19
+
If you running with AnviL Gen3-DRS to download CRAM files you also need to provide a Genome fasta file with `--genome_fasta`, which will be used to convert CRAM files to BAM format. If you are donwloading bam files, you can skip this parameter.
20
+
21
+
For a minimal params list check [gtex.config](../conf/examples/GTEX_config.md)
0 commit comments