If SEQUENCE SLIDER is useful to you, please cite: Reference: Rafael J Borges, Guilherme H M Salvador, Daniel C Pimenta, Lucilene D dos Santos, Marcos R M Fontes, Isabel Usón, SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources, Nucleic Acids Research, Volume 50, Issue 9, 20 May 2022, Page e50, https://doi.org/10.1093/nar/gkac029
20 September 2021
Author : Rafael Borges
Objective: Generate amino acid possibilities by residue, fit best rotamer in electron density and calculate their side chain and main chain real-space correlation coefficient
Requirements: CCP4 (https://www.ccp4.ac.uk/) and Phenix (http://www.phenix-online.org/) should be installed. In the terminal, CCP4 and Phenix should be accessible by giving the source as instructed in these software installations.
USAGE: SLIDER_VENOM.py pdb_file reflections.mtz map_coeffs.mtz output TRYALL
pdb_file should be a protein coordinate file with best possible fit against electron density
reflections.mtz should contain intensities or amplitudes
map_coeffs.mtz with calculated map (2FOFCWT/PH2FOFCWT)
output should be a new path, a folder 'output' will contain all SLIDER output and files starting with output will summarize information:
type_of_run (which was TRYALL in example USAGE)
Type_of_run should be a string with keywords and can change how SLIDER is run:
Amino acid possibilities can be generated in three different scenarios:
Trying all 20 possibilities for each residue (keyword: TRYALL)
Possibilities restricted by:
mass spectrometry (keyword: MASSPEC)
alignment (keyword: ALIGN)
If the last two options are chosen, an additional file containing either mass spectrometry or alignment file should be given.
The mass spectrometry file should be a file containing text of amino acids (aa) in one letter code (FASTA), each column should contain generated aa, per example, if 1st and 3rd residue should be a F and T, and 2nd residue either L,D or N, the file should be:
FLT
D
N
Keyword SKIPTEST should be given to skip RAM memory calculation;
Links to the external software of Phenix and Coot should be accessible through the terminal.
Output files:
output_all.log contains information of chain and residue number, RSCC and delta contrast
output_Resolved.log same as output_all.log, but only resolved residues
output_Dubious.log same as output_all.log, but only dubious residues
output_runline.log has the line used to run SLIDER_VENOM.py
output_polder_coot_open_maps contains a script to open omit maps in coot (Go to Calculate -> Run Script -> select file)
20 September 2021
Author : Rafael Borges
Objective: Given Dubious and Resolved log files from SLIDER_VENOM with residues and their RSCC, return variability of sequences\
USAGE: GivenDubiousResolvedResiduesReturnSequenceVariability.py Dubious.log Resolved.log OutputFile\
OutputFile should be a new path
RETURNS: a file with amino acid possibilities (line) by residue number (column)\
19 April 2020
Author : Rafael Borges
Objective: Given ConSurf query_msa.aln and SLIDER table, return percentage of SLIDER residue\
USAGE: ConSurf_query_msa_SLIDER.py query_msa.aln output_Resolved.log OutputFile
OutputFile should be a new path
RETURNS: a table with SLIDER details plus absolute number and % of 100% of given residue\
08 April 2020
Author : Rafael Borges
Objetive: Obtain positive SLIDER peptides from overall run from PATTERNLAB
It reads a sequence and patternlab fasta file with all values
It returns each peptide among its PrimaryScore and PPM\
USAGE: ObtainPeptidesPatternLabRun.py seq.seq fasta.fasta OutputFile
seq.seq has just the sequence without >
fasta.fasta exactly like PatternLab output
OutputFile should be a new path
RETURNS: a table with peptide PrimaryScore PPM
a file with each peptide with its position in sequence\
08 April 2020
Author : Rafael Borges
Objetive: Obtain positive SLIDER peptides from overall run from PEAKS HTML output
It reads a sequence and PEAKS HTML output with all values
It returns each peptide among its -10lgP\
USAGE: ObtainPeptidesPatternLabRun.py seq.seq PEAKS.html TableRSCC_all.log OutputFile
seq.seq has just the sequence without >
PEAKS.html exactly like PEAKS html output
TableRSCC_all.log should be table that SLIDER_VENOM.py output
OutputFile should be a new path
RETURNS: a table with SLIDER calculations among its -10lgP
a file with each peptide among its -10lgP
a file with each amino acid possibility with its position in sequence\
\
22 January 2022 If SEQUENCE SLIDER is useful to you, please cite: Reference: Rafael J Borges, Guilherme H M Salvador, Daniel C Pimenta, Lucilene D dos Santos, Marcos R M Fontes, Isabel Usón, SEQUENCE SLIDER: integration of structural and genetic data to characterize isoforms from natural sources, Nucleic Acids Research, Volume 50, Issue 9, 20 May 2022, Page e50, https://doi.org/10.1093/nar/gkac029
(C) 2022 Rafael Junqueira Borges (C) 2021 Rafael Junqueira Borges
SLIDER uses phenix and CCP4 tools, please also cite them: Agirre et al, 2023. https://doi.org/10.1107/S2059798323003595 Liebschner et al, 2019. https://doi.org/10.1107/S2059798316018210 Liebschner et al, 2017. https://doi.org/10.1107/S2059798319011471 Emsley and Cowtan, 2014. https://doi.org/10.1107/S0907444904019158