Sequence Analysis¶
This module contains features for analyzing protein sequences.
MSA IO¶
MSAFile
- read/write MSA files in FASTA/SELEX/Stockholm formatsparseMSA()
- parse MSA fileswriteMSA()
- parse MSA files
Editing¶
mergeMSA()
- merge MSA data for multi-domain proteinsrefineMSA()
- refine MSA by removing gapped columns and/or sequences
Analysis¶
calcMSAOccupancy()
- calculate row (sequence) or column occupancycalcShannonEntropy()
- calculate Shannon entropybuildMutinfoMatrix()
- build mutual information matrixbuildOMESMatrix()
- build mutual observed minus expected squared covariance matrixbuildSCAMatrix()
- build statistical coupling analysis matrixbuildSeqidMatrix()
- build sequence identity matrixbuildDirectInfoMatrix()
- build direct information matrixuniqueSequences()
- select unique sequencesapplyMutinfoCorr()
- apply correction to mutual information matrixapplyMutinfoNorm()
- apply normalization to mutual information matrixcalcMeff()
- calculate sequence weightscalcRankorder()
- rank order scores
Plotting¶
showShannonEntropy()
- plot Shannon entropyshowMSAOccupancy()
- plot row (sequence) or column occupancyshowMutinfoMatrix()
- show mutual information matrix