CarboxySVM(Beta)
所属分类:人工智能/神经网络/深度学习
开发工具:Python
文件大小:158KB
下载次数:5
上传日期:2014-01-06 20:29:47
上 传 者:
逗乐的东北姑娘
说明: 蛋白质位点预测的源代码,适合生物信息学初学者。
(Protein gamma-carboxylation sites prediction of the source code, suitable for beginners bioinformatics.)
文件列表:
CarboxySVM\data\pssm\P81755_51.pssm (3990, 2013-09-03)
CarboxySVM\data\pssm\P81755_54.pssm (3990, 2013-09-03)
CarboxySVM\data\rsa\rsa.dat (448, 2013-09-04)
CarboxySVM\data\disorder-ss\disorder.ss.dat (154, 2013-09-03)
CarboxySVM\bin\csv2svm.py (732, 2013-09-04)
CarboxySVM\bin\carboxylated.print.features.name.py~ (1946, 2013-04-17)
CarboxySVM\bin\pssm.feature.py~ (3556, 2012-08-19)
CarboxySVM\bin\ss.features.v2.py (4045, 2013-04-26)
CarboxySVM\bin\pssm.feature.v2.py (6653, 2013-04-29)
CarboxySVM\bin\dis.features.v2.py (3948, 2013-04-26)
CarboxySVM\bin\rsa.features.v2.py~ (5274, 2013-04-29)
CarboxySVM\bin\aa.49.v2.py (1187, 2013-04-26)
CarboxySVM\bin\rsa.features.v2.py (5272, 2013-04-29)
CarboxySVM\bin\pssm.feature.v2.py~ (6652, 2013-04-29)
CarboxySVM\bin\sliding_features_21.py (1326, 2013-09-04)
CarboxySVM\bin\FastaClass.py (2052, 2013-09-03)
CarboxySVM\bin\FastaClass.pyc (3332, 2013-09-03)
CarboxySVM\bin\sweep_svm_model.py (2504, 2013-09-04)
CarboxySVM\bin\show.predicts.py (1233, 2013-09-04)
CarboxySVM\data\eg.12.fa (66, 2013-09-04)
CarboxySVM\data\eg.21.fa (66, 2013-09-04)
CarboxySVM\data\eg.fa (76, 2013-09-03)
CarboxySVM\model\train.select.features.id (331, 2013-09-04)
CarboxySVM\model\train.select.libsvm.model (292397, 2013-09-04)
CarboxySVM\model\svm-train (67988, 2013-09-04)
CarboxySVM\model\svm-predict (63319, 2013-09-04)
CarboxySVM\outputs\test.svm (1834, 2013-09-04)
CarboxySVM\outputs\out.txt (231, 2013-09-04)
CarboxySVM\eg.sh~ (1705, 2013-09-04)
CarboxySVM\CarboxySVM.sh (1937, 2013-09-04)
CarboxySVM\CarboxySVM.sh~ (1920, 2013-09-04)
CarboxySVM\data\pssm (0, 2013-09-03)
CarboxySVM\data\rsa (0, 2013-09-04)
CarboxySVM\data\disorder-ss (0, 2013-09-04)
CarboxySVM\bin (0, 2013-09-04)
CarboxySVM\data (0, 2013-09-04)
CarboxySVM\model (0, 2013-09-04)
CarboxySVM\outputs (0, 2013-09-04)
... ...
Jianzhao Gao Ning Zhang and Jishou Ruan, Copyright (C) 2013
This file describes how to work with CarboxySVM (V1.0 Beta)
2013-09-04
------------------------------------------------
=====
CarboxySVM (V1.0 Beta)
CarboxySVM is written by Python 2.6.
The original program was compiled on a Linux system (Ubuntu 11.04)
========
run the CarboxySVM.sh
type:
./CarboxySVM.sh ./data/eg.fa ./data/eg.21.fa
results are in the ./outputs/out.txt
If you want to run the prediction on the other datasets, please follow the steps:
(1) get the sliding windows of the gamma-carboxylation sites
./bin/sliding_features_21.py fasta_file fw_n
where
fasta_file: protein sequence in fasta format.
fw_n: the carboxylation sites are in the sliding windows of the size 21.
in fw_n:
"""
>P81755_51
RTIRTRLNIRECCEDGWCCTA
>P81755_54
RTRLNIRECCEDGWCCTAAPL
"""
(2) run the BLAST, DISOPRED, PSIPRED, REAL SPINE for fw_n.
(3)Run the blast, and copy the pssm file in ./data/pssm/
each pssm file should be "*.pssm", where * is protein ID.
(4) Copy the results (named disorder.ss.dat) from PSIPRED
and DISOPRED into ./data/disorder-ss/
the file is like this:
"""
>P81755_51
RTIRTRLNIRECCEDGWCCTA
CEEEEHHHHHHHHCCCCEEEC
**...................
"""
where 1st line is ID; 2nd line is protein segment;
3rd line is predicted secondary structure;
4th line is the predicted disorder from DISOPRED.
(5)Copy the results (named rsa.dat ) from Real Spine into ./data/rsa/
the file is like this :
"""
>P81755_51
RTIRTRLNIRECCEDGWCCTA
0.59336,0.47527,0.32557,...,0.47468,
111111110110011000011
"""
where 1st line is ID; 2nd line is protein segment;
3rd line is predicted relative solvent accessibility values;
4th line is the annotation of buried (0) or exposed (1). if the value > 0.25 annotated 1 else annotated 0.
(6) Then type
./CarboxySVM.sh fasta_f fw_n
where fasta_f and fw_n is as same as (1)
=====
install DISOPRED (v2.4)
http://bioinfadmin.cs.ucl.ac.uk/downloads/DISOPRED/
=====
install Real Spine (v3.0)
http://sparks.informatics.iupui.edu/Publications_files/publication.htm
=====
install PSIPRED (v3.3)
http://bioinfadmin.cs.ucl.ac.uk/downloads/psipred/
=====
install BLAST
http://www.ncbi.nih.gov/blast/download.shtml
=====
FEEDBACK
If you have any feedback, contact Jianzhao Gao at gaojz@nankai.edu.cn.
or Ning Zhang at zhni@tju.edu.cn
=====
REFERENCE
E. Faraggi, B. Xue, and Y. Zhou, Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion
angles of proteins by fast guided-learning through a two-layer neural network.,Proteins 74, 857-871 (2009)
Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
McGuffin L.J., Bryson K., Jones D.T., 2000. The PSIPRED protein structure prediction server. Bioinformatics. 16, 404-405.
Ward J.J., Sodhi J.S., McGuffin L.J., Buxton B.F., Jones D.T., 2004. Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J. Mol. Biol. 337, 635-***5.
近期下载者:
相关文件:
收藏者: