CarboxySVM(Beta)

所属分类:人工智能/神经网络/深度学习
开发工具:Python
文件大小:158KB
下载次数:5
上传日期:2014-01-06 20:29:47
上 传 者逗乐的东北姑娘
说明:  蛋白质位点预测的源代码,适合生物信息学初学者。
(Protein gamma-carboxylation sites prediction of the source code, suitable for beginners bioinformatics.)

文件列表:
CarboxySVM\data\pssm\P81755_51.pssm (3990, 2013-09-03)
CarboxySVM\data\pssm\P81755_54.pssm (3990, 2013-09-03)
CarboxySVM\data\rsa\rsa.dat (448, 2013-09-04)
CarboxySVM\data\disorder-ss\disorder.ss.dat (154, 2013-09-03)
CarboxySVM\bin\csv2svm.py (732, 2013-09-04)
CarboxySVM\bin\carboxylated.print.features.name.py~ (1946, 2013-04-17)
CarboxySVM\bin\pssm.feature.py~ (3556, 2012-08-19)
CarboxySVM\bin\ss.features.v2.py (4045, 2013-04-26)
CarboxySVM\bin\pssm.feature.v2.py (6653, 2013-04-29)
CarboxySVM\bin\dis.features.v2.py (3948, 2013-04-26)
CarboxySVM\bin\rsa.features.v2.py~ (5274, 2013-04-29)
CarboxySVM\bin\aa.49.v2.py (1187, 2013-04-26)
CarboxySVM\bin\rsa.features.v2.py (5272, 2013-04-29)
CarboxySVM\bin\pssm.feature.v2.py~ (6652, 2013-04-29)
CarboxySVM\bin\sliding_features_21.py (1326, 2013-09-04)
CarboxySVM\bin\FastaClass.py (2052, 2013-09-03)
CarboxySVM\bin\FastaClass.pyc (3332, 2013-09-03)
CarboxySVM\bin\sweep_svm_model.py (2504, 2013-09-04)
CarboxySVM\bin\show.predicts.py (1233, 2013-09-04)
CarboxySVM\data\eg.12.fa (66, 2013-09-04)
CarboxySVM\data\eg.21.fa (66, 2013-09-04)
CarboxySVM\data\eg.fa (76, 2013-09-03)
CarboxySVM\model\train.select.features.id (331, 2013-09-04)
CarboxySVM\model\train.select.libsvm.model (292397, 2013-09-04)
CarboxySVM\model\svm-train (67988, 2013-09-04)
CarboxySVM\model\svm-predict (63319, 2013-09-04)
CarboxySVM\outputs\test.svm (1834, 2013-09-04)
CarboxySVM\outputs\out.txt (231, 2013-09-04)
CarboxySVM\eg.sh~ (1705, 2013-09-04)
CarboxySVM\CarboxySVM.sh (1937, 2013-09-04)
CarboxySVM\CarboxySVM.sh~ (1920, 2013-09-04)
CarboxySVM\data\pssm (0, 2013-09-03)
CarboxySVM\data\rsa (0, 2013-09-04)
CarboxySVM\data\disorder-ss (0, 2013-09-04)
CarboxySVM\bin (0, 2013-09-04)
CarboxySVM\data (0, 2013-09-04)
CarboxySVM\model (0, 2013-09-04)
CarboxySVM\outputs (0, 2013-09-04)
... ...

Jianzhao Gao Ning Zhang and Jishou Ruan, Copyright (C) 2013 This file describes how to work with CarboxySVM (V1.0 Beta) 2013-09-04 ------------------------------------------------ ===== CarboxySVM (V1.0 Beta) CarboxySVM is written by Python 2.6. The original program was compiled on a Linux system (Ubuntu 11.04) ======== run the CarboxySVM.sh type: ./CarboxySVM.sh ./data/eg.fa ./data/eg.21.fa results are in the ./outputs/out.txt If you want to run the prediction on the other datasets, please follow the steps: (1) get the sliding windows of the gamma-carboxylation sites ./bin/sliding_features_21.py fasta_file fw_n where fasta_file: protein sequence in fasta format. fw_n: the carboxylation sites are in the sliding windows of the size 21. in fw_n: """ >P81755_51 RTIRTRLNIRECCEDGWCCTA >P81755_54 RTRLNIRECCEDGWCCTAAPL """ (2) run the BLAST, DISOPRED, PSIPRED, REAL SPINE for fw_n. (3)Run the blast, and copy the pssm file in ./data/pssm/ each pssm file should be "*.pssm", where * is protein ID. (4) Copy the results (named disorder.ss.dat) from PSIPRED and DISOPRED into ./data/disorder-ss/ the file is like this: """ >P81755_51 RTIRTRLNIRECCEDGWCCTA CEEEEHHHHHHHHCCCCEEEC **................... """ where 1st line is ID; 2nd line is protein segment; 3rd line is predicted secondary structure; 4th line is the predicted disorder from DISOPRED. (5)Copy the results (named rsa.dat ) from Real Spine into ./data/rsa/ the file is like this : """ >P81755_51 RTIRTRLNIRECCEDGWCCTA 0.59336,0.47527,0.32557,...,0.47468, 111111110110011000011 """ where 1st line is ID; 2nd line is protein segment; 3rd line is predicted relative solvent accessibility values; 4th line is the annotation of buried (0) or exposed (1). if the value > 0.25 annotated 1 else annotated 0. (6) Then type ./CarboxySVM.sh fasta_f fw_n where fasta_f and fw_n is as same as (1) ===== install DISOPRED (v2.4) http://bioinfadmin.cs.ucl.ac.uk/downloads/DISOPRED/ ===== install Real Spine (v3.0) http://sparks.informatics.iupui.edu/Publications_files/publication.htm ===== install PSIPRED (v3.3) http://bioinfadmin.cs.ucl.ac.uk/downloads/psipred/ ===== install BLAST http://www.ncbi.nih.gov/blast/download.shtml ===== FEEDBACK If you have any feedback, contact Jianzhao Gao at gaojz@nankai.edu.cn. or Ning Zhang at zhni@tju.edu.cn ===== REFERENCE E. Faraggi, B. Xue, and Y. Zhou, Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by fast guided-learning through a two-layer neural network.,Proteins 74, 857-871 (2009) Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm McGuffin L.J., Bryson K., Jones D.T., 2000. The PSIPRED protein structure prediction server. Bioinformatics. 16, 404-405. Ward J.J., Sodhi J.S., McGuffin L.J., Buxton B.F., Jones D.T., 2004. Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J. Mol. Biol. 337, 635-***5.

近期下载者

相关文件


收藏者