• f2_825153
  • 1.9MB
  • zip
  • 0
  • VIP专享
  • 0
  • 2022-06-13 03:52
这是在和 JCB 中发布的 PASTA(使用 Saté 和 TrAnsitivity 的实用对齐)算法的实现: Mirarab S、Nguyen N、Warnow T. PASTA:超大多序列比对。 夏兰河,编辑。 Res Comput Mol Biol。 2014:177-191。 Mirarab S、Nguyen N、Guo S、Wang LS、Kim J、Warnow T. PASTA:核苷酸和氨基酸序列的超大多序列比对。 J 计算生物学。 2015;22(5):377-386。 。 最新版本包括此处描述的新分解技术: Balaban、Metin、Niema Moshiri、Uyen Mai 和 Siavash Mirarab。 “TreeCluster:使用系统发育树聚类生物序列。” BioRxiv, 2019, 591388. doi:10.1101/591388。 接
This is an implementation of the PASTA (Practical Alignment using Saté and TrAnsitivity) algorithm published in [RECOMB-2014]( and JCB: * Mirarab S, Nguyen N, Warnow T. PASTA: ultra-large multiple sequence alignment. Sharan R, ed. Res Comput Mol Biol. 2014:177-191. * Mirarab S, Nguyen N, Guo S, Wang L-S, Kim J, Warnow T. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences. J Comput Biol. 2015;22(5):377-386. [doi:10.1089/cmb.2014.0156]( The latest version includes a new decomposition technique described here: * Balaban, Metin, Niema Moshiri, Uyen Mai, and Siavash Mirarab. “TreeCluster : Clustering Biological Sequences Using Phylogenetic Trees.” BioRxiv, 2019, 591388. doi:10.1101/591388. #### Contact: All questions and inquires should be addressed to our user email group: ``. Please check our [Tutorial](pasta-doc/ and [previous posts](!forum/pasta-users) before sending new requests. #### Developers * The code and the algorithm are developed by Siavash Mirarab and Tandy Warnow, with help from Nam Nguyen. The latest version of the code includes a new code decomposition designed and implemented by [Uyen Mai]( * The current PASTA code is heavily based on the [SATé code]( developed by Mark Holder's group at KU. Refer to sate-doc directory for documentation of the SATé code, including the list of authors, license, etc. * [Niema Moshiri]( has contributed to the import to dendropy 4 and python 3 and to the Docker image. **Documentation**: In addition to this README file, you can consult our [Tutorial](pasta-doc/ INSTALLATION === You have four options for installing PASTA. - **Windows:** If you have a Windows machine, currently using the Docker image or the Virtual Machine (VM) image we provide is your only option. Among those two options, Docker is the preferred method. - **Linux:** If you have Linux (or other \*nix systems), you can still use Docker or VM, but downloading the code from github and installing it is what we recommend. - **MAC:** We have four options for MAC: VM, Docker, installing from the code, and downloading the .dmg file. If you mostly use the GUI, then the MAC .dmg file is a good option (although sometimes it can be behind the latest code). Otherwise, we reocmmend either Docker or the code. ### 1. From pre-build MAC image file 1. Download the MAC application `.dmg` file from [here]( Use the lastest version available 2. Open the .dmg file and copy its content to your preferred destination (do not run PASTA from the image itself). 3. Simply run the PASTA app from where you copied it. If the APP does not work, let us know. We will try to fix issues. But you can also try first installing PASTA from the source code (see below) and then run `./` from the pasta directory. This will create an app under `dist` directory, which you should be able to run and copy to any other location. ### 2. From Source Code The current version of PASTA has been developed and tested entirely on Linux and MAC. Windows won't work currently (future versions may or may not support Windows). You need to have: - [Python]( (version 2.7 or later, including python 3) - [Dendropy]( (but the setup script should automatically install dendropy for you if you don't have it) - [Java]( (only required for using OPAL) - [wxPython]( - only required if you want to use the GUI. The setup script does not automatically install this. **Installation steps**: 1. Open a terminal and create a directory where you want to keep PASTA and go to this directory. For example: ```bash mkdir ~/pasta-code cd ~/pasta-code` ``` 2. Clone the PASTA code repository from our [github repository]( For example you can use: ```bash git clone ``` If you don't have git, you can directly download a [zip file from the repository]( and decompress it into your desired directory. 3. A. Clone the relevant "tools" directory (these are also forked from the SATé project). There are different repositories for [linux]( and [MAC]( You can use ```bash git clone #for Linux ``` or ```bash git clone #for MAC ``` Or you can directly download these as zip files for [Linux]( or [MAC]( and decompress them in your target directory (e.g. `pasta-code`). * Note that the tools directory and the PASTA code directory should be under the same parent directory. * When you use the zip files instead of using `git`, after decompressing the zip file you may get a directory called `sate-tools-mac-master` or `sate-tools-linux-master` instead of `sate-tools-mac` or `sate-tools-linux`. You need to rename these directories and remove the `-master` part. * Those with 32-bit Linux machines need to be aware that the master branch has 64-bit binaries. 32-bit binaries are provided in the `32bit` branch of `sate-tools-linux` git project (so download [this zip file]( instead). 3. B. (Optional) Only if you want to use MAFFT-Homologs within PASTA: `cd sate-tools-linux` or `cd sate-tools-mac` Use `git clone` or download directly at `` * Be sure to leave this directory `cd ..` before starting the next step 4. `cd pasta` (or `cd pasta-master` if you used the zip file instead of clonning the git repository) 5. Then run: ``` bash sudo python develop ``` If you don't have root access, use: ``` bash python develop --user ``` **Common Problems:** * `Could not find SATé tools bundle directory`: this means you don't have the right tools directory at the right location. Maybe you downloaded MAC instead of Linux? Or, maybe you didn't put the directory in the parent directory of where pasta code is? Most likely, you used the zip files and forgot to remove teh `-master` from the directory name. Run `mv sate-tools-mac-master sate-tools-mac` on MAC or `mv sate-tools-linux-master sate-tools-linux` to fix this issue. * The `` script is supposed to install setuptools for you if you don't have it. This sometimes works and sometimes doesn't. If you get an error with a message like ` invalid command 'develop'`, it means that setuptools is not installed. To solve this issue, you can manually install [setup tools]( For example, on Linux, you can run `curl -o - | sudo python` (but note there are other ways of installing setuptools as well). 6. Pasta now includes additional aligners for Linux and MAC users: mafft-ginsi, mafft-homologs, contralign (version 1), and probcons. In order to use mafft-homologs and contralign, the user must set the environment variable `CONTRALIGN_DIR=/dir/to/sate-tools-linux`. You can use `export CONTRALIGN_DIR=/dir/to/sate-tools-linux` or just edit `~/.bashrc` to have `CONTRALIGN_DIR=dir/to/sate-tools-linux`. * To use these aligners, add the following to you
    • K均值聚类算法
    • 聚类算法程序
    • 谱聚类 聚类算法
      谱聚类 聚类算法 spectralClustering
    • k均值聚类算法
    • 图像聚类算法图像聚类算法
    • Ncut聚类算法
    • DBSCAN聚类算法
    • ap聚类算法
    • DBSCAN聚类算法
    • AP聚类算法代码