Forest
class representing a collection of phylogenetic trees, a Tree
class for individual phylogenetic trees, a SFS
(Site Frequency Spectrum) class for analyzing genetic diversity, and a SizeMatchedModel
class for statistical modeling based on data size. Additionally, there is a script for generating and annotating reference data in the form of phylogenetic trees.
Forest
class: For managing collections of phylogenetic trees.Tree
class: For analyzing individual phylogenetic trees.SFS
class: For calculating and analyzing Site Frequency Spectra.SizeMatchedModel
class: For statistical modeling based on data size.generate_annotate_forest.py
: Script for generating reference data.forest.py
: Defines the Forest
class for managing collections of trees.tree.py
: Defines the Tree
class for individual tree analysis.sfs.py
: Defines the SFS
class for Site Frequency Spectrum analysis.size_matched_model.py
: Defines the SizeMatchedModel
class for size-based statistical modeling.generate_annotate_forest.py
: Script for generating and annotating reference data.ete3
: For phylogenetic tree manipulation and visualization.Bio.Phylo
: For interfacing with Biopython’s phylogenetic tree representation.numpy
: For numerical computations and array manipulations.scipy
: For various scientific computing tasks and statistical functions.pandas
: For data manipulation and analysis.matplotlib
: For visualization of results and trees.sys
: For accessing command line arguments.time
: For tracking script execution time.uuid
: For generating unique identifiers.pickle
: For serializing and saving objects.gzip
: For compressing output files.Forest
, Tree
, SFS
, SizeMatchedModel
) use constructor parameters and method arguments for configuration. Key parameters include:
Forest
and Tree
:
SFS
:
SizeMatchedModel
:
generate_annotate_forest.py
script takes command-line arguments for configuration:
n_leaves
: Number of leaves (tips) in each generated tree.n_trees
: Number of trees to generate for the forest.alpha
: Shape parameter influencing the tree structure.output_dir
: Path to the directory where the output file will be saved.