rank_sequences.py
: A script that ranks sequences in a multiple sequence alignment using the Local Branching Index (LBI).
infer_fitness.py
: A script that performs full fitness inference on sequences in a multiple sequence alignment.
prediction_src/
: A directory containing the core implementation of fitness inference and sequence ranking algorithms. Key modules include:
sequence_ranking.py
: Combines sequence alignment processing with node ranking and fitness inference.fitness_inference.py
: Implements the core algorithm for inferring fitness distributions on phylogenetic trees.node_ranking.py
: Extends fitness inference functionality to provide methods for ranking and coloring tree nodes.ancestral.py
: Implements maximum likelihood estimation for ancestral sequence reconstruction.tree_utils.py
: Provides utility functions for manipulating and visualizing phylogenetic trees.rank_sequences.py
: Script for ranking sequences using the Local Branching Index.
infer_fitness.py
: Script for performing full fitness inference on sequences.
fasttree
for phylogenetic tree construction.
rank_sequences.py
and infer_fitness.py
) use command-line arguments for configuration. Key parameters include:
--aln
: Path to the input alignment file.--outgroup
: Name of the outgroup sequence.--eps_branch
: Minimal branch length for inference.--tau
: Time scale for local tree length estimation (for LBI).--diffusion
: Fitness diffusion coefficient (for full inference).--gamma
: Scale factor for time scale.--omega
: Approximate sampling fraction divided by fitness standard deviation.--collapse
: Option to collapse internal branches with identical sequences.--plot
: Option to plot trees.