Skip to content

Commit

Permalink
Complete docs
Browse files Browse the repository at this point in the history
  • Loading branch information
savitakartik committed Nov 15, 2024
1 parent 98b0aa5 commit 99b6527
Show file tree
Hide file tree
Showing 6 changed files with 118 additions and 11 deletions.
46 changes: 45 additions & 1 deletion docs/contributing.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,5 +7,49 @@ there is anything missing, please open an [issue](https://github.com/tskit-dev/t
or [pull request](https://github.com/tskit-dev/tsbrowse/pulls) on Github.

## Quick start
### Using a virtual environment
First, set up a virtual environment:
```
$ python3 -m venv tsbrowse_env
```
Then, activate the virtual environment:
```
$ source tsbrowse_env/bin/activate
```
Within the virtual environment, install requirements:
```
$ python3 -m pip install -r requirements.txt
```

TODO! Add information for developers here.
### Github workflow
1. Fork the tsbrowse github repository at https://github.com/tskit-dev/tsbrowse/ to create your own copy of the project.

2. Clone a local copy:
```
$ git clone https://github.com/your-user-name/tsbrowse.git
$ cd tsbrowse
$ git remote add upstream https://github.com/tskit-dev/tsbrowse.git
```
This creates the directory tsbrowse and connects your repository to the upstream (main project) tsbrowse repository.
3. Install pre-commit hooks. These will be executed automatically on every commit, to ensure the code meets development requirements.
```
$ pre-commit install
```
4. Run the test suite and make sure tests pass:
```
$ python3 -m pytest
```
5. Create a topic branch and add commits:
```
$ git fetch upstream
$ git checkout upstream/main
$ git checkout -b topic_branch_name
```
6. When you are ready to share your contribution, push changes to your fork:
```
git push origin branch-name
```
7. Create a [pull request] (https://help.github.com/articles/about-pull-requests/) on GitHub.

### Documentation
Tsbrowse documentation is written in markdown. These files are contained in the docs directory. To build the documentation, run `make` in the `docs` directory.
6 changes: 3 additions & 3 deletions docs/edges.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,13 +4,13 @@

![Edges Page](tsbrowse:example.tsbrowse:edges)

An edge represents the relationship between a pair of nodes (parent, child) in an ARG over time (assigning the node occurring in more recent time as the child).
An edge represents the relationship between a pair of nodes (parent, child) in the ARGs over time (assigning the node occurring in more recent time as the child).

The interactive plot on the top row allows us to visualise these relationships as horizontal lines denoting the length of the edge on the sequence (X-axis) and time of one of the nodes (Y-axis). On mouse-over of an individual edge, the tool-tip displays the parent and child node IDs, length of sequence the edge spans, and the amount of time it spans (branch length).

The bottom panel contains three static histograms which, left to right, summarise the span of edges over the sequence, the span of edges over time (branch lengths), and the product of the two spans.
The bottom panel contains three static histograms which. left to right, summarise the span of edges over the sequence, the span of edges over times (branch lengths), and the product of the two spans.


## Plot controls (sidebar):
* The `log y-axis` checkbox plots node time on log scale.
* The checkbox group allows switching between the nodes represented on the Y-axis (parent or child).
* The checkbox group allows switching between the nodes represented on the Y-axis (parent or child).
21 changes: 19 additions & 2 deletions docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,23 @@

# Tsbrowse documentation

tsbrowse is an open-source Python package for visualising ARGs in the [tskit](https://tskit.dev/tskit/docs/) tree sequence format.
This is the documentation for tsbrowse, an open-source Python package for visualising ARGs in the [tskit](https://tskit.dev/tskit/docs/) tree sequence format.

TODO! Add links to other pages
## Contents
### Getting started
This section includes an introduction to tsbrowse, amd notes on how to install and use the app.
[Introduction](intro.md)
[Installing tsbrowse](install.md)
### Pages
This section provides an overview of the various pages displayed in the webapp.
[Overview](overview.md)
[Tables](tables.md)
[Mutations](mutations.md)
[Edges](edges.md)
[Nodes](nodes.md)
[Trees](trees.md)
### Extras
Feedback and bug reports are welcome. This section contains information on contributing and engaging with the tskit community.
[Contributing](_build/html/contributing.html)
[Changelog](CHANGELOG.md)
[Community](https://tskit.dev/community/)
21 changes: 20 additions & 1 deletion docs/install.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,24 @@
(install)=

# Installing tsbrowse
```
python3 -m pip install tsbrowse
```

TODO! Add information on install and deps, then link to intro
To install the development version:
```
python3 -m pip install git+https://github.com/tskit-dev/tsbrowse
```
Please also see the documentation on [Contributing](contributing.md).

# Dependencies
|Package |Minimum supported version|
|-----|--------|
|Datashader|0.16.3|
|Panel|1.5.2|
|Tszip||
|Tskit||
|Daiquiri||
|Hvplot||

For an introduction to tsbrowse, see [Introduction](intro.md)
29 changes: 26 additions & 3 deletions docs/intro.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,31 @@
(sec_intro)=

# Introduction to tsbrowse
Tsbrowse is an open-source Python web-app intended as a genome browser for ARGs. It provides interactive visual summaries of the fundamental components of an ARG, i.e. the mutations, nodes, edges and trees it encodes. Tsbrowse operates on ARGs encoded as succinct tree sequences (see [tskit](https://tskit.dev/tskit/docs/stable/introduction.html)). It can be used to inspect biobank-scale ARGs without requiring data uploads to a server.

# Data Model
## Data Model
The foundation of tsbrowse is the [tskit data model](https://tskit.dev/tskit/docs/stable/data-model.html), which is a generalised ARG representation. Tsbrowse can therefore be applied to ARGs inferred using widely-used methods including [tsinfer](https://tskit.dev/tsinfer/docs/stable/), [Relate](https://myersgroup.github.io/relate/), [ARG-Needle](https://palamaralab.github.io/software/argneedle/) and [SINGER](https://github.com/popgenmethods/SINGER). Tsbrowse augments the underlying tree sequence tables with additional columns to create a compressed `.tsbrowse` file, containing all necessary semantic metadata required for automatic visualisation with the [Holoviz](https://holoviz.org/) ecosystem.

# How to
TODO! Add a quick start guide with example tree sequences
## How to
Consider an example tree sequence simulated with msprime:
```
def make_sweep_ts(n, Ne, L, rho=1e-8, mu=1e-8):
sweep_model = msprime.SweepGenicSelection(
position=L/2, start_frequency=0.0001, end_frequency=0.9999, s=0.25, dt=1e-6)
models = [sweep_model, msprime.StandardCoalescent()]
ts = msprime.sim_ancestry(
n, model=models, population_size=Ne, sequence_length=L, recombination_rate=rho, random_seed=1234)
return msprime.sim_mutations(ts, rate=mu, random_seed=4321)
sim_ts = make_sweep_ts(300, Ne=10_000, L=5_000_000, mu=1e-8)
sim_ts.dump("example.trees")
```
To run tsbrowse on this tree sequence, we first run the preprocessing step.
```
python -m tsbrowse preprocess example.trees
```
This creates a `.tsbrowse` file in the same directory as the input tree sequence.
We can then run the serve step to view the app in a web browser:
```
python -m tsbrowse serve example.tsbrowse --port 8080
```
6 changes: 5 additions & 1 deletion docs/trees.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,4 +2,8 @@

# Trees page

![Trees Page](tsbrowse:example.tsbrowse:trees)
![Trees Page](tsbrowse:example.tsbrowse:trees)
A tree sequence generally consists of multiple highly correlated genetic genealogies along the sequence, referred to as trees. This page summarises the individual trees in a tree sequence. The histograms show the distribution of various aspects of the trees, including the number of sites and mutations in each tree, the genomic span (i.e., the length of sequence the tree spans), the total branch length (i.e., the total sum of branch lengths over all samples in the tree, in time units), and the arity (i.e., the number of children nodes in the tree have).

## Plot controls (sidebar):
* The `log y-axis` checkbox plots the number of trees on the log scale.

0 comments on commit 99b6527

Please sign in to comment.