You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+4-8Lines changed: 4 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,22 +1,19 @@
1
1
# Contributing
2
2
3
-
4
3
## General guidelines
5
4
6
5
If you haven't contributed to open-source before, we recommend you read [this excellent guide by GitHub on how to contribute to open source](https://opensource.guide/how-to-contribute). The guide is long, so you can gloss over things you're familiar with.
7
6
8
7
If you're not already familiar with it, we follow the [fork and pull model](https://help.github.com/articles/about-collaborative-development-models) on GitHub. Also, check out this recommended [git workflow](https://www.asmeurer.com/git-workflow/).
9
8
10
-
11
9
## Contributing Code
12
10
13
11
This project has a number of requirements for all code contributed.
14
12
15
-
* We follow the [PEP-8 style](https://www.python.org/dev/peps/pep-0008/) convention.
16
-
* We use [NumPy-style docstrings](https://numpydoc.readthedocs.io/en/latest/format.html).
17
-
* It's ideal if user-facing API changes or new features have documentation added.
18
-
* It is best if all new functionality and/or bug fixes have unit tests added with each use-case.
19
-
13
+
- We follow the [PEP-8 style](https://www.python.org/dev/peps/pep-0008/) convention.
14
+
- We use [NumPy-style docstrings](https://numpydoc.readthedocs.io/en/latest/format.html).
15
+
- It's ideal if user-facing API changes or new features have documentation added.
16
+
- It is best if all new functionality and/or bug fixes have unit tests added with each use-case.
20
17
21
18
## Setting up Your Development Environment
22
19
@@ -96,7 +93,6 @@ This will build the documentation and serve it on a local http server which list
96
93
97
94
Documentation from the `main` branch and tagged releases is automatically built and hosted on [readthedocs](https://readthedocs.org/).
98
95
99
-
100
96
## Acknowledgments
101
97
102
98
This document is based off of the [guidelines from the sparse project](https://github.com/pydata/sparse/blob/master/docs/contributing.rst).
Copy file name to clipboardExpand all lines: README.md
+8-7Lines changed: 8 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,9 +14,9 @@ Bioframe enables flexible and scalable operations on genomic interval dataframes
14
14
15
15
Bioframe is built directly on top of [Pandas](https://pandas.pydata.org/). Bioframe provides:
16
16
17
-
* A variety of genomic interval operations that work directly on dataframes.
18
-
* Operations for special classes of genomic intervals, including chromosome arms and fixed-size bins.
19
-
* Conveniences for diverse tabular genomic data formats and loading genome assembly summary information.
17
+
- A variety of genomic interval operations that work directly on dataframes.
18
+
- Operations for special classes of genomic intervals, including chromosome arms and fixed-size bins.
19
+
- Conveniences for diverse tabular genomic data formats and loading genome assembly summary information.
20
20
21
21
Read the [documentation](https://bioframe.readthedocs.io/en/latest/), including the [guide](https://bioframe.readthedocs.io/en/latest/guide-intervalops.html), as well as the [publication](https://doi.org/10.1093/bioinformatics/btae088) for more information.
22
22
@@ -34,10 +34,10 @@ pip install bioframe
34
34
35
35
Interested in contributing to bioframe? That's great! To get started, check out the [contributing guide](https://github.com/open2c/bioframe/blob/main/CONTRIBUTING.md). Discussions about the project roadmap take place on the [Open2C Discord](https://discord.com/invite/qVfSbDYHNG) server and regular developer meetings scheduled there. Anyone can join and participate!
36
36
37
-
38
37
## Interval operations
39
38
40
39
Key genomic interval operations in bioframe include:
40
+
41
41
-`overlap`: Find pairs of overlapping genomic intervals between two dataframes.
42
42
-`closest`: For every interval in a dataframe, find the closest intervals in a second dataframe.
43
43
-`cluster`: Group overlapping intervals in a dataframe into clusters.
Bioframe additionally has functions that are frequently used for genomic interval operations and can be expressed as combinations of these core operations and dataframe operations, including: `coverage`, `expand`, `merge`, `select`, and `subtract`.
47
47
48
48
To `overlap` two dataframes, call:
49
+
49
50
```python
50
51
import bioframe as bf
51
52
@@ -62,8 +63,8 @@ For these two input dataframes, with intervals all on the same chromosome:
See this [jupyter notebook](https://github.com/open2c/bioframe/tree/master/docs/tutorials/tutorial_assign_motifs_to_peaks.ipynb) for an example of how to assign TF motifs to ChIP-seq peaks using bioframe.
94
94
95
+
See this [jupyter notebook](https://github.com/open2c/bioframe/tree/master/docs/tutorials/tutorial_assign_motifs_to_peaks.ipynb) for an example of how to assign TF motifs to ChIP-seq peaks using bioframe.
95
96
96
97
## Citing
97
98
98
-
If you use ***bioframe*** in your work, please cite:
99
+
If you use **_bioframe_** in your work, please cite:
Copy file name to clipboardExpand all lines: docs/guide-bedtools.md
-3Lines changed: 0 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,7 +14,6 @@ kernelspec:
14
14
15
15
# Bioframe for bedtools users
16
16
17
-
18
17
Bioframe is built around the analysis of genomic intervals as a pandas [DataFrame](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) in memory, rather than working with tab-delimited text files saved on disk.
19
18
20
19
Bioframe supports reading a number of standard genomics text file formats via [`read_table`](https://bioframe.readthedocs.io/en/latest/api-fileops.html#bioframe.io.fileops.read_table), including BED files (see [schemas](https://github.com/open2c/bioframe/blob/main/bioframe/io/schemas.py)), which will load them as pandas DataFrames, a complete list of helper functions is [available here](API_fileops).
@@ -25,7 +24,6 @@ For example, with gtf files, you do not need to turn them into bed files, you ca
25
24
26
25
Finally, if needed, bioframe provides a convenience function to write dataframes to a standard BED file using [`to_bed`](https://bioframe.readthedocs.io/en/latest/api-fileops.html#bioframe.io.bed.to_bed).
27
26
28
-
29
27
## `bedtools intersect`
30
28
31
29
### Select unique entries from the first bed overlapping the second bed `-u`
> **Note:** This gives one row per overlap and can contain duplicates. The output dataframe of the former method will use the same pandas index as the input dataframe `B`, while the latter result --- the join output --- will have an integer range index, like a pandas merge.
0 commit comments