Fragments
This part of spoc is responsible for dealing aligned fragments that have not yet been converted to contacts. It deals with label information as well as expanding fragments to contacts.
FragmentAnnotator
Responsible for annotating labels and sister identity of mapped read fragments.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
label_library |
Dict[str, bool]
|
Dictionary containing the label library. |
required |
Source code in spoc/fragments.py
annotate_fragments(fragments)
Takes fragment dataframe and returns a copy of it with its labelling state in a separate
column with name is_labelled
. If drop_uninformative is true, drops fragments that
are not in label library.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
fragments |
Fragments
|
Fragments object containing the fragment data. |
required |
Returns:
Name | Type | Description |
---|---|---|
Fragments |
Fragments
|
Fragments object with annotated fragment data. |
Source code in spoc/fragments.py
FragmentExpander
Expands n-way fragments over sequencing reads to yield contacts.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
number_fragments |
int
|
Number of fragments. |
required |
contains_metadata |
bool
|
Whether the fragment data contains metadata. Defaults to True. |
True
|
Source code in spoc/fragments.py
116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 |
|
expand(fragments)
expand contacts n-ways
Parameters:
Name | Type | Description | Default |
---|---|---|---|
fragments |
Fragments
|
Fragments object containing the fragment data. |
required |
Returns:
Name | Type | Description |
---|---|---|
Contacts |
Contacts
|
Contacts object containing the expanded contact data. |
Source code in spoc/fragments.py
Fragments
Genomic fragments that can be labelled or not.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
fragment_frame |
DataFrame
|
DataFrame containing the fragment data. |
required |
Source code in spoc/fragments.py
contains_metadata: bool
property
Returns whether the dataframe contains metadata.
Returns:
Name | Type | Description |
---|---|---|
bool |
bool
|
Whether the fragment data contains metadata. |
data: DataFrame
property
Returns the underlying dataframe.
Returns:
Name | Type | Description |
---|---|---|
DataFrame |
DataFrame
|
Fragment data. |
is_dask: bool
property
Returns whether the underlying dataframe is dask dataframe.
Returns:
Name | Type | Description |
---|---|---|
bool |
bool
|
Whether the underlying dataframe is a dask dataframe. |