Documentation for `Annotation`¶

`cnv: CNVRegionAnnotation` `instance-attribute` ¶

See CNVRegionAnnotation

`count_gene_types()` ¶

Count the number of genes of each type in GenesDB

Returns:

Type	Description
`GenesDBGeneTypesCounter`	A dictionary with the count of each gene type.

`count_regulatory_types()` ¶

Count the number of regulatory element types in the database

Returns:

Type	Description
`RegulatoryTypesCounter`	A dictionary with the count of each regulatory type.

Notes

The regulatory database entries are looped over and their "type" element is checked for presence in any of existing regulatory types.
Unknown regulatory types are counted as 'other'. To get a count of all regulatory types, sum the values.

`get_annotated_genes()` ¶

Get list of names for genes that are morbid or associated with a disease

Returns:

Type	Description
`AnnotatedGenesList`	Dictionary with two lists of gene names: 'morbid_genes' and 'associated_with_disease'

`get_benign_cnvs_gs_outer(frequency_threshold)` ¶

Get Benign CNV GS outer entries with no defined frequency or higher than the threshold

Parameters:

Name	Type	Description	Default
`frequency_threshold`	`float`	Minimum frequency for a benign CNV GS outer entry to be considered	required

Returns:

Type	Description
`list[dict[str, Any]]`	List of benign CNV GS outer entries

`get_common_variability_regions()` ¶

Get GnomAD regions that intersect with the CNV and are in the specified populations

Returns:

Type	Description
`list[CommonVariabilityRegion]`	List of common variability regions that intersect with the CNV

`get_gene_by_name(gene_name)` ¶

Get 'Genes' entry with gene_name

`get_gene_transcript_regions(gene_name)` ¶

Get transcript regions for a gene

Parameters:

Name	Type	Description	Default
`gene_name`	`str`	Name of the gene	required

Returns:

Type	Description
`list[TranscriptRegion]`	List of transcript regions for the gene

`get_genes(gene_type=None, overlap=enums.Overlap.ANY)` ¶

Get 'Genes' entries whose gene_type is 'gene_type' and overlaps the CNV region in that manner

`get_haploinsufficient_gene_names(overlap_type, valid_scores)` ¶

Get names of genes with sufficient Haploinsufficiency Score

`get_haploinsufficient_genes(overlap_type, valid_scores)` ¶

Get all HiGene entries with sufficient Haploinsufficiency Score

Parameters:

Name	Type	Description	Default
`overlap_type`	`Overlap`	Type of overlap to consider.	required
`valid_scores`	`list[int]`	List of valid scores to consider. Entry scores are coerced to integers.	required

Returns:

Type	Description
`list[HiGeneEntry]`	List of HiGene entries with sufficient Haploinsufficiency Score

`get_haploinsufficient_regions(overlap_type, valid_scores)` ¶

Get all HiRegion entries with sufficient Haploinsufficiency Score

Parameters:

Name	Type	Description	Default
`overlap_type`	`Overlap`	Type of overlap to consider.	required
`valid_scores`	`list[int]`	List of valid scores to consider. Entry scores are coerced to integers.	required

Returns:

Type	Description
`list[HiRegionEntry]`	List of HiRegion entries with sufficient Haploinsufficiency Score

`get_high_risk_loss_genes()` ¶

Get genes with high risk loss predictors

Raises:

Type	Description
`HighRiskForDuplicationError`	If the CNV region is a duplication, not a loss

Returns:

Type	Description
`list[dict[str, Any]]`	List of genes with high risk loss predictors

`get_triplosensitivity_gene_names(overlap_type, valid_scores)` ¶

Get names of genes with sufficient Triplosensitivity Score

`get_triplosensitivity_genes(overlap_type, valid_scores)` ¶

Get all HiGene entries with sufficient Triplosensitivity Score

Parameters:

Name	Type	Description	Default
`overlap_type`	`Overlap`	Type of overlap to consider.	required
`valid_scores`	`list[int]`	List of valid scores to consider. Entry scores are coerced to integers.	required

Returns:

Type	Description
`list[HiGeneEntry]`	List of HiGene entries with sufficient Triplosensitivity Score

`get_triplosensitivity_regions(overlap_type, valid_scores)` ¶

Get all HiRegion entries with sufficient Triplosensitivity Score

Parameters:

Name	Type	Description	Default
`overlap_type`	`Overlap`	Type of overlap to consider.	required
`valid_scores`	`list[int]`	List of valid scores to consider. Entry scores are coerced to integers.	required

Returns:

Type	Description
`list[HiRegionEntry]`	List of HiRegion entries with sufficient Triplosensitivity Score

`load_from_json(json_file)` `classmethod` ¶

Load annotation from a JSON file

Parameters:

Name	Type	Description	Default
`json_file`	`str`	Path to the JSON file. Can be gzipped.	required

Returns:

Type	Description
`Annotation`	Annotation object loaded from the JSON file.

Raises:

Type	Description
`FileNotFoundError`	If the file does not exist.

handler: python options: show_root_heading: false show_source: true

Documentation for Annotation¶

cnv: CNVRegionAnnotation instance-attribute ¶

count_gene_types() ¶

count_regulatory_types() ¶

get_annotated_genes() ¶

get_benign_cnvs_gs_outer(frequency_threshold) ¶

get_common_variability_regions() ¶

get_gene_by_name(gene_name) ¶

get_gene_transcript_regions(gene_name) ¶

get_genes(gene_type=None, overlap=enums.Overlap.ANY) ¶

get_haploinsufficient_gene_names(overlap_type, valid_scores) ¶

get_haploinsufficient_genes(overlap_type, valid_scores) ¶

get_haploinsufficient_regions(overlap_type, valid_scores) ¶

get_high_risk_loss_genes() ¶

get_triplosensitivity_gene_names(overlap_type, valid_scores) ¶

get_triplosensitivity_genes(overlap_type, valid_scores) ¶

get_triplosensitivity_regions(overlap_type, valid_scores) ¶

load_from_json(json_file) classmethod ¶

Documentation for `Annotation`¶

`cnv: CNVRegionAnnotation` `instance-attribute` ¶

`count_gene_types()` ¶

`count_regulatory_types()` ¶

`get_annotated_genes()` ¶

`get_benign_cnvs_gs_outer(frequency_threshold)` ¶

`get_common_variability_regions()` ¶

`get_gene_by_name(gene_name)` ¶

`get_gene_transcript_regions(gene_name)` ¶

`get_genes(gene_type=None, overlap=enums.Overlap.ANY)` ¶

`get_haploinsufficient_gene_names(overlap_type, valid_scores)` ¶

`get_haploinsufficient_genes(overlap_type, valid_scores)` ¶

`get_haploinsufficient_regions(overlap_type, valid_scores)` ¶

`get_high_risk_loss_genes()` ¶

`get_triplosensitivity_gene_names(overlap_type, valid_scores)` ¶

`get_triplosensitivity_genes(overlap_type, valid_scores)` ¶

`get_triplosensitivity_regions(overlap_type, valid_scores)` ¶

`load_from_json(json_file)` `classmethod` ¶