This function retrieves cytogenetic bands. If no cytogenetic information is available for the queried species then it will be omitted from in the returned value.
Usage
get_cytogenetic_bands(
species_name = "homo_sapiens",
verbose = FALSE,
warnings = TRUE,
progress_bar = TRUE
)Arguments
- species_name
The species name, i.e., the scientific name, all letters lowercase and space replaced by underscore. Examples:
'homo_sapiens'(human),'ovis_aries'(Domestic sheep) or'capra_hircus'(Goat).- verbose
Whether to be chatty.
- warnings
Whether to print warnings.
- progress_bar
Whether to show a progress bar.
Value
A tibble, each row being a cytogenetic band,
of 8 variables:
species_nameEnsembl species name: this is the name used internally by Ensembl to uniquely identify a species by name. It is the scientific name but formatted without capitalisation and spacing converted with an underscore, e.g.,
'homo_sapiens'.assembly_nameAssembly name.
cytogenetic_bandName of the cytogenetic_band.
chromosomeChromosome name.
startGenomic start position of the cytogenetic band. Starts at 1.
endGenomic end position of the cytogenetic band. End position is included in the band interval.
stainGiemsa stain results: Giemsa negative,
'gneg'; Giemsa positive, of increasing intensities,'gpos25','gpos50','gpos75', and'gpos100'; centromeric region,'acen'; heterochromatin, either pericentric or telomeric,'gvar'; and short arm of acrocentric chromosomes are coded as'stalk'.strandStrand.
Examples
# Get toplevel sequences for the human genome (default)
get_cytogenetic_bands()
#> # A tibble: 862 × 8
#> species_name assembly_name cytogenetic_band chromosome start end stain
#> <chr> <chr> <chr> <chr> <int> <int> <chr>
#> 1 homo_sapiens GRCh38 p11.1 Y 10300001 1.04e7 acen
#> 2 homo_sapiens GRCh38 p11.2 Y 600001 1.03e7 gneg
#> 3 homo_sapiens GRCh38 p11.31 Y 300001 6 e5 gpos…
#> 4 homo_sapiens GRCh38 p11.32 Y 1 3 e5 gneg
#> 5 homo_sapiens GRCh38 q11.1 Y 10400001 1.06e7 acen
#> 6 homo_sapiens GRCh38 q11.21 Y 10600001 1.24e7 gneg
#> 7 homo_sapiens GRCh38 q11.221 Y 12400001 1.71e7 gpos…
#> 8 homo_sapiens GRCh38 q11.222 Y 17100001 1.96e7 gneg
#> 9 homo_sapiens GRCh38 q11.223 Y 19600001 2.38e7 gpos…
#> 10 homo_sapiens GRCh38 q11.23 Y 23800001 2.66e7 gneg
#> # ℹ 852 more rows
#> # ℹ 1 more variable: strand <int>
# Get toplevel sequences for Mus musculus
get_cytogenetic_bands('mus_musculus')
#> # A tibble: 82 × 8
#> species_name assembly_name cytogenetic_band chromosome start end stain
#> <chr> <chr> <chr> <chr> <int> <int> <chr>
#> 1 mus_musculus GRCm39 "" MT 1 1.63e4 gneg
#> 2 mus_musculus GRCm39 "" Y 1 9.15e7 gneg
#> 3 mus_musculus GRCm39 "cen" 2 110001 1.56e6 acen
#> 4 mus_musculus GRCm39 "cen" 2 1555001 3 e6 acen
#> 5 mus_musculus GRCm39 "p" 2 1 1.10e5 gneg
#> 6 mus_musculus GRCm39 "q" 2 3000001 1.82e8 gneg
#> 7 mus_musculus GRCm39 "cen" 4 110001 1.56e6 acen
#> 8 mus_musculus GRCm39 "cen" 4 1555001 3 e6 acen
#> 9 mus_musculus GRCm39 "p" 4 1 1.10e5 gneg
#> 10 mus_musculus GRCm39 "q" 4 3000001 1.57e8 gneg
#> # ℹ 72 more rows
#> # ℹ 1 more variable: strand <int>