; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G21590 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G21590
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionHistone H2A
Genome locationClcChr02:33780530..33789499
RNA-Seq ExpressionClc02G21590
SyntenyClc02G21590
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000775 - chromosome, centromeric region (cellular component)
GO:0000786 - nucleosome (cellular component)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR002119 - Histone H2A
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold
IPR032454 - Histone H2A, C-terminal domain
IPR032458 - Histone H2A conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570726.1 Ras-related protein RABE1c, partial [Cucurbita argyrosperma subsp. sororia]3.0e-5944.36Show/hide
Query:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
        + + +K        EKKK+VSRSVKAGLQFPVGRIARYLKNGRY+QRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
Subjt:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK

Query:  LLAGVTIASGGVLPNINPFCCRRKRIRLRKSRNLHRRPGSLH-RSQPEKVGLAGLGLDESGKPNNSFIHSPRRRWFIVELSYQLLCANSLCLRLFLRAVC
        LLAGVTIASGGVLPNINP    +K  +  K      + G    R     + L G+            + +P  R       Y  L    L     +   C
Subjt:  LLAGVTIASGGVLPNINPFCCRRKRIRLRKSRNLHRRPGSLH-RSQPEKVGLAGLGLDESGKPNNSFIHSPRRRWFIVELSYQLLCANSLCLRLFLRAVC

Query:  IPNRFDCNCYGCSTLQELGQIMILLIKLLLIGDSVSILKSEPLRLDGKTNQASNLGIQLVRSAFEQSPQLTTVEQW----GILLVYDVTDESSFNTSKSS
        +  RF    +  S +  +G                   K   + LDGK      + +Q+  +A ++  +  T   +    GILLVYDVTDESSFN  ++ 
Subjt:  IPNRFDCNCYGCSTLQELGQIMILLIKLLLIGDSVSILKSEPLRLDGKTNQASNLGIQLVRSAFEQSPQLTTVEQW----GILLVYDVTDESSFNTSKSS

Query:  SKSTLSLKPSLVSYLLI------SFSYRLCLHPKDRALADEYGIKFFETQGISSNVSL----------------STDSKAEPSTIKINQQDQGANAGQAA
         ++        V+ +L+        S R     K +ALADEYGIKFFET    +N+++                 TDSKAE STIKINQQDQGANAGQAA
Subjt:  SKSTLSLKPSLVSYLLI------SFSYRLCLHPKDRALADEYGIKFFETQGISSNVSL----------------STDSKAEPSTIKINQQDQGANAGQAA

Query:  QKSACCGS
        QKS+CCGS
Subjt:  QKSACCGS

XP_008448380.1 PREDICTED: histone H2A [Cucumis melo]4.9e-3877.05Show/hide
Query:  RQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL
        + +K        EKKKAVSRSVKAGLQFPVGRIARYLK GRYAQRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL
Subjt:  RQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL

Query:  AGVTIASGGVLPNINPFCCRRK
        AGVTIASGGVLPNINP    +K
Subjt:  AGVTIASGGVLPNINPFCCRRK

XP_022947502.1 histone H2A-like [Cucurbita moschata]6.4e-3875Show/hide
Query:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
        +A+ +K        EKKK+VSRSVKAGLQFPVGRIARYLK GRY+QRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
Subjt:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK

Query:  LLAGVTIASGGVLPNINPFCCRRK
        LLAGVTIASGGVLPNINP    +K
Subjt:  LLAGVTIASGGVLPNINPFCCRRK

XP_023006960.1 histone H2A-like [Cucurbita maxima]6.4e-3875Show/hide
Query:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
        +A+ +K        EKKK+VSRSVKAGLQFPVGRIARYLK GRY+QRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
Subjt:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK

Query:  LLAGVTIASGGVLPNINPFCCRRK
        LLAGVTIASGGVLPNINP    +K
Subjt:  LLAGVTIASGGVLPNINPFCCRRK

XP_023532320.1 histone H2A-like [Cucurbita pepo subsp. pepo]6.4e-3875Show/hide
Query:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
        +A+ +K        EKKK+VSRSVKAGLQFPVGRIARYLK GRY+QRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
Subjt:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK

Query:  LLAGVTIASGGVLPNINPFCCRRK
        LLAGVTIASGGVLPNINP    +K
Subjt:  LLAGVTIASGGVLPNINPFCCRRK

TrEMBL top hitse value%identityAlignment
A0A1S3BK65 Histone H2A2.4e-3877.05Show/hide
Query:  RQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL
        + +K        EKKKAVSRSVKAGLQFPVGRIARYLK GRYAQRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL
Subjt:  RQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL

Query:  AGVTIASGGVLPNINPFCCRRK
        AGVTIASGGVLPNINP    +K
Subjt:  AGVTIASGGVLPNINPFCCRRK

A0A5N6M734 Histone H2A5.3e-3885.58Show/hide
Query:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSRSDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNINPFC
        +KKAVSRSVKAGLQFPVGRI RYLK GRYAQRVGTGAPV L+    VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIA GGVLPNINP  
Subjt:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSRSDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNINPFC

Query:  CRRK
          +K
Subjt:  CRRK

A0A6J1D4R0 Histone H2A3.1e-3874.22Show/hide
Query:  RQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL
        + +K        EKKKAVSRSVKAGLQFPVGRIARYLK GRYAQRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL
Subjt:  RQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLL

Query:  AGVTIASGGVLPNINPFCCRRKRIRLRK
        AGVTIASGGVLPNINP    +K  +  K
Subjt:  AGVTIASGGVLPNINPFCCRRKRIRLRK

A0A6J1G6M6 Histone H2A3.1e-3875Show/hide
Query:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
        +A+ +K        EKKK+VSRSVKAGLQFPVGRIARYLK GRY+QRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
Subjt:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK

Query:  LLAGVTIASGGVLPNINPFCCRRK
        LLAGVTIASGGVLPNINP    +K
Subjt:  LLAGVTIASGGVLPNINPFCCRRK

A0A6J1L1N2 Histone H2A3.1e-3875Show/hide
Query:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
        +A+ +K        EKKK+VSRSVKAGLQFPVGRIARYLK GRY+QRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK
Subjt:  AARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGK

Query:  LLAGVTIASGGVLPNINPFCCRRK
        LLAGVTIASGGVLPNINP    +K
Subjt:  LLAGVTIASGGVLPNINPFCCRRK

SwissProt top hitse value%identityAlignment
A2WQG7 Probable histone H2A.59.3e-4080.73Show/hide
Query:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN
        +KKAVSRSVKAGLQFPVGRI RYLK GRYAQR+GTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIA GGVLPN
Subjt:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN

Query:  INPFCCRRK
        INP    +K
Subjt:  INPFCCRRK

A2XZN0 Probable histone H2A.66.0e-3975.83Show/hide
Query:  RKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAG
        +KA   +    KKK VSRSVKAGLQFPVGRI RYLK GRYAQRVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAG
Subjt:  RKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAG

Query:  VTIASGGVLPNINPFCCRRK
        VTIA GGVLPNINP    +K
Subjt:  VTIASGGVLPNINPFCCRRK

A2Y5G8 Probable histone H2A.43.5e-3979.82Show/hide
Query:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN
        KKK VSRSVKAGLQFPVGRI RYLK GRY+QR+GTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIA GGVLPN
Subjt:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN

Query:  INPFCCRRK
        INP    +K
Subjt:  INPFCCRRK

Q6L500 Probable histone H2A.43.5e-3979.82Show/hide
Query:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN
        KKK VSRSVKAGLQFPVGRI RYLK GRY+QR+GTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIA GGVLPN
Subjt:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN

Query:  INPFCCRRK
        INP    +K
Subjt:  INPFCCRRK

Q94E96 Probable histone H2A.59.3e-4080.73Show/hide
Query:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN
        +KKAVSRSVKAGLQFPVGRI RYLK GRYAQR+GTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIA GGVLPN
Subjt:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN

Query:  INPFCCRRK
        INP    +K
Subjt:  INPFCCRRK

Arabidopsis top hitse value%identityAlignment
AT1G08880.1 Histone superfamily protein1.3e-3373Show/hide
Query:  KAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNIN
        K+VSRS KAGLQFPVGRIAR+LK+G+YA+RVG GAPV LS      +  VLELAGNAARDNKK RI+PRH+ LA+RNDEEL KLL  VTIA+GGVLPNI+
Subjt:  KAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNIN

AT5G02560.1 histone H2A 122.3e-3865.41Show/hide
Query:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN
        KKK VSRSVK+GLQFPVGRI RYLK GRY++RVGTGAPV L+      +  VLELAGNAARDNKKNRIIPRHVLLA+RNDEELG LL GVTIA GGVLPN
Subjt:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPN

Query:  INPFCCRRKRIRLRKSRNLHRRPGSLHRSQPEK
        INP    +K  +   +    + P    +S P+K
Subjt:  INPFCCRRKRIRLRKSRNLHRRPGSLHRSQPEK

AT5G02560.2 histone H2A 126.4e-3657.32Show/hide
Query:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLL----------------------------SRSD-GVLELAGNAARDNKKNRIIPRHVLLA
        KKK VSRSVK+GLQFPVGRI RYLK GRY++RVGTGAPV L                            S SD  VLELAGNAARDNKKNRIIPRHVLLA
Subjt:  KKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLL----------------------------SRSD-GVLELAGNAARDNKKNRIIPRHVLLA

Query:  IRNDEELGKLLAGVTIASGGVLPNINPFCCRRKRIRLRKSRNLHRRPGSLHRSQPEK
        +RNDEELG LL GVTIA GGVLPNINP    +K  +   +    + P    +S P+K
Subjt:  IRNDEELGKLLAGVTIASGGVLPNINPFCCRRKRIRLRKSRNLHRRPGSLHRSQPEK

AT5G27670.1 histone H2A 77.5e-3773.64Show/hide
Query:  EKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLP
        ++KK+VS+SVKAGLQFPVGRIARYLK GRYA R G+GAPV L+      +  VLELAGNAARDNKKNRI PRH+ LAIRNDEELG+LL GVTIASGGVLP
Subjt:  EKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLP

Query:  NINPFCCRRK
        NINP    +K
Subjt:  NINPFCCRRK

AT5G59870.1 histone H2A 68.3e-3671.17Show/hide
Query:  AEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVL
        A K K+VS+S+KAGLQFPVGRI R+LK GRYAQR+G GAPV ++      +  VLELAGNAARDNKK+RIIPRH+LLAIRNDEELGKLL+GVTIA GGVL
Subjt:  AEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR-----SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVL

Query:  PNINPFCCRRK
        PNIN     +K
Subjt:  PNINPFCCRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTATGTGGACTCACCCGAGTCCCCACGCACCATGCAATGACGATCACATTCAAACCTGCTTCTCCCTCAATCAACACCGACAACTCTCCGGCGAGCAACACCTA
CGTTTTCTCCACAACAATACATTCAACCAGTATCCAATGGAAACCGGCGGCAAGGCAAAGAAAGGCGCAGGAGGAAGAAGAGGAGGCGGAGAAGAAGAAGGCAGTTTCTC
GCTCCGTCAAAGCCGGTTTACAGTTCCCCGTCGGCCGTATCGCTCGTTATCTGAAGAACGGAAGATACGCACAGCGTGTCGGCACCGGCGCTCCGGTCCTACTTAGCCGC
AGTGATGGAGTTCTTGAGTTGGCTGGAAATGCGGCTAGAGATAACAAGAAGAACAGGATCATTCCAAGGCACGTTCTATTGGCGATTAGAAACGATGAAGAACTCGGAAA
GTTGCTGGCCGGCGTAACTATTGCTAGCGGTGGTGTTCTTCCGAATATCAACCCGTTCTGTTGCCGAAGAAAACGGATAAGGCTACGAAAGAGCCGAAATCTCCATCGAA
GGCCGGGAAGTCTCCATCGAAGTCAGCCTGAAAAGGTTGGATTAGCAGGGTTAGGGTTGGATGAATCTGGGAAGCCTAATAATTCATTCATTCATTCTCCCAGACGTAGA
TGGTTTATTGTCGAATTGAGTTACCAGCTCCTGTGTGCAAATTCTCTTTGCTTACGGTTATTTCTGAGGGCTGTTTGTATTCCGAATCGATTTGATTGCAATTGTTATGG
CTGCTCCACCCTGCAAGAGCTCGGGCAGATTATGATTTTACTCATAAAGCTTCTTTTAATCGGCGATAGCGTATCGATTTTAAAATCAGAACCATTGAGGCTTGATGGGA
AAACGAATCAAGCTTCAAATTTGGGGATACAGCTGGTCAGGAGCGCTTTCGAACAATCACCACAGCTTACTACCGTGGAGCAATGGGGCATTTTGCTGGTCTATGATGTA
ACTGATGAATCATCTTTTAACACTTCCAAGTCTTCTTCTAAGAGTACTTTATCTCTGAAGCCCTCACTAGTATCATATCTTTTAATATCCTTCTCTTATAGGCTGTGCCT
ACATCCAAAGGACAGGGCGCTTGCTGATGAGTATGGGATCAAATTCTTTGAAACTCAAGGGATATCAAGCAACGTCTCGCTGAGTACTGATTCAAAAGCCGAGCCTTCGA
CGATCAAGATTAATCAACAAGACCAGGGAGCCAATGCTGGTCAGGCTGCACAAAAATCAGCTTGCTGTGGTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATTTATGTGGACTCACCCGAGTCCCCACGCACCATGCAATGACGATCACATTCAAACCTGCTTCTCCCTCAATCAACACCGACAACTCTCCGGCGAGCAACACCTA
CGTTTTCTCCACAACAATACATTCAACCAGTATCCAATGGAAACCGGCGGCAAGGCAAAGAAAGGCGCAGGAGGAAGAAGAGGAGGCGGAGAAGAAGAAGGCAGTTTCTC
GCTCCGTCAAAGCCGGTTTACAGTTCCCCGTCGGCCGTATCGCTCGTTATCTGAAGAACGGAAGATACGCACAGCGTGTCGGCACCGGCGCTCCGGTCCTACTTAGCCGC
AGTGATGGAGTTCTTGAGTTGGCTGGAAATGCGGCTAGAGATAACAAGAAGAACAGGATCATTCCAAGGCACGTTCTATTGGCGATTAGAAACGATGAAGAACTCGGAAA
GTTGCTGGCCGGCGTAACTATTGCTAGCGGTGGTGTTCTTCCGAATATCAACCCGTTCTGTTGCCGAAGAAAACGGATAAGGCTACGAAAGAGCCGAAATCTCCATCGAA
GGCCGGGAAGTCTCCATCGAAGTCAGCCTGAAAAGGTTGGATTAGCAGGGTTAGGGTTGGATGAATCTGGGAAGCCTAATAATTCATTCATTCATTCTCCCAGACGTAGA
TGGTTTATTGTCGAATTGAGTTACCAGCTCCTGTGTGCAAATTCTCTTTGCTTACGGTTATTTCTGAGGGCTGTTTGTATTCCGAATCGATTTGATTGCAATTGTTATGG
CTGCTCCACCCTGCAAGAGCTCGGGCAGATTATGATTTTACTCATAAAGCTTCTTTTAATCGGCGATAGCGTATCGATTTTAAAATCAGAACCATTGAGGCTTGATGGGA
AAACGAATCAAGCTTCAAATTTGGGGATACAGCTGGTCAGGAGCGCTTTCGAACAATCACCACAGCTTACTACCGTGGAGCAATGGGGCATTTTGCTGGTCTATGATGTA
ACTGATGAATCATCTTTTAACACTTCCAAGTCTTCTTCTAAGAGTACTTTATCTCTGAAGCCCTCACTAGTATCATATCTTTTAATATCCTTCTCTTATAGGCTGTGCCT
ACATCCAAAGGACAGGGCGCTTGCTGATGAGTATGGGATCAAATTCTTTGAAACTCAAGGGATATCAAGCAACGTCTCGCTGAGTACTGATTCAAAAGCCGAGCCTTCGA
CGATCAAGATTAATCAACAAGACCAGGGAGCCAATGCTGGTCAGGCTGCACAAAAATCAGCTTGCTGTGGTTCTTAA
Protein sequenceShow/hide protein sequence
MYLCGLTRVPTHHAMTITFKPASPSINTDNSPASNTYVFSTTIHSTSIQWKPAARQRKAQEEEEEAEKKKAVSRSVKAGLQFPVGRIARYLKNGRYAQRVGTGAPVLLSR
SDGVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNINPFCCRRKRIRLRKSRNLHRRPGSLHRSQPEKVGLAGLGLDESGKPNNSFIHSPRRR
WFIVELSYQLLCANSLCLRLFLRAVCIPNRFDCNCYGCSTLQELGQIMILLIKLLLIGDSVSILKSEPLRLDGKTNQASNLGIQLVRSAFEQSPQLTTVEQWGILLVYDV
TDESSFNTSKSSSKSTLSLKPSLVSYLLISFSYRLCLHPKDRALADEYGIKFFETQGISSNVSLSTDSKAEPSTIKINQQDQGANAGQAAQKSACCGS