; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G07670 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G07670
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationClcChr07:20463337..20466153
RNA-Seq ExpressionClc07G07670
SyntenyClc07G07670
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR025718 - Histone deacetylase complex subunit SAP30, Sin3 binding domain
IPR038291 - SAP30, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044820.1 uncharacterized protein E6C27_scaffold74G001110 [Cucumis melo var. makuwa]7.9e-9084.26Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+ WAPSSSAKSTG GSYGEIQSIHMPK GT                     VNSISN SR+QIL VVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

TYK16644.1 uncharacterized protein E5676_scaffold21G004740 [Cucumis melo var. makuwa]1.2e-9084.72Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+ WAPSSSAKSTGRGSYGEIQSIHMPK GT                     VNSISN SR+QIL VVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

XP_004146519.1 uncharacterized protein LOC101209634 [Cucumis sativus]1.3e-9285.65Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+PWAPSSSAKST RGSYGEIQSIHMPK GT                     VNSISN SR+QILQVVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

XP_008452044.1 PREDICTED: uncharacterized protein LOC103493170 [Cucumis melo]1.2e-9084.72Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+ WAPSSSAKSTGRGSYGEIQSIHMPK GT                     VNSISN SR+QIL VVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

XP_022136865.1 uncharacterized protein LOC111008453 isoform X4 [Momordica charantia]1.5e-8884.65Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        +S IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNEND+HD DNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAK-STGRGSYGEIQSIHMP-------KPGT------------VNSISNASREQILQVVQRHFALQTSLNEA
        DNDFSSSIVFHKLSK K RQIRPWAPSSSAK +TGRGSYGEIQSIH+P       K GT            VNSISNASR+Q+L VVQRHFALQT+LNEA
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAK-STGRGSYGEIQSIHMP-------KPGT------------VNSISNASREQILQVVQRHFALQTSLNEA

Query:  SVMTEFIRAVKRRRE
        SVMTEFIRAVKRRRE
Subjt:  SVMTEFIRAVKRRRE

TrEMBL top hitse value%identityAlignment
A0A0A0KUM7 SAP30_Sin3_bdg domain-containing protein6.3e-9385.65Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+PWAPSSSAKST RGSYGEIQSIHMPK GT                     VNSISN SR+QILQVVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

A0A1S3BU28 uncharacterized protein LOC1034931705.9e-9184.72Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+ WAPSSSAKSTGRGSYGEIQSIHMPK GT                     VNSISN SR+QIL VVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

A0A5A7TNP9 SAP30_Sin3_bdg domain-containing protein3.8e-9084.26Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+ WAPSSSAKSTG GSYGEIQSIHMPK GT                     VNSISN SR+QIL VVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

A0A5D3D093 SAP30_Sin3_bdg domain-containing protein5.9e-9184.72Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        AS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE
        DNDFSSS+VFHKLSK K RQI+ WAPSSSAKSTGRGSYGEIQSIHMPK GT                     VNSISN SR+QIL VVQRHFALQTSLNE
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSIHMPKPGT---------------------VNSISNASREQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRRE
        ASVMTEFIRAVK+RRE
Subjt:  ASVMTEFIRAVKRRRE

A0A6J1C548 uncharacterized protein LOC111008453 isoform X47.2e-8984.65Show/hide
Query:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK
        +S IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNEND+HD DNSSCSSDIGEK
Subjt:  ASGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEK

Query:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAK-STGRGSYGEIQSIHMP-------KPGT------------VNSISNASREQILQVVQRHFALQTSLNEA
        DNDFSSSIVFHKLSK K RQIRPWAPSSSAK +TGRGSYGEIQSIH+P       K GT            VNSISNASR+Q+L VVQRHFALQT+LNEA
Subjt:  DNDFSSSIVFHKLSKTKARQIRPWAPSSSAK-STGRGSYGEIQSIHMP-------KPGT------------VNSISNASREQILQVVQRHFALQTSLNEA

Query:  SVMTEFIRAVKRRRE
        SVMTEFIRAVKRRRE
Subjt:  SVMTEFIRAVKRRRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein5.2e-3947.66Show/hide
Query:  IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKDND
        IQS + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D+ DLD  +   +  +    
Subjt:  IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKDND

Query:  FSSSIVFHKLSKTKAR-----------QIRPWAPSSSAKSTG----RGSYGEIQSIHMPKP-------GTVNSISNASREQILQVVQRHFALQTSLNEAS
        F +S    K  K+K R             R  +  S +KS+G         ++  + MP           V++I N S+EQ++ +VQRHF  Q  ++E  
Subjt:  FSSSIVFHKLSKTKAR-----------QIRPWAPSSSAKSTG----RGSYGEIQSIHMPKP-------GTVNSISNASREQILQVVQRHFALQTSLNEAS

Query:  VMTEFIRAVKRRRE
        V+  F++A KR ++
Subjt:  VMTEFIRAVKRRRE

AT1G19330.2 unknown protein5.2e-3948.08Show/hide
Query:  IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEH-DLDNSSCS-SDIGEKD
        IQS + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D+  D +N+  + SD+  +D
Subjt:  IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEH-DLDNSSCS-SDIGEKD

Query:  --NDFSSSIVFHKLSKTKARQI-RPWAPSSSAKSTG----RGSYGEIQSIHMPKP-------GTVNSISNASREQILQVVQRHFALQTSLNEASVMTEFI
              S +   + S++  + + R  +  S +KS+G         ++  + MP           V++I N S+EQ++ +VQRHF  Q  ++E  V+  F+
Subjt:  --NDFSSSIVFHKLSKTKARQI-RPWAPSSSAKSTG----RGSYGEIQSIHMPKP-------GTVNSISNASREQILQVVQRHFALQTSLNEASVMTEFI

Query:  RAVKRRRE
        +A KR ++
Subjt:  RAVKRRRE

AT1G19330.3 unknown protein2.3e-3947.91Show/hide
Query:  IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKDND
        IQS + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D+ DLD  +   +  +    
Subjt:  IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKDND

Query:  FSSSIVFHKLSKTKAR-----------QIRPWAPSSSAKSTGRGSYGEIQSIHMPK---PGTVN---------SISNASREQILQVVQRHFALQTSLNEA
        F +S    K  K+K R             R  +  S +KS+G      +Q + + K   P  +N         +I N S+EQ++ +VQRHF  Q  ++E 
Subjt:  FSSSIVFHKLSKTKAR-----------QIRPWAPSSSAKSTGRGSYGEIQSIHMPK---PGTVN---------SISNASREQILQVVQRHFALQTSLNEA

Query:  SVMTEFIRAVKRRRE
         V+  F++A KR ++
Subjt:  SVMTEFIRAVKRRRE

AT1G75060.1 unknown protein4.8e-3745.19Show/hide
Query:  SGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKD
        S +QS F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLEHPTGNE D +DL+    +      D
Subjt:  SGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKD

Query:  NDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSI--------------------HMPKPGTVNSISNASREQILQVVQRHFALQTSLNEAS
             ++  HK SK +  +    +  +  +     S+ +I SI                    +      V+++ N ++EQ++ ++QRHF  Q  ++E  
Subjt:  NDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSI--------------------HMPKPGTVNSISNASREQILQVVQRHFALQTSLNEAS

Query:  VMTEFIRA
        V+  F++A
Subjt:  VMTEFIRA

AT1G75060.2 unknown protein3.7e-3745.41Show/hide
Query:  SGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKD
        S +QS F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLEHPTGNE D +DL+    +      D
Subjt:  SGIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKD

Query:  NDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSI-------------------HMPKPGTVNSISNASREQILQVVQRHFALQTSLNEASV
             ++  HK SK +  +    +  +  +     S+ +I SI                   +      V+++ N ++EQ++ ++QRHF  Q  ++E  V
Subjt:  NDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGEIQSI-------------------HMPKPGTVNSISNASREQILQVVQRHFALQTSLNEASV

Query:  MTEFIRA
        +  F++A
Subjt:  MTEFIRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAGCCTATTTATGAGCACCAATCTTATCCTGTCTTGCTGTGGTTCATTTATTATCAGTACAATGTGCCCGTGAAATACAATATACTGCATGACACTTACAGGAG
TTCAACATCTGTGAAGTTAGATTCCACTGAAGCAAGCAAAATTCATAATGCCTTACTGATCAGAAAAATAAGGCTGTTGGTTATTATAGTTCAAGTTGAGCTCGAATTCT
TGCACTATATGAACCACAGGGCTTCTGGAATACAGTCTCCTTTCCGTGAGGAGAGTGGAGATGAAGAGCTTTCTGTTCTTCCAAGGCATACTAAAGTTATTGTCACTGGA
AATAACAGAACAAAGTCTGTTTTATTGGGATTGCAAGGTGTGGTTAAGAAGGCTGTTGGCCTTGGAGGCTGGCATTGGCTGGTTTTGAAAAATGGTGTTGAAGTGAAGCT
CCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGAACATGATTTGGACAACTCAAGTTGTAGCTCTGACATCGGGGAGAAGGACAATGATT
TCTCTAGCAGCATTGTCTTCCACAAACTTAGCAAAACAAAAGCTAGGCAGATTAGGCCATGGGCTCCATCTTCATCAGCGAAGTCAACTGGTCGGGGCAGCTATGGAGAA
ATTCAGTCTATTCACATGCCAAAGCCAGGGACGGTGAATAGCATTTCAAATGCTTCAAGGGAGCAAATCCTTCAAGTTGTGCAACGGCATTTTGCATTACAGACAAGTTT
AAACGAGGCATCGGTGATGACTGAATTCATTCGAGCTGTTAAGAGACGAAGGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAGCCTATTTATGAGCACCAATCTTATCCTGTCTTGCTGTGGTTCATTTATTATCAGTACAATGTGCCCGTGAAATACAATATACTGCATGACACTTACAGGAG
TTCAACATCTGTGAAGTTAGATTCCACTGAAGCAAGCAAAATTCATAATGCCTTACTGATCAGAAAAATAAGGCTGTTGGTTATTATAGTTCAAGTTGAGCTCGAATTCT
TGCACTATATGAACCACAGGGCTTCTGGAATACAGTCTCCTTTCCGTGAGGAGAGTGGAGATGAAGAGCTTTCTGTTCTTCCAAGGCATACTAAAGTTATTGTCACTGGA
AATAACAGAACAAAGTCTGTTTTATTGGGATTGCAAGGTGTGGTTAAGAAGGCTGTTGGCCTTGGAGGCTGGCATTGGCTGGTTTTGAAAAATGGTGTTGAAGTGAAGCT
CCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGAACATGATTTGGACAACTCAAGTTGTAGCTCTGACATCGGGGAGAAGGACAATGATT
TCTCTAGCAGCATTGTCTTCCACAAACTTAGCAAAACAAAAGCTAGGCAGATTAGGCCATGGGCTCCATCTTCATCAGCGAAGTCAACTGGTCGGGGCAGCTATGGAGAA
ATTCAGTCTATTCACATGCCAAAGCCAGGGACGGTGAATAGCATTTCAAATGCTTCAAGGGAGCAAATCCTTCAAGTTGTGCAACGGCATTTTGCATTACAGACAAGTTT
AAACGAGGCATCGGTGATGACTGAATTCATTCGAGCTGTTAAGAGACGAAGGGAGTGAGTGATGATATTGAAAAGTGATTCACGTAAATGCATGTTTGTATAGTGATTGA
TGATTATGCCTCCTAATTGTAGCCTATAAACTATGTAAACTTCTAAACCTCTAACATTGAGTTGATATTTCCTAGACCCATAGTATATTTTGAACTTCTCTCTCCCGAGT
CTTGATGAAATCCTGAGGGGGTTTGGGCACCAATTTAGTGTCACTAC
Protein sequenceShow/hide protein sequence
MKEPIYEHQSYPVLLWFIYYQYNVPVKYNILHDTYRSSTSVKLDSTEASKIHNALLIRKIRLLVIIVQVELEFLHYMNHRASGIQSPFREESGDEELSVLPRHTKVIVTG
NNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSDIGEKDNDFSSSIVFHKLSKTKARQIRPWAPSSSAKSTGRGSYGE
IQSIHMPKPGTVNSISNASREQILQVVQRHFALQTSLNEASVMTEFIRAVKRRRE