; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002039 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002039
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationChr11:2816699..2819093
RNA-Seq ExpressionHG10002039
SyntenyHG10002039
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR025718 - Histone deacetylase complex subunit SAP30, Sin3 binding domain
IPR038291 - SAP30, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044820.1 uncharacterized protein E6C27_scaffold74G001110 [Cucumis melo var. makuwa]3.7e-8982.06Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQIL VVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

TYK16644.1 uncharacterized protein E5676_scaffold21G004740 [Cucumis melo var. makuwa]3.7e-8982.06Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQIL VVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

XP_004146519.1 uncharacterized protein LOC101209634 [Cucumis sativus]7.9e-9283.41Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+P APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQILQVVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

XP_008452044.1 PREDICTED: uncharacterized protein LOC103493170 [Cucumis melo]3.7e-8982.06Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQIL VVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

XP_022136865.1 uncharacterized protein LOC111008453 isoform X4 [Momordica charantia]1.8e-8882.43Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFC+SRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNEND+HD DNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMP-------KPGT------------VNSISNASRDQILQVVQRHFAL
        SS+IGEKDNDFSSSIVFHKLSK KVRQIRP APSSSAK      S GEIQSIH+P       K GT            VNSISNASRDQ+L VVQRHFAL
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMP-------KPGT------------VNSISNASRDQILQVVQRHFAL

Query:  QTSLNEASVMTEFIRAVKRRRD
        QT+LNEASVMTEFIRAVKRRR+
Subjt:  QTSLNEASVMTEFIRAVKRRRD

TrEMBL top hitse value%identityAlignment
A0A0A0KUM7 SAP30_Sin3_bdg domain-containing protein3.8e-9283.41Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+P APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQILQVVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

A0A1S3BU28 uncharacterized protein LOC1034931701.8e-8982.06Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQIL VVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

A0A5A7TNP9 SAP30_Sin3_bdg domain-containing protein1.8e-8982.06Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQIL VVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

A0A5D3D093 SAP30_Sin3_bdg domain-containing protein1.8e-8982.06Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFCAS IQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDE DLDNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA
        SS+IGEKDNDFSSS+VFHKLSKPKVRQI+  APSSSAKS      GEIQSIHMPK GT                     VNSISN SRDQIL VVQRHFA
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSN-----GEIQSIHMPKPGT---------------------VNSISNASRDQILQVVQRHFA

Query:  LQTSLNEASVMTEFIRAVKRRRD
        LQTSLNEASVMTEFIRAVK+RR+
Subjt:  LQTSLNEASVMTEFIRAVKRRRD

A0A6J1C548 uncharacterized protein LOC111008453 isoform X48.8e-8982.43Show/hide
Query:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC
        M+EPEFC+SRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNEND+HD DNSSC
Subjt:  MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSC

Query:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMP-------KPGT------------VNSISNASRDQILQVVQRHFAL
        SS+IGEKDNDFSSSIVFHKLSK KVRQIRP APSSSAK      S GEIQSIH+P       K GT            VNSISNASRDQ+L VVQRHFAL
Subjt:  SSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAK------SNGEIQSIHMP-------KPGT------------VNSISNASRDQILQVVQRHFAL

Query:  QTSLNEASVMTEFIRAVKRRRD
        QT+LNEASVMTEFIRAVKRRR+
Subjt:  QTSLNEASVMTEFIRAVKRRRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein1.3e-3947.66Show/hide
Query:  RIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKDN
        +IQS + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D+ DLD  +      +   
Subjt:  RIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKDN

Query:  DFSSSIVFHKLSKPKVR-----------QIRPLAPSSSAKSNG---------EIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNEA
         F +S    K  K K+R             R L+  S +KS+G         ++  + MP           V++I N S++Q++ +VQRHF  Q  ++E 
Subjt:  DFSSSIVFHKLSKPKVR-----------QIRPLAPSSSAKSNG---------EIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNEA

Query:  SVMTEFIRAVKRRR
         V+  F++A KR +
Subjt:  SVMTEFIRAVKRRR

AT1G19330.2 unknown protein4.8e-3947.6Show/hide
Query:  RIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEH-DLDNSSCS-SEIGEK
        +IQS + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D+  D +N+  + S++  +
Subjt:  RIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEH-DLDNSSCS-SEIGEK

Query:  D--NDFSSSIVFHKLSKPKVRQI-RPLAPSSSAKSNG---------EIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNEASVMTEF
        D      S +   + S+   + + R L+  S +KS+G         ++  + MP           V++I N S++Q++ +VQRHF  Q  ++E  V+  F
Subjt:  D--NDFSSSIVFHKLSKPKVRQI-RPLAPSSSAKSNG---------EIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNEASVMTEF

Query:  IRAVKRRR
        ++A KR +
Subjt:  IRAVKRRR

AT1G19330.3 unknown protein1.7e-3947.44Show/hide
Query:  RIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKDN
        +IQS + + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLE PTGNE D+ DLD  +      +   
Subjt:  RIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKDN

Query:  DFSSSIVFHKLSKPKVR-----------QIRPLAPSSSAKSNG----------EIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNE
         F +S    K  K K+R             R L+  S +KS+G          ++  + MP           V++I N S++Q++ +VQRHF  Q  ++E
Subjt:  DFSSSIVFHKLSKPKVR-----------QIRPLAPSSSAKSNG----------EIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNE

Query:  ASVMTEFIRAVKRRR
          V+  F++A KR +
Subjt:  ASVMTEFIRAVKRRR

AT1G75060.1 unknown protein1.6e-3744.23Show/hide
Query:  SRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNEND-EHDLDNSS--------
        S++QS F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLEHPTGNE D + ++D+S+        
Subjt:  SRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNEND-EHDLDNSS--------

Query:  ---------CSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSNGEIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNEAS
                  S + G + +  S   ++ ++S     +I  + P  + K   ++  + M            V+++ N +++Q++ ++QRHF  Q  ++E  
Subjt:  ---------CSSEIGEKDNDFSSSIVFHKLSKPKVRQIRPLAPSSSAKSNGEIQSIHMPKP-------GTVNSISNASRDQILQVVQRHFALQTSLNEAS

Query:  VMTEFIRA
        V+  F++A
Subjt:  VMTEFIRA

AT1G75060.2 unknown protein1.2e-3746.6Show/hide
Query:  SRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKD
        S++QS F + S +EELSVLPRHTKV+VTGNNRTKSVL+GLQGVVKKAVGLGGWHWLVL NG+EVKLQRNALSVLEHPTGNE D +DL+    +      D
Subjt:  SRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKD

Query:  NDFSSSIVFHKLSKPKVRQIR----PLAPSSSAKSNGEIQSI-------------------HMPKPGTVNSISNASRDQILQVVQRHFALQTSLNEASVM
             ++  HK  K   R  R     L    S  S+ +I SI                   +      V+++ N +++Q++ ++QRHF  Q  ++E  V+
Subjt:  NDFSSSIVFHKLSKPKVRQIR----PLAPSSSAKSNGEIQSI-------------------HMPKPGTVNSISNASRDQILQVVQRHFALQTSLNEASVM

Query:  TEFIRA
          F++A
Subjt:  TEFIRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGAACCAGAGTTTTGTGCTTCCAGAATACAGTCTCCTTTCCGTGAGGAGAGTGGGGATGAAGAGCTTTCAGTTCTTCCAAGGCACACTAAAGTTATTGTCACTGG
AAATAACAGAACAAAGTCTGTTTTATTGGGATTGCAAGGCGTGGTTAAGAAGGCCGTTGGCCTTGGAGGCTGGCATTGGCTGGTTTTGAAAAATGGGGTTGAAGTGAAGC
TGCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGAACATGATTTGGACAACTCAAGTTGTAGCTCCGAGATCGGGGAGAAGGATAATGAT
TTCTCTAGCAGCATTGTTTTCCACAAACTTAGCAAACCAAAAGTTAGGCAGATTAGGCCATTGGCTCCATCTTCATCAGCAAAGTCAAATGGAGAAATTCAGTCTATTCA
CATGCCAAAGCCAGGGACGGTGAATAGTATTTCAAATGCGTCAAGGGACCAAATCCTTCAAGTTGTGCAACGGCATTTTGCATTACAGACAAGTTTGAACGAGGCATCGG
TGATGACTGAATTCATTCGAGCTGTTAAGAGACGAAGAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGAACCAGAGTTTTGTGCTTCCAGAATACAGTCTCCTTTCCGTGAGGAGAGTGGGGATGAAGAGCTTTCAGTTCTTCCAAGGCACACTAAAGTTATTGTCACTGG
AAATAACAGAACAAAGTCTGTTTTATTGGGATTGCAAGGCGTGGTTAAGAAGGCCGTTGGCCTTGGAGGCTGGCATTGGCTGGTTTTGAAAAATGGGGTTGAAGTGAAGC
TGCAAAGGAATGCATTGAGTGTGCTGGAACATCCCACAGGAAACGAAAATGATGAACATGATTTGGACAACTCAAGTTGTAGCTCCGAGATCGGGGAGAAGGATAATGAT
TTCTCTAGCAGCATTGTTTTCCACAAACTTAGCAAACCAAAAGTTAGGCAGATTAGGCCATTGGCTCCATCTTCATCAGCAAAGTCAAATGGAGAAATTCAGTCTATTCA
CATGCCAAAGCCAGGGACGGTGAATAGTATTTCAAATGCGTCAAGGGACCAAATCCTTCAAGTTGTGCAACGGCATTTTGCATTACAGACAAGTTTGAACGAGGCATCGG
TGATGACTGAATTCATTCGAGCTGTTAAGAGACGAAGAGACTGA
Protein sequenceShow/hide protein sequence
MLEPEFCASRIQSPFREESGDEELSVLPRHTKVIVTGNNRTKSVLLGLQGVVKKAVGLGGWHWLVLKNGVEVKLQRNALSVLEHPTGNENDEHDLDNSSCSSEIGEKDND
FSSSIVFHKLSKPKVRQIRPLAPSSSAKSNGEIQSIHMPKPGTVNSISNASRDQILQVVQRHFALQTSLNEASVMTEFIRAVKRRRD