; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012470 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012470
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA binding protein
Genome locationChr01:21503250..21504902
RNA-Seq ExpressionHG10012470
SyntenyHG10012470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650412.1 hypothetical protein Csa_010702 [Cucumis sativus]2.2e-10789.96Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR H NHLQSNSISHCQ+CGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMCFRCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSGG LCQSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAV-AGGTVEEEESKLQEKEVTAIFD
        +NGDAV +   VEEEE KLQEK+VTAIFD
Subjt:  TNGDAV-AGGTVEEEESKLQEKEVTAIFD

KAG6582143.1 hypothetical protein SDJN03_22145, partial [Cucurbita argyrosperma subsp. sororia]3.2e-10672.09Show/hide
Query:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC
        + IEFKIQ KR KSP  FFFF  F       + P F                            +  S SRVSVPA+A AVPPMNR H NHLQSNSIS+C
Subjt:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC

Query:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG
         ECGISQS CWILHNVR KASFRRLCTNCVLK+NLSRFCPLCFD+YDDSTPP SHQRVMCFRCPSISH+SC SFRFSSTFLCP+CSDPCFAFFDGFDS G
Subjt:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG

Query:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL
        L QSE  VA LAGR  DGKSAKAIVAAARV+AQSMRRAA DARAVAEMKI+NA FAKKQATLALERLA+LVLQEKD+NGYAKTNG+A AG   EEEE++L
Subjt:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL

Query:  Q
        Q
Subjt:  Q

KAG7018542.1 hypothetical protein SDJN02_20411, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-11171.61Show/hide
Query:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC
        + IEFKIQ KR KSP  FFFF  F       + P F                            +  S SRVSVPA+A AVPPMNR H NHLQSNSIS+C
Subjt:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC

Query:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG
         ECGISQS CWILHNVR KASFRRLCTNCVLK+NLSRFCPLCFD+YDDSTPP SHQRVMCFRCPSISH+SC SFRFSSTFLCP+CSDPCFAFFDGFDS G
Subjt:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG

Query:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL
        L QSE  VA LAGR  DGKSAKAIVAAARV+AQSMRRAA DARAVAEMKI+NA FAKKQATLALERLA+LVLQEKD+NGYAKTNG+A AG   EEEE++L
Subjt:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL

Query:  QEKEVTAIFDRMKANQT
        Q + VTAI +RMKAN T
Subjt:  QEKEVTAIFDRMKANQT

XP_022955770.1 uncharacterized protein LOC111457661 [Cucurbita moschata]2.5e-11171.61Show/hide
Query:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC
        + IEFKIQ KR KSP  FFFF  F       + P F                            +  S SRVSVPA+A AVPPMNR H NHLQSNSIS+C
Subjt:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC

Query:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG
         ECGISQS CWILHNVR KASFRRLCTNCVLK+NLSRFCPLCFD+YDDSTPP SHQRVMCFRCPSISH+SC SFRFSSTFLCP+CSDPCFAFFDGFDS G
Subjt:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG

Query:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL
        L QSE  VA LAGR  DGKSAKAIVAAARV+AQSMRRAA DARAVAEMKI+NA FAKKQATLALERLA+LVLQEKD+NGYAKTNG+A AG   EEEE++L
Subjt:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL

Query:  QEKEVTAIFDRMKANQT
        Q + VTAI +RMKAN T
Subjt:  QEKEVTAIFDRMKANQT

XP_038902666.1 uncharacterized protein LOC120089301 [Benincasa hispida]1.3e-12094.92Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNRSHCNHLQSNS+SHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSC SFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT
        LCSDPCFAFFDGFDSGGLCQSESTVAFLA RNVD KSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKD+NGYAKT
Subjt:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT

Query:  NGDAVAG-GTVEEEESKLQEKEVTAIFDRMKANQTP
        NGDAVAG  TVEEE+SKLQEKEVT+IF+RMK N+TP
Subjt:  NGDAVAG-GTVEEEESKLQEKEVTAIFDRMKANQTP

TrEMBL top hitse value%identityAlignment
A0A0A0L5G4 Uncharacterized protein2.1e-11688.89Show/hide
Query:  IGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMC
        IGFSL  SHSRVSVPA+A  VPPMNR H NHLQSNSISHCQ+CGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMC
Subjt:  IGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMC

Query:  FRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQ
        FRCPSISHLSCVSFRFSSTFLCPLCSDP F FFDGFDSGG LCQSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQ
Subjt:  FRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQ

Query:  ATLALERLAYLVLQEKDKNGYAKTNGDAV-AGGTVEEEESKLQEKEVTAIFD
        ATLALE+LAYLVLQEKDKNGY+K+NGDAV +   VEEEE KLQEK+VTAIFD
Subjt:  ATLALERLAYLVLQEKDKNGYAKTNGDAV-AGGTVEEEESKLQEKEVTAIFD

A0A1S4DSP5 uncharacterized protein LOC1034839249.7e-10186.46Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR   NHLQ+NSISHCQECGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMC+RCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSG  L QSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD
        +NGDAVA    VEEEE KL EKE   +F+
Subjt:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD

A0A5A7U9X0 DNA binding protein2.1e-10388.65Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR   NHLQ+NSISHCQECGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMC+RCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSG  L QSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD
        +NGDAVA    VEEEE KL EKEVTAIFD
Subjt:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD

A0A6J1GVZ7 uncharacterized protein LOC1114576611.2e-11171.61Show/hide
Query:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC
        + IEFKIQ KR KSP  FFFF  F       + P F                            +  S SRVSVPA+A AVPPMNR H NHLQSNSIS+C
Subjt:  MDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHC

Query:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG
         ECGISQS CWILHNVR KASFRRLCTNCVLK+NLSRFCPLCFD+YDDSTPP SHQRVMCFRCPSISH+SC SFRFSSTFLCP+CSDPCFAFFDGFDS G
Subjt:  QECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGG

Query:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL
        L QSE  VA LAGR  DGKSAKAIVAAARV+AQSMRRAA DARAVAEMKI+NA FAKKQATLALERLA+LVLQEKD+NGYAKTNG+A AG   EEEE++L
Subjt:  LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKL

Query:  QEKEVTAIFDRMKANQT
        Q + VTAI +RMKAN T
Subjt:  QEKEVTAIFDRMKANQT

A0A6J1IXP5 uncharacterized protein LOC1114794681.9e-10484.62Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR HCNHLQSNSIS+C ECGISQS CWILHNVR KASFRRLCTNCVLK+NLSRFCPLCFD+YDDSTPP SHQRVMCFRCPSISH+SC SFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT
        LCSDPCFAFFDGFDSG L QSE  V+ LAGR  DGKSAKAIVAAARV AQSMRRAA DARAVAEMK +NA FAKKQATLALERLAYLVLQEKD+NGYAKT
Subjt:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT

Query:  NGDAVAGGTVEEEESKLQEKEVTAIFDRMKANQT
        NG+A AG   EEEE++LQ + VTAI +RMKANQT
Subjt:  NGDAVAGGTVEEEESKLQEKEVTAIFDRMKANQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09520.1 LOCATED IN: chloroplast5.0e-2536.07Show/hide
Query:  SNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCF--RCPSISHLSCVSFRFSSTFLCPLCSDP-CF
        +NS   C +CG S +  W++H VRL+AS R  CT+C+L+++ + FCP CF +YD S  PPS +RV C    C S++H+ C       ++LCP C DP  F
Subjt:  SNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCF--RCPSISHLSCVSFRFSSTFLCPLCSDP-CF

Query:  AFFDGF-DSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTN-----
        +FF    D  G             R VD   ++A + AA+++A SM +A + A+   + + K AA AKK+A  ALE++  L  +EK ++   K       
Subjt:  AFFDGF-DSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTN-----

Query:  -----GDAVAGGTVEEEES
               A  G TV+E ES
Subjt:  -----GDAVAGGTVEEEES

AT3G17460.1 PHD finger family protein2.1e-1533.15Show/hide
Query:  LHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFS--------------STFLCPLCSDPCFAFFDGFDS
        +H V    +FRRLCT+C+LK     FC +CF+++D++ PP +  R++C  CPS +HLSC +   S              S+F C  CS+P F FF     
Subjt:  LHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFS--------------STFLCPLCSDPCFAFFDGFDS

Query:  GGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
              E+ +          KSA A+VAA  +S  +M +A    +  A  KI  A  AK +A  AL  L  +V+++    G  K
Subjt:  GGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAAGACATCTCTAATGGACATTGAGTTCAAAATTCAAAGAAAGAGAGCCAAATCGCCATATGCCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCAGTCCAAT
CTCCATTCCTCCATTTTTCTTCTTCTTAAGCTTCCTTTCAAGGTTTCTCTCTCTGTTCTTCAAGCTTCAGAGCCCTACCCTCCCTTCCATCGGATTCTCTCTCTTCGCCT
CCCATTCTAGGGTTTCTGTTCCGGCGGATGCTGTTGCTGTTCCTCCGATGAATCGCTCACACTGCAACCATCTTCAATCCAATTCGATTTCTCACTGCCAAGAATGCGGC
ATCTCTCAGTCCGCTTGTTGGATCCTCCACAATGTCCGTCTCAAAGCTTCCTTCCGTCGTCTCTGCACCAATTGCGTCCTCAAGCACAACCTCTCCCGTTTCTGTCCTCT
TTGCTTCGACGTTTACGACGATTCGACTCCTCCGCCGTCTCATCAGCGAGTTATGTGCTTCAGATGCCCTTCAATCTCGCATCTCTCTTGCGTTTCCTTTCGATTTTCGT
CTACCTTTCTATGCCCTCTCTGCTCCGATCCTTGTTTTGCCTTCTTCGATGGGTTTGACTCCGGTGGCCTTTGTCAATCGGAGTCCACTGTCGCCTTTTTGGCCGGGAGA
AACGTTGATGGTAAATCAGCGAAAGCGATTGTCGCTGCGGCTCGTGTCTCCGCTCAATCCATGCGGAGAGCAGCTCTTGACGCTAGGGCTGTAGCGGAGATGAAGATCAA
GAACGCTGCCTTTGCTAAGAAACAAGCTACTCTCGCATTGGAACGGCTTGCTTATCTTGTGCTTCAGGAGAAGGACAAAAATGGATATGCTAAAACTAATGGAGATGCTG
TTGCTGGTGGGACGGTTGAAGAAGAAGAATCCAAGCTACAGGAGAAAGAGGTAACAGCCATTTTCGATCGTATGAAGGCGAATCAGACTCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCAAGACATCTCTAATGGACATTGAGTTCAAAATTCAAAGAAAGAGAGCCAAATCGCCATATGCCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCAGTCCAAT
CTCCATTCCTCCATTTTTCTTCTTCTTAAGCTTCCTTTCAAGGTTTCTCTCTCTGTTCTTCAAGCTTCAGAGCCCTACCCTCCCTTCCATCGGATTCTCTCTCTTCGCCT
CCCATTCTAGGGTTTCTGTTCCGGCGGATGCTGTTGCTGTTCCTCCGATGAATCGCTCACACTGCAACCATCTTCAATCCAATTCGATTTCTCACTGCCAAGAATGCGGC
ATCTCTCAGTCCGCTTGTTGGATCCTCCACAATGTCCGTCTCAAAGCTTCCTTCCGTCGTCTCTGCACCAATTGCGTCCTCAAGCACAACCTCTCCCGTTTCTGTCCTCT
TTGCTTCGACGTTTACGACGATTCGACTCCTCCGCCGTCTCATCAGCGAGTTATGTGCTTCAGATGCCCTTCAATCTCGCATCTCTCTTGCGTTTCCTTTCGATTTTCGT
CTACCTTTCTATGCCCTCTCTGCTCCGATCCTTGTTTTGCCTTCTTCGATGGGTTTGACTCCGGTGGCCTTTGTCAATCGGAGTCCACTGTCGCCTTTTTGGCCGGGAGA
AACGTTGATGGTAAATCAGCGAAAGCGATTGTCGCTGCGGCTCGTGTCTCCGCTCAATCCATGCGGAGAGCAGCTCTTGACGCTAGGGCTGTAGCGGAGATGAAGATCAA
GAACGCTGCCTTTGCTAAGAAACAAGCTACTCTCGCATTGGAACGGCTTGCTTATCTTGTGCTTCAGGAGAAGGACAAAAATGGATATGCTAAAACTAATGGAGATGCTG
TTGCTGGTGGGACGGTTGAAGAAGAAGAATCCAAGCTACAGGAGAAAGAGGTAACAGCCATTTTCGATCGTATGAAGGCGAATCAGACTCCGTAG
Protein sequenceShow/hide protein sequence
MGKTSLMDIEFKIQRKRAKSPYAFFFFFFFFFFFSPISIPPFFFFLSFLSRFLSLFFKLQSPTLPSIGFSLFASHSRVSVPADAVAVPPMNRSHCNHLQSNSISHCQECG
ISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFFDGFDSGGLCQSESTVAFLAGR
NVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKLQEKEVTAIFDRMKANQTP