; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003262 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003262
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionC2H2-type domain-containing protein
Genome locationChr11:19445906..19449752
RNA-Seq ExpressionHG10003262
SyntenyHG10003262
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily
IPR044653 - C2H2-type zinc-finger protein AZF1/2/3-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN55980.1 hypothetical protein Csa_011357 [Cucumis sativus]2.9e-4253.53Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH
        MDH+ + LRIKIK+R+ NA A  VDD+D   QF S       +ET MEKGLLFQP TPVTA  +   S SP      VVDCS  L SWLLTGRRGRKS  
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH

Query:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI
          TS T         IP D L  KR+K ET SD +E S   K H KENKN   +EKEYKCN+C KVF++ +A+GGHKS H KANKTD AI KET +    
Subjt:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI

Query:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD
         TIGT+FE K T+S+ DSST     ++ ++DFDLN+LP ++
Subjt:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD

KGN56002.2 hypothetical protein Csa_011010 [Cucumis sativus]3.4e-1957.63Show/hide
Query:  KRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKETTANTTITTIGTTFESKITLSVDSS---TGR
        KR+KIET SD E+  K+K HS E KN  +EKEYKC++CSKVF++SRA+GGHKSSHYK  KT D  I KETT         T+    IT+S+DSS   T +
Subjt:  KRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKETTANTTITTIGTTFESKITLSVDSS---TGR

Query:  KRMMDFDLN-QLPSDDDD
        KR+MDFDLN QLP DD++
Subjt:  KRMMDFDLN-QLPSDDDD

TYK19398.1 TPRXL protein [Cucumis melo var. makuwa]1.3e-4755.6Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK
        MDH+ + LRIKIK+RN +  AA VD   D+D  KQFCS+ HM  DSET M+KGLLFQ  TP+    +   S SPS  + D+VDCS  L SW LTGRRGRK
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK

Query:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN
        SI ++ T            I  D L  KR+K ET SD EKS +    +KENKN  V+EK+YKCN+CSKVF+S +A+GGHKS HYKANKT IAI KE  AN
Subjt:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN

Query:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP
            TIGTT E KI +S V SST R      MMDFDLN+LP
Subjt:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP

XP_008449183.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103491133 [Cucumis melo]9.4e-2533.96Show/hide
Query:  MKVHWEGEEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAAS---SSSSVSE
        MK + EG   EEE+ TK  C+VCNREFNSIKALYGHM                              SWSLTAKRG KGI  V+ A ++S   SSSS +E
Subjt:  MKVHWEGEEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAAS---SSSSVSE

Query:  AKPKSHFTLIDRV-DQRKENNVSSPPTKRMKK-----------------------------MMNDWDNSVIDT-QERRVPNFDLNELPPNEDDDDNNDDN
        AK +S ++  D V DQRK   +       +KK                             ++    N    T QER++ NFDLNELP  E     +DDN
Subjt:  AKPKSHFTLIDRV-DQRKENNVSSPPTKRMKK-----------------------------MMNDWDNSVIDT-QERRVPNFDLNELPPNEDDDDNNDDN

Query:  GEAKANIAYQMDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLL
         E KA+I                   GN        +  +++F  V                               S  P  +               L
Subjt:  GEAKANIAYQMDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLL

Query:  TGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKE
         GRRGRK         +  T Y I  D L  KR KIET SD ++  K+ G           KEY C+ CSKVF++ RA+GGHKSSHYK NKT D  I KE
Subjt:  TGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKE

Query:  TTANTTITTIGTTFESKITLSVDS
        TT         TT E KIT+S+DS
Subjt:  TTANTTITTIGTTFESKITLSVDS

XP_011652725.1 putative protein TPRXL [Cucumis sativus]8.4e-5040.42Show/hide
Query:  EEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAASSSSSVSEAKPKSHFTLI
        E  EEE+ TK  CKVCNREFNSIKALYGHM                              SWS TAKRG KGIG +S A  A+ SSS S +         
Subjt:  EEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAASSSSSVSEAKPKSHFTLI

Query:  DRVDQRKENNVSSPPTKRMKKMMNDWDNSVIDTQERRVPNFDLNELPPNEDDDDNNDDNGEAKANIAYQMDHDCDGLRIKIKVRNGNAVAATVDDDDSNK
           + + E+  S P               V++ Q                                           +IK + RN      ++D +D NK
Subjt:  DRVDQRKENNVSSPPTKRMKKMMNDWDNSVIDTQERRVPNFDLNELPPNEDDDDNNDDNGEAKANIAYQMDHDCDGLRIKIKVRNGNAVAATVDDDDSNK

Query:  QFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----SIHTSYTIPDDIL-------------
        Q CS+G HMR D ETTMEKGL  QPPT  T   SSS SS PS A+ +  VVDC  SL TSW LTGRRGRK    S  +S T  DD++             
Subjt:  QFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----SIHTSYTIPDDIL-------------

Query:  -------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKETTANTTITTIGTTFES
                     + KR+KIET SD E+  K+K HS E KN  +EKEYKC++CSKVF++SRA+GGHKSSHYK  KT D  I KETT         T+   
Subjt:  -------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKETTANTTITTIGTTFES

Query:  KITLSVDSS---TGRKRMMDFDLN-QLPSDDDD
         IT+S+DSS   T +KR+MDFDLN QLP DD++
Subjt:  KITLSVDSS---TGRKRMMDFDLN-QLPSDDDD

TrEMBL top hitse value%identityAlignment
A0A0A0L1Z9 C2H2-type domain-containing protein1.4e-4253.53Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH
        MDH+ + LRIKIK+R+ NA A  VDD+D   QF S       +ET MEKGLLFQP TPVTA  +   S SP      VVDCS  L SWLLTGRRGRKS  
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRKSIH

Query:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI
          TS T         IP D L  KR+K ET SD +E S   K H KENKN   +EKEYKCN+C KVF++ +A+GGHKS H KANKTD AI KET +    
Subjt:  --TSYT---------IPDDILEAKRMKIETRSD-DEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTI

Query:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD
         TIGT+FE K T+S+ DSST     ++ ++DFDLN+LP ++
Subjt:  TTIGTTFESKITLSV-DSSTG----RKRMMDFDLNQLPSDD

A0A0A0L2N1 Uncharacterized protein1.6e-0935.94Show/hide
Query:  EEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAASSSSSVS-------EAKP
        E  EEE+ TK  CKVCNREFNSIKALYGHM                              SWS TAKRG KGIG +S A  A+ SSS S       EAK 
Subjt:  EEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAASSSSSVS-------EAKP

Query:  KSHFTLIDRVDQRKE------------NNV-----------------SSPPTKRMKKMMNDWDNSVIDT--QERRVPNFDLNELPPNEDDDD
        +S F+  D V+ +++            N V                       R  K++    ++V +   QER++ NFDLNELP  EDD++
Subjt:  KSHFTLIDRVDQRKE------------NNV-----------------SSPPTKRMKKMMNDWDNSVIDT--QERRVPNFDLNELPPNEDDDD

A0A0A0L4M3 C2H2-type domain-containing protein1.7e-4050.2Show/hide
Query:  IKIKVRNGNAVAATVDDDDSNKQFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----SIHT
        +KIK R  N    ++D +D NKQ CS+G HMR D ETTMEKGL  QPPT  T   SSS SS PS A+ +  VVDC  SL TSW LTGRRGRK    S  +
Subjt:  IKIKVRNGNAVAATVDDDDSNKQFCSVG-HMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASID--VVDCSISL-TSWLLTGRRGRK----SIHT

Query:  SYTIPDDIL--------------------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-D
        S T  DD++                          + KR+KIET SD E+  K+K HS E KN  +EKEYKC++CSKVF++SRA+GGHKSSHYK  KT D
Subjt:  SYTIPDDIL--------------------------EAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-D

Query:  IAITKETTANTTITTIGTTFESKITLSVDSS---TGRKRMMDFDLN-QLPSDDDD
          I KETT         T+    IT+S+DSS   T +KR+MDFDLN QLP DD++
Subjt:  IAITKETTANTTITTIGTTFESKITLSVDSS---TGRKRMMDFDLN-QLPSDDDD

A0A1S3BLH2 LOW QUALITY PROTEIN: uncharacterized protein LOC1034911334.5e-2533.96Show/hide
Query:  MKVHWEGEEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAAS---SSSSVSE
        MK + EG   EEE+ TK  C+VCNREFNSIKALYGHM                              SWSLTAKRG KGI  V+ A ++S   SSSS +E
Subjt:  MKVHWEGEEEEEEEKTKSSCKVCNREFNSIKALYGHM------------------------------SWSLTAKRGRKGIGSVSVAVAAS---SSSSVSE

Query:  AKPKSHFTLIDRV-DQRKENNVSSPPTKRMKK-----------------------------MMNDWDNSVIDT-QERRVPNFDLNELPPNEDDDDNNDDN
        AK +S ++  D V DQRK   +       +KK                             ++    N    T QER++ NFDLNELP  E     +DDN
Subjt:  AKPKSHFTLIDRV-DQRKENNVSSPPTKRMKK-----------------------------MMNDWDNSVIDT-QERRVPNFDLNELPPNEDDDDNNDDN

Query:  GEAKANIAYQMDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLL
         E KA+I                   GN        +  +++F  V                               S  P  +               L
Subjt:  GEAKANIAYQMDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLL

Query:  TGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKE
         GRRGRK         +  T Y I  D L  KR KIET SD ++  K+ G           KEY C+ CSKVF++ RA+GGHKSSHYK NKT D  I KE
Subjt:  TGRRGRK---------SIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKT-DIAITKE

Query:  TTANTTITTIGTTFESKITLSVDS
        TT         TT E KIT+S+DS
Subjt:  TTANTTITTIGTTFESKITLSVDS

A0A5D3D748 TPRXL protein6.5e-4855.6Show/hide
Query:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK
        MDH+ + LRIKIK+RN +  AA VD   D+D  KQFCS+ HM  DSET M+KGLLFQ  TP+    +   S SPS  + D+VDCS  L SW LTGRRGRK
Subjt:  MDHDCDGLRIKIKVRNGNAVAATVD---DDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAASIDVVDCSISLTSWLLTGRRGRK

Query:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN
        SI ++ T            I  D L  KR+K ET SD EKS +    +KENKN  V+EK+YKCN+CSKVF+S +A+GGHKS HYKANKT IAI KE  AN
Subjt:  SIHTSYT------------IPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNE-VMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN

Query:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP
            TIGTT E KI +S V SST R      MMDFDLN+LP
Subjt:  TTITTIGTTFESKITLS-VDSSTGR----KRMMDFDLNQLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49930.1 C2H2 and C2HC zinc fingers superfamily protein2.6e-0432.04Show/hide
Query:  SLTSWLLTGRRGRKSIHTSYTIP--DDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKET
        +L SW    R  R  I      P  ++ L    + +   S D  SP    HS    ++  +K+YKC++C K F S +A+GGHK+SH K    D+  +  T
Subjt:  SLTSWLLTGRRGRKSIHTSYTIP--DDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKET

Query:  TAN
          N
Subjt:  TAN

AT5G67450.1 zinc-finger protein 17.4e-0435.38Show/hide
Query:  KEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTITTIGTTFESKITLSVDSSTG
        ++YKC +C K FSS +A+GGHK+SH K   T I    +  +N + +  G+     + ++V  +TG
Subjt:  KEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTANTTITTIGTTFESKITLSVDSSTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTTCATTGGGAAGGTGAAGAAGAAGAAGAAGAAGAAAAGACCAAATCTTCATGCAAAGTATGCAATAGAGAATTCAATTCCATCAAAGCATTGTATGGTCATAT
GAGCTGGTCCCTCACCGCCAAGAGGGGCCGTAAAGGCATTGGCTCTGTTTCTGTTGCTGTTGCCGCCTCCTCCTCTTCTTCAGTTTCTGAAGCTAAACCCAAATCTCATT
TTACCCTAATCGATAGAGTGGATCAACGAAAAGAGAATAATGTGAGTAGTCCTCCAACGAAGAGAATGAAGAAAATGATGAATGATTGGGATAATTCCGTAATAGATACT
CAAGAGAGGAGGGTCCCTAATTTTGATCTCAATGAGCTCCCACCGAATGAAGACGACGACGACAACAACGACGACAACGGAGAAGCTAAAGCAAACATAGCTTATCAGAT
GGATCATGATTGCGACGGACTCAGGATCAAGATCAAGGTCAGAAACGGCAACGCCGTTGCCGCCACGGTGGACGACGACGACAGTAATAAGCAGTTTTGTTCCGTTGGTC
ATATGAGGATTGATTCTGAAACAACAATGGAGAAGGGATTATTATTCCAACCCCCAACTCCGGTCACCGCCAACTGTTCTTCCTCTTTCTCTTCCTCTCCATCTGCCGCT
TCTATCGACGTTGTTGATTGTTCTATATCTCTAACCTCCTGGTTGCTTACTGGCCGGAGAGGGCGGAAGAGTATTCATACTTCTTACACTATCCCTGATGATATACTTGA
GGCAAAGAGAATGAAGATTGAGACTCGTTCCGATGACGAGAAGTCACCAAAAAGGAAAGGGCACAGTAAAGAAAATAAGAATGAGGTTATGGAGAAGGAATATAAATGCA
ATATATGTAGCAAAGTGTTCTCAAGTTCAAGAGCAATGGGAGGCCATAAGTCCAGCCACTACAAAGCCAATAAAACAGACATTGCCATTACAAAGGAAACTACTGCCAAT
ACTACTATTACTACCATTGGAACAACCTTCGAATCGAAGATAACGTTGTCGGTAGATTCATCGACGGGTCGGAAGAGGATGATGGATTTTGATCTCAACCAGCTGCCGTC
GGACGACGACGACGACCAAGACAGAAGCCATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTTCATTGGGAAGGTGAAGAAGAAGAAGAAGAAGAAAAGACCAAATCTTCATGCAAAGTATGCAATAGAGAATTCAATTCCATCAAAGCATTGTATGGTCATAT
GAGCTGGTCCCTCACCGCCAAGAGGGGCCGTAAAGGCATTGGCTCTGTTTCTGTTGCTGTTGCCGCCTCCTCCTCTTCTTCAGTTTCTGAAGCTAAACCCAAATCTCATT
TTACCCTAATCGATAGAGTGGATCAACGAAAAGAGAATAATGTGAGTAGTCCTCCAACGAAGAGAATGAAGAAAATGATGAATGATTGGGATAATTCCGTAATAGATACT
CAAGAGAGGAGGGTCCCTAATTTTGATCTCAATGAGCTCCCACCGAATGAAGACGACGACGACAACAACGACGACAACGGAGAAGCTAAAGCAAACATAGCTTATCAGAT
GGATCATGATTGCGACGGACTCAGGATCAAGATCAAGGTCAGAAACGGCAACGCCGTTGCCGCCACGGTGGACGACGACGACAGTAATAAGCAGTTTTGTTCCGTTGGTC
ATATGAGGATTGATTCTGAAACAACAATGGAGAAGGGATTATTATTCCAACCCCCAACTCCGGTCACCGCCAACTGTTCTTCCTCTTTCTCTTCCTCTCCATCTGCCGCT
TCTATCGACGTTGTTGATTGTTCTATATCTCTAACCTCCTGGTTGCTTACTGGCCGGAGAGGGCGGAAGAGTATTCATACTTCTTACACTATCCCTGATGATATACTTGA
GGCAAAGAGAATGAAGATTGAGACTCGTTCCGATGACGAGAAGTCACCAAAAAGGAAAGGGCACAGTAAAGAAAATAAGAATGAGGTTATGGAGAAGGAATATAAATGCA
ATATATGTAGCAAAGTGTTCTCAAGTTCAAGAGCAATGGGAGGCCATAAGTCCAGCCACTACAAAGCCAATAAAACAGACATTGCCATTACAAAGGAAACTACTGCCAAT
ACTACTATTACTACCATTGGAACAACCTTCGAATCGAAGATAACGTTGTCGGTAGATTCATCGACGGGTCGGAAGAGGATGATGGATTTTGATCTCAACCAGCTGCCGTC
GGACGACGACGACGACCAAGACAGAAGCCATTCTTGA
Protein sequenceShow/hide protein sequence
MKVHWEGEEEEEEEKTKSSCKVCNREFNSIKALYGHMSWSLTAKRGRKGIGSVSVAVAASSSSSVSEAKPKSHFTLIDRVDQRKENNVSSPPTKRMKKMMNDWDNSVIDT
QERRVPNFDLNELPPNEDDDDNNDDNGEAKANIAYQMDHDCDGLRIKIKVRNGNAVAATVDDDDSNKQFCSVGHMRIDSETTMEKGLLFQPPTPVTANCSSSFSSSPSAA
SIDVVDCSISLTSWLLTGRRGRKSIHTSYTIPDDILEAKRMKIETRSDDEKSPKRKGHSKENKNEVMEKEYKCNICSKVFSSSRAMGGHKSSHYKANKTDIAITKETTAN
TTITTIGTTFESKITLSVDSSTGRKRMMDFDLNQLPSDDDDDQDRSHS