; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G001570 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G001570
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUbiquitin carboxyl-terminal hydrolase
Genome locationCmo_Chr20:784042..788989
RNA-Seq ExpressionCmoCh20G001570
SyntenyCmoCh20G001570
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570459.1 hypothetical protein SDJN03_29374, partial [Cucurbita argyrosperma subsp. sororia]1.3e-11399.04Show/hide
Query:  NEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAA
        N GQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAA
Subjt:  NEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAA

Query:  LRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGILRGILIWIRVYDTGSRLGSSGNITWHVSTYPLPPDLDQQLQYGGLTFLSLFF
        LRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGILRGILIWIRVYDTGSRLGS+GNITWHVSTYPLPPDLDQQLQYGGLTFLSLFF
Subjt:  LRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGILRGILIWIRVYDTGSRLGSSGNITWHVSTYPLPPDLDQQLQYGGLTFLSLFF

Query:  CVIIINDIC
        CVIIINDIC
Subjt:  CVIIINDIC

XP_004140174.2 uncharacterized protein LOC101221687 isoform X1 [Cucumis sativus]5.1e-10270.63Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNE--GQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGK
        MH VPSQ +EQSHPDPFEGRLEAFTPERE+SY+ASKNEDQWRWERDESKMPNSM SHMFNE  GQGGD  RSYFQGQRPNPKL LEKGSN+D R QSHGK
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNE--GQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGK

Query:  NMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP
        NMESRFGDG LPQNFDGLEQKFIDDII  +KEQ+DAEDEENARHRERI+AIN+QYEEQLAALR +HAGRRDELLRRES+AR HQYQKGI DHYPNGGIGP
Subjt:  NMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP

Query:  ENELGFLGIL--------------------------------------RGILIWIRVYDTGSRLGSSGNITWHVST-YPLPPDLDQ
         +  G  G+                                       RG     RVYDT SRLGS+GNITWHV   + LPPDLDQ
Subjt:  ENELGFLGIL--------------------------------------RGILIWIRVYDTGSRLGSSGNITWHVST-YPLPPDLDQ

XP_022943334.1 uncharacterized protein LOC111448131 [Cucurbita moschata]9.7e-10997.1Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
        MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM

Query:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN
        ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N
Subjt:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN

Query:  ELGFLGI
          G  G+
Subjt:  ELGFLGI

XP_022986988.1 uncharacterized protein LOC111484575 [Cucurbita maxima]2.9e-10594.69Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
        MHQVPSQR+EQSHPDPFEGRLEAFTPERENSYIASKNEDQWR ERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPK VLEKGSNSDLRFQSHGKNM
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM

Query:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN
        ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQ+D EDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N
Subjt:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN

Query:  ELGFLGI
          G  G+
Subjt:  ELGFLGI

XP_023512046.1 uncharacterized protein LOC111776883 [Cucurbita pepo subsp. pepo]3.7e-10896.14Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
        MHQVPSQRMEQSHPDPFEGRLEAFTPERENSY+ASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM

Query:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN
        ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQ+DAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N
Subjt:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN

Query:  ELGFLGI
          G  G+
Subjt:  ELGFLGI

TrEMBL top hitse value%identityAlignment
A0A1S3BME9 uncharacterized protein LOC1034914401.7e-9870.76Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNE--GQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGK
        MH VPSQ +EQSHPDPFEGRLEAFTPERE+SY+ASKNEDQWRWERDESKMPNSM SHMFNE  GQGGD  RSYFQGQRPNPKL LEKGSNSD R QSHGK
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNE--GQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGK

Query:  NMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP
        NMESRFGDG LPQNFDGLEQKFIDDII  +KEQ+DAEDEENARHRERI+AIN+QYEEQLAALR +HAGRRDELLRRES+AR HQYQKGI DHYPNGGIGP
Subjt:  NMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP

Query:  ENELGFLGIL--------------------------------------RGILIWIRVYDTGSRLGSSGNITWHVSTY
         +  G  G+                                       RG     RVYDT SRL S+GNITWHV  Y
Subjt:  ENELGFLGIL--------------------------------------RGILIWIRVYDTGSRLGSSGNITWHVSTY

A0A6J1FXP5 uncharacterized protein LOC1114481314.7e-10997.1Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
        MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM

Query:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN
        ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N
Subjt:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN

Query:  ELGFLGI
          G  G+
Subjt:  ELGFLGI

A0A6J1H5U1 uncharacterized protein LOC1114598384.5e-9686.47Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
        MH VPSQR+EQSHPDPFEGRLEAFTPERENS++ASKNEDQWRWERDESKMPNSMASHMFNEGQGGD  RSYFQGQRPN KLVLEKGSNSD R QSHGKNM
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM

Query:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN
        E+RFGDGLLPQNFDGLEQKFIDDII  +KEQ+DAEDEENARHRERI+AIN+QYEEQLAALRA+HAGRRDELL+RESSAR HQYQKGI DHY NGGIGP +
Subjt:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN

Query:  ELGFLGI
          G  G+
Subjt:  ELGFLGI

A0A6J1JI55 uncharacterized protein LOC1114845751.4e-10594.69Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
        MHQVPSQR+EQSHPDPFEGRLEAFTPERENSYIASKNEDQWR ERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPK VLEKGSNSDLRFQSHGKNM
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM

Query:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN
        ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQ+D EDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N
Subjt:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN

Query:  ELGFLGI
          G  G+
Subjt:  ELGFLGI

A0A6J1K145 uncharacterized protein LOC1114907594.5e-9686.47Show/hide
Query:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM
        MH VPSQR+EQSHPDPFEGRLEAFTPERENS++ASKNEDQWRWERDESKMPNSMASHMFNEGQGGD  RSYFQGQRPN KLVLEKGSNSD R QSHGKNM
Subjt:  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNM

Query:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN
        E+RFGDGLLPQNFDGLEQKFIDDII  +KEQ+DAEDEENARHRERI+AIN+QYEEQLAALRA+HAGRRDELL+RESSAR HQYQKGI DHY NGGIGP +
Subjt:  ESRFGDGLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPEN

Query:  ELGFLGI
          G  G+
Subjt:  ELGFLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G22040.1 unknown protein8.6e-3945.69Show/hide
Query:  RMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGD
        +++  H D F+G+LEAFTPER+  Y  S+ E QWRWERD   M   MA+ ++NEGQ G D  R+Y++GQ  +PK  +EK  +       H +N ++ + +
Subjt:  RMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGD

Query:  GLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN
            Q F+GLEQKF+DDI   +K+Q +AED E ARHRE+I  IN++YEEQLA LRA+H G+R+E++R+ES AR  Q+++   G+ D Y    +G  N
Subjt:  GLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN

AT5G22040.2 unknown protein8.6e-3945.69Show/hide
Query:  RMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGD
        +++  H D F+G+LEAFTPER+  Y  S+ E QWRWERD   M   MA+ ++NEGQ G D  R+Y++GQ  +PK  +EK  +       H +N ++ + +
Subjt:  RMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGD

Query:  GLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN
            Q F+GLEQKF+DDI   +K+Q +AED E ARHRE+I  IN++YEEQLA LRA+H G+R+E++R+ES AR  Q+++   G+ D Y    +G  N
Subjt:  GLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN

AT5G22040.3 unknown protein8.6e-3945.69Show/hide
Query:  RMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGD
        +++  H D F+G+LEAFTPER+  Y  S+ E QWRWERD   M   MA+ ++NEGQ G D  R+Y++GQ  +PK  +EK  +       H +N ++ + +
Subjt:  RMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGD

Query:  GLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN
            Q F+GLEQKF+DDI   +K+Q +AED E ARHRE+I  IN++YEEQLA LRA+H G+R+E++R+ES AR  Q+++   G+ D Y    +G  N
Subjt:  GLLPQNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAAGTTCCCAGTCAAAGGATGGAACAAAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATATAGCTTCGAAAAA
TGAGGATCAATGGAGATGGGAAAGAGATGAGTCGAAGATGCCAAATTCTATGGCCTCTCACATGTTCAATGAAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAG
GTCAGAGACCAAATCCAAAATTGGTTCTTGAGAAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCG
CAGAATTTTGACGGTCTCGAGCAGAAATTCATTGATGACATCATTAACTTTTCCAAGGAACAAAGTGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGAGAATTAT
CGCTATCAATTCTCAGTACGAGGAGCAATTAGCAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATC
AGAAGGGAATAAGGGACCATTACCCAAATGGGGGCATTGGCCCAGAGAATGAACTCGGTTTCTTGGGAATTCTGCGCGGAATTCTAATTTGGATTCGAGTTTATGACACT
GGCTCACGGTTAGGTTCCAGTGGCAACATCACGTGGCACGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCT
CTTCTTTTGTGTTATTATTATCAATGATATCTGTTAG
mRNA sequenceShow/hide mRNA sequence
GACATAATTATGAAAGGAAGGACTTGTTAGAATTTTTGTTGGAGTAGAGAGATCAACTGATCAAGCGCTACGGGGTTTGTGTAGCGGGTCACAGTTACGGGGCGTGCGGA
TCTACCATGACCCGAACTGGATCGTCCTCATCTAGCATCTCATTTGCTCGAGACGCTGCGGGCTTCGATTTCAATCTGGAGAGGATAACTTGACCATTACAGAGAAAAAA
TCATATAAGGTGGGCGTAACAATGAGACAGTAGGGGCAATATCCTGATTCAGGACTTGGCTCATATTCAGTTTCTCAAATGCATCAAGTTCCCAGTCAAAGGATGGAACA
AAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATATAGCTTCGAAAAATGAGGATCAATGGAGATGGGAAAGAGATGAGT
CGAAGATGCCAAATTCTATGGCCTCTCACATGTTCAATGAAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAGGTCAGAGACCAAATCCAAAATTGGTTCTTGAG
AAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCGCAGAATTTTGACGGTCTCGAGCAGAAATTCAT
TGATGACATCATTAACTTTTCCAAGGAACAAAGTGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGAGAATTATCGCTATCAATTCTCAGTACGAGGAGCAATTAG
CAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATCAGAAGGGAATAAGGGACCATTACCCAAATGGG
GGCATTGGCCCAGAGAATGAACTCGGTTTCTTGGGAATTCTGCGCGGAATTCTAATTTGGATTCGAGTTTATGACACTGGCTCACGGTTAGGTTCCAGTGGCAACATCAC
GTGGCACGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCTCTTCTTTTGTGTTATTATTATCAATGATATCT
GTTAGGGCCTCGCTCTGAAACTGCCTGCTCAATATTCACAGCATACACACTTTTACTTGCTGGGATAAAGAAATCATGTTGTTCTTCGCTTTAATATTTCACTTTGTTAG
TGATTCTGTTAGAATTGGATGCCATTCCCTTTCTCTTTCGCCTGGGTTTGGTTTGGTTTCCTTGAAGCCAAATAGAATCCTACCTTCATTCCCAAATAAAGTTATTGGGG
TTTAAGGATTGCTTACTCTTTTACCATCACCATTACCGTCGAGCGGTTTAGTACACACGACTCAATTCCAACATAATTACTCTGGTATAGTTAGCTCTTTTAGGTACTTG
AATATTTTCTAACAGTTTTGGTCAATTTACAGTTATTGCTTTTGTAAACATAGTAGCTAAATAATCCCAAC
Protein sequenceShow/hide protein sequence
MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLP
QNFDGLEQKFIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGILRGILIWIRVYDT
GSRLGSSGNITWHVSTYPLPPDLDQQLQYGGLTFLSLFFCVIIINDIC