; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G3762 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G3762
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationctg105:559612..561188
RNA-Seq ExpressionCucsat.G3762
SyntenyCucsat.G3762
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033068.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.64e-15077.45Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED PKDAK+KKDWLRDDARLYLQIKNSIESEIIGLV          ++++               C++FFRAEQKAESVT+YFMRLKKI A L 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK
        LLLPFSPDVKVQQ QREKM V IFLNGLLPEFGM K QILSDSKIPSLDDAFTRVLRIESSP  VSIPQ SSAL SKNNNPRAP       QR S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT
        P S +IVCNYC KPGH+KRDCRKLLYKNSQ+SQHAQIASTCDIPEASVTISADE+ KFQNYQ+ LQASSSSTPIASTVAPGN KCLLTSSTKWVIDS AT
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT

Query:  AHMTGS
         HMTG+
Subjt:  AHMTGS

KAA0033139.1 uncharacterized protein E6C27_scaffold269G002790 [Cucumis melo var. makuwa]7.72e-11564.38Show/hide
Query:  YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKK
        YL+STDMDDHMTE+ P++AK+KK+WL DDARLYLQIKNSIESEIIGL+DH                                     +ESVT+YFMRLKK
Subjt:  YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKK

Query:  IIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKP
        I AEL LLLPF+PDVKVQQ QREKMAVMI LNGLLPEFGM KTQILS+SKIPSLDDAFTRVLRIESSP  VSIPQ +S L SKNNNPRAP+         
Subjt:  IIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKP

Query:  ESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATA
                     G+++   RK++  ++Q  Q   IASTCDIPEAS+TISADE+AKFQNYQ+SLQA SSSTP+ASTVAPGN KCLLTSSTKWVIDSGAT 
Subjt:  ESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATA

Query:  HMTGSC
        HMTG C
Subjt:  HMTGSC

KAA0038222.1 Copia protein [Cucumis melo var. makuwa]4.84e-11268.26Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED P+DAK KKDWLRDDARLYLQIKNSI                           KEQVHRMFEVCMQF RAEQKAESVT+YFMRLKKI AEL 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK
        LLLPFSPDVK Q                          ILSDSKIPSLD+AFTRVL  ESSP  VSIPQ S++L SKNNNPRAP+         S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW
        P+S EIVCNYCRKP H KRDCRKLLYKNSQ+SQHAQIASTCDIPEAS+TISA+E AK QNYQ+SLQASSSSTPIASTV PGN KCLLTSSTKW
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW

XP_031738594.1 uncharacterized protein LOC116402733 isoform X1 [Cucumis sativus]6.13e-12699.47Show/hide
Query:  MAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLL
        M VMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLL
Subjt:  MAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLL

Query:  YKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGSCDEEDYW
        YKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGSCDEEDYW
Subjt:  YKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGSCDEEDYW

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]3.63e-19999.67Show/hide
Query:  YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKK
        YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKK
Subjt:  YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKK

Query:  IIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKP
        IIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKP
Subjt:  IIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKP

Query:  ESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATA
        ESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATA
Subjt:  ESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATA

Query:  HMTGS
        HMTG+
Subjt:  HMTGS

TrEMBL top hitse value%identityAlignment
A0A5A7SR90 Gag-pol polyprotein1.28e-15077.45Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED PKDAK+KKDWLRDDARLYLQIKNSIESEIIGLV          ++++               C++FFRAEQKAESVT+YFMRLKKI A L 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK
        LLLPFSPDVKVQQ QREKM V IFLNGLLPEFGM K QILSDSKIPSLDDAFTRVLRIESSP  VSIPQ SSAL SKNNNPRAP       QR S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT
        P S +IVCNYC KPGH+KRDCRKLLYKNSQ+SQHAQIASTCDIPEASVTISADE+ KFQNYQ+ LQASSSSTPIASTVAPGN KCLLTSSTKWVIDS AT
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT

Query:  AHMTGS
         HMTG+
Subjt:  AHMTGS

A0A5A7SVC9 Uncharacterized protein3.74e-11564.38Show/hide
Query:  YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKK
        YL+STDMDDHMTE+ P++AK+KK+WL DDARLYLQIKNSIESEIIGL+DH                                     +ESVT+YFMRLKK
Subjt:  YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKK

Query:  IIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKP
        I AEL LLLPF+PDVKVQQ QREKMAVMI LNGLLPEFGM KTQILS+SKIPSLDDAFTRVLRIESSP  VSIPQ +S L SKNNNPRAP+         
Subjt:  IIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKP

Query:  ESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATA
                     G+++   RK++  ++Q  Q   IASTCDIPEAS+TISADE+AKFQNYQ+SLQA SSSTP+ASTVAPGN KCLLTSSTKWVIDSGAT 
Subjt:  ESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATA

Query:  HMTGSC
        HMTG C
Subjt:  HMTGSC

A0A5A7T406 Copia protein2.34e-11268.26Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED P+DAK KKDWLRDDARLYLQIKNSI                           KEQVHRMFEVCMQF RAEQKAESVT+YFMRLKKI AEL 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK
        LLLPFSPDVK Q                          ILSDSKIPSLD+AFTRVL  ESSP  VSIPQ S++L SKNNNPRAP+         S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW
        P+S EIVCNYCRKP H KRDCRKLLYKNSQ+SQHAQIASTCDIPEAS+TISA+E AK QNYQ+SLQASSSSTPIASTV PGN KCLLTSSTKW
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW

A0A5D3D1J3 Uncharacterized protein2.07e-11164Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTE+ P++AK+KK+WL DDARLYLQIKNSIESEIIGL+DH                                     +ESVT+YFMRLKKI AEL 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIV
        LLLPF+PDVKVQQ QREKMAVMI LNGLLPEFGM KTQILS+SKIPSLDDAFTRVLRIESSP  VSIPQ +S L SKNNNPRAP+               
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIV

Query:  CNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGSC
               G+++   RK++  ++Q  Q   IASTCDIPEAS+TISADE+AKFQNYQ+SLQA SSSTP+ASTVAPGN KCLLTSSTKWVIDSGAT HMTG C
Subjt:  CNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGSC

A0A5D3E5M8 Copia protein2.70e-11268.6Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED P+DAK KKDWLRDDARLYLQIKNSI                           KEQVHRMFEVCMQF RAEQKAESVT+YFMRLKKI AEL 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK
        LLLPFSPDVK Q                          ILSDSKIPSLD+AFTRVLR ESSP  VSIPQ S++L SKNNNPRAP+         S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW
        P+S EIVCNYCRKP H KRDCRKLLYKNSQ+SQHAQIASTCDIPEAS+TISA+E AK QNYQ+SLQASSSSTPIASTV PGN KCLLTSSTKW
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGATCAA
GAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCATAGAA
TGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTTACCT
TTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTTCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTCTCTC
TGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGCAAGA
ACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGATTGT
CGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTAAGTT
TCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACCAAATGGGTCA
TAGACTCTGGTGCCACAGCTCATATGACAGGATCGTGTGACGAAGAAGATTATTGGTAG
mRNA sequenceShow/hide mRNA sequence
TATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGATCAA
GAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCATAGAA
TGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTTACCT
TTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTTCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTCTCTC
TGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGCAAGA
ACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGATTGT
CGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTAAGTT
TCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACCAAATGGGTCA
TAGACTCTGGTGCCACAGCTCATATGACAGGATCGTGTGACGAAGAAGATTATTGGTAG
Protein sequenceShow/hide protein sequence
YLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLP
FSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDC
RKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGSCDEEDYW