; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G14457 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G14457
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationctg1869:1230014..1232037
RNA-Seq ExpressionCucsat.G14457
SyntenyCucsat.G14457
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM96107.1 hypothetical protein AMTR_s02760p00000080, partial [Amborella trichopoda]7.44e-10657.23Show/hide
Query:  SKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQ-KKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEV
        SKIT+HKL GSNY +W +TI  YLRS D DDH+T+DPP++    +K WLR+DARL+LQI+NSI++E+I L++HCE VKEL+ +L+FLYSGK+ + R+++V
Subjt:  SKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQ-KKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEV

Query:  CMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPS
        C  F+RAE++ +S+ +YFM  KK   EL +LLPFSPDVKVQQ QRE+MA+M FL GL  EF  AK+Q+LS S + SL D FTRVLR E   T+ S P  +
Subjt:  CMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPS

Query:  SALFSKNNNPRAPQRNST------------DHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQH-AQIASTCDIPEASVTISADEFAKFQNYQESL
        SAL S+NN+  AP+RNST            D+R P S  I+CNYC+KPGH K +CRKL +KN  R QH A IAST D  + SV ISADEFAKF  YQE+L
Subjt:  SALFSKNNNPRAPQRNST------------DHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQH-AQIASTCDIPEASVTISADEFAKFQNYQESL

Query:  QASSSS-TPIA
        ++SSSS T IA
Subjt:  QASSSS-TPIA

KAA0033068.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.65e-13476.87Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED PKDAK+KKDWLRDDARLYLQIKNSIESEIIGLV          ++++               C++FFRAEQKAESVT+YFMRLKKI A L 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK
        LLLPFSPDVKVQQ QREKM V IFLNGLLPEFGM K QILSDSKIPSLDDAFTRVLRIESSP  VSIPQ SSAL SKNNNPRAP       QR S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG
        P S +IVCNYC KPGH+KRDCRKLLYKNSQ+SQHAQIASTCDIPEASVTISADE+ KFQNYQ+ LQASSSSTPIASTVAPG
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG

KAA0033139.1 uncharacterized protein E6C27_scaffold269G002790 [Cucumis melo var. makuwa]1.43e-12365.09Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MADI+N+VVSNVI LASKITEHKLNGSNYYDWR+TI FYL+STDMDDHMTE+ P++AK+KK+WL DDARLYLQIKNSIESEIIGL+DH            
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
                                 +ESVT+YFMRLKKI AEL LLLPF+PDVKVQQ QREKMAVMI LNGLLPEFGM KTQILS+SKIPSLDDAFTRVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
        RIESSP  VSIPQ +S L SKNNNPRAP+                      G+++   RK++  ++Q  Q   IASTCDIPEAS+TISADE+AKFQNYQ+
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE

Query:  SLQASSSSTPIASTVAPG
        SLQA SSSTP+ASTVAPG
Subjt:  SLQASSSSTPIASTVAPG

XP_031738595.1 uncharacterized protein LOC116402733 isoform X2 [Cucumis sativus]5.56e-10899.39Show/hide
Query:  MAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLL
        M VMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLL
Subjt:  MAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLL

Query:  YKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGSCDEEDYW
        YKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGSCDEEDYW
Subjt:  YKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGSCDEEDYW

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]5.51e-211100Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
        RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE

Query:  SLQASSSSTPIASTVAPG
        SLQASSSSTPIASTVAPG
Subjt:  SLQASSSSTPIASTVAPG

TrEMBL top hitse value%identityAlignment
A0A5A7SR90 Gag-pol polyprotein1.28e-13476.87Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED PKDAK+KKDWLRDDARLYLQIKNSIESEIIGLV          ++++               C++FFRAEQKAESVT+YFMRLKKI A L 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK
        LLLPFSPDVKVQQ QREKM V IFLNGLLPEFGM K QILSDSKIPSLDDAFTRVLRIESSP  VSIPQ SSAL SKNNNPRAP       QR S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG
        P S +IVCNYC KPGH+KRDCRKLLYKNSQ+SQHAQIASTCDIPEASVTISADE+ KFQNYQ+ LQASSSSTPIASTVAPG
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG

A0A5A7SVC9 Uncharacterized protein6.91e-12465.09Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MADI+N+VVSNVI LASKITEHKLNGSNYYDWR+TI FYL+STDMDDHMTE+ P++AK+KK+WL DDARLYLQIKNSIESEIIGL+DH            
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
                                 +ESVT+YFMRLKKI AEL LLLPF+PDVKVQQ QREKMAVMI LNGLLPEFGM KTQILS+SKIPSLDDAFTRVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
        RIESSP  VSIPQ +S L SKNNNPRAP+                      G+++   RK++  ++Q  Q   IASTCDIPEAS+TISADE+AKFQNYQ+
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE

Query:  SLQASSSSTPIASTVAPG
        SLQA SSSTP+ASTVAPG
Subjt:  SLQASSSSTPIASTVAPG

A0A5A7T406 Copia protein2.25e-10467.26Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED P+DAK KKDWLRDDARLYLQIKNSI                           KEQVHRMFEVCMQF RAEQKAESVT+YFMRLKKI AEL 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK
        LLLPFSPDVK Q                          ILSDSKIPSLD+AFTRVL  ESSP  VSIPQ S++L SKNNNPRAP+         S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG
        P+S EIVCNYCRKP H KRDCRKLLYKNSQ+SQHAQIASTCDIPEAS+TISA+E AK QNYQ+SLQASSSSTPIASTV PG
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG

A0A5D3E5M8 Copia protein2.28e-10467.62Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED P+DAK KKDWLRDDARLYLQIKNSI                           KEQVHRMFEVCMQF RAEQKAESVT+YFMRLKKI AEL 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK
        LLLPFSPDVK Q                          ILSDSKIPSLD+AFTRVLR ESSP  VSIPQ S++L SKNNNPRAP+         S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG
        P+S EIVCNYCRKP H KRDCRKLLYKNSQ+SQHAQIASTCDIPEAS+TISA+E AK QNYQ+SLQASSSSTPIASTV PG
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPG

U5CZW1 Uncharacterized protein (Fragment)3.60e-10657.23Show/hide
Query:  SKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQ-KKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEV
        SKIT+HKL GSNY +W +TI  YLRS D DDH+T+DPP++    +K WLR+DARL+LQI+NSI++E+I L++HCE VKEL+ +L+FLYSGK+ + R+++V
Subjt:  SKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQ-KKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEV

Query:  CMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPS
        C  F+RAE++ +S+ +YFM  KK   EL +LLPFSPDVKVQQ QRE+MA+M FL GL  EF  AK+Q+LS S + SL D FTRVLR E   T+ S P  +
Subjt:  CMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPS

Query:  SALFSKNNNPRAPQRNST------------DHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQH-AQIASTCDIPEASVTISADEFAKFQNYQESL
        SAL S+NN+  AP+RNST            D+R P S  I+CNYC+KPGH K +CRKL +KN  R QH A IAST D  + SV ISADEFAKF  YQE+L
Subjt:  SALFSKNNNPRAPQRNST------------DHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQH-AQIASTCDIPEASVTISADEFAKFQNYQESL

Query:  QASSSS-TPIA
        ++SSSS T IA
Subjt:  QASSSS-TPIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein3.8e-0426.12Show/hide
Query:  LNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLV-DHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRA
        L+GSN+ +W+  +L  L   D+D  +  + P   K+ K W R +    + +K  I     G+V D   + K+ L  L+  ++  E+  R          +
Subjt:  LNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLV-DHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRA

Query:  EQKAESVTSYFMRLKKIIAE---LGLLLPFSPDV
          + E+V    MR+K + A+   LG+   FS D+
Subjt:  EQKAESVTSYFMRLKKIIAE---LGLLLPFSPDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCCAATTATTACGATTGGCGTCGGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTTCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGC
AAGAACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGA
TTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTA
AGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGATCGTGTGACGAAGAAGATTATTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCCAATTATTACGATTGGCGTCGGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTTCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGC
AAGAACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGA
TTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTA
AGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGATCGTGTGACGAAGAAGATTATTGGTAG
Protein sequenceShow/hide protein sequence
MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVH
RMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFS
KNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGSCDEEDYW