; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020653 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020653
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr03:10829760..10833255
RNA-Seq ExpressionPay0020653
SyntenyPay0020653
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]2.7e-29148.11Show/hide
Query:  ISSLKLKLREVQNENDQILKSVKMLNSGTKNLDSILKSGHNGSHRYGLGF----------------------VASASRAMHN-------------LALSG
        IS LK ++  + ++ + + KS+KMLN G+  LD +L  G N  ++ GLGF                        S  R+ H+                 G
Subjt:  ISSLKLKLREVQNENDQILKSVKMLNSGTKNLDSILKSGHNGSHRYGLGF----------------------VASASRAMHN-------------LALSG

Query:  ELNLLKDVRLPLHPF--------RPQMM---------------------------------------------HCVTGHVTFGDGTKGKIIAKGNIDKNN
        +   +K     LHP         R +MM                                              C T +VTFGDG+KGKII  G +  + 
Subjt:  ELNLLKDVRLPLHPF--------RPQMM---------------------------------------------HCVTGHVTFGDGTKGKIIAKGNIDKNN

Query:  LPRLNDL-------------------------------------------------------------------------------GHVSMKGLEKVIKN
        LP LN +                                                                               GH+ ++G++K+I  
Subjt:  LPRLNDL-------------------------------------------------------------------------------GHVSMKGLEKVIKN

Query:  EAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDTVEICKNLCLKLQRE
         A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K++T E+ K L L+LQRE
Subjt:  EAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDTVEICKNLCLKLQRE

Query:  KGKKITRIRSDHGKDFDN---------EGF------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVTIRTGMTVTLYELWK
        K   I RIRSDHG++F+N         EG             N   ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT+R G   TLYE+WK
Subjt:  KGKKITRIRSDHGKDFDN---------EGF------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVTIRTGMTVTLYELWK

Query:  EREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PNMSEA----------
         R+P++K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   N+++A          
Subjt:  EREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PNMSEA----------

Query:  -STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFRR-------------
         S T    +++ D  S    + +  K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+R             
Subjt:  -STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFRR-------------

Query:  ----------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKH
                  N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEEVYV QPKGF D  HP H
Subjt:  ----------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKH

Query:  VYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKN
        VY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VGEL+YFLGLQ+KQ  
Subjt:  VYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKN

Query:  DGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTS
        D IF+SQ +YAK++VKKFG+E A +KRTPA  H+KL++D  G  V   LYRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL  VKRILKYV+GTS
Subjt:  DGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTS

Query:  DFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYYDNMSAI
        D+G+MY + + P LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +QD MTLY DNMSAI
Subjt:  DFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYYDNMSAI

Query:  DISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        +ISKNP+QHSRTKHIDIRHH+I +LV+DKVI L H+ +  Q  DIFTK LDAN FE LR  LG+CL
Subjt:  DISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

AAO73523.1 gag-pol polyprotein [Glycine max]1.5e-28957.73Show/hide
Query:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT
        GH+ ++G++K++   A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K+ T
Subjt:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT

Query:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT
         E+ K L L+LQREK   I RIRSDHG++F+N  F  F                      ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT
Subjt:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT

Query:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN
        +R G   TLYE+WK R+P++K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   N
Subjt:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN

Query:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR
        +++A           S T    +++ D  S    + +  K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+
Subjt:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR

Query:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV
        R                       N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEEVYV
Subjt:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV

Query:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG
         QPKGF D  HP HVY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VG
Subjt:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL
        EL+YFLGLQ+KQ  D IF+SQ +YAK++VKKFG+E A +KRTPA  H+KL++D  G  V  K YRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL
Subjt:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL

Query:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD
          VKRILKYV+GTSD+G+MY + ++  LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +
Subjt:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD

Query:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        QD MTLY DNMSAI+ISKNP+QHSRTKHIDIRHH+I +LV+DKVI L H+ +  Q  DIFTK LDAN FE LR  LG+C+
Subjt:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

AAO73527.1 gag-pol polyprotein [Glycine max]2.7e-29158.07Show/hide
Query:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT
        GH+ ++G++K+I   A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K++T
Subjt:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT

Query:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT
         E+ K L L+LQREK   I RIRSDHG++F+N  F  F                      ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT
Subjt:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT

Query:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN
        +R G   TLYE+WK R+P++K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   N
Subjt:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN

Query:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR
        +++A           S T    +++ D  S    + +  K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+
Subjt:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR

Query:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV
        R                       N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEEVYV
Subjt:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV

Query:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG
         QPKGF D  HP HVY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VG
Subjt:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL
        EL+YFLGLQ+KQ  D IF+SQ +YAK++VKKFG+E A +KRTPA  H+KL++D  G  V   LYRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL
Subjt:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL

Query:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD
          VKRILKYV+GTSD+G+MY + + P LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +
Subjt:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD

Query:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        QD MTLY DNMSAI+ISKNP+QHSRTKHIDIRHH+I +LV+DKVI L H+ +  Q  DIFTK LDAN FE LR  LG+CL
Subjt:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

AAO73529.1 gag-pol polyprotein [Glycine max]3.2e-29258.18Show/hide
Query:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT
        GH+ ++G++K+I   A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K+DT
Subjt:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT

Query:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT
         E+ K L L+LQREK   I RIRSDHG++F+N  F  F                      ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT
Subjt:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT

Query:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDETPNMS
        +R G   TLYE+WK R+P +K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   +
Subjt:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDETPNMS

Query:  EASTTSTVE-VSKADNPSDDPGKSL--------------KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR
         A T  + E    +D+ +D+P  +               K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+
Subjt:  EASTTSTVE-VSKADNPSDDPGKSL--------------KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR

Query:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV
        R                       N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEE YV
Subjt:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV

Query:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG
         QPKGFVD  HP HVY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VG
Subjt:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL
        EL+YFLGLQ+KQ  D IF+SQ KYAK++VKKFG+E A +KRTPA  H+KL++D  G  V   LYRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL
Subjt:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL

Query:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD
          VKRILKYV+GTSD+G+MY + +   LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +
Subjt:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD

Query:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        QD MTLY DNMSAI+ISKNP+QHSRTKHIDIRHH+I ELV+DKVI L+H+ +  Q  DIFTK LDA  FE LR  LG+CL
Subjt:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

MCH79363.1 gag-pol polyprotein [Trifolium medium]6.1e-29158.49Show/hide
Query:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTD
        LGH+ +KG++K I  EAI  +  L +     CG+CQI KQT+ +H  L+   T+RVLELLHMDLMGPMQ ES+ GKRY  V+VDD+SR+TW+ F K K+D
Subjt:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTD

Query:  TVEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGF---------------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRV
        + E+ KNLCL+LQREK   I RIRSDHGK+F+N  F                     N   ERKNRT+QE  RVM+HAK+LP   WAEA+NT C+IHNRV
Subjt:  TVEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGF---------------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRV

Query:  TIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSA--------IKQMND
        T+R+G + TLYELWK R+P +K+FHVFGS CYILAD+E R+K D +SE+GIFLGYS NSRAY+V N+R++ +ME+INVV++D  SA        +   +D
Subjt:  TIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSA--------IKQMND

Query:  --EEDETPNMSEASTTSTVEVSKADNPSDDPGKS-LKNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR---
          EE E  +  EAST+    V+K   PS    K+  K+ IIG  + G+ T+R       + + + C+ S IEP  V  AL DE+W+ AMQEEL QF+   
Subjt:  --EEDETPNMSEASTTSTVEVSKADNPSDDPGKS-LKNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR---

Query:  --------------------RNNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQP
                            RN +DE G VT+NKARLVAQGY+QVEG++FDETFAPVARLE+I+LL+G++CI +FKLYQ+DVKS FLN YL+EEVYV QP
Subjt:  --------------------RNNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQP

Query:  KGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELS
        KGF+D  +P HVYKL K+LYGLKQAPRAWYERLT++L  + Y +G   KTLF+  K+  L++AQIYVD I+FGG   ++V +F+  MQSEFEMS+VGEL+
Subjt:  KGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELS

Query:  YFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVV
        YFLGLQ+KQ  D IF+SQ KYAK++VKKFG+E A  KRTPAA H+KLTRD  G  V   +Y+S++GSLLYLTASRPDI + VG+CAR+QA+P +SHL  V
Subjt:  YFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVV

Query:  KRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDT
        KRILKY++GTSD+G++Y       LVGYCDADW GSADDRKSTS GCFFLGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML +Y   QD 
Subjt:  KRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDT

Query:  MTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        MTL+ DN+SAI+ISKNPIQHSRTKHIDIRHHFI +LVE+ V+ L+H+S++ Q  DIFTK LDAN +E LR  LG+CL
Subjt:  MTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

TrEMBL top hitse value%identityAlignment
A0A392LWM0 Gag-pol polyprotein (Fragment)2.9e-29158.49Show/hide
Query:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTD
        LGH+ +KG++K I  EAI  +  L +     CG+CQI KQT+ +H  L+   T+RVLELLHMDLMGPMQ ES+ GKRY  V+VDD+SR+TW+ F K K+D
Subjt:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTD

Query:  TVEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGF---------------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRV
        + E+ KNLCL+LQREK   I RIRSDHGK+F+N  F                     N   ERKNRT+QE  RVM+HAK+LP   WAEA+NT C+IHNRV
Subjt:  TVEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGF---------------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRV

Query:  TIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSA--------IKQMND
        T+R+G + TLYELWK R+P +K+FHVFGS CYILAD+E R+K D +SE+GIFLGYS NSRAY+V N+R++ +ME+INVV++D  SA        +   +D
Subjt:  TIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSA--------IKQMND

Query:  --EEDETPNMSEASTTSTVEVSKADNPSDDPGKS-LKNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR---
          EE E  +  EAST+    V+K   PS    K+  K+ IIG  + G+ T+R       + + + C+ S IEP  V  AL DE+W+ AMQEEL QF+   
Subjt:  --EEDETPNMSEASTTSTVEVSKADNPSDDPGKS-LKNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR---

Query:  --------------------RNNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQP
                            RN +DE G VT+NKARLVAQGY+QVEG++FDETFAPVARLE+I+LL+G++CI +FKLYQ+DVKS FLN YL+EEVYV QP
Subjt:  --------------------RNNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQP

Query:  KGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELS
        KGF+D  +P HVYKL K+LYGLKQAPRAWYERLT++L  + Y +G   KTLF+  K+  L++AQIYVD I+FGG   ++V +F+  MQSEFEMS+VGEL+
Subjt:  KGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELS

Query:  YFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVV
        YFLGLQ+KQ  D IF+SQ KYAK++VKKFG+E A  KRTPAA H+KLTRD  G  V   +Y+S++GSLLYLTASRPDI + VG+CAR+QA+P +SHL  V
Subjt:  YFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVV

Query:  KRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDT
        KRILKY++GTSD+G++Y       LVGYCDADW GSADDRKSTS GCFFLGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML +Y   QD 
Subjt:  KRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDT

Query:  MTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        MTL+ DN+SAI+ISKNPIQHSRTKHIDIRHHFI +LVE+ V+ L+H+S++ Q  DIFTK LDAN +E LR  LG+CL
Subjt:  MTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

Q84VH6 Gag-pol polyprotein1.6e-29258.18Show/hide
Query:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT
        GH+ ++G++K+I   A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K+DT
Subjt:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT

Query:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT
         E+ K L L+LQREK   I RIRSDHG++F+N  F  F                      ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT
Subjt:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT

Query:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDETPNMS
        +R G   TLYE+WK R+P +K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   +
Subjt:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDETPNMS

Query:  EASTTSTVE-VSKADNPSDDPGKSL--------------KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR
         A T  + E    +D+ +D+P  +               K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+
Subjt:  EASTTSTVE-VSKADNPSDDPGKSL--------------KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR

Query:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV
        R                       N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEE YV
Subjt:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV

Query:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG
         QPKGFVD  HP HVY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VG
Subjt:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL
        EL+YFLGLQ+KQ  D IF+SQ KYAK++VKKFG+E A +KRTPA  H+KL++D  G  V   LYRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL
Subjt:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL

Query:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD
          VKRILKYV+GTSD+G+MY + +   LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +
Subjt:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD

Query:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        QD MTLY DNMSAI+ISKNP+QHSRTKHIDIRHH+I ELV+DKVI L+H+ +  Q  DIFTK LDA  FE LR  LG+CL
Subjt:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

Q84VH8 Gag-pol polyprotein1.3e-29158.07Show/hide
Query:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT
        GH+ ++G++K+I   A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K++T
Subjt:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT

Query:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT
         E+ K L L+LQREK   I RIRSDHG++F+N  F  F                      ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT
Subjt:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT

Query:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN
        +R G   TLYE+WK R+P++K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   N
Subjt:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN

Query:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR
        +++A           S T    +++ D  S    + +  K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+
Subjt:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR

Query:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV
        R                       N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEEVYV
Subjt:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV

Query:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG
         QPKGF D  HP HVY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VG
Subjt:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL
        EL+YFLGLQ+KQ  D IF+SQ +YAK++VKKFG+E A +KRTPA  H+KL++D  G  V   LYRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL
Subjt:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL

Query:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD
          VKRILKYV+GTSD+G+MY + + P LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +
Subjt:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD

Query:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        QD MTLY DNMSAI+ISKNP+QHSRTKHIDIRHH+I +LV+DKVI L H+ +  Q  DIFTK LDAN FE LR  LG+CL
Subjt:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

Q84VI2 Gag-pol polyprotein7.3e-29057.73Show/hide
Query:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT
        GH+ ++G++K++   A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K+ T
Subjt:  GHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDT

Query:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT
         E+ K L L+LQREK   I RIRSDHG++F+N  F  F                      ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT
Subjt:  VEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFS---------------------ERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVT

Query:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN
        +R G   TLYE+WK R+P++K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   N
Subjt:  IRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PN

Query:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR
        +++A           S T    +++ D  S    + +  K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+
Subjt:  MSEA-----------STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFR

Query:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV
        R                       N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEEVYV
Subjt:  R-----------------------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYV

Query:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG
         QPKGF D  HP HVY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VG
Subjt:  AQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL
        EL+YFLGLQ+KQ  D IF+SQ +YAK++VKKFG+E A +KRTPA  H+KL++D  G  V  K YRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL
Subjt:  ELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHL

Query:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD
          VKRILKYV+GTSD+G+MY + ++  LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +
Subjt:  EVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFD

Query:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        QD MTLY DNMSAI+ISKNP+QHSRTKHIDIRHH+I +LV+DKVI L H+ +  Q  DIFTK LDAN FE LR  LG+C+
Subjt:  QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

Q84VI4 Gag-pol polyprotein1.3e-29148.11Show/hide
Query:  ISSLKLKLREVQNENDQILKSVKMLNSGTKNLDSILKSGHNGSHRYGLGF----------------------VASASRAMHN-------------LALSG
        IS LK ++  + ++ + + KS+KMLN G+  LD +L  G N  ++ GLGF                        S  R+ H+                 G
Subjt:  ISSLKLKLREVQNENDQILKSVKMLNSGTKNLDSILKSGHNGSHRYGLGF----------------------VASASRAMHN-------------LALSG

Query:  ELNLLKDVRLPLHPF--------RPQMM---------------------------------------------HCVTGHVTFGDGTKGKIIAKGNIDKNN
        +   +K     LHP         R +MM                                              C T +VTFGDG+KGKII  G +  + 
Subjt:  ELNLLKDVRLPLHPF--------RPQMM---------------------------------------------HCVTGHVTFGDGTKGKIIAKGNIDKNN

Query:  LPRLNDL-------------------------------------------------------------------------------GHVSMKGLEKVIKN
        LP LN +                                                                               GH+ ++G++K+I  
Subjt:  LPRLNDL-------------------------------------------------------------------------------GHVSMKGLEKVIKN

Query:  EAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDTVEICKNLCLKLQRE
         A+  I +L +     CG+CQI KQ + +H+ L+   T+RVLELLHMDLMGPMQ ESL GKRY  VVVDD+SR+TWV F + K++T E+ K L L+LQRE
Subjt:  EAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTDTVEICKNLCLKLQRE

Query:  KGKKITRIRSDHGKDFDN---------EGF------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVTIRTGMTVTLYELWK
        K   I RIRSDHG++F+N         EG             N   ERKNRTLQE  RVM+HAK LP   WAEA+NT C+IHNRVT+R G   TLYE+WK
Subjt:  KGKKITRIRSDHGKDFDN---------EGF------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVTIRTGMTVTLYELWK

Query:  EREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PNMSEA----------
         R+P++K+FH+FGS CYILAD+E R+K D +S+ GIFLGYS NSRAY+VFN+R+ +VME+INVV++DL  A K+  +E+  T   N+++A          
Subjt:  EREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET--PNMSEA----------

Query:  -STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFRR-------------
         S T    +++ D  S    + +  K  IIG  + G+ T+ +E    +++V + C+ S IEP  V  AL DE+W+NAMQEEL QF+R             
Subjt:  -STTSTVEVSKADNPSDDPGKSL--KNSIIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFRR-------------

Query:  ----------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKH
                  N T+E G +T+NKARLVAQGY Q+EG++FDETFAPVARLE+I+LLLG++CI KFKLYQ+DVKS FLN YLNEEVYV QPKGF D  HP H
Subjt:  ----------NNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKH

Query:  VYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKN
        VY+L K+LYGLKQAPRAWYERLT +L  + Y +G I KTLF+ + ++ L++AQIYVD I+FGG   +++ +F+  MQSEFEMS+VGEL+YFLGLQ+KQ  
Subjt:  VYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKN

Query:  DGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTS
        D IF+SQ +YAK++VKKFG+E A +KRTPA  H+KL++D  G  V   LYRS++GSLLYLTASRPDI Y VG+CAR+QA+P ISHL  VKRILKYV+GTS
Subjt:  DGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTS

Query:  DFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYYDNMSAI
        D+G+MY + + P LVGYCDADW GSADDRKSTS GCF+LGNNLISW SKKQNCVSLSTAEAEYIA GS C+QL+WMK ML EY  +QD MTLY DNMSAI
Subjt:  DFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYYDNMSAI

Query:  DISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL
        +ISKNP+QHSRTKHIDIRHH+I +LV+DKVI L H+ +  Q  DIFTK LDAN FE LR  LG+CL
Subjt:  DISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-9327.27Show/hide
Query:  KNNLPRLND-LGHVSMKGL-----EKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKE-CYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVD
        KNN    ++  GH+S   L     + +  ++++L  L+L       C  C   KQ R   K LK+  +  R L ++H D+ GP+   +L  K Y ++ VD
Subjt:  KNNLPRLND-LGHVSMKGL-----EKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKE-CYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVD

Query:  DYSRYTWVCFPKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKDF-DNE--------------------GFNSFSERKNRTLQEMTRVMIHAKNLPLC
         ++ Y      K K+D   + ++   K +     K+  +  D+G+++  NE                      N  SER  RT+ E  R M+    L   
Subjt:  DYSRYTWVCFPKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKDF-DNE--------------------GFNSFSERKNRTLQEMTRVMIHAKNLPLC

Query:  FWAEAVNTTCHIHNRVTIR--TGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVIND
        FW EAV T  ++ NR+  R     + T YE+W  ++P +K+  VFG+T Y+   K  + K+D +S + IF+GY  N   +K+++  +E  +   +VV+++
Subjt:  FWAEAVNTTCHIHNRVTIR--TGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVIND

Query:  LDSA-----------IKQMNDEED-------------ETPNMS-EASTTSTVEVSKADNPSDDPGKS--------------------LKNSIIGGSSAGM
         +             +K   + E+             E PN S E      ++ SK     + P  S                    LK+S         
Subjt:  LDSA-----------IKQMNDEED-------------ETPNMS-EASTTSTVEVSKADNPSDDPGKS--------------------LKNSIIGGSSAGM

Query:  QTKRKEKIDY---------------------------------------------MKMVVDLCYNS--------TIEPSTVHSALKDEY-----------
        ++K++++ D+                                             +K    + YN          +   T+ + + + +           
Subjt:  QTKRKEKIDY---------------------------------------------MKMVVDLCYNS--------TIEPSTVHSALKDEY-----------

Query:  WLNAMQEELLQFRRNNT-----------------------DEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKS
        W  A+  EL   + NNT                       +E G   + KARLVA+G+ Q   I+++ETFAPVAR+ + + +L +      K++Q+DVK+
Subjt:  WLNAMQEELLQFRRNNT-----------------------DEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKS

Query:  VFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKS--DQLLVAQIYVDVIIFGGFPQDLVNN
         FLN  L EE+Y+  P+G   S +  +V KLNK++YGLKQA R W+E     LK  E+    + + ++I  K   ++ +   +YVD ++        +NN
Subjt:  VFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKS--DQLLVAQIYVDVIIFGGFPQDLVNN

Query:  FINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVK---LTRDTDGAEVGHKLYRSIVGSLLY-LTASRPDI
        F   +  +F M+ + E+ +F+G++I+ + D I++SQ  Y K ++ KF +E      TP  + +    L  D D     +   RS++G L+Y +  +RPD+
Subjt:  FINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVK---LTRDTDGAEVGHKLYRSIVGSLLY-LTASRPDI

Query:  AYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTT--PTLVGYCDADWTGSADDRKSTSEGCFFLGN-NLISWLSKKQNCVSLSTAEAEYI
           V I +R+ +       + +KR+L+Y+ GT D  +++         ++GY D+DW GS  DRKST+   F + + NLI W +K+QN V+ S+ EAEY+
Subjt:  AYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTT--PTLVGYCDADWTGSADDRKSTSEGCFFLGN-NLISWLSKKQNCVSLSTAEAEYI

Query:  ATGSGCTQLIWMKNMLHEYRFD-QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLG
        A      + +W+K +L       ++ + +Y DN   I I+ NP  H R KHIDI++HF  E V++ VI L++I +  Q  DIFTKPL A  F  LR  LG
Subjt:  ATGSGCTQLIWMKNMLHEYRFD-QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLG

Query:  V
        +
Subjt:  V

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-11931.08Show/hide
Query:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTD
        +GH+S KGL+ + K   I       V     C  C   KQ R + ++  E   N +L+L++ D+ GPM+ ES+ G +Y +  +DD SR  WV   K K  
Subjt:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKTD

Query:  TVEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGF---------------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRV
          ++ +     ++RE G+K+ R+RSD+G ++ +  F                     N  +ER NRT+ E  R M+    LP  FW EAV T C++ NR 
Subjt:  TVEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGF---------------------NSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRV

Query:  TIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET---
                    +W  +E +  +  VFG   +    KE R K D +S   IF+GY      Y++++   + V+ + +VV    +S ++   D  ++    
Subjt:  TIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDET---

Query:  --------PNMSEASTTSTVEVSKADNPSDDPGKSLKNS---IIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDE--------------
                P+ S   T++     +     + PG+ ++       G       T+ +E+   ++        S   PST +  + D+              
Subjt:  --------PNMSEASTTSTVEVSKADNPSDDPGKSLKNS---IIGGSSAGMQTKRKEKIDYMKMVVDLCYNSTIEPSTVHSALKDE--------------

Query:  --YWLNAMQEELLQFRRNNT----------------------DEAGC-VTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQID
            + AMQEE+   ++N T                       +  C + + KARLV +G+ Q +GI+FDE F+PV ++ +I+ +L ++     ++ Q+D
Subjt:  --YWLNAMQEELLQFRRNNT----------------------DEAGC-VTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQID

Query:  VKSVFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSD-QLLVAQIYVDVIIFGGFPQDLV
        VK+ FL+  L EE+Y+ QP+GF  +     V KLNKSLYGLKQAPR WY +   ++K + Y +      ++  R S+   ++  +YVD ++  G  + L+
Subjt:  VKSVFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSD-QLLVAQIYVDVIIFGGFPQDLV

Query:  NNFINIMQSEFEMSMVGELSYFLGLQIKQKNDG--IFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHK------LYRSIVGSLLY-L
              +   F+M  +G     LG++I ++     +++SQEKY + ++++F ++ A+   TP A H+KL++      V  K       Y S VGSL+Y +
Subjt:  NNFINIMQSEFEMSMVGELSYFLGLQIKQKNDG--IFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHK------LYRSIVGSLLY-L

Query:  TASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAE
          +RPDIA+ VG+ +RF  +P   H E VK IL+Y+ GT+    + F G+ P L GY DAD  G  D+RKS++   F      ISW SK Q CV+LST E
Subjt:  TASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAE

Query:  AEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRA
        AEYIA      ++IW+K  L E    Q    +Y D+ SAID+SKN + H+RTKHID+R+H+I E+V+D+ +K+  IS+N    D+ TK +  N FE  + 
Subjt:  AEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRA

Query:  DLGV
         +G+
Subjt:  DLGV

P92519 Uncharacterized mitochondrial protein AtMg008101.1e-3537.67Show/hide
Query:  IYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKL-YRS
        +YVD I+  G    L+N  I  + S F M  +G + YFLG+QIK    G+F+SQ KYA+ ++   G+   +   TP    +KL      A+      +RS
Subjt:  IYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKL-YRS

Query:  IVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQN
        IVG+L YLT +RPDI+Y V I  +   +P ++  +++KR+L+YV GT   G+     +   +  +CD+DW G    R+ST+  C FLG N+ISW +K+Q 
Subjt:  IVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQN

Query:  CVSLSTAEAEYIATGSGCTQLIW
         VS S+ E EY A      +L W
Subjt:  CVSLSTAEAEYIATGSGCTQLIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-8433.94Show/hide
Query:  NDLDSAIKQMNDEEDETPNMSEASTTSTVEVSKADNPSD---DPGKSLKNSIIGGSSAGMQTK---RKEKIDYMK------MVVDLCYNSTIEPSTVHSA
        N+  S + Q      ++ + S + TTS    S +  P      P   L   +   + A + T     + K   +K      + V L   S  EP T   A
Subjt:  NDLDSAIKQMNDEEDETPNMSEASTTSTVEVSKADNPSD---DPGKSLKNSIIGGSSAGMQTK---RKEKIDYMK------MVVDLCYNSTIEPSTVHSA

Query:  LKDEYWLNAMQEELLQFRRNNT------------------------DEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLY
        LKDE W NAM  E+     N+T                        +  G + + KARLVA+GYNQ  G+++ ETF+PV +  +I+++LG++  + + + 
Subjt:  LKDEYWLNAMQEELLQFRRNNT------------------------DEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLY

Query:  QIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQD
        Q+DV + FL   L ++VY++QP GF+D + P +V KL K+LYGLKQAPRAWY  L  YL    +       +LF+ ++   ++   +YVD I+  G    
Subjt:  QIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQD

Query:  LVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKL-----TRDTDGAEVGHKLYRSIVGSLLYLTA
        L++N ++ +   F +    EL YFLG++ K+   G+ +SQ +Y  D++ +  +  A+   TP A   KL     T+ TD  E     YR IVGSL YL  
Subjt:  LVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKL-----TRDTDGAEVGHKLYRSIVGSLLYLTA

Query:  SRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAE
        +RPDI+Y V   ++F   P   HL+ +KRIL+Y+ GT + G+    G T +L  Y DADW G  DD  ST+    +LG++ ISW SKKQ  V  S+ EAE
Subjt:  SRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAE

Query:  YIATGSGCTQLIWMKNMLHE--YRFDQDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRA
        Y +  +  +++ W+ ++L E   R  +  + +Y DN+ A  +  NP+ HSR KHI I +HFI   V+   +++ H+S++ Q  D  TKPL   +F+   +
Subjt:  YIATGSGCTQLIWMKNMLHE--YRFDQDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRA

Query:  DLGV
         +GV
Subjt:  DLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.4e-9026.74Show/hide
Query:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHK---SLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKG
        LGH S+  L  VI N + L +L+   ++   C DC I+K    +HK   S     +++ LE ++ D+       S+   RY ++ VD ++RYTW+   K 
Subjt:  LGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHK---SLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKG

Query:  KT---DTVEICKNLCLKLQREKGKKITRIRSDHGKDF-------------------DNEGFNSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHI
        K+   DT  I K+L   ++     +I  + SD+G +F                        N  SERK+R + EM   ++   ++P  +W  A +   ++
Subjt:  KT---DTVEICKNLCLKLQREKGKKITRIRSDHGKDF-------------------DNEGFNSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHI

Query:  HNRVTIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVIND-----------LDS
         NR+        + ++    + PN +   VFG  CY       R K + +S+Q  F+GYS    AY   +  +  +  + +V  ++           + +
Subjt:  HNRVTIRTGMTVTLYELWKEREPNIKYFHVFGSTCYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVIND-----------LDS

Query:  AIKQMNDEEDETPNMSEASTT---------------------------STVEVSKADNPSDD--------------------------------------
        + +Q +D     P+ +   TT                            T +VS ++ PS                                        
Subjt:  AIKQMNDEEDETPNMSEASTT---------------------------STVEVSKADNPSDD--------------------------------------

Query:  -----------------------------PGKSLKNSIIGGSSA---------------------------GMQTKRKEKI----DYMKMVVDLCYNSTI
                                     P  S+       SS+                            M T+ K+ I            L  NS  
Subjt:  -----------------------------PGKSLKNSIIGGSSA---------------------------GMQTKRKEKI----DYMKMVVDLCYNSTI

Query:  EPSTVHSALKDEYWLNAMQEELLQFRRNNT------------------------DEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGIS
        EP T   A+KD+ W  AM  E+     N+T                        +  G + + KARLVA+GYNQ  G+++ ETF+PV +  +I+++LG++
Subjt:  EPSTVHSALKDEYWLNAMQEELLQFRRNNT------------------------DEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGIS

Query:  CIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVI
          + + + Q+DV + FL   L +EVY++QP GFVD + P +V +L K++YGLKQAPRAWY  L  YL    +       +LF+ ++   ++   +YVD I
Subjt:  CIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVI

Query:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLY
        +  G    L+ + ++ +   F +    +L YFLG++ K+   G+ +SQ +Y  D++ +  +  A+   TP A   KLT  +         YR IVGSL Y
Subjt:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLY

Query:  LTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTA
        L  +RPD++Y V   +++   P   H   +KR+L+Y+ GT D G+    G T +L  Y DADW G  DD  ST+    +LG++ ISW SKKQ  V  S+ 
Subjt:  LTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTA

Query:  EAEYIATGSGCTQLIWMKNMLHEYRFD-QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYL
        EAEY +  +  ++L W+ ++L E          +Y DN+ A  +  NP+ HSR KHI + +HFI   V+   +++ H+S++ Q  D  TKPL   +F+  
Subjt:  EAEYIATGSGCTQLIWMKNMLHEYRFD-QDTMTLYYDNMSAIDISKNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYL

Query:  RADLGV
           +GV
Subjt:  RADLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.5e-7435.11Show/hide
Query:  EKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFRRNNTDEA-----------------------GCVTKNKARLVAQGYNQVEGIEFDET
        EK+  +     +C     EPST + A +   W  AM +E+      +T E                        G + + KARLVA+GY Q EGI+F ET
Subjt:  EKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFRRNNTDEA-----------------------GCVTKNKARLVAQGYNQVEGIEFDET

Query:  FAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFV----DSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVK
        F+PV +L +++L+L IS I  F L+Q+D+ + FLN  L+EE+Y+  P G+     DS  P  V  L KS+YGLKQA R W+ + +V L G  + +     
Subjt:  FAPVARLEAIQLLLGISCIQKFKLYQIDVKSVFLNDYLNEEVYVAQPKGFV----DSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVK

Query:  TLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTR
        T F+   +   L   +YVD II        V+   + ++S F++  +G L YFLGL+I +   GI I Q KYA D++ + GL   +    P    V  + 
Subjt:  TLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTR

Query:  DTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFF
         + G  V  K YR ++G L+YL  +R DI++ V   ++F   P ++H + V +IL Y+ GT   G+ Y       L  + DA +    D R+ST+  C F
Subjt:  DTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFF

Query:  LGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYY-DNMSAIDISKNPIQHSRTKHIDIRHHFIHE
        LG +LISW SKKQ  VS S+AEAEY A      +++W+     E +      TL + DN +AI I+ N + H RTKHI+   H + E
Subjt:  LGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYY-DNMSAIDISKNPIQHSRTKHIDIRHHFIHE

ATMG00810.1 DNA/RNA polymerases superfamily protein7.5e-3737.67Show/hide
Query:  IYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKL-YRS
        +YVD I+  G    L+N  I  + S F M  +G + YFLG+QIK    G+F+SQ KYA+ ++   G+   +   TP    +KL      A+      +RS
Subjt:  IYVDVIIFGGFPQDLVNNFINIMQSEFEMSMVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKL-YRS

Query:  IVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQN
        IVG+L YLT +RPDI+Y V I  +   +P ++  +++KR+L+YV GT   G+     +   +  +CD+DW G    R+ST+  C FLG N+ISW +K+Q 
Subjt:  IVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRILKYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQN

Query:  CVSLSTAEAEYIATGSGCTQLIW
         VS S+ E EY A      +L W
Subjt:  CVSLSTAEAEYIATGSGCTQLIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-0935.2Show/hide
Query:  MQTKRKEKIDYMKMVVDLCYNSTI--EPSTVHSALKDEYWLNAMQEELLQFRRNNT-----------------------DEAGCVTKNKARLVAQGYNQV
        M T+ K  I+ +     L   +TI  EP +V  ALKD  W  AMQEEL    RN T                          G + + KARLVA+G++Q 
Subjt:  MQTKRKEKIDYMKMVVDLCYNSTI--EPSTVHSALKDEYWLNAMQEELLQFRRNNT-----------------------DEAGCVTKNKARLVAQGYNQV

Query:  EGIEFDETFAPVARLEAIQLLLGIS
        EGI F ET++PV R   I+ +L ++
Subjt:  EGIEFDETFAPVARLEAIQLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTAATATCTTCTCTAAAGTTAAAACTAAGAGAAGTTCAGAATGAGAATGATCAGATTTTAAAATCCGTTAAAATGCTAAACTCAGGAACGAAGAATCTAGATTC
AATACTTAAGTCTGGACATAATGGTTCTCATAGATATGGGTTGGGATTTGTGGCCTCTGCAAGTAGAGCCATGCACAACCTCGCATTGTCTGGAGAATTAAATCTGCTGA
AAGATGTAAGATTGCCTTTACATCCGTTCAGACCGCAGATGATGCACTGTGTCACCGGACATGTTACCTTTGGTGATGGTACAAAAGGAAAAATTATAGCTAAAGGTAAC
ATAGACAAAAATAACCTACCACGTCTAAACGATCTGGGGCATGTCAGTATGAAAGGCTTGGAAAAAGTCATTAAAAATGAAGCAATTTTGGAAATTCTTGATTTAGACGT
AAATAGAAAATTCTTCTGTGGAGACTGTCAAATTAGCAAGCAGACAAGGTCTACTCATAAAAGTCTGAAAGAATGTTATACAAATAGAGTTTTGGAATTGTTACATATGG
ATCTCATGGGTCCAATGCAAACAGAAAGTCTAAGAGGAAAGAGGTATATGCTGGTTGTAGTTGATGATTACTCAAGATATACTTGGGTTTGCTTTCCCAAAGGAAAAACA
GATACTGTTGAAATATGCAAAAATTTGTGTTTGAAGCTACAACGTGAAAAAGGGAAGAAGATAACGAGGATCCGAAGTGATCATGGTAAAGATTTTGATAATGAAGGCTT
TAACAGTTTTTCAGAAAGAAAGAACAGGACGTTACAAGAAATGACACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTACCT
GTCACATTCATAATAGGGTAACTATTAGAACTGGAATGACTGTTACTCTTTATGAACTTTGGAAAGAGAGAGAGCCAAATATTAAATACTTCCATGTGTTTGGAAGTACA
TGTTATATCTTAGCTGACAAGGAATACCGTCAGAAATGGGATGCAAGGTCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGAGCCTATAAAGTCTTCAATAA
CAGATCTGAAAGTGTTATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAGCTA
GCACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCTGATGATCCAGGCAAAAGTTTGAAAAACTCTATTATTGGTGGTTCGTCAGCTGGGATGCAAACCAAA
AGGAAAGAAAAGATTGATTATATGAAGATGGTTGTTGATTTATGTTATAATTCCACCATTGAACCTTCTACTGTTCACTCTGCTCTCAAGGATGAGTATTGGCTAAATGC
TATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATACTGATGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGTATAATCAAGTTGAAGGTA
TTGAATTTGATGAAACGTTTGCTCCTGTTGCTCGACTTGAAGCCATTCAATTATTACTTGGTATATCATGCATACAGAAATTTAAATTGTATCAGATAGATGTAAAGAGT
GTCTTCTTAAATGATTATTTGAATGAGGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCTGAGCATCCGAAGCATGTGTATAAGCTCAACAAATCTTTATATGG
GCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTGTTTACTTGAAAGGTAAAGAATATTCCAGAGGAGAAATTGTCAAGACCTTGTTTATACACAGGAAATCTG
ATCAACTTTTGGTTGCTCAAATTTATGTTGATGTCATCATTTTTGGAGGTTTTCCTCAAGATCTAGTAAATAATTTCATTAACATCATGCAGTCAGAATTCGAAATGAGC
ATGGTTGGAGAACTTTCATACTTTTTGGGTCTTCAAATTAAGCAAAAGAATGATGGCATCTTCATATCTCAGGAAAAGTATGCCAAGGATATGGTTAAAAAGTTTGGTTT
GGAACAGGCTCGAAATAAGCGGACTCCAGCTGCGGCACATGTTAAACTTACAAGAGACACTGATGGTGCTGAAGTTGGTCACAAACTCTACAGGAGTATAGTAGGCAGCT
TATTATATTTAACAGCAAGTCGACCTGACATAGCTTATGTTGTGGGAATATGTGCTCGTTTTCAAGCGGATCCCTGCATCTCTCACTTAGAAGTTGTTAAACGAATTCTT
AAGTATGTTCATGGGACCAGTGACTTTGGAATGATGTATTTCTATGGTACCACCCCCACTCTTGTTGGATATTGTGATGCTGACTGGACAGGTTCAGCTGATGATCGTAA
AAGTACGTCTGAAGGATGTTTCTTTTTAGGAAACAACTTAATTTCTTGGTTAAGTAAGAAGCAAAACTGTGTCTCTTTATCTACAGCTGAAGCGGAATATATAGCAACAG
GTAGTGGTTGCACACAGTTGATTTGGATGAAAAATATGTTGCATGAATATCGCTTTGATCAAGACACTATGACGTTGTATTATGACAATATGAGCGCAATTGATATATCG
AAGAATCCTATTCAACATAGTCGAACAAAGCACATTGACATAAGACATCACTTTATTCATGAACTTGTTGAAGATAAAGTAATTAAGCTTGATCATATTTCTTCCAACTT
ACAATCAGTCGATATTTTCACTAAACCTCTGGATGCGAACTCATTTGAATACTTACGTGCTGATTTAGGAGTGTGTCTTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTAATATCTTCTCTAAAGTTAAAACTAAGAGAAGTTCAGAATGAGAATGATCAGATTTTAAAATCCGTTAAAATGCTAAACTCAGGAACGAAGAATCTAGATTC
AATACTTAAGTCTGGACATAATGGTTCTCATAGATATGGGTTGGGATTTGTGGCCTCTGCAAGTAGAGCCATGCACAACCTCGCATTGTCTGGAGAATTAAATCTGCTGA
AAGATGTAAGATTGCCTTTACATCCGTTCAGACCGCAGATGATGCACTGTGTCACCGGACATGTTACCTTTGGTGATGGTACAAAAGGAAAAATTATAGCTAAAGGTAAC
ATAGACAAAAATAACCTACCACGTCTAAACGATCTGGGGCATGTCAGTATGAAAGGCTTGGAAAAAGTCATTAAAAATGAAGCAATTTTGGAAATTCTTGATTTAGACGT
AAATAGAAAATTCTTCTGTGGAGACTGTCAAATTAGCAAGCAGACAAGGTCTACTCATAAAAGTCTGAAAGAATGTTATACAAATAGAGTTTTGGAATTGTTACATATGG
ATCTCATGGGTCCAATGCAAACAGAAAGTCTAAGAGGAAAGAGGTATATGCTGGTTGTAGTTGATGATTACTCAAGATATACTTGGGTTTGCTTTCCCAAAGGAAAAACA
GATACTGTTGAAATATGCAAAAATTTGTGTTTGAAGCTACAACGTGAAAAAGGGAAGAAGATAACGAGGATCCGAAGTGATCATGGTAAAGATTTTGATAATGAAGGCTT
TAACAGTTTTTCAGAAAGAAAGAACAGGACGTTACAAGAAATGACACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTACCT
GTCACATTCATAATAGGGTAACTATTAGAACTGGAATGACTGTTACTCTTTATGAACTTTGGAAAGAGAGAGAGCCAAATATTAAATACTTCCATGTGTTTGGAAGTACA
TGTTATATCTTAGCTGACAAGGAATACCGTCAGAAATGGGATGCAAGGTCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGAGCCTATAAAGTCTTCAATAA
CAGATCTGAAAGTGTTATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAGCTA
GCACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCTGATGATCCAGGCAAAAGTTTGAAAAACTCTATTATTGGTGGTTCGTCAGCTGGGATGCAAACCAAA
AGGAAAGAAAAGATTGATTATATGAAGATGGTTGTTGATTTATGTTATAATTCCACCATTGAACCTTCTACTGTTCACTCTGCTCTCAAGGATGAGTATTGGCTAAATGC
TATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATACTGATGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGTATAATCAAGTTGAAGGTA
TTGAATTTGATGAAACGTTTGCTCCTGTTGCTCGACTTGAAGCCATTCAATTATTACTTGGTATATCATGCATACAGAAATTTAAATTGTATCAGATAGATGTAAAGAGT
GTCTTCTTAAATGATTATTTGAATGAGGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCTGAGCATCCGAAGCATGTGTATAAGCTCAACAAATCTTTATATGG
GCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTGTTTACTTGAAAGGTAAAGAATATTCCAGAGGAGAAATTGTCAAGACCTTGTTTATACACAGGAAATCTG
ATCAACTTTTGGTTGCTCAAATTTATGTTGATGTCATCATTTTTGGAGGTTTTCCTCAAGATCTAGTAAATAATTTCATTAACATCATGCAGTCAGAATTCGAAATGAGC
ATGGTTGGAGAACTTTCATACTTTTTGGGTCTTCAAATTAAGCAAAAGAATGATGGCATCTTCATATCTCAGGAAAAGTATGCCAAGGATATGGTTAAAAAGTTTGGTTT
GGAACAGGCTCGAAATAAGCGGACTCCAGCTGCGGCACATGTTAAACTTACAAGAGACACTGATGGTGCTGAAGTTGGTCACAAACTCTACAGGAGTATAGTAGGCAGCT
TATTATATTTAACAGCAAGTCGACCTGACATAGCTTATGTTGTGGGAATATGTGCTCGTTTTCAAGCGGATCCCTGCATCTCTCACTTAGAAGTTGTTAAACGAATTCTT
AAGTATGTTCATGGGACCAGTGACTTTGGAATGATGTATTTCTATGGTACCACCCCCACTCTTGTTGGATATTGTGATGCTGACTGGACAGGTTCAGCTGATGATCGTAA
AAGTACGTCTGAAGGATGTTTCTTTTTAGGAAACAACTTAATTTCTTGGTTAAGTAAGAAGCAAAACTGTGTCTCTTTATCTACAGCTGAAGCGGAATATATAGCAACAG
GTAGTGGTTGCACACAGTTGATTTGGATGAAAAATATGTTGCATGAATATCGCTTTGATCAAGACACTATGACGTTGTATTATGACAATATGAGCGCAATTGATATATCG
AAGAATCCTATTCAACATAGTCGAACAAAGCACATTGACATAAGACATCACTTTATTCATGAACTTGTTGAAGATAAAGTAATTAAGCTTGATCATATTTCTTCCAACTT
ACAATCAGTCGATATTTTCACTAAACCTCTGGATGCGAACTCATTTGAATACTTACGTGCTGATTTAGGAGTGTGTCTTACTTAA
Protein sequenceShow/hide protein sequence
MSVISSLKLKLREVQNENDQILKSVKMLNSGTKNLDSILKSGHNGSHRYGLGFVASASRAMHNLALSGELNLLKDVRLPLHPFRPQMMHCVTGHVTFGDGTKGKIIAKGN
IDKNNLPRLNDLGHVSMKGLEKVIKNEAILEILDLDVNRKFFCGDCQISKQTRSTHKSLKECYTNRVLELLHMDLMGPMQTESLRGKRYMLVVVDDYSRYTWVCFPKGKT
DTVEICKNLCLKLQREKGKKITRIRSDHGKDFDNEGFNSFSERKNRTLQEMTRVMIHAKNLPLCFWAEAVNTTCHIHNRVTIRTGMTVTLYELWKEREPNIKYFHVFGST
CYILADKEYRQKWDARSEQGIFLGYSQNSRAYKVFNNRSESVMETINVVINDLDSAIKQMNDEEDETPNMSEASTTSTVEVSKADNPSDDPGKSLKNSIIGGSSAGMQTK
RKEKIDYMKMVVDLCYNSTIEPSTVHSALKDEYWLNAMQEELLQFRRNNTDEAGCVTKNKARLVAQGYNQVEGIEFDETFAPVARLEAIQLLLGISCIQKFKLYQIDVKS
VFLNDYLNEEVYVAQPKGFVDSEHPKHVYKLNKSLYGLKQAPRAWYERLTVYLKGKEYSRGEIVKTLFIHRKSDQLLVAQIYVDVIIFGGFPQDLVNNFINIMQSEFEMS
MVGELSYFLGLQIKQKNDGIFISQEKYAKDMVKKFGLEQARNKRTPAAAHVKLTRDTDGAEVGHKLYRSIVGSLLYLTASRPDIAYVVGICARFQADPCISHLEVVKRIL
KYVHGTSDFGMMYFYGTTPTLVGYCDADWTGSADDRKSTSEGCFFLGNNLISWLSKKQNCVSLSTAEAEYIATGSGCTQLIWMKNMLHEYRFDQDTMTLYYDNMSAIDIS
KNPIQHSRTKHIDIRHHFIHELVEDKVIKLDHISSNLQSVDIFTKPLDANSFEYLRADLGVCLT