; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003779 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003779
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr03:20756519..20763813
RNA-Seq ExpressionPay0003779
SyntenyPay0003779
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0050896 - response to stimulus (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052755.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0073.84Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVN+NFLV FILNSLPSEYGP
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE
        FHMNYNTLKDKWN                                      NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE
Subjt:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE

Query:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----
        NK +                                                  +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Subjt:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----

Query:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS
         YTMPGTPQQN VAER+                          +RT                                                + +   
Subjt:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS

Query:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------
         VEIPSSITSSQ+VVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ         
Subjt:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------

Query:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT
         LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Subjt:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT

Query:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
        AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
Subjt:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK

Query:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR
        EFLSKNFEMKDM EASYVIGIEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQMETI YASIVGSLLYAQTCTR
Subjt:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR

Query:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV
         DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQG KDYMLTYKRSDHLEVI YSDS+FAGCVDTRKSTFGYLFLLAEGAISWK     ++  STMEAEFV
Subjt:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV

Query:  ACFEAT
         CFEAT
Subjt:  ACFEAT

RVW32004.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.4e-29651.74Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MFMRMT+ANNIK+++  TE A EF+KSVE+  + + ADKSLAGTLM+ LT +K+DG + I +HIL MT  AA+LK +GM ++++FLV F+LNSLPS++ P
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN----NGKGNHGQLKVKQSS-----APIH----KKGQIKD------------------------KCRFCNKPGHYQKDCLKRKAWFEN
        F ++YNT  D+WN      K    +++++Q       A  H    KKG+ K                          C FC K GH +KDC+KRKAWFE 
Subjt:  FHMNYNTLKDKWN----NGKGNHGQLKVKQSS-----APIH----KKGQIKD------------------------KCRFCNKPGHYQKDCLKRKAWFEN

Query:  KGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR-------------------------------------------
        +G + + VC+ESNL EVP NTWWIDSG T HV+N MQGFLTTR    +E+F++MGNR                                           
Subjt:  KGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR-------------------------------------------

Query:  -VKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM----------------------------------
         +KI++SDRGGEYYG+       PGPFAKFLE HGI AQYTMPGTPQQNGVAERRNRTLM MVRSM                                  
Subjt:  -VKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM----------------------------------

Query:  --------------------------------------------------------------------------------------------VEIPSSIT
                                                                                                    V+IP    
Subjt:  --------------------------------------------------------------------------------------------VEIPSSIT

Query:  SSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ----------LDAMKEEL
          +++VP  V  V + ++   +G  P  +I   E V E PQ   LRRS R RR AI+DDY+VYL ES++D+ I  DPVSFSQ          ++AM EEL
Subjt:  SSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ----------LDAMKEEL

Query:  KSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDE
        KSM  N VWDL+ELP   K VGCKWVFKTKRD+ GNIER+KARLVAKG+           FSPVSKKDSLRIIMALVAH+DLELHQMDVKTAFLNGN  +
Subjt:  KSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDE

Query:  EVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEM
            +  +G   +  +H+VCKLK+SIYGLKQASRQWY+KFN+TITSFGFKENIVD+CIYLK+SGSKFI L+LYVDDILLA++D GLL +TKE+LSKNF M
Subjt:  EVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEM

Query:  KDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGM
         DMGEA+YVIGIEIFRDR+ G+LGLSQK YI++VLE+F M  CSS + PI KGDK S MQCP+N +ER QM+ IPYAS VGSL+YAQTCTRPDISFA+GM
Subjt:  KDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGM

Query:  LGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHG
        LGRYQS+PG +HWKAAKKV+RYLQGTKDYMLTYKRS+ LEV+GYSDS++ GC+D+ KST G++F+LA GAISWK     +   STMEAEFVACFEA+ H 
Subjt:  LGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHG

Query:  LWLRNFISGLGIADSIAKPLRIYCDNSSA-------------KHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERMDI
        LWLRNFISGLG+ DSIAKPLRIYCDN++A             KHM+LKY  +KE VQK++VS+E+I T LM+ADPLTKGLPPK + +HV RM +
Subjt:  LWLRNFISGLGIADSIAKPLRIYCDNSSA-------------KHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERMDI

RZB61294.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]3.4e-30161.76Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MFMRMTVA++IK+T+  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEY P
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW
        F M+YNT+KDKWN                 +G+H                         G LK+K     I KK    + C FC K GH+QKDC KRK+W
Subjt:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW

Query:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQ
        FE KG+ NAL                              GFLT +T +PNE+F+FMGNRVK      G        G     LE+  +           
Subjt:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQ

Query:  QNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI
                +R L+++  S ++I     +        V + N+ +E Q N +     ++ NEP+ E PQE+ LR+S R RR AIS+DY+VYLHE E +LSI
Subjt:  QNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI

Query:  -DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRI
         DNDPVSFSQ          L+AMKEE+ SM  N VWDLVELPK  KRVGCKWVFKTKRDS+GN+ERYKARLVAKG+           FSPVS+KDS RI
Subjt:  -DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRI

Query:  IMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVL
        IMALV HYDLELHQMDVKTAFLNG+L+E+V+MDQP GF VEGKEHMVCKLK+SIYGLKQASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVL
Subjt:  IMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVL

Query:  YVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQME
        YVDDIL+ATND GLL +TK+FLS NFEMKDMGEA+YVIGIEIFR+R+ GLLGLSQK YINKVLE+F+M+KCS+S VPIQK DKFSL QCPKN+LER QME
Subjt:  YVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQME

Query:  TIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAIS
         I YAS+VGS++YAQTCTRPDISFA GMLGRYQSNPGM+HWKAAKKVLRYLQGTKD+MLTYKRSDHLEVIGYSDS+FAGCVDTRKST G++FLLA GAIS
Subjt:  TIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAIS

Query:  WKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMI
        WK     VV  STMEAEFVACFEAT+   WLRNFISGLGI DSIA+PL++YCDNS+             AKHMELKYF +KEEVQK+RVS+EHISTKLMI
Subjt:  WKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMI

Query:  ADPLTKGLPPKMFNDHVE
        ADPLTKGLPPK F +HVE
Subjt:  ADPLTKGLPPKMFNDHVE

RZC25410.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]0.0e+0050.12Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEYGP
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW
        F M+YNT+KDKWN                 +G+H                         G LK+K     I KK    + C FC K GH+QKDC KRK+W
Subjt:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW

Query:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR----------------------------------------
        FE KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTMQGFLT +T +PNE+F+FMGNR                                        
Subjt:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------VKILRSDRGGEYY-
                                                                                              VKI+RSDRGGEYY 
Subjt:  --------------------------------------------------------------------------------------VKILRSDRGGEYY-

Query:  -----GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM-------------------------------------------------
             G+ P PFAK L+  GICAQYTMPGTPQQNGV+ERRN+TLM+MVRSM                                                 
Subjt:  -----GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM-------------------------------------------------

Query:  -----------------------------------------------------------------------------VEIPSSITSSQVVVPVVVDSVNN
                                                                                     V++P +  SS  V+   V + N+
Subjt:  -----------------------------------------------------------------------------VEIPSSITSSQVVVPVVVDSVNN

Query:  PQEQQINGQTPHND--IVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLV
         +E Q      HND  ++ NEP+ E PQE+ LR+S R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQ          L+AMKEE+ SM  N+VWDLV
Subjt:  PQEQQINGQTPHND--IVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLV

Query:  ELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV
        ELPK  KRVG KWVFKTKRDS+GN+ERYKARLVAKG+           FSPVS+KDS RIIMALVAHYDLELHQMDVKTAFLNG+L+E+V+MDQP GF V
Subjt:  ELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV

Query:  EGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGI
        EGKEHMVCKLK+SIYGLKQASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVLYVDDILLATND GL  +TK+FLS NFEMKDMGEASYVIGI
Subjt:  EGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGI

Query:  EIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDH
        EIFR+R+ GLLGLSQKAYINKVLE+F+M+KCS+S VPIQKGDKFSL QCPKN+LER QME IPYAS+VGS++YAQTCTRPDISFA GMLGRYQSNPGM+H
Subjt:  EIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDH

Query:  WKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGI
        WKAAKKVLRYLQGTKD++LTYKRSDHLEVIGYSDS+FAGCVDTRKST G++FLLA GAISWK     VV  STMEA FVACFEAT+   WLRNFISGLGI
Subjt:  WKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGI

Query:  ADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVER
         DSIA+PL++YCDNS+             AKHMELKYFA+KEEVQK+RVS+EHI+TKLMIADPLTKGLPPK F ++VE+
Subjt:  ADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVER

TYK04201.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0073.84Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVN+NFLVTFILNSLPSEYGP
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE
        FHMNYNTLKDKWN                                      NGKGNHGQLKVKQSSAPIHKKG+IKDKCRFCNKPGHYQKDCLKRKAWFE
Subjt:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE

Query:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----
        NK +                                                  +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Subjt:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----

Query:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS
         YTMPGTPQQN VAER+                          +RT                                                + +   
Subjt:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS

Query:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------
         VEIPSSITSSQ+VVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ         
Subjt:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------

Query:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT
         LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Subjt:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT

Query:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
        AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
Subjt:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK

Query:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR
        EFLSKNFEMKDM EASYVIGIEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQMETI YASIVGSLLYAQTCTR
Subjt:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR

Query:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV
         DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQG KDYMLTYKRSDHLEVI YSDS+FAGCVDTRKSTFGYLFLLAEGAISWK     ++  STMEAEFV
Subjt:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV

Query:  ACFEAT
         CFEAT
Subjt:  ACFEAT

TrEMBL top hitse value%identityAlignment
A0A438D994 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-29651.74Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MFMRMT+ANNIK+++  TE A EF+KSVE+  + + ADKSLAGTLM+ LT +K+DG + I +HIL MT  AA+LK +GM ++++FLV F+LNSLPS++ P
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN----NGKGNHGQLKVKQSS-----APIH----KKGQIKD------------------------KCRFCNKPGHYQKDCLKRKAWFEN
        F ++YNT  D+WN      K    +++++Q       A  H    KKG+ K                          C FC K GH +KDC+KRKAWFE 
Subjt:  FHMNYNTLKDKWN----NGKGNHGQLKVKQSS-----APIH----KKGQIKD------------------------KCRFCNKPGHYQKDCLKRKAWFEN

Query:  KGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR-------------------------------------------
        +G + + VC+ESNL EVP NTWWIDSG T HV+N MQGFLTTR    +E+F++MGNR                                           
Subjt:  KGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR-------------------------------------------

Query:  -VKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM----------------------------------
         +KI++SDRGGEYYG+       PGPFAKFLE HGI AQYTMPGTPQQNGVAERRNRTLM MVRSM                                  
Subjt:  -VKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM----------------------------------

Query:  --------------------------------------------------------------------------------------------VEIPSSIT
                                                                                                    V+IP    
Subjt:  --------------------------------------------------------------------------------------------VEIPSSIT

Query:  SSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ----------LDAMKEEL
          +++VP  V  V + ++   +G  P  +I   E V E PQ   LRRS R RR AI+DDY+VYL ES++D+ I  DPVSFSQ          ++AM EEL
Subjt:  SSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ----------LDAMKEEL

Query:  KSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDE
        KSM  N VWDL+ELP   K VGCKWVFKTKRD+ GNIER+KARLVAKG+           FSPVSKKDSLRIIMALVAH+DLELHQMDVKTAFLNGN  +
Subjt:  KSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDE

Query:  EVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEM
            +  +G   +  +H+VCKLK+SIYGLKQASRQWY+KFN+TITSFGFKENIVD+CIYLK+SGSKFI L+LYVDDILLA++D GLL +TKE+LSKNF M
Subjt:  EVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEM

Query:  KDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGM
         DMGEA+YVIGIEIFRDR+ G+LGLSQK YI++VLE+F M  CSS + PI KGDK S MQCP+N +ER QM+ IPYAS VGSL+YAQTCTRPDISFA+GM
Subjt:  KDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGM

Query:  LGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHG
        LGRYQS+PG +HWKAAKKV+RYLQGTKDYMLTYKRS+ LEV+GYSDS++ GC+D+ KST G++F+LA GAISWK     +   STMEAEFVACFEA+ H 
Subjt:  LGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHG

Query:  LWLRNFISGLGIADSIAKPLRIYCDNSSA-------------KHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERMDI
        LWLRNFISGLG+ DSIAKPLRIYCDN++A             KHM+LKY  +KE VQK++VS+E+I T LM+ADPLTKGLPPK + +HV RM +
Subjt:  LWLRNFISGLGIADSIAKPLRIYCDNSSA-------------KHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERMDI

A0A445GJ88 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-30161.76Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MFMRMTVA++IK+T+  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEY P
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW
        F M+YNT+KDKWN                 +G+H                         G LK+K     I KK    + C FC K GH+QKDC KRK+W
Subjt:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW

Query:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQ
        FE KG+ NAL                              GFLT +T +PNE+F+FMGNRVK      G        G     LE+  +           
Subjt:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQ

Query:  QNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI
                +R L+++  S ++I     +        V + N+ +E Q N +     ++ NEP+ E PQE+ LR+S R RR AIS+DY+VYLHE E +LSI
Subjt:  QNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI

Query:  -DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRI
         DNDPVSFSQ          L+AMKEE+ SM  N VWDLVELPK  KRVGCKWVFKTKRDS+GN+ERYKARLVAKG+           FSPVS+KDS RI
Subjt:  -DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRI

Query:  IMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVL
        IMALV HYDLELHQMDVKTAFLNG+L+E+V+MDQP GF VEGKEHMVCKLK+SIYGLKQASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVL
Subjt:  IMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVL

Query:  YVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQME
        YVDDIL+ATND GLL +TK+FLS NFEMKDMGEA+YVIGIEIFR+R+ GLLGLSQK YINKVLE+F+M+KCS+S VPIQK DKFSL QCPKN+LER QME
Subjt:  YVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQME

Query:  TIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAIS
         I YAS+VGS++YAQTCTRPDISFA GMLGRYQSNPGM+HWKAAKKVLRYLQGTKD+MLTYKRSDHLEVIGYSDS+FAGCVDTRKST G++FLLA GAIS
Subjt:  TIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAIS

Query:  WKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMI
        WK     VV  STMEAEFVACFEAT+   WLRNFISGLGI DSIA+PL++YCDNS+             AKHMELKYF +KEEVQK+RVS+EHISTKLMI
Subjt:  WKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMI

Query:  ADPLTKGLPPKMFNDHVE
        ADPLTKGLPPK F +HVE
Subjt:  ADPLTKGLPPKMFNDHVE

A0A445LQ30 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0050.12Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEYGP
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW
        F M+YNT+KDKWN                 +G+H                         G LK+K     I KK    + C FC K GH+QKDC KRK+W
Subjt:  FHMNYNTLKDKWN---------------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAW

Query:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR----------------------------------------
        FE KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTMQGFLT +T +PNE+F+FMGNR                                        
Subjt:  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------VKILRSDRGGEYY-
                                                                                              VKI+RSDRGGEYY 
Subjt:  --------------------------------------------------------------------------------------VKILRSDRGGEYY-

Query:  -----GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM-------------------------------------------------
             G+ P PFAK L+  GICAQYTMPGTPQQNGV+ERRN+TLM+MVRSM                                                 
Subjt:  -----GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSM-------------------------------------------------

Query:  -----------------------------------------------------------------------------VEIPSSITSSQVVVPVVVDSVNN
                                                                                     V++P +  SS  V+   V + N+
Subjt:  -----------------------------------------------------------------------------VEIPSSITSSQVVVPVVVDSVNN

Query:  PQEQQINGQTPHND--IVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLV
         +E Q      HND  ++ NEP+ E PQE+ LR+S R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQ          L+AMKEE+ SM  N+VWDLV
Subjt:  PQEQQINGQTPHND--IVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ----------LDAMKEELKSMNDNEVWDLV

Query:  ELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV
        ELPK  KRVG KWVFKTKRDS+GN+ERYKARLVAKG+           FSPVS+KDS RIIMALVAHYDLELHQMDVKTAFLNG+L+E+V+MDQP GF V
Subjt:  ELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV

Query:  EGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGI
        EGKEHMVCKLK+SIYGLKQASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVLYVDDILLATND GL  +TK+FLS NFEMKDMGEASYVIGI
Subjt:  EGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGI

Query:  EIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDH
        EIFR+R+ GLLGLSQKAYINKVLE+F+M+KCS+S VPIQKGDKFSL QCPKN+LER QME IPYAS+VGS++YAQTCTRPDISFA GMLGRYQSNPGM+H
Subjt:  EIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDH

Query:  WKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGI
        WKAAKKVLRYLQGTKD++LTYKRSDHLEVIGYSDS+FAGCVDTRKST G++FLLA GAISWK     VV  STMEA FVACFEAT+   WLRNFISGLGI
Subjt:  WKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGI

Query:  ADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVER
         DSIA+PL++YCDNS+             AKHMELKYFA+KEEVQK+RVS+EHI+TKLMIADPLTKGLPPK F ++VE+
Subjt:  ADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVER

A0A5A7UG95 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0073.84Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVN+NFLV FILNSLPSEYGP
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE
        FHMNYNTLKDKWN                                      NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE
Subjt:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE

Query:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----
        NK +                                                  +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Subjt:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----

Query:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS
         YTMPGTPQQN VAER+                          +RT                                                + +   
Subjt:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS

Query:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------
         VEIPSSITSSQ+VVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ         
Subjt:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------

Query:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT
         LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Subjt:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT

Query:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
        AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
Subjt:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK

Query:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR
        EFLSKNFEMKDM EASYVIGIEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQMETI YASIVGSLLYAQTCTR
Subjt:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR

Query:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV
         DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQG KDYMLTYKRSDHLEVI YSDS+FAGCVDTRKSTFGYLFLLAEGAISWK     ++  STMEAEFV
Subjt:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV

Query:  ACFEAT
         CFEAT
Subjt:  ACFEAT

A0A5D3BWW5 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0073.84Show/hide
Query:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP
        MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVN+NFLVTFILNSLPSEYGP
Subjt:  MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGP

Query:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE
        FHMNYNTLKDKWN                                      NGKGNHGQLKVKQSSAPIHKKG+IKDKCRFCNKPGHYQKDCLKRKAWFE
Subjt:  FHMNYNTLKDKWN--------------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFE

Query:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----
        NK +                                                  +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Subjt:  NKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ----

Query:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS
         YTMPGTPQQN VAER+                          +RT                                                + +   
Subjt:  -YTMPGTPQQNGVAERR--------------------------NRTL-----------------------------------------------MNMVRS

Query:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------
         VEIPSSITSSQ+VVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ         
Subjt:  MVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ---------

Query:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT
         LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Subjt:  -LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT

Query:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
        AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
Subjt:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK

Query:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR
        EFLSKNFEMKDM EASYVIGIEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQMETI YASIVGSLLYAQTCTR
Subjt:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR

Query:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV
         DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQG KDYMLTYKRSDHLEVI YSDS+FAGCVDTRKSTFGYLFLLAEGAISWK     ++  STMEAEFV
Subjt:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFV

Query:  ACFEAT
         CFEAT
Subjt:  ACFEAT

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-8737.83Show/hide
Query:  DAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAF
        +A+  EL +   N  W + + P+    V  +WVF  K +  GN  RYKARLVA+G+           F+PV++  S R I++LV  Y+L++HQMDVKTAF
Subjt:  DAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAF

Query:  LNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISG--SKFIILVLYVDDILLATNDFGLLCQTK
        LNG L EE++M  P+G  +      VCKL ++IYGLKQA+R W+  F   +    F  + VDRCIY+   G  ++ I ++LYVDD+++AT D   +   K
Subjt:  LNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISG--SKFIILVLYVDDILLATNDFGLLCQTK

Query:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR
         +L + F M D+ E  + IGI I  +     + LSQ AY+ K+L KF M+ C++   P+     + L       L  ++    P  S++G L+Y   CTR
Subjt:  EFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR

Query:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLE--VIGYSDSNFAGCVDTRKSTFGYLFLLAE-GAISW--KIPPSLVVSTMEA
        PD++ AV +L RY S    + W+  K+VLRYL+GT D  L +K++   E  +IGY DS++AG    RKST GYLF + +   I W  K   S+  S+ EA
Subjt:  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLE--VIGYSDSNFAGCVDTRKSTFGYLFLLAE-GAISW--KIPPSLVVSTMEA

Query:  EFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMF
        E++A FEA    LWL+  ++ + I   +  P++IY DN               AKH+++KY   +E+VQ   + +E+I T+  +AD  TK LP   F
Subjt:  EFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-12336.78Show/hide
Query:  GNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMV-------------------------------EIPSSI
        G ++K LRSD GGEY  +    F ++  SHGI  + T+PGTPQ NGVAER NRT++  VRSM+                               EIP  +
Subjt:  GNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMV-------------------------------EIPSSI

Query:  -TSSQV-------------------------------------------------------------------------------VVP---VVVDSVNNP
         T+ +V                                                                               ++P    +  + NNP
Subjt:  -TSSQV-------------------------------------------------------------------------------VVP---VVVDSVNNP

Query:  -------QEQQINGQTPHNDIVTNEPVTEGPQEIE-----------LRRSVRSR---RSAISDDYLVYL--HESEFDLSIDNDPVSFSQLDAMKEELKSM
                E    G+ P   I   E + EG +E+E           LRRS R R   R   S +Y++     E E    + + P     + AM+EE++S+
Subjt:  -------QEQQINGQTPHNDIVTNEPVTEGPQEIE-----------LRRSVRSR---RSAISDDYLVYL--HESEFDLSIDNDPVSFSQLDAMKEELKSM

Query:  NDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVF
          N  + LVELPK  + + CKWVFK K+D +  + RYKARLV KG+           FSPV K  S+R I++L A  DLE+ Q+DVKTAFL+G+L+EE++
Subjt:  NDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVF

Query:  MDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLK-ISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKD
        M+QPEGF V GK+HMVCKL +S+YGLKQA RQWY+KF+  + S  + +   D C+Y K  S + FIIL+LYVDD+L+   D GL+ + K  LSK+F+MKD
Subjt:  MDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLK-ISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKD

Query:  MGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLG
        +G A  ++G++I R+RT   L LSQ+ YI +VLE+F M        P+    K S   CP    E+  M  +PY+S VGSL+YA  CTRPDI+ AVG++ 
Subjt:  MGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLG

Query:  RYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW--KIPPSLVVSTMEAEFVACFEATVHGLW
        R+  NPG +HW+A K +LRYL+GT    L +  SD + + GY+D++ AG +D RKS+ GYLF  + GAISW  K+   + +ST EAE++A  E     +W
Subjt:  RYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW--KIPPSLVVSTMEAEFVACFEATVHGLW

Query:  LRNFISGLGIADSIAKPLRIYCDNSSA-------------KHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMF
        L+ F+  LG+     K   +YCD+ SA             KH++++Y  I+E V  E + V  IST    AD LTK +P   F
Subjt:  LRNFISGLGIADSIAKPLRIYCDNSSA-------------KHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMF

P25600 Putative transposon Ty5-1 protein YCL074W7.0e-3931.31Show/hide
Query:  MDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGL
        MDV TAFLN  +DE +++ QP GF+ E     V +L   +YGLKQA   W    N+T+   GF  +  +  +Y + +    I + +YVDD+L+A     +
Subjt:  MDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGL

Query:  LCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYA
          + K+ L+K + MKD+G+    +G+ I +  ++G + LS + YI K   + +++    +  P+           P      +  +  PY SIVG LL+ 
Subjt:  LCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYA

Query:  QTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW---KIPPSLVVST
            RPDIS+ V +L R+   P   H ++A++VLRYL  T+   L Y+    L +  Y D++     D   ST GY+ LLA   ++W   K+   + V +
Subjt:  QTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW---KIPPSLVVST

Query:  MEAEFVACFEATV
         EAE++   E  +
Subjt:  MEAEFVACFEATV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-8035.91Show/hide
Query:  DAMKEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIMALVAHYDLELHQMDVKTA
        +AM  E+ +   N  WDLV  P      VGC+W+F  K +S+G++ RYKARLVAKGYN          FSPV K  S+RI++ +       + Q+DV  A
Subjt:  DAMKEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIMALVAHYDLELHQMDVKTA

Query:  FLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKE
        FL G L ++V+M QP GF+ + + + VCKL++++YGLKQA R WY++  + + + GF  ++ D  +++   G   + +++YVDDIL+  ND  LL  T +
Subjt:  FLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKE

Query:  FLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRP
         LS+ F +KD  E  Y +GIE  R  T   L LSQ+ YI  +L +  M        P+    K SL    K        +   Y  IVGSL Y    TRP
Subjt:  FLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRP

Query:  DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW--KIPPSLVVSTMEAEFVA
        DIS+AV  L ++   P  +H +A K++LRYL GT ++ +  K+ + L +  YSD+++AG  D   ST GY+  L    ISW  K    +V S+ EAE+ +
Subjt:  DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW--KIPPSLVVSTMEAEFVA

Query:  CFEATVHGLWLRNFISGLGIADSIAKPLRIYCDN-------------SSAKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERM
            +    W+ + ++ LGI   + +P  IYCDN             S  KH+ + Y  I+ +VQ   + V H+ST   +AD LTK L    F +   ++
Subjt:  CFEATVHGLWLRNFISGLGIADSIAKPLRIYCDN-------------SSAKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERM

Query:  DISR
         ++R
Subjt:  DISR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.7e-7833.28Show/hide
Query:  PGTPQQNGVAERRNRTLMNMVRSMVEI--PSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHE
        P +P QN    +   +  ++      I  P+S +SS    P +   +  P   Q+N Q P N   T+   T     I       S  ++++ +      E
Subjt:  PGTPQQNGVAERRNRTLMNMVRSMVEI--PSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHE

Query:  SEFDLSIDNDPVSFSQLDAMKEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIMA
            +    D        AM  E+ +   N  WDLV  P  S   VGC+W+F  K +S+G++ RYKARLVAKGYN          FSPV K  S+RI++ 
Subjt:  SEFDLSIDNDPVSFSQLDAMKEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIMA

Query:  LVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD
        +       + Q+DV  AFL G L +EV+M QP GF+ + +   VC+L+++IYGLKQA R WY++    + + GF  +I D  +++   G   I +++YVD
Subjt:  LVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD

Query:  DILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIP
        DIL+  ND  LL  T + LS+ F +K+  +  Y +GIE    R    L LSQ+ Y   +L +  M        P+    K +L    K        +   
Subjt:  DILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIP

Query:  YASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW--
        Y  IVGSL Y    TRPD+S+AV  L +Y   P  DHW A K+VLRYL GT D+ +  K+ + L +  YSD+++AG  D   ST GY+  L    ISW  
Subjt:  YASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW--

Query:  KIPPSLVVSTMEAEFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDN-------------SSAKHMELKYFAIKEEVQKERVSVEHISTKLMIADP
        K    +V S+ EAE+ +    +    W+ + ++ LGI   ++ P  IYCDN             S  KH+ L Y  I+ +VQ   + V H+ST   +AD 
Subjt:  KIPPSLVVSTMEAEFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDN-------------SSAKHMELKYFAIKEEVQKERVSVEHISTKLMIADP

Query:  LTKGLPPKMFNDHVERMDI
        LTK L    F +   ++ +
Subjt:  LTKGLPPKMFNDHVERMDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.2e-8338.24Show/hide
Query:  AMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFL
        AM +E+ +M     W++  LP   K +GCKWV+K K +S+G IERYKARLVAKGY           FSPV K  S+++I+A+ A Y+  LHQ+D+  AFL
Subjt:  AMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFL

Query:  NGNLDEEVFMDQPEGFMVEGKEHM----VCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQT
        NG+LDEE++M  P G+     + +    VC LK+SIYGLKQASRQW+LKF+ T+  FGF ++  D   +LKI+ + F+ +++YVDDI++ +N+   + + 
Subjt:  NGNLDEEVFMDQPEGFMVEGKEHM----VCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQT

Query:  KEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCT
        K  L   F+++D+G   Y +G+EI R      + + Q+ Y   +L++  +  C  S VP+     FS           + ++   Y  ++G L+Y Q  T
Subjt:  KEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCT

Query:  RPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEF
        R DISFAV  L ++   P + H +A  K+L Y++GT    L Y     +++  +SD++F  C DTR+ST GY   L    ISWK     VV  S+ EAE+
Subjt:  RPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEF

Query:  VACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSSAKHM
         A   AT   +WL  F   L +   ++KP  ++CDN++A H+
Subjt:  VACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSSAKHM

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.0e-0937.97Show/hide
Query:  TCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLL
        T TRPD++FAV  L ++ S       +A  KVL Y++GT    L Y  +  L++  ++DS++A C DTR+S  G+  L+
Subjt:  TCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLL

ATMG00810.1 DNA/RNA polymerases superfamily protein2.0e-2536.28Show/hide
Query:  LVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTH-GLLGLSQKAYINKVLEKFKMDKCS--SSVVPIQKGDKFSLMQCPKNEL
        L+LYVDDILL  +   LL      LS  F MKD+G   Y +GI+I   +TH   L LSQ  Y  ++L    M  C   S+ +P++     S  + P    
Subjt:  LVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTH-GLLGLSQKAYINKVLEKFKMDKCS--SSVVPIQKGDKFSLMQCPKNEL

Query:  ERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLL
             +   + SIVG+L Y  T TRPDIS+AV ++ +    P +  +   K+VLRY++GT  + L   ++  L V  + DS++AGC  TR+ST G+   L
Subjt:  ERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLL

Query:  AEGAISW--KIPPSLVVSTMEAEFVA
            ISW  K  P++  S+ E E+ A
Subjt:  AEGAISW--KIPPSLVVSTMEAEFVA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.8e-1143.75Show/hide
Query:  AMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIM
        AM+EEL +++ N+ W LV  P     +GCKWVFKTK  S+G ++R KARLVAKG++          +SPV +  ++R I+
Subjt:  AMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSPVSKKDSLRIIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAATCTGTGGAAAAATGTTCTCAGTCAGAGTCGGC
TGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGACGAACTTGGCAGCAAGGT
TAAAGACCATGGGAATGGAAGTTAATAAGAATTTTTTGGTAACGTTTATCCTTAATTCTTTACCTTCAGAGTATGGTCCATTTCACATGAACTATAACACTCTGAAAGAT
AAATGGAATAATGGGAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACC
TGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATA
CATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATACGATGCAGGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGA
GTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGG
AACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTC
CTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATATTGTAACAAATGAACCTGTAACTGAGGGACCACAAGAAATA
GAATTAAGAAGATCTGTAAGATCAAGAAGATCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGACTTAAGCATTGATAATGATCCAGTTTCGTT
TTCACAATTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTCT
TTAAGACCAAACGTGACTCAAATGGAAATATTGAACGATACAAGGCTAGACTTGTTGCCAAAGGTTATAACTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATT
ATGGCTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTT
TATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAGTGGTATCTTAAGTTTAATGATACCATCACATCTT
TTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGAC
TTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTCCGTGATCGAACACA
TGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTA
GTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTCCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACCAGAC
ATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAAGATTATAT
GCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTCAAATTTTGCCGGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAG
CTGAAGGAGCAATTTCATGGAAAATACCCCCGTCTCTTGTAGTGTCCACTATGGAAGCTGAATTTGTAGCATGCTTTGAGGCTACAGTTCATGGTTTATGGCTGCGGAAC
TTTATCTCAGGACTTGGAATTGCCGACAGTATTGCCAAGCCGCTGAGAATTTATTGTGATAATTCTAGTGCTAAACATATGGAATTAAAATACTTTGCCATTAAAGAAGA
AGTTCAGAAAGAGAGGGTGTCAGTTGAACACATTAGCACTAAACTTATGATTGCGGATCCACTGACTAAAGGATTGCCACCAAAGATGTTCAATGATCACGTTGAACGTA
TGGACATCAGTAGATATCATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAATCTGTGGAAAAATGTTCTCAGTCAGAGTCGGC
TGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGACGAACTTGGCAGCAAGGT
TAAAGACCATGGGAATGGAAGTTAATAAGAATTTTTTGGTAACGTTTATCCTTAATTCTTTACCTTCAGAGTATGGTCCATTTCACATGAACTATAACACTCTGAAAGAT
AAATGGAATAATGGGAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACC
TGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATA
CATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATACGATGCAGGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGA
GTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGG
AACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTC
CTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATATTGTAACAAATGAACCTGTAACTGAGGGACCACAAGAAATA
GAATTAAGAAGATCTGTAAGATCAAGAAGATCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGACTTAAGCATTGATAATGATCCAGTTTCGTT
TTCACAATTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTCT
TTAAGACCAAACGTGACTCAAATGGAAATATTGAACGATACAAGGCTAGACTTGTTGCCAAAGGTTATAACTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATT
ATGGCTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTT
TATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAGTGGTATCTTAAGTTTAATGATACCATCACATCTT
TTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGAC
TTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTCCGTGATCGAACACA
TGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTA
GTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTCCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACCAGAC
ATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAAGATTATAT
GCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTCAAATTTTGCCGGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAG
CTGAAGGAGCAATTTCATGGAAAATACCCCCGTCTCTTGTAGTGTCCACTATGGAAGCTGAATTTGTAGCATGCTTTGAGGCTACAGTTCATGGTTTATGGCTGCGGAAC
TTTATCTCAGGACTTGGAATTGCCGACAGTATTGCCAAGCCGCTGAGAATTTATTGTGATAATTCTAGTGCTAAACATATGGAATTAAAATACTTTGCCATTAAAGAAGA
AGTTCAGAAAGAGAGGGTGTCAGTTGAACACATTAGCACTAAACTTATGATTGCGGATCCACTGACTAAAGGATTGCCACCAAAGATGTTCAATGATCACGTTGAACGTA
TGGACATCAGTAGATATCATCATTGA
Protein sequenceShow/hide protein sequence
MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKD
KWNNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNR
VKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEI
ELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQLDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYNFSPVSKKDSLRII
MALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATND
FGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPD
ISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVVSTMEAEFVACFEATVHGLWLRN
FISGLGIADSIAKPLRIYCDNSSAKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERMDISRYHH