; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0053361 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0053361
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr02:20758311..20760219
RNA-Seq ExpressionCmc02g0053361
SyntenyCmc02g0053361
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]7.4e-20557.8Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP+VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY
        +INV+V+D     K+ D EED       V TS     D +K+     N D    S + E+ +       S+  +K H    IIGDP  G+T+R ++    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY

Query:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA
         +++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVA
Subjt:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA

Query:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS
        RLE+I LLLG++CI KFK YQMDVK AFLNGYLNEE+YV QPKGF D   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + 
Subjt:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS

Query:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH
        +L++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD 
Subjt:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH

Query:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF
         LYRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHLT +KRI+KYV+GT+D+ I+Y + +  +LV YCDADWAGS +DRKSTS GCF+LGNNLISWF
Subjt:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF

Query:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        SKKQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

AAO73523.1 gag-pol polyprotein [Glycine max]9.6e-20557.8Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP+VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY
        +INV+V+D     K+ D EED       V TS     D +K+     N D    S + E+ +       S+  +K H    IIGDP  G+T+R ++    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY

Query:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA
         +++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVA
Subjt:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA

Query:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS
        RLE+I LLLG++CI KFK YQMDVK AFLNGYLNEE+YV QPKGF D   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + 
Subjt:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS

Query:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH
        +L++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD 
Subjt:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH

Query:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF
        K YRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHL  +KRI+KYV+GT+D+ I+Y + ++S+LV YCDADWAGS +DRKSTS GCF+LGNNLISWF
Subjt:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF

Query:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        SKKQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

AAO73527.1 gag-pol polyprotein [Glycine max]1.6e-20457.5Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP+VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK
        +INV+V+D     K+ D EED              D +K+     N D    S + E+ +       S+  +K H    IIGDP  G+T+R ++     +
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK

Query:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL
        ++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVARL
Subjt:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL

Query:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL
        E+I LLLG++CI KFK YQMDVK AFLNGYLNEE+YV QPKGF D   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + +L
Subjt:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL

Query:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL
        ++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD  L
Subjt:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL

Query:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK
        YRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHLT +KRI+KYV+GT+D+ I+Y + +  +LV YCDADWAGS +DRKSTS GCF+LGNNLISWFSK
Subjt:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK

Query:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        KQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

AAO73529.1 gag-pol polyprotein [Glycine max]7.4e-20558.11Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY
        +INV+V+D     K+ D EED       V TS     D +K+     N D    S + E  +      PS   +K H    IIGDP  G+T+R ++    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY

Query:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA
         +++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVA
Subjt:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA

Query:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS
        RLE+I LLLG++CI KFK YQMDVK AFLNGYLNEE YV QPKGFVD   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + 
Subjt:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS

Query:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH
        +L++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD 
Subjt:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH

Query:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF
         LYRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHL  +KRI+KYV+GT+D+ I+Y + + S+LV YCDADWAGS +DRKSTS GCF+LGNNLISWF
Subjt:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF

Query:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        SKKQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

PNY10358.1 retrotransposon-related protein, partial [Trifolium pratense]4.0e-20357.19Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAKNLPL+FWA  +NTACYIHNR+T R GT+ TLYELWK +KP VKYFH FGS CYILADRE  RK   K ++G+FLGYS N+RAYRVFN+RT T+ME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK
        +INV+++DS    +++ D E     +  +P  +  +     +   + D++  +T      +G    PS   +K+H    +IGDP  GIT+R+ + V    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK

Query:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL
         I++ C+   IE  + + AL DEYWINAMQEEL  FKR+ VW LVP+P++ N+IGTKW++KNK DE+G +TRNKARLVAQGY QIEGVDFDE FAPVARL
Subjt:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL

Query:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL
        E+I LLL ++CI KFK YQMDVK AFLNGYLNEE+YV QPKGFVD  FP HVYK+ KALY LKQAP+AWYE LT +L N+ Y +GG DKTLF+      L
Subjt:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL

Query:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL
        ++AQIYVDDI+F G  + +V+ F+  ++SEFEMS+                         YAKNIVKKFG++   HKRTPA T++K+TKD  G  VD  L
Subjt:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL

Query:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK
        YRSMIGSLLYLTASR DI +A+ +CAR+    + SHL  +KRI KYV+ T ++ ILYS+   S LV YCDADWAGS +DRKSTS  CFFLGNNL+SWFSK
Subjt:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK

Query:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        KQN VSLSTAEAEYIAAGS C+QL+WM+ M  +
Subjt:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

TrEMBL top hitse value%identityAlignment
A0A2K3P4Z0 Retrotransposon-related protein (Fragment)2.0e-20357.19Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAKNLPL+FWA  +NTACYIHNR+T R GT+ TLYELWK +KP VKYFH FGS CYILADRE  RK   K ++G+FLGYS N+RAYRVFN+RT T+ME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK
        +INV+++DS    +++ D E     +  +P  +  +     +   + D++  +T      +G    PS   +K+H    +IGDP  GIT+R+ + V    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK

Query:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL
         I++ C+   IE  + + AL DEYWINAMQEEL  FKR+ VW LVP+P++ N+IGTKW++KNK DE+G +TRNKARLVAQGY QIEGVDFDE FAPVARL
Subjt:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL

Query:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL
        E+I LLL ++CI KFK YQMDVK AFLNGYLNEE+YV QPKGFVD  FP HVYK+ KALY LKQAP+AWYE LT +L N+ Y +GG DKTLF+      L
Subjt:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL

Query:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL
        ++AQIYVDDI+F G  + +V+ F+  ++SEFEMS+                         YAKNIVKKFG++   HKRTPA T++K+TKD  G  VD  L
Subjt:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL

Query:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK
        YRSMIGSLLYLTASR DI +A+ +CAR+    + SHL  +KRI KYV+ T ++ ILYS+   S LV YCDADWAGS +DRKSTS  CFFLGNNL+SWFSK
Subjt:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK

Query:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        KQN VSLSTAEAEYIAAGS C+QL+WM+ M  +
Subjt:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

Q84VH6 Gag-pol polyprotein3.6e-20558.11Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY
        +INV+V+D     K+ D EED       V TS     D +K+     N D    S + E  +      PS   +K H    IIGDP  G+T+R ++    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY

Query:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA
         +++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVA
Subjt:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA

Query:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS
        RLE+I LLLG++CI KFK YQMDVK AFLNGYLNEE YV QPKGFVD   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + 
Subjt:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS

Query:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH
        +L++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD 
Subjt:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH

Query:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF
         LYRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHL  +KRI+KYV+GT+D+ I+Y + + S+LV YCDADWAGS +DRKSTS GCF+LGNNLISWF
Subjt:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF

Query:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        SKKQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

Q84VH8 Gag-pol polyprotein7.9e-20557.5Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP+VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK
        +INV+V+D     K+ D EED              D +K+     N D    S + E+ +       S+  +K H    IIGDP  G+T+R ++     +
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSK

Query:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL
        ++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVARL
Subjt:  MIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARL

Query:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL
        E+I LLLG++CI KFK YQMDVK AFLNGYLNEE+YV QPKGF D   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + +L
Subjt:  EAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDL

Query:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL
        ++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD  L
Subjt:  IVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKL

Query:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK
        YRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHLT +KRI+KYV+GT+D+ I+Y + +  +LV YCDADWAGS +DRKSTS GCF+LGNNLISWFSK
Subjt:  YRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSK

Query:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        KQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  KQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

Q84VI2 Gag-pol polyprotein4.7e-20557.8Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP+VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY
        +INV+V+D     K+ D EED       V TS     D +K+     N D    S + E+ +       S+  +K H    IIGDP  G+T+R ++    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY

Query:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA
         +++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVA
Subjt:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA

Query:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS
        RLE+I LLLG++CI KFK YQMDVK AFLNGYLNEE+YV QPKGF D   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + 
Subjt:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS

Query:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH
        +L++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD 
Subjt:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH

Query:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF
        K YRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHL  +KRI+KYV+GT+D+ I+Y + ++S+LV YCDADWAGS +DRKSTS GCF+LGNNLISWF
Subjt:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF

Query:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        SKKQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

Q84VI4 Gag-pol polyprotein3.6e-20557.8Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+HAK LP + WA  +NTACYIHNR+T R GT  TLYE+WKGRKP+VK+FH FGS CYILADRE  RK   K + G+FLGYS N+RAYRVFN+RT TVME
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY
        +INV+V+D     K+ D EED       V TS     D +K+     N D    S + E+ +       S+  +K H    IIGDP  G+T+R ++    
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTS--TSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDY

Query:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA
         +++++ C+   IE  + + AL DE+WINAMQEEL  FKRN VW LVP+P+  N+IGTKWIFKNK +E G +TRNKARLVAQGY QIEGVDFDETFAPVA
Subjt:  SKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVA

Query:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS
        RLE+I LLLG++CI KFK YQMDVK AFLNGYLNEE+YV QPKGF D   P HVY++ KALY LKQAP+AWYE LT +L  + Y +GG DKTLF+ + + 
Subjt:  RLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSS

Query:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH
        +L++AQIYVDDI+F G    ++  F+  ++SEFEMS+                         YAKNIVKKFG++ + HKRTPA T++K++KD  G +VD 
Subjt:  DLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSM-------------------------YAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDH

Query:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF
         LYRSMIGSLLYLTASR DI YA+ +CAR+  +P+ SHLT +KRI+KYV+GT+D+ I+Y + +  +LV YCDADWAGS +DRKSTS GCF+LGNNLISWF
Subjt:  KLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWF

Query:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        SKKQNCVSLSTAEAEYIAAGS C+QL+WMK M  +
Subjt:  SKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.9e-5632.45Show/hide
Query:  WINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGISCIRKFKFYQMDVKR
        W  A+  EL   K NN WT+  +P+  NI+ ++W+F  K +E G   R KARLVA+G+ Q   +D++ETFAPVAR+ +   +L +      K +QMDVK 
Subjt:  WINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGISCIRKFKFYQMDVKR

Query:  AFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFI--NRTSSDLIVAQIYVDDIIFRGFPKALVDS
        AFLNG L EE+Y+  P+G   S    +V K+NKA+Y LKQA + W+E     L    +     D+ ++I      ++ I   +YVDD++        +++
Subjt:  AFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFI--NRTSSDLIVAQIYVDDIIFRGFPKALVDS

Query:  FIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTP---AVTYVKITKDINGKTVDHKLYRSMIGSLLY-LTASRLDI
        F   +  +F M                         S Y K I+ KF ++      TP    + Y  +  D +  T      RS+IG L+Y +  +R D+
Subjt:  FIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTP---AVTYVKITKDINGKTVDHKLYRSMIGSLLY-LTASRLDI

Query:  AYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILY--SYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGN-NLISWFSKKQNCVSLSTAEAEYI
          A+ I +R+     +     +KR+++Y+ GT D ++++  +    + ++ Y D+DWAGS  DRKST+   F + + NLI W +K+QN V+ S+ EAEY+
Subjt:  AYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILY--SYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGN-NLISWFSKKQNCVSLSTAEAEYI

Query:  AAGSGCTQLIWMK
        A      + +W+K
Subjt:  AAGSGCTQLIWMK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-6727.88Show/hide
Query:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME
        M+    LP  FW   + TACY+ NR  +          +W  ++ +  +   FG   +    +E   K   K    +F+GY      YR+++   + V+ 
Subjt:  MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVME

Query:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGD---PFTG----------
        + +V+  +S+        +  E  K  ++P   ++        T+N   S +ST+ E + +G    P     +  Q    + +   P  G          
Subjt:  TINVLVNDSKYTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGD---PFTG----------

Query:  ----ITSRKKDKVDYSKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYA
            + SR+    +Y  +  D       E+ S     +    + AMQEE+   ++N  + LV  P     +  KW+FK K D    + R KARLV +G+ 
Subjt:  ----ITSRKKDKVDYSKMIADLCYALAIELTSFEAALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYA

Query:  QIEGVDFDETFAPVARLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYS
        Q +G+DFDE F+PV ++ +I  +L ++     +  Q+DVK AFL+G L EE+Y+ QP+GF  +     V K+NK+LY LKQAP+ WY     ++ ++ Y 
Subjt:  QIEGVDFDETFAPVARLEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYS

Query:  RGGTDKTLFINRTS-SDLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEM---------------------------SMYAKNIVKKFGLDQSQHKRTP
        +  +D  ++  R S ++ I+  +YVDD++  G  K L+      +   F+M                             Y + ++++F +  ++   TP
Subjt:  RGGTDKTLFINRTS-SDLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEM---------------------------SMYAKNIVKKFGLDQSQHKRTP

Query:  AVTYVKITKDINGKTVDHK------LYRSMIGSLLY-LTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADW
           ++K++K +   TV+ K       Y S +GSL+Y +  +R DIA+A+ + +RF  +P   H   +K I++Y+ GTT   + +   +  IL  Y DAD 
Subjt:  AVTYVKITKDINGKTVDHK------LYRSMIGSLLY-LTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADW

Query:  AGSTNDRKSTSSGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD
        AG  ++RKS++   F      ISW SK Q CV+LST EAEYIAA     ++IW+K    +
Subjt:  AGSTNDRKSTSSGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHD

P92519 Uncharacterized mitochondrial protein AtMg008102.7e-2433.78Show/hide
Query:  IYVDDIIFRGFPKALVDSFIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTPAVTYVKITKDIN-GKTVDHKLYRS
        +YVDDI+  G    L++  I  + S F M                         + YA+ I+   G+   +   TP    +K+   ++  K  D   +RS
Subjt:  IYVDDIIFRGFPKALVDSFIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTPAVTYVKITKDIN-GKTVDHKLYRS

Query:  MIGSLLYLTASRLDIAYAIEI-CARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVK-YCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKK
        ++G+L YLT +R DI+YA+ I C R H +P  +    +KR+++YV GT  F  LY +  + + V+ +CD+DWAG T+ R+ST+  C FLG N+ISW +K+
Subjt:  MIGSLLYLTASRLDIAYAIEI-CARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVK-YCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKK

Query:  QNCVSLSTAEAEYIAAGSGCTQLIW
        Q  VS S+ E EY A      +L W
Subjt:  QNCVSLSTAEAEYIAAGSGCTQLIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-6735.75Show/hide
Query:  YALAIELTSFE------AALKDEYWINAMQEELLPFKRNNVWTLV-PKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVAR
        Y+LA+ L +         ALKDE W NAM  E+     N+ W LV P P    I+G +WIF  K +  G + R KARLVA+GY Q  G+D+ ETF+PV +
Subjt:  YALAIELTSFE------AALKDEYWINAMQEELLPFKRNNVWTLV-PKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVAR

Query:  LEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSD
          +I ++LG++  R +   Q+DV  AFL G L +++Y++QP GF+D   P +V K+ KALY LKQAP+AWY  L  YL    +    +D +LF+ +    
Subjt:  LEAICLLLGISCIRKFKFYQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSD

Query:  LIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFE----------MSMYAKNIVKKFGLDQSQH-----KRTPAVTYVKITKDI----------NGKTVDHK
        ++   +YVDDI+  G    L+ + +D +   F           + + AK +     L Q ++      RT  +T   +T  +            K  D  
Subjt:  LIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFE----------MSMYAKNIVKKFGLDQSQH-----KRTPAVTYVKITKDI----------NGKTVDHK

Query:  LYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFS
         YR ++GSL YL  +R DI+YA+   ++F   P   HL  +KRI++Y+ GT +  I      T  L  Y DADWAG  +D  ST+    +LG++ ISW S
Subjt:  LYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFS

Query:  KKQNCVSLSTAEAEYIAAGSGCTQLIWM
        KKQ  V  S+ EAEY +  +  +++ W+
Subjt:  KKQNCVSLSTAEAEYIAAGSGCTQLIWM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-6735.85Show/hide
Query:  ALKDEYWINAMQEELLPFKRNNVWTLV-PKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGISCIRKFKF
        A+KD+ W  AM  E+     N+ W LV P P    I+G +WIF  K +  G + R KARLVA+GY Q  G+D+ ETF+PV +  +I ++LG++  R +  
Subjt:  ALKDEYWINAMQEELLPFKRNNVWTLV-PKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGISCIRKFKF

Query:  YQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDLIVAQIYVDDIIFRGFPK
         Q+DV  AFL G L +E+Y++QP GFVD   P +V ++ KA+Y LKQAP+AWY  L  YL    +    +D +LF+ +    +I   +YVDDI+  G   
Subjt:  YQMDVKRAFLNGYLNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDLIVAQIYVDDIIFRGFPK

Query:  ALVDSFIDIIKSEFE----------MSMYAKNIVKKFGLDQSQHK---------------RTPAVTYVKITKDINGKTVDHKLYRSMIGSLLYLTASRLD
         L+   +D +   F           + + AK + +   L Q ++                 TP  T  K+T     K  D   YR ++GSL YL  +R D
Subjt:  ALVDSFIDIIKSEFE----------MSMYAKNIVKKFGLDQSQHK---------------RTPAVTYVKITKDINGKTVDHKLYRSMIGSLLYLTASRLD

Query:  IAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAA
        ++YA+   +++   P   H   +KR+++Y+ GT D  I      T  L  Y DADWAG T+D  ST+    +LG++ ISW SKKQ  V  S+ EAEY + 
Subjt:  IAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAA

Query:  GSGCTQLIWM
         +  ++L W+
Subjt:  GSGCTQLIWM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.5e-6631.05Show/hide
Query:  TSTSVDVSKA-----DIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSKMIADLCYALAIELTSFEAALKDEYW
        +S+S+D+  +     D+   +   S++ T K A ++             H  +S+     +   S +K    Y   +  +C A A E +++  A +   W
Subjt:  TSTSVDVSKA-----DIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSKMIADLCYALAIELTSFEAALKDEYW

Query:  INAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGISCIRKFKFYQMDVKRA
          AM +E+   +  + W +   P     IG KW++K K +  G + R KARLVA+GY Q EG+DF ETF+PV +L ++ L+L IS I  F  +Q+D+  A
Subjt:  INAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGISCIRKFKFYQMDVKRA

Query:  FLNGYLNEELYVAQPKGFV----DSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDLIVAQIYVDDIIFRGFPKALVD
        FLNG L+EE+Y+  P G+     DS  P  V  + K++Y LKQA + W+   ++ L    + +  +D T F+  T++  +   +YVDDII      A VD
Subjt:  FLNGYLNEELYVAQPKGFV----DSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDLIVAQIYVDDIIFRGFPKALVD

Query:  SFIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKLYRSMIGSLLYLTASRLDIAYA
             +KS F++                           YA +++ + GL   +    P    V  +    G  VD K YR +IG L+YL  +RLDI++A
Subjt:  SFIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKLYRSMIGSLLYLTASRLDIAYA

Query:  IEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAAGSGC
        +   ++F   PR +H   + +I+ Y+ GT    + YS      L  + DA +    + R+ST+  C FLG +LISW SKKQ  VS S+AEAEY A     
Subjt:  IEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAAGSGC

Query:  TQLIWMKNMFHD
         +++W+   F +
Subjt:  TQLIWMKNMFHD

ATMG00240.1 Gag-Pol-related retrotransposon family protein7.5e-0629.55Show/hide
Query:  LYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGC-----FFLG
        +YLT +R D+ +A+   ++F    RT+ +  + +++ YV GT    + YS  +   L  + D+DWA   + R+S +  C     +FLG
Subjt:  LYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAGSTNDRKSTSSGC-----FFLG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-2533.78Show/hide
Query:  IYVDDIIFRGFPKALVDSFIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTPAVTYVKITKDIN-GKTVDHKLYRS
        +YVDDI+  G    L++  I  + S F M                         + YA+ I+   G+   +   TP    +K+   ++  K  D   +RS
Subjt:  IYVDDIIFRGFPKALVDSFIDIIKSEFEM-------------------------SMYAKNIVKKFGLDQSQHKRTPAVTYVKITKDIN-GKTVDHKLYRS

Query:  MIGSLLYLTASRLDIAYAIEI-CARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVK-YCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKK
        ++G+L YLT +R DI+YA+ I C R H +P  +    +KR+++YV GT  F  LY +  + + V+ +CD+DWAG T+ R+ST+  C FLG N+ISW +K+
Subjt:  MIGSLLYLTASRLDIAYAIEI-CARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVK-YCDADWAGSTNDRKSTSSGCFFLGNNLISWFSKK

Query:  QNCVSLSTAEAEYIAAGSGCTQLIW
        Q  VS S+ E EY A      +L W
Subjt:  QNCVSLSTAEAEYIAAGSGCTQLIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.1e-2050Show/hide
Query:  ALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGIS
        ALKD  W  AMQEEL    RN  W LVP P   NI+G KW+FK K+   G + R KARLVA+G+ Q EG+ F ET++PV R   I  +L ++
Subjt:  ALKDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACATGCCAAAAATTTACCCTTGCATTTTTGGGCAAGAACTATTAACACAGCTTGCTACATTCACAATAGAATTACCACTCGATCTGGAACTAATGTGACCTTATA
TGAGCTGTGGAAGGGCAGAAAGCCTAATGTTAAATATTTTCATAATTTTGGAAGTACTTGTTATATTCTTGCTGACAGAGAATATCATCGAAAATGGTATGCTAAATTAG
AACAAGGTCTATTTCTTGGATATTCACAAAACAACAGAGCCTACAGAGTTTTCAACAATAGAACTGAAACAGTTATGGAAACAATTAACGTTTTGGTCAATGATTCTAAA
TATACTAATAAGCAAATTGATGATGAAGAGGACGAAGCACCTAAGGTGACTCTAGTTCCAACTTCTACATCTGTTGATGTATCCAAAGCTGATATTGAGACAACTAATTT
TGACTTAAGCTATAAATCTACATCTAAGGAAGCAACAGTTGAAGGAACTTTATCAATTCCCTCATCACATTTTCGAAAAAGTCATCAATCAAGCTCAATTATTGGTGATC
CTTTTACTGGAATCACCAGTAGGAAGAAAGACAAAGTAGATTATTCGAAAATGATTGCTGATTTATGTTATGCTTTAGCAATTGAACTCACATCTTTTGAGGCTGCACTT
AAGGATGAATACTGGATAAATGCCATGCAAGAAGAGTTACTTCCATTTAAGCGTAACAATGTATGGACCTTGGTTCCTAAACCTGACGAGGCGAATATTATAGGAACCAA
GTGGATTTTTAAAAATAAGATTGATGAATCAGGGTGTGTAACAAGGAACAAAGCTCGTTTGGTGGCTCAAGGTTATGCTCAGATAGAAGGAGTTGATTTTGATGAAACTT
TTGCACCTGTTGCCAGACTTGAAGCAATTTGCCTTCTGCTCGGTATATCTTGTATTCGCAAATTTAAATTTTATCAAATGGATGTCAAGAGAGCCTTTCTAAATGGATAC
TTGAATGAGGAACTCTATGTAGCACAACCTAAAGGGTTTGTTGATTCTAAATTCCCTCAACATGTTTACAAGATCAATAAAGCCTTGTATGTGCTAAAGCAAGCTCCTAA
AGCTTGGTATGAATGCTTAACAATTTATCTGGGTAATAAAGTATATTCCAGAGGAGGGACTGACAAGACACTATTTATTAATAGAACCAGCAGTGATCTCATTGTGGCTC
AAATTTATGTTGATGATATTATATTTAGGGGATTTCCTAAAGCACTTGTTGATAGTTTTATTGACATAATCAAATCAGAATTTGAAATGAGTATGTATGCCAAGAACATA
GTCAAGAAATTTGGTCTGGATCAGTCTCAACACAAAAGGACTCCAGCTGTGACATATGTTAAAATTACCAAGGATATTAATGGTAAAACAGTAGATCACAAATTGTACAG
GAGCATGATTGGGAGTCTCCTATATTTAACAGCAAGCAGACTTGACATTGCCTATGCTATTGAAATATGTGCTCGATTTCATTTGGATCCTCGTACTTCTCACTTGACAT
TCATTAAACGAATAATGAAGTATGTTCACGGAACAACTGACTTTAGAATTTTATATTCCTATTATACAACTTCTATACTCGTGAAATATTGTGATGCAGATTGGGCTGGT
TCCACTAATGACAGGAAAAGCACCTCTAGTGGATGTTTCTTTCTTGGAAACAATCTTATCTCATGGTTTAGTAAGAAACAAAATTGTGTATCTCTCTCTACAGCAGAGGC
TGAGTATATAGCTGCAGGGAGTGGATGTACTCAATTGATATGGATGAAAAACATGTTTCATGATACAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGATACATGCCAAAAATTTACCCTTGCATTTTTGGGCAAGAACTATTAACACAGCTTGCTACATTCACAATAGAATTACCACTCGATCTGGAACTAATGTGACCTTATA
TGAGCTGTGGAAGGGCAGAAAGCCTAATGTTAAATATTTTCATAATTTTGGAAGTACTTGTTATATTCTTGCTGACAGAGAATATCATCGAAAATGGTATGCTAAATTAG
AACAAGGTCTATTTCTTGGATATTCACAAAACAACAGAGCCTACAGAGTTTTCAACAATAGAACTGAAACAGTTATGGAAACAATTAACGTTTTGGTCAATGATTCTAAA
TATACTAATAAGCAAATTGATGATGAAGAGGACGAAGCACCTAAGGTGACTCTAGTTCCAACTTCTACATCTGTTGATGTATCCAAAGCTGATATTGAGACAACTAATTT
TGACTTAAGCTATAAATCTACATCTAAGGAAGCAACAGTTGAAGGAACTTTATCAATTCCCTCATCACATTTTCGAAAAAGTCATCAATCAAGCTCAATTATTGGTGATC
CTTTTACTGGAATCACCAGTAGGAAGAAAGACAAAGTAGATTATTCGAAAATGATTGCTGATTTATGTTATGCTTTAGCAATTGAACTCACATCTTTTGAGGCTGCACTT
AAGGATGAATACTGGATAAATGCCATGCAAGAAGAGTTACTTCCATTTAAGCGTAACAATGTATGGACCTTGGTTCCTAAACCTGACGAGGCGAATATTATAGGAACCAA
GTGGATTTTTAAAAATAAGATTGATGAATCAGGGTGTGTAACAAGGAACAAAGCTCGTTTGGTGGCTCAAGGTTATGCTCAGATAGAAGGAGTTGATTTTGATGAAACTT
TTGCACCTGTTGCCAGACTTGAAGCAATTTGCCTTCTGCTCGGTATATCTTGTATTCGCAAATTTAAATTTTATCAAATGGATGTCAAGAGAGCCTTTCTAAATGGATAC
TTGAATGAGGAACTCTATGTAGCACAACCTAAAGGGTTTGTTGATTCTAAATTCCCTCAACATGTTTACAAGATCAATAAAGCCTTGTATGTGCTAAAGCAAGCTCCTAA
AGCTTGGTATGAATGCTTAACAATTTATCTGGGTAATAAAGTATATTCCAGAGGAGGGACTGACAAGACACTATTTATTAATAGAACCAGCAGTGATCTCATTGTGGCTC
AAATTTATGTTGATGATATTATATTTAGGGGATTTCCTAAAGCACTTGTTGATAGTTTTATTGACATAATCAAATCAGAATTTGAAATGAGTATGTATGCCAAGAACATA
GTCAAGAAATTTGGTCTGGATCAGTCTCAACACAAAAGGACTCCAGCTGTGACATATGTTAAAATTACCAAGGATATTAATGGTAAAACAGTAGATCACAAATTGTACAG
GAGCATGATTGGGAGTCTCCTATATTTAACAGCAAGCAGACTTGACATTGCCTATGCTATTGAAATATGTGCTCGATTTCATTTGGATCCTCGTACTTCTCACTTGACAT
TCATTAAACGAATAATGAAGTATGTTCACGGAACAACTGACTTTAGAATTTTATATTCCTATTATACAACTTCTATACTCGTGAAATATTGTGATGCAGATTGGGCTGGT
TCCACTAATGACAGGAAAAGCACCTCTAGTGGATGTTTCTTTCTTGGAAACAATCTTATCTCATGGTTTAGTAAGAAACAAAATTGTGTATCTCTCTCTACAGCAGAGGC
TGAGTATATAGCTGCAGGGAGTGGATGTACTCAATTGATATGGATGAAAAACATGTTTCATGATACAAGATAA
Protein sequenceShow/hide protein sequence
MIHAKNLPLHFWARTINTACYIHNRITTRSGTNVTLYELWKGRKPNVKYFHNFGSTCYILADREYHRKWYAKLEQGLFLGYSQNNRAYRVFNNRTETVMETINVLVNDSK
YTNKQIDDEEDEAPKVTLVPTSTSVDVSKADIETTNFDLSYKSTSKEATVEGTLSIPSSHFRKSHQSSSIIGDPFTGITSRKKDKVDYSKMIADLCYALAIELTSFEAAL
KDEYWINAMQEELLPFKRNNVWTLVPKPDEANIIGTKWIFKNKIDESGCVTRNKARLVAQGYAQIEGVDFDETFAPVARLEAICLLLGISCIRKFKFYQMDVKRAFLNGY
LNEELYVAQPKGFVDSKFPQHVYKINKALYVLKQAPKAWYECLTIYLGNKVYSRGGTDKTLFINRTSSDLIVAQIYVDDIIFRGFPKALVDSFIDIIKSEFEMSMYAKNI
VKKFGLDQSQHKRTPAVTYVKITKDINGKTVDHKLYRSMIGSLLYLTASRLDIAYAIEICARFHLDPRTSHLTFIKRIMKYVHGTTDFRILYSYYTTSILVKYCDADWAG
STNDRKSTSSGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAAGSGCTQLIWMKNMFHDTR