; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0052491 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0052491
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPol
Genome locationCMiso1.1chr02:19943522..19945231
RNA-Seq ExpressionCmc02g0052491
SyntenyCmc02g0052491
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW50867.1 Copia protein [Vitis vinifera]1.4e-25875.66Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLA+YSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FY H+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

RVW51051.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.1e-25875.66Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR G+DYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKK ++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLARYSSSPT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F   GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

RVW51062.1 Copia protein [Vitis vinifera]3.7e-25975.84Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YL+AIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

RVW59049.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.7e-25976.01Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

RVX16158.1 Copia protein [Vitis vinifera]4.1e-25875.66Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLK+GY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LK FYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

TrEMBL top hitse value%identityAlignment
A0A438ET65 Copia protein6.8e-25975.66Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLA+YSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FY H+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

A0A438ETG9 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-25875.66Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR G+DYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKK ++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLARYSSSPT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F   GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

A0A438ETU3 Copia protein1.8e-25975.84Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YL+AIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

A0A438FGC5 Retrovirus-related Pol polyprotein from transposon TNT 1-948.1e-26076.01Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLKEGY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LKRFYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

A0A438K4P3 Copia protein2.0e-25875.66Show/hide
Query:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR
        TG++W+R N+V++NIFA+ VA +II  +ED EP++V+ECR+R DW KWKEAIQAELNSLTK EVFGPVV TP+ VKPVG+KWVFVRKRNENNE+ RYKAR
Subjt:  TGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKAR

Query:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY
        LVAQG SQR GIDYEETYSPV+DAIT R+LISLAV E LDM LMDV+TAYLYGS+ N+IYMKIPEGFK+P++ N+  R + SIKLQRSLYGLKQS RMWY
Subjt:  LVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWY

Query:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM
        NRLSEYLLK+GY NNPI PC+FIKKS++GFAIIAVYVDDLN++GTPEEL++   YLKKEFEMKDLGKTKFCLGLQ+EH  +G+ +HQSTY KK+LK FYM
Subjt:  NRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYM

Query:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN
        DKAHPL+ PMVVRSLDVKKD FR  E +EELLGPEV YLSAIGALMYLAN TRPDIAFSVNLLARYSS+PT+RHWNG+KH+LR+LRGT DMGLFYS +S 
Subjt:  DKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSN

Query:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK
          L+GYADAGYLSDPHK  SQTGY+F C GTAISWRSVKQTM ATSSNH+EILAIHE S+E +WLRSM  HIRE+CGLS  K  PT LFE N ACI QI 
Subjt:  FDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIK

Query:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL
        GGYIKG RTKHISPK FYTH+L+++G+I +QQI S DNL DLFTK+LPTSTF+KL+H IGMR+L+++
Subjt:  GGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGMRRLREL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-8134.34Show/hide
Query:  NRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQG
        N +N VV N      AH I ++     P S DE + R D S W+EAI  ELN+  K      +   P+    V  +WVF  K NE     RYKARLVA+G
Subjt:  NRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQG

Query:  LSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSE
         +Q+  IDYEET++PV    + R+++SL +  NL +H MDV TA+L G+LK EIYM++P+G       + NS  +C  KL +++YGLKQ+ R W+    +
Subjt:  LSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSE

Query:  YLLKEGYQNNPIYPCVFI--KKSQSGFAIIAVYVDDLNI-IGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDK
         L +  + N+ +  C++I  K + +    + +YVDD+ I  G    ++    YL ++F M DL + K  +G+++E   D I++ QS Y KKIL +F M+ 
Subjt:  YLLKEGYQNNPIYPCVFI--KKSQSGFAIIAVYVDDLNI-IGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDK

Query:  AHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFD
         + ++ P+        K  + L   +E+   P     S IG LMY+   TRPD+  +VN+L+RYSS      W  +K VLR+L+GTIDM L +     F+
Subjt:  AHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFD

Query:  --LVGYADAGYLSDPHKAISQTGYMFTC-GGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQI
          ++GY D+ +        S TGY+F       I W + +Q   A SS  AE +A+ E  +E +WL+ +         ++     P  ++E N  CI  I
Subjt:  --LVGYADAGYLSDPHKAISQTGYMFTC-GGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQI

Query:  KGGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGM
                R KHI  K  +  +  +N  I ++ I +++ L D+FTK LP + F +L   +G+
Subjt:  KGGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-8534.94Show/hide
Query:  NEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITL
        ++D EP+S+ E  +  + ++  +A+Q E+ SL K   +  +V  PKG +P+  KWVF  K++ + ++ RYKARLV +G  Q+ GID++E +SPVV   ++
Subjt:  NEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITL

Query:  RYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKK-S
        R ++SLA   +L++  +DV TA+L+G L+ EIYM+ PEGF++          +C  KL +SLYGLKQ+ R WY +   ++  + Y      PCV+ K+ S
Subjt:  RYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKK-S

Query:  QSGFAIIAVYVDDLNIIGTPEEL-SKAIEYLKKEFEMKDLGKTKFCLGLQV--EHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFR
        ++ F I+ +YVDD+ I+G  + L +K    L K F+MKDLG  +  LG+++  E  +  +++ Q  Y +++L+RF M  A P++ P+       KK    
Subjt:  QSGFAIIAVYVDDLNIIGTPEEL-SKAIEYLKKEFEMKDLGKTKFCLGLQV--EHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFR

Query:  LREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTG
          E+   +   +V Y SA+G+LMY    TRPDIA +V +++R+  +P K HW  VK +LR+LRGT    L +   S+  L GY DA    D     S TG
Subjt:  LREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTG

Query:  YMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLFYTHDLE
        Y+FT  G AISW+S  Q   A S+  AE +A  ET KE +WL+           L   +    +  +  +A  + +    +   RTKHI  +  +  ++ 
Subjt:  YMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLFYTHDLE

Query:  ENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGM
        ++  + + +IS+ +N  D+ TK +P + FE     +GM
Subjt:  ENGDISIQQISSKDNLVDLFTKALPTSTFEKLVHNIGM

Q03619 Transposon Ty1-ER2 Gag-Pol polyprotein4.9e-3626.3Show/hide
Query:  KDWSKWKEAIQAELNSLTKCEVFG-PVVYTPKGVKP---VGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYL-----ISL
        K+  K+ +A   E+N L K + +     Y  K + P   +   ++F RKR+       +KAR VA+G      I + +TY P + + T+ +      +SL
Subjt:  KDWSKWKEAIQAELNSLTKCEVFG-PVVYTPKGVKP---VGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYL-----ISL

Query:  AVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKE-GYQNNPIYPCVFIKKSQSGFAI
        A+  N  +  +D+ +AYLY  +K E+Y++ P    + +           I+L++SLYGLKQS   WY  +  YL+K+ G +    + CVF K SQ     
Subjt:  AVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKE-GYQNNPIYPCVFIKKSQSGFAI

Query:  IAVYVDDLNIIGTPEELS-KAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVR----------SLDVKKDI
        I ++VDD+ +       + K I  LKK+++ K +   +    +Q + L   I   +  Y K  ++    +K   LN+P+  +           L + +  
Subjt:  IAVYVDDLNIIGTPEELS-KAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVR----------SLDVKKDI

Query:  FRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSN----KSNFDLVGYADAGYLSDPHK
          L ED+ ++   E+  L  IG   Y+    R D+ + +N LA++   P+K+  +    +++ +  T D  L +      K    LV  +DA Y + P+ 
Subjt:  FRLREDNEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSN----KSNFDLVGYADAGYLSDPHK

Query:  AISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLF
          SQ G ++   G  I  +S K ++T TS+  AE   IH  S+    L +++H ++E      +K L T     + + I  I     +  R +    K  
Subjt:  AISQTGYMFTCGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLF

Query:  YTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVH
           D      + +  I +K N+ D+ TK LP  TF+ L +
Subjt:  YTHDLEENGDISIQQISSKDNLVDLFTKALPTSTFEKLVH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.9e-7131.02Show/hide
Query:  KWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDV
        +W+ A+ +E+N+      +  V   P  V  VG +W+F +K N +  + RYKARLVA+G +QR G+DY ET+SPV+ + ++R ++ +AV  +  +  +DV
Subjt:  KWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDV

Query:  VTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGT-
          A+L G+L +++YM  P GF   +  N     +C  KL+++LYGLKQ+ R WY  L  YLL  G+ N+     +F+ +       + VYVDD+ I G  
Subjt:  VTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGT-

Query:  PEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGAL
        P  L   ++ L + F +KD  +  + LG++ + +  G+ + Q  Y   +L R  M  A P+  PM            +L   +   L     Y   +G+L
Subjt:  PEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGAL

Query:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTAT
         YLA  TRPDI+++VN L+++   PT+ H   +K +LR+L GT + G+F    +   L  Y+DA +  D    +S  GY+   G   ISW S KQ     
Subjt:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTAT

Query:  SSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTK
        SS  AE  ++  TS E  W+ S+   +    G+  ++  P +++  N      +    +   R KHI+    +  +  ++G + +  +S+ D L D  TK
Subjt:  SSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTK

Query:  ALPTSTFEKLVHNIGMRRL
         L  + F+     IG+ R+
Subjt:  ALPTSTFEKLVHNIGMRRL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.5e-7231.59Show/hide
Query:  KWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDV
        +W++A+ +E+N+      +  V   P  V  VG +W+F +K N +  + RYKARLVA+G +QR G+DY ET+SPV+ + ++R ++ +AV  +  +  +DV
Subjt:  KWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDV

Query:  VTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTP
          A+L G+L +E+YM  P GF          R     +L++++YGLKQ+ R WY  L  YLL  G+ N+     +F+ +       + VYVDD+ I G  
Subjt:  VTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKKSQSGFAIIAVYVDDLNIIGTP

Query:  EELSK-AIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGAL
          L K  ++ L + F +K+     + LG++ + +  G+ + Q  YT  +L R  M  A P+  PM            +L   +   L     Y   +G+L
Subjt:  EELSK-AIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFRLREDNEELLGPEVSYLSAIGAL

Query:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTAT
         YLA  TRPD++++VN L++Y   PT  HWN +K VLR+L GT D G+F    +   L  Y+DA +  D    +S  GY+   G   ISW S KQ     
Subjt:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVKQTMTAT

Query:  SSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTK
        SS  AE  ++  TS E  W+ S+   +    G+  S   P +++  N      +    +   R KHI+    +  +  ++G + +  +S+ D L D  TK
Subjt:  SSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTK

Query:  ALPTSTFEKLVHNIGM
         L    F+     IG+
Subjt:  ALPTSTFEKLVHNIGM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.7e-6532.78Show/hide
Query:  EPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLI
        EP + +E    K++  W  A+  E+ ++     +  +   P   KP+G KWV+  K N +  + RYKARLVA+G +Q+ GID+ ET+SPV    +++ ++
Subjt:  EPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLI

Query:  SLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKKSQSGFA
        +++   N  +H +D+  A+L G L  EIYMK+P G+   +  +     +C +K  +S+YGLKQ+ R W+ + S  L+  G+  +      F+K + + F 
Subjt:  SLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYPCVFIKKSQSGFA

Query:  IIAVYVDDLNIIGTPE-ELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPM---VVRSLDVKKDIFRLRED
         + VYVDD+ I    +  + +    LK  F+++DLG  K+ LGL++   A GI I Q  Y   +L    +    P ++PM   V  S     D    +  
Subjt:  IIAVYVDDLNIIGTPE-ELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPM---VVRSLDVKKDIFRLRED

Query:  NEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFT
                 +Y   IG LMYL   TR DI+F+VN L+++S +P   H   V  +L +++GT+  GLFYS+++   L  ++DA + S      S  GY   
Subjt:  NEELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFT

Query:  CGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHI
         G + ISW+S KQ + + SS  AE  A+   + E +WL      ++    L  SK  PT+LF  NTA I  I    +   RTKHI
Subjt:  CGGTAISWRSVKQTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHI

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.0e-0940.51Show/hide
Query:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGY
        MYL   TRPD+ F+VN L+++SS+        V  VL +++GT+  GLFYS  S+  L  +AD+ + S P    S TG+
Subjt:  MYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGY

ATMG00810.1 DNA/RNA polymerases superfamily protein2.6e-2429.91Show/hide
Query:  IAVYVDDLNIIGTPEE-LSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFRLREDNEEL
        + +YVDD+ + G+    L+  I  L   F MKDLG   + LG+Q++    G+F+ Q+ Y ++IL    M    P++ P+ +          +L       
Subjt:  IAVYVDDLNIIGTPEE-LSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFRLREDNEEL

Query:  LGPEVS-YLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFTCGG
          P+ S + S +GAL YL   TRPDI+++VN++ +    PT   ++ +K VLR+++GTI  GL+    S  ++  + D+ +        S TG+    G 
Subjt:  LGPEVS-YLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFTCGG

Query:  TAISWRSVKQTMTATSSNHAEILAIHETSKEYVW
          ISW + +Q   + SS   E  A+  T+ E  W
Subjt:  TAISWRSVKQTMTATSSNHAEILAIHETSKEYVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.1e-1038.83Show/hide
Query:  EPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLI
        EPKSV        W    +A+Q EL++L++ + +  +V  P     +G KWVF  K + +  + R KARLVA+G  Q  GI + ETYSPVV   T+R ++
Subjt:  EPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQRSGIDYEETYSPVVDAITLRYLI

Query:  SLA
        ++A
Subjt:  SLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGAAAAAGGTGGAATAGAATTAATGTAGTTGTGGACAACATTTTTGCGTATAATGTTGCACATAATATCATTCATGAAAATGAGGATTATGAACCTAAATCTGT
TGACGAATGTCGTAATAGAAAGGATTGGTCCAAGTGGAAAGAAGCCATCCAAGCAGAACTAAACTCACTCACGAAATGTGAAGTTTTTGGACCTGTAGTTTATACACCTA
AAGGTGTAAAACCTGTGGGATTTAAATGGGTATTTGTGCGTAAACGAAATGAAAATAATGAGGTCACTAGATATAAAGCACGACTTGTTGCACAAGGATTGTCTCAAAGA
TCAGGCATTGATTATGAGGAAACATATTCTCCTGTGGTGGATGCTATTACATTAAGATATTTAATTAGTCTGGCTGTATGTGAAAATCTTGATATGCATCTTATGGATGT
AGTTACAGCATACTTATATGGATCTTTGAAAAATGAAATCTATATGAAAATCCCTGAAGGATTTAAGATACCTGAATCATATAATTCAAACTCTAGAGAATTATGTTCAA
TCAAATTACAAAGATCATTATATGGATTGAAACAATCAAGACGAATGTGGTACAATCGCTTAAGTGAATATTTATTGAAAGAAGGTTATCAAAATAATCCAATTTATCCA
TGTGTTTTTATTAAGAAATCACAGTCAGGATTTGCGATTATAGCTGTATATGTTGATGATTTAAATATAATTGGAACTCCTGAAGAGCTTTCAAAGGCAATAGAATATCT
CAAAAAGGAATTTGAAATGAAAGATCTTGGTAAGACAAAATTTTGCCTTGGCTTACAAGTCGAGCATTTAGCCGATGGAATTTTTATTCATCAATCAACATATACAAAAA
AGATTTTAAAAAGATTCTACATGGACAAAGCACACCCATTAAACATTCCAATGGTGGTTCGATCACTAGATGTAAAAAAGGATATCTTTCGACTTCGAGAAGATAATGAA
GAATTACTTGGTCCTGAAGTATCGTACCTTAGTGCAATAGGTGCACTAATGTATCTTGCTAATAACACAAGACCAGATATAGCATTTTCAGTAAATTTGTTAGCAAGATA
TAGTTCTTCTCCAACAAAAAGACACTGGAATGGAGTTAAGCATGTACTTCGTCATCTGCGAGGGACAATTGATATGGGTTTGTTTTATTCAAATAAATCAAACTTTGATC
TAGTTGGTTATGCGGATGCTGGATATTTATCTGATCCACACAAAGCAATATCTCAAACAGGTTATATGTTTACATGTGGAGGAACTGCTATATCTTGGCGATCTGTAAAG
CAAACCATGACGGCCACTTCATCGAATCATGCAGAAATTCTTGCAATTCATGAAACTAGTAAAGAATATGTATGGTTGAGGTCAATGACTCATCATATTCGAGAAACATG
TGGTTTGTCTTTCAGTAAAAATTTACCAACAATATTATTTGAACATAATACCGCATGTATAGTACAAATCAAAGGAGGGTATATAAAAGGAGGTAGAACAAAGCATATCT
CACCAAAACTCTTCTATACGCATGACCTTGAAGAAAATGGTGACATCAGTATTCAACAAATTTCTTCAAAAGACAACTTGGTGGACTTATTCACAAAAGCATTACCCACA
TCAACATTTGAAAAGCTAGTGCACAACATTGGAATGCGACGACTCAGAGAACTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGGAAAAAGGTGGAATAGAATTAATGTAGTTGTGGACAACATTTTTGCGTATAATGTTGCACATAATATCATTCATGAAAATGAGGATTATGAACCTAAATCTGT
TGACGAATGTCGTAATAGAAAGGATTGGTCCAAGTGGAAAGAAGCCATCCAAGCAGAACTAAACTCACTCACGAAATGTGAAGTTTTTGGACCTGTAGTTTATACACCTA
AAGGTGTAAAACCTGTGGGATTTAAATGGGTATTTGTGCGTAAACGAAATGAAAATAATGAGGTCACTAGATATAAAGCACGACTTGTTGCACAAGGATTGTCTCAAAGA
TCAGGCATTGATTATGAGGAAACATATTCTCCTGTGGTGGATGCTATTACATTAAGATATTTAATTAGTCTGGCTGTATGTGAAAATCTTGATATGCATCTTATGGATGT
AGTTACAGCATACTTATATGGATCTTTGAAAAATGAAATCTATATGAAAATCCCTGAAGGATTTAAGATACCTGAATCATATAATTCAAACTCTAGAGAATTATGTTCAA
TCAAATTACAAAGATCATTATATGGATTGAAACAATCAAGACGAATGTGGTACAATCGCTTAAGTGAATATTTATTGAAAGAAGGTTATCAAAATAATCCAATTTATCCA
TGTGTTTTTATTAAGAAATCACAGTCAGGATTTGCGATTATAGCTGTATATGTTGATGATTTAAATATAATTGGAACTCCTGAAGAGCTTTCAAAGGCAATAGAATATCT
CAAAAAGGAATTTGAAATGAAAGATCTTGGTAAGACAAAATTTTGCCTTGGCTTACAAGTCGAGCATTTAGCCGATGGAATTTTTATTCATCAATCAACATATACAAAAA
AGATTTTAAAAAGATTCTACATGGACAAAGCACACCCATTAAACATTCCAATGGTGGTTCGATCACTAGATGTAAAAAAGGATATCTTTCGACTTCGAGAAGATAATGAA
GAATTACTTGGTCCTGAAGTATCGTACCTTAGTGCAATAGGTGCACTAATGTATCTTGCTAATAACACAAGACCAGATATAGCATTTTCAGTAAATTTGTTAGCAAGATA
TAGTTCTTCTCCAACAAAAAGACACTGGAATGGAGTTAAGCATGTACTTCGTCATCTGCGAGGGACAATTGATATGGGTTTGTTTTATTCAAATAAATCAAACTTTGATC
TAGTTGGTTATGCGGATGCTGGATATTTATCTGATCCACACAAAGCAATATCTCAAACAGGTTATATGTTTACATGTGGAGGAACTGCTATATCTTGGCGATCTGTAAAG
CAAACCATGACGGCCACTTCATCGAATCATGCAGAAATTCTTGCAATTCATGAAACTAGTAAAGAATATGTATGGTTGAGGTCAATGACTCATCATATTCGAGAAACATG
TGGTTTGTCTTTCAGTAAAAATTTACCAACAATATTATTTGAACATAATACCGCATGTATAGTACAAATCAAAGGAGGGTATATAAAAGGAGGTAGAACAAAGCATATCT
CACCAAAACTCTTCTATACGCATGACCTTGAAGAAAATGGTGACATCAGTATTCAACAAATTTCTTCAAAAGACAACTTGGTGGACTTATTCACAAAAGCATTACCCACA
TCAACATTTGAAAAGCTAGTGCACAACATTGGAATGCGACGACTCAGAGAACTTAAGTGA
Protein sequenceShow/hide protein sequence
MTGKRWNRINVVVDNIFAYNVAHNIIHENEDYEPKSVDECRNRKDWSKWKEAIQAELNSLTKCEVFGPVVYTPKGVKPVGFKWVFVRKRNENNEVTRYKARLVAQGLSQR
SGIDYEETYSPVVDAITLRYLISLAVCENLDMHLMDVVTAYLYGSLKNEIYMKIPEGFKIPESYNSNSRELCSIKLQRSLYGLKQSRRMWYNRLSEYLLKEGYQNNPIYP
CVFIKKSQSGFAIIAVYVDDLNIIGTPEELSKAIEYLKKEFEMKDLGKTKFCLGLQVEHLADGIFIHQSTYTKKILKRFYMDKAHPLNIPMVVRSLDVKKDIFRLREDNE
ELLGPEVSYLSAIGALMYLANNTRPDIAFSVNLLARYSSSPTKRHWNGVKHVLRHLRGTIDMGLFYSNKSNFDLVGYADAGYLSDPHKAISQTGYMFTCGGTAISWRSVK
QTMTATSSNHAEILAIHETSKEYVWLRSMTHHIRETCGLSFSKNLPTILFEHNTACIVQIKGGYIKGGRTKHISPKLFYTHDLEENGDISIQQISSKDNLVDLFTKALPT
STFEKLVHNIGMRRLRELK