; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0106201 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0106201
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr04:24128631..24131846
RNA-Seq ExpressionCmc04g0106201
SyntenyCmc04g0106201
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]1.5e-26548.39Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEAC
        +D  T  VL+ G + DGLY F    SH  L  + S +K       S  + V   S +   DLWH+RLGHP    +K  L+  + +   INK+  NFC +C
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEAC

Query:  ALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKP
         LGK H  PFS S T YT PL+LI  DLWGP + +S++G+RYYI FVDA+SR++WI+ L  KS+A   F  FKT VE      IKSLQTD G EF+ F+ 
Subjt:  ALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKP

Query:  FLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPY
        +L ++GI HR++CP+T +QNG+ ERKHR I+E GLTLL  A+LPL FWDE+F T VYL NRLPT +L +  P+E LF   P++  L+VFGC C+P LRPY
Subjt:  FLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPY

Query:  QSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN
         +HKL  RS  CTFLGYS  HKGYKC++S+GR++IS  V+F+E SFPY+                 SH S   S  VLSP        P+ S  P S M+
Subjt:  QSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN

Query:  -----HNEDRRHTDTVFDNTDHLNPTIVYPLE---------TGTQESPRDDGNS--------GGITQSPSPMEPLHQTDS---AFNHPHWKKVMEEEFEA
             H       DT       ++  +  P++         + T+   +D  N+         GI +    +  + +  S   A     WKK M  E++A
Subjt:  -----HNEDRRHTDTVFDNTDHLNPTIVYPLE---------TGTQESPRDDGNS--------GGITQSPSPMEPLHQTDS---AFNHPHWKKVMEEEFEA

Query:  LQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENV
        LQ+N TWSL P    ++ +GCKWV+K K N DG++ +YKARLVAKGFHQ    D+ ETFSPVVKP T+R+  TIA+ + W+I+QLDVNNAFL+G+L E V
Subjt:  LQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENV

Query:  YMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKD
        +M+QP GF  + +  +VC L KALYGLKQAPRAW+EKL  +L S GF ++K+D SL +  TP    YVL+ VDD++++ S    + SL+  LNS+F+LKD
Subjt:  YMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKD

Query:  LGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKH
        LG++ YFLG++VS+ TN GL LSQ+KYI DLLQ+TKM+  KP  TP+ +G  L    G+P  D+H YRS VGALQY T+T PE+S+SVNK CQFM  P  
Subjt:  LGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKH

Query:  THWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDL
         HW++VKRILRYL+G L HGL  +KS N+ L+GF DADWASD DDR+STSG CV+ G NL+ W SKK  I+SRSS E EYR LA L  E+ W+ SLL++L
Subjt:  THWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDL

Query:  YIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG
         + L  PP++WCDNLS V LSANP+LH++TK +E D+YFVR+ + + ++ +RH+P+ +Q+AD+LTK +S+  F   ++KL + + +T+ L+G
Subjt:  YIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG

KAA0052371.1 putative mitochondrial protein [Cucumis melo var. makuwa]3.1e-26391.18Show/hide
Query:  MEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLH
        MEEEFEALQKN TW LTPQNPNQKIVGCKWVFKIKRNS G I+RYKARLVAKGFHQTPNIDYNETFSPVVK +TI M LTIAIMKGWSIRQLDVNNAFLH
Subjt:  MEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLH

Query:  GNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLN
        GNLDENVYM+Q FGFEVKSSYPMVCHLKKALYGLKQAPRAWYE LS  LHSLGFRTSKADTSLLI VTPT CCY LI VDDLIIM SS+KDVNSLVHSLN
Subjt:  GNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLN

Query:  SQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQ
        SQFALKDLGKLSYFLGVEVSYPTNG LFLSQSKYITDLLQRTKML+AKPISTPMVSGPLLSAFQGEPFHDVHL RSVVGALQYATLTHPEISYSVNKACQ
Subjt:  SQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQ

Query:  FMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWI
        FMHTPKHTHWQLVKRILRYLKGVLYHGLW  KSDN SLVGFADADWASDPDDRKSTSG CVYFGNNLV WGSKK SIISRSSTEAEYRCLALLATE+VWI
Subjt:  FMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWI

Query:  CSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG
         SLLNDLYIDLPFPPIL CDNLSAVH SANPILHSKTK VE DIYFVRDLI+K KL +RHLPATEQIADILTKPLSAQSFH LKN +TVIDS  IGLQG
Subjt:  CSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-26949.95Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK
        +D     +LLQG L+ GLY+F +      K    S SN K+             N+  P+   S+  + DLWH+RLGHP   IV   LN       T + 
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK

Query:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT
         + C AC LGK H LPF  S T+YT PLQL+  DLWGPA   S  GF YY+SFVDAYSRYTW+YFL  KS    AF  FK   E   G  +K+ QTD G 
Subjt:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT

Query:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC
        EF+  K + +Q+GI HR++CP+TSKQNGI+ERKHRHI+E+GLTLL+QA+LPL +W +AFST+V+LINRLPT VL    P E LF  KPN+  L+VFGC C
Subjt:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC

Query:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP
        +P+LRPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A          SHS+     IP   N+        LS P  S   
Subjt:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP

Query:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP
        S  ++ N   D R       NTD       LN +                PL T + E P +  N+  +T    P                      +E 
Subjt:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP

Query:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM
         +    A +HP WK+ M+EEF AL KN TWSL     N+  VGC+WVFK+KRN DGS+SRYKARLVAKG+ Q P  D+ ETFSPVVKP TIR+ L IA+ 
Subjt:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM

Query:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL
        + W IRQLDVNNAFL+G L E VYM QP GF+ K++    +VC L KALYGLKQAPRAW++KL  SL   GF ++K+D SL +  T     +VL+ VDD+
Subjt:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL

Query:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ
        ++  SS ++++ L+  L   F+LKDLG+LSYFLG+EV    +GGL LSQ KYI DLL++TKM  AK + TPM+SG  LSA  G+P  +V  YRSVVGALQ
Subjt:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ

Query:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS
        Y T+T PEI++SVNK CQFM  P  THW+ VKRILRYL G    G+  + S+ M+LVGF DADW SD DDR+STSG CV+ G +LV W SKK    SRSS
Subjt:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS

Query:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT
        TEAEYR LA L +E++W+ SLL++L   +   P++WCDN+S V LSANP+LHS+TK +E D+YFVR+ + + KL + H+P  +Q+AD+ TKPLS + F  
Subjt:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT

Query:  LKNKLTV
        L+ KLTV
Subjt:  LKNKLTV

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.6e-26148.96Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK
        +D     +LLQG L+ GLY+F +      K    S SN K+             N+  P+   S+  + DLWH+RLGHP   IV   LN       T + 
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK

Query:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT
         + C AC LGK H LPF  S T+YT PLQL+  DLWGPA   S  GF YY+SFVDAYSRYTW+YFL  KS    AF  FK   E   G  +K+ QTD G 
Subjt:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT

Query:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC
        EF+  K + +Q+GI HR++CP+TSKQNGI+ERKHRHI+E+GLTLL+QA+LPL +W +AFST+V+LINRLPT VL    P E LF  KPN+  L+VFGC C
Subjt:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC

Query:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP
        +P+LRPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A          SHS+     IP   N+        LS P  S   
Subjt:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP

Query:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP
        S  ++ N   D R       NTD       LN +                PL T + E P +  N+  +T    P                      +E 
Subjt:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP

Query:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM
         +    A +HP WK+ M+EEF AL KN TWSL     N+  VGC+WVFK+KRN DGS+SRYKARLVAKG+ Q P  D+ ETFSPVVKP TIR+ L IA+ 
Subjt:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM

Query:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL
        + W IRQLDVNNAFL+G L E VYM QP GF+ K++    +VC L KALYGLKQAPRAW++KL  SL   GF ++K+D SL +  T     +VL+ VDD+
Subjt:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL

Query:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ
        ++  SS ++++ L+  L   F+LKDLG+LSYFLG+E                  DLL++TKM  AK + TPM+SG  LSA  G+P  +V  YRSVVGALQ
Subjt:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ

Query:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS
        Y T+T PEI++SVNK CQFM  P  THW+ VKRILRYL G    G+  + S+ M+LVGF DADW SD DDR+STSG CV+ G +LV W SKK    SRSS
Subjt:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS

Query:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT
        TEAEYR LA L +E++W+ SLL++L   +   P++WCDN+S V LSANP+LHS+TK +E D+YFVR+ + + KL + H+P  +Q+AD+ TKPLS + F  
Subjt:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT

Query:  LKNKLTV
        L+ KLTV
Subjt:  LKNKLTV

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.3e-26448.98Show/hide
Query:  GLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEACALGKHHALPFS
        G + DGLY F    SH  L  + S +K       S  + V   S +   DLWH+RLG P    +K  L+  + +   INK+  NFC +C LGK H  PFS
Subjt:  GLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEACALGKHHALPFS

Query:  HSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRI
         S T YT PL+LI  DLWGPA  +S++G+RYYI FVDA+SR++WI+ L  KS+A   F  FKT VE      IKSLQTD G EF+ F+ +L ++GI HR+
Subjt:  HSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRI

Query:  TCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTP
        +CP+T +QNG+ ERKHR I+E GLTLL   +LPL FWDE+F T VYL NRLPT VL +  P+E LF   P++  L+VFGC C+P LRPY +HKL  RS  
Subjt:  TCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTP

Query:  CTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN-----HNEDRR
        CTFLGYS  HKGYKC++S+GR++ISR V+F+E SFPY+                 SH S   S  VLSP        P+ S  P S M+     H     
Subjt:  CTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN-----HNEDRR

Query:  HTDTVFDNTDHLNPTIVYPLE---------TGTQESPRD-DGNSGGITQSPSPM-----------EPLHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLT
          DT       ++  +  P++         + T+   +D D     IT++ S +           EP     +A     WKK M  E++ALQ+N TWSL 
Subjt:  HTDTVFDNTDHLNPTIVYPLE---------TGTQESPRD-DGNSGGITQSPSPM-----------EPLHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLT

Query:  PQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEV
        P    ++ +GCKWV+K K N DG++ +YKARLVAKGFHQ    D+ ETFSPVVKP TIR+  TIA+ + W+I+QLDVNNAFL+G+L E V+M+QP GF  
Subjt:  PQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEV

Query:  KSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGV
        + +  +VC L KALYGLKQAPRAW+EKL  +L S GF ++K+D SL +  TP+   YVL+ VDD++++ S    + SL+  LNS+F+LKDLG++ YFLG+
Subjt:  KSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGV

Query:  EVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRIL
        +VS+ TN GL LSQ+KYI DLLQ+TKM+  KP  TP+ +G  L A  G+P  D+H YRS VGALQY T+T PE+S+SVNK CQFM  P   HW+ VKRIL
Subjt:  EVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRIL

Query:  RYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPIL
        RYL+G L HGL  +KS N+ L+GF DADWASD DDR+STSG CV+ G NL+ W SKK   +SRSSTEAEYR LA L  E+ W+ SLL++L + L  PP++
Subjt:  RYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPIL

Query:  WCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG
        WCDNLS V LSANP+LH++TK +E D+YFV + + + ++ +RH+P+ +Q+AD+LTK +S+  F   ++KL + + +T+ L+G
Subjt:  WCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG

TrEMBL top hitse value%identityAlignment
A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-27049.95Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK
        +D     +LLQG L+ GLY+F +      K    S SN K+             N+  P+   S+  + DLWH+RLGHP   IV   LN       T + 
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK

Query:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT
         + C AC LGK H LPF  S T+YT PLQL+  DLWGPA   S  GF YY+SFVDAYSRYTW+YFL  KS    AF  FK   E   G  +K+ QTD G 
Subjt:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT

Query:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC
        EF+  K + +Q+GI HR++CP+TSKQNGI+ERKHRHI+E+GLTLL+QA+LPL +W +AFST+V+LINRLPT VL    P E LF  KPN+  L+VFGC C
Subjt:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC

Query:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP
        +P+LRPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A          SHS+     IP   N+        LS P  S   
Subjt:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP

Query:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP
        S  ++ N   D R       NTD       LN +                PL T + E P +  N+  +T    P                      +E 
Subjt:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP

Query:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM
         +    A +HP WK+ M+EEF AL KN TWSL     N+  VGC+WVFK+KRN DGS+SRYKARLVAKG+ Q P  D+ ETFSPVVKP TIR+ L IA+ 
Subjt:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM

Query:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL
        + W IRQLDVNNAFL+G L E VYM QP GF+ K++    +VC L KALYGLKQAPRAW++KL  SL   GF ++K+D SL +  T     +VL+ VDD+
Subjt:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL

Query:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ
        ++  SS ++++ L+  L   F+LKDLG+LSYFLG+EV    +GGL LSQ KYI DLL++TKM  AK + TPM+SG  LSA  G+P  +V  YRSVVGALQ
Subjt:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ

Query:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS
        Y T+T PEI++SVNK CQFM  P  THW+ VKRILRYL G    G+  + S+ M+LVGF DADW SD DDR+STSG CV+ G +LV W SKK    SRSS
Subjt:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS

Query:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT
        TEAEYR LA L +E++W+ SLL++L   +   P++WCDN+S V LSANP+LHS+TK +E D+YFVR+ + + KL + H+P  +Q+AD+ TKPLS + F  
Subjt:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT

Query:  LKNKLTV
        L+ KLTV
Subjt:  LKNKLTV

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-26148.96Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK
        +D     +LLQG L+ GLY+F +      K    S SN K+             N+  P+   S+  + DLWH+RLGHP   IV   LN       T + 
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF-----------NTVVPK---SNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINK

Query:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT
         + C AC LGK H LPF  S T+YT PLQL+  DLWGPA   S  GF YY+SFVDAYSRYTW+YFL  KS    AF  FK   E   G  +K+ QTD G 
Subjt:  LNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGT

Query:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC
        EF+  K + +Q+GI HR++CP+TSKQNGI+ERKHRHI+E+GLTLL+QA+LPL +W +AFST+V+LINRLPT VL    P E LF  KPN+  L+VFGC C
Subjt:  EFKPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKC

Query:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP
        +P+LRPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A          SHS+     IP   N+        LS P  S   
Subjt:  YPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA-------SFASHSS-----IPKSNNV--------LSPPLHSIIP

Query:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP
        S  ++ N   D R       NTD       LN +                PL T + E P +  N+  +T    P                      +E 
Subjt:  SSLMNHN--EDRRHTDTVFDNTDH------LNPTI-------------VYPLETGTQESPRDDGNSGGITQSPSP----------------------MEP

Query:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM
         +    A +HP WK+ M+EEF AL KN TWSL     N+  VGC+WVFK+KRN DGS+SRYKARLVAKG+ Q P  D+ ETFSPVVKP TIR+ L IA+ 
Subjt:  LHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIM

Query:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL
        + W IRQLDVNNAFL+G L E VYM QP GF+ K++    +VC L KALYGLKQAPRAW++KL  SL   GF ++K+D SL +  T     +VL+ VDD+
Subjt:  KGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSS--YPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDL

Query:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ
        ++  SS ++++ L+  L   F+LKDLG+LSYFLG+E                  DLL++TKM  AK + TPM+SG  LSA  G+P  +V  YRSVVGALQ
Subjt:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQ

Query:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS
        Y T+T PEI++SVNK CQFM  P  THW+ VKRILRYL G    G+  + S+ M+LVGF DADW SD DDR+STSG CV+ G +LV W SKK    SRSS
Subjt:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSS

Query:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT
        TEAEYR LA L +E++W+ SLL++L   +   P++WCDN+S V LSANP+LHS+TK +E D+YFVR+ + + KL + H+P  +Q+AD+ TKPLS + F  
Subjt:  TEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHT

Query:  LKNKLTV
        L+ KLTV
Subjt:  LKNKLTV

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-26448.98Show/hide
Query:  GLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEACALGKHHALPFS
        G + DGLY F    SH  L  + S +K       S  + V   S +   DLWH+RLG P    +K  L+  + +   INK+  NFC +C LGK H  PFS
Subjt:  GLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEACALGKHHALPFS

Query:  HSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRI
         S T YT PL+LI  DLWGPA  +S++G+RYYI FVDA+SR++WI+ L  KS+A   F  FKT VE      IKSLQTD G EF+ F+ +L ++GI HR+
Subjt:  HSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRI

Query:  TCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTP
        +CP+T +QNG+ ERKHR I+E GLTLL   +LPL FWDE+F T VYL NRLPT VL +  P+E LF   P++  L+VFGC C+P LRPY +HKL  RS  
Subjt:  TCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTP

Query:  CTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN-----HNEDRR
        CTFLGYS  HKGYKC++S+GR++ISR V+F+E SFPY+                 SH S   S  VLSP        P+ S  P S M+     H     
Subjt:  CTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN-----HNEDRR

Query:  HTDTVFDNTDHLNPTIVYPLE---------TGTQESPRD-DGNSGGITQSPSPM-----------EPLHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLT
          DT       ++  +  P++         + T+   +D D     IT++ S +           EP     +A     WKK M  E++ALQ+N TWSL 
Subjt:  HTDTVFDNTDHLNPTIVYPLE---------TGTQESPRD-DGNSGGITQSPSPM-----------EPLHQTDSAFNHPHWKKVMEEEFEALQKNGTWSLT

Query:  PQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEV
        P    ++ +GCKWV+K K N DG++ +YKARLVAKGFHQ    D+ ETFSPVVKP TIR+  TIA+ + W+I+QLDVNNAFL+G+L E V+M+QP GF  
Subjt:  PQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEV

Query:  KSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGV
        + +  +VC L KALYGLKQAPRAW+EKL  +L S GF ++K+D SL +  TP+   YVL+ VDD++++ S    + SL+  LNS+F+LKDLG++ YFLG+
Subjt:  KSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGV

Query:  EVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRIL
        +VS+ TN GL LSQ+KYI DLLQ+TKM+  KP  TP+ +G  L A  G+P  D+H YRS VGALQY T+T PE+S+SVNK CQFM  P   HW+ VKRIL
Subjt:  EVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRIL

Query:  RYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPIL
        RYL+G L HGL  +KS N+ L+GF DADWASD DDR+STSG CV+ G NL+ W SKK   +SRSSTEAEYR LA L  E+ W+ SLL++L + L  PP++
Subjt:  RYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPIL

Query:  WCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG
        WCDNLS V LSANP+LH++TK +E D+YFV + + + ++ +RH+P+ +Q+AD+LTK +S+  F   ++KL + + +T+ L+G
Subjt:  WCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG

A0A5A7UFS3 Putative mitochondrial protein1.5e-26391.18Show/hide
Query:  MEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLH
        MEEEFEALQKN TW LTPQNPNQKIVGCKWVFKIKRNS G I+RYKARLVAKGFHQTPNIDYNETFSPVVK +TI M LTIAIMKGWSIRQLDVNNAFLH
Subjt:  MEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLH

Query:  GNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLN
        GNLDENVYM+Q FGFEVKSSYPMVCHLKKALYGLKQAPRAWYE LS  LHSLGFRTSKADTSLLI VTPT CCY LI VDDLIIM SS+KDVNSLVHSLN
Subjt:  GNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLN

Query:  SQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQ
        SQFALKDLGKLSYFLGVEVSYPTNG LFLSQSKYITDLLQRTKML+AKPISTPMVSGPLLSAFQGEPFHDVHL RSVVGALQYATLTHPEISYSVNKACQ
Subjt:  SQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQ

Query:  FMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWI
        FMHTPKHTHWQLVKRILRYLKGVLYHGLW  KSDN SLVGFADADWASDPDDRKSTSG CVYFGNNLV WGSKK SIISRSSTEAEYRCLALLATE+VWI
Subjt:  FMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWI

Query:  CSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG
         SLLNDLYIDLPFPPIL CDNLSAVH SANPILHSKTK VE DIYFVRDLI+K KL +RHLPATEQIADILTKPLSAQSFH LKN +TVIDS  IGLQG
Subjt:  CSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG

A5BFT3 Integrase catalytic domain-containing protein7.3e-26648.39Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEAC
        +D  T  VL+ G + DGLY F    SH  L  + S +K       S  + V   S +   DLWH+RLGHP    +K  L+  + +   INK+  NFC +C
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTK-------SVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKL--NFCEAC

Query:  ALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKP
         LGK H  PFS S T YT PL+LI  DLWGP + +S++G+RYYI FVDA+SR++WI+ L  KS+A   F  FKT VE      IKSLQTD G EF+ F+ 
Subjt:  ALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKP

Query:  FLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPY
        +L ++GI HR++CP+T +QNG+ ERKHR I+E GLTLL  A+LPL FWDE+F T VYL NRLPT +L +  P+E LF   P++  L+VFGC C+P LRPY
Subjt:  FLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPY

Query:  QSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN
         +HKL  RS  CTFLGYS  HKGYKC++S+GR++IS  V+F+E SFPY+                 SH S   S  VLSP        P+ S  P S M+
Subjt:  QSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS--------------FASHSSIPKSNNVLSP--------PLHSIIPSSLMN

Query:  -----HNEDRRHTDTVFDNTDHLNPTIVYPLE---------TGTQESPRDDGNS--------GGITQSPSPMEPLHQTDS---AFNHPHWKKVMEEEFEA
             H       DT       ++  +  P++         + T+   +D  N+         GI +    +  + +  S   A     WKK M  E++A
Subjt:  -----HNEDRRHTDTVFDNTDHLNPTIVYPLE---------TGTQESPRDDGNS--------GGITQSPSPMEPLHQTDS---AFNHPHWKKVMEEEFEA

Query:  LQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENV
        LQ+N TWSL P    ++ +GCKWV+K K N DG++ +YKARLVAKGFHQ    D+ ETFSPVVKP T+R+  TIA+ + W+I+QLDVNNAFL+G+L E V
Subjt:  LQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENV

Query:  YMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKD
        +M+QP GF  + +  +VC L KALYGLKQAPRAW+EKL  +L S GF ++K+D SL +  TP    YVL+ VDD++++ S    + SL+  LNS+F+LKD
Subjt:  YMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKD

Query:  LGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKH
        LG++ YFLG++VS+ TN GL LSQ+KYI DLLQ+TKM+  KP  TP+ +G  L    G+P  D+H YRS VGALQY T+T PE+S+SVNK CQFM  P  
Subjt:  LGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKH

Query:  THWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDL
         HW++VKRILRYL+G L HGL  +KS N+ L+GF DADWASD DDR+STSG CV+ G NL+ W SKK  I+SRSS E EYR LA L  E+ W+ SLL++L
Subjt:  THWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDL

Query:  YIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG
         + L  PP++WCDNLS V LSANP+LH++TK +E D+YFVR+ + + ++ +RH+P+ +Q+AD+LTK +S+  F   ++KL + + +T+ L+G
Subjt:  YIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-11229.33Show/hide
Query:  LWHRRLGH-PHLPIVKAFLNHIDHSSGTINKL----NFCEACALGKHHALPFSH--SLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWI
        LWH R GH     +++    ++      +N L      CE C  GK   LPF      T    PL ++  D+ GP   V+ +   Y++ FVD ++ Y   
Subjt:  LWHRRLGH-PHLPIVKAFLNHIDHSSGTINKL----NFCEACALGKHHALPFSH--SLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWI

Query:  YFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFST
        Y + +KSD F  FQ F    E      +  L  D G E+     + F  + GI + +T P+T + NG+ ER  R I E   T++S A L  SFW EA  T
Subjt:  YFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFST

Query:  SVYLINRLPTPVLDNIS--PLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSF-----
        + YLINR+P+  L + S  P E    +KP    LRVFG   Y +++  Q  K   +S    F+GY  +  G+K   A + +  ++R V+ DE +      
Subjt:  SVYLINRLPTPVLDNIS--PLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSF-----

Query:  -PYASFASHSSIPKSNNVLSPPLHSIIPSSLMNH----------------------NEDRRHTDTVFDN------------------------------T
          + +     S    N         II +   N                       N+ R+   T F N                               
Subjt:  -PYASFASHSSIPKSNNVLSPPLHSIIPSSLMNH----------------------NEDRRHTDTVFDN------------------------------T

Query:  DHLN--------------PTIVYPLETGTQESPRDDG----NSGGITQSPSPMEPLHQTDSAFN-------------------------HPHWKKVMEEE
        DHLN               T  +  E G     ++DG    N         P    ++ D++ N                            W++ +  E
Subjt:  DHLN--------------PTIVYPLETGTQESPRDDG----NSGGITQSPSPMEPLHQTDSAFN-------------------------HPHWKKVMEEE

Query:  FEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLD
          A + N TW++T +  N+ IV  +WVF +K N  G+  RYKARLVA+GF Q   IDY ETF+PV +  + R  L++ I     + Q+DV  AFL+G L 
Subjt:  FEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLD

Query:  ENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLI--HVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQ
        E +YM+ P G    S    VC L KA+YGLKQA R W+E    +L    F  S  D  + I          YVL+ VDD++I       +N+    L  +
Subjt:  ENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLI--HVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQ

Query:  FALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATL-THPEISYSVNKACQF
        F + DL ++ +F+G+ +    +  ++LSQS Y+  +L +  M     +STP+ S         +   +    RS++G L Y  L T P+++ +VN   ++
Subjt:  FALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATL-THPEISYSVNKACQF

Query:  MHTPKHTHWQLVKRILRYLKGVLYHGLWFRK--SDNMSLVGFADADWASDPDDRKSTSGFCV-YFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELV
                WQ +KR+LRYLKG +   L F+K  +    ++G+ D+DWA    DRKST+G+    F  NL+ W +K+ + ++ SSTEAEY  L     E +
Subjt:  MHTPKHTHWQLVKRILRYLKGVLYHGLWFRK--SDNMSLVGFADADWASDPDDRKSTSGFCV-YFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELV

Query:  WICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVI
        W+  LL  + I L  P  ++ DN   + ++ NP  H + K ++   +F R+ +Q   + + ++P   Q+ADI TKPL A  F  L++KL ++
Subjt:  WICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-13835.05Show/hide
Query:  LDLWHRRLGH---PHLPIVKAFLNHIDHSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYF
        +DLWH+R+GH     L I+ A  + I ++ GT  K   C+ C  GK H + F  S     + L L+  D+ GP    S  G +Y+++F+D  SR  W+Y 
Subjt:  LDLWHRRLGH---PHLPIVKAFLNHIDHSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYF

Query:  LNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSV
        L  K   F  FQKF   VE+  G+ +K L++D G E+  + F+ +   HGI H  T P T + NG+ ER +R I+E   ++L  A LP SFW EA  T+ 
Subjt:  LNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSV

Query:  YLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFI-SRHVLFDENSFPYASFASHS
        YLINR P+  L    P      ++ ++  L+VFGC+ + ++   Q  KL  +S PC F+GY     GY+      +  I SR V+F E+       A+  
Subjt:  YLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFI-SRHVLFDENSFPYASFASHS

Query:  SIPKSNNVLSPPLHSIIPSSLMNHNEDRRHTDTVFDNTDHLNPTI---------VYPLETGTQ-----------ESPRDDGNSGGITQ---SPSPMEPLH
        S    N ++  P    IPS+  N       TD V +  +     I         V  +E  TQ           E PR +      T+        EP  
Subjt:  SIPKSNNVLSPPLHSIIPSSLMNHNEDRRHTDTVFDNTDHLNPTI---------VYPLETGTQ-----------ESPRDDGNSGGITQ---SPSPMEPLH

Query:  QTDSAFNHP---HWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAI
              +HP      K M+EE E+LQKNGT+ L      ++ + CKWVFK+K++ D  + RYKARLV KGF Q   ID++E FSPVVK  +IR  L++A 
Subjt:  QTDSAFNHP---HWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAI

Query:  MKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLI-HVTPTPCCYVLICVDDL
             + QLDV  AFLHG+L+E +YM+QP GFEV     MVC L K+LYGLKQAPR WY K  S + S  +  + +D  +     +      +L+ VDD+
Subjt:  MKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLI-HVTPTPCCYVLICVDDL

Query:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVE-VSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLS------AFQGEPFHDVHLYR
        +I+   +  +  L   L+  F +KDLG     LG++ V   T+  L+LSQ KYI  +L+R  M  AKP+STP+     LS        + +       Y 
Subjt:  IIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVE-VSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLS------AFQGEPFHDVHLYR

Query:  SVVGALQYATL-THPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKK
        S VG+L YA + T P+I+++V    +F+  P   HW+ VK ILRYL+G     L F  SD + L G+ DAD A D D+RKS++G+   F    + W SK 
Subjt:  SVVGALQYATL-THPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKK

Query:  HSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKP
           ++ S+TEAEY        E++W+   L +L +      +++CD+ SA+ LS N + H++TK ++   +++R+++    L +  +   E  AD+LTK 
Subjt:  HSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKP

Query:  LSAQSFHTLK
        +    F   K
Subjt:  LSAQSFHTLK

P92519 Uncharacterized mitochondrial protein AtMg008102.3e-5446.26Show/hide
Query:  YVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVH
        Y+L+ VDD+++  SS   +N L+  L+S F++KDLG + YFLG+++ ++P+  GLFLSQ+KY   +L    ML+ KP+STP+    L S+     + D  
Subjt:  YVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVH

Query:  LYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGS
         +RS+VGALQY TLT P+ISY+VN  CQ MH P    + L+KR+LRY+KG ++HGL+  K+  +++  F D+DWA     R+ST+GFC + G N++ W +
Subjt:  LYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGS

Query:  KKHSIISRSSTEAEYRCLALLATELVW
        K+   +SRSSTE EYR LAL A EL W
Subjt:  KKHSIISRSSTEAEYRCLALLATELVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-20939.47Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTIN---KLNFCEACALGKHH
        +DL+TG  LLQG   D LY++ I  S      +  ++K+  ++            WH RLGHP   I+ + ++  ++S   +N   K   C  C + K +
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTIN---KLNFCEACALGKHH

Query:  ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHG
         +PFS S    T PL+ I  D+W   + +SH+ +RYY+ FVD ++RYTW+Y L  KS     F  FK  +E      I +  +D G EF     +  QHG
Subjt:  ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHG

Query:  IEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLS
        I H  + P+T + NG+ ERKHRHI+E GLTLLS A++P ++W  AF+ +VYLINRLPTP+L   SP +KLF   PN+  LRVFGC CYP+LRPY  HKL 
Subjt:  IEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLS

Query:  LRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------ASHSSIPKSNNVL-----SPPLHSIIPSS-----LM
         +S  C FLGYS +   Y CL     RL+ISRHV FDEN FP++++                  + H+++P    VL     S P H+  P S       
Subjt:  LRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------ASHSSIPKSNNVL-----SPPLHSIIPSS-----LM

Query:  NHNEDRRHTDTVF----------------------------------DNTDHLNPTIVYPLE-----------TGTQESPRDDGNSGGITQSP-----SP
        N      + D+ F                                   NT   NPT   P +           + +  SP    +S   + +P      P
Subjt:  NHNEDRRHTDTVF----------------------------------DNTDHLNPTIVYPLE-----------TGTQESPRDDGNSGGITQSP-----SP

Query:  MEPLHQ--------------------------------------------TDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQ-KIVGCKWVFKIKR
          PL Q                                               A     W+  M  E  A   N TW L P  P+   IVGC+W+F  K 
Subjt:  MEPLHQ--------------------------------------------TDSAFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQ-KIVGCKWVFKIKR

Query:  NSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQ
        NSDGS++RYKARLVAKG++Q P +DY ETFSPV+K  +IR+ L +A+ + W IRQLDVNNAFL G L ++VYM QP GF  K     VC L+KALYGLKQ
Subjt:  NSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSSYPMVCHLKKALYGLKQ

Query:  APRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVS-YPTNGGLFLSQSKYI
        APRAWY +L + L ++GF  S +DTSL +        Y+L+ VDD++I  +    +++ + +L+ +F++KD  +L YFLG+E    PT  GL LSQ +YI
Subjt:  APRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVS-YPTNGGLFLSQSKYI

Query:  TDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDN
         DLL RT M+ AKP++TPM   P LS + G    D   YR +VG+LQY   T P+ISY+VN+  QFMH P   H Q +KRILRYL G   HG++ +K + 
Subjt:  TDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDN

Query:  MSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHS
        +SL  ++DADWA D DD  ST+G+ VY G++ + W SKK   + RSSTEAEYR +A  ++E+ WICSLL +L I L  PP+++CDN+ A +L ANP+ HS
Subjt:  MSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHS

Query:  KTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTV
        + K +  D +F+R+ +Q G L + H+   +Q+AD LTKPLS  +F    +K+ V
Subjt:  KTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-20739.6Show/hide
Query:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTIN---KLNFCEACALGKHH
        +DL+TG  LLQG   D LY++ I         + S   S+F +   K+       WH RLGHP L I+ + ++  +HS   +N   KL  C  C + K H
Subjt:  QDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTIN---KLNFCEACALGKHH

Query:  ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHG
         +PFS+S    + PL+ I  D+W   + +S + +RYY+ FVD ++RYTW+Y L  KS     F  FK+ VE      I +L +D G EF   + +L QHG
Subjt:  ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHG

Query:  IEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLS
        I H  + P+T + NG+ ERKHRHI+EMGLTLLS A++P ++W  AFS +VYLINRLPTP+L   SP +KLF + PN+  L+VFGC CYP+LRPY  HKL 
Subjt:  IEHRITCPYTSKQNGIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLS

Query:  LRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYA------------------SFASHSSIPKSNNVLSPP--------------------
         +S  C F+GYS +   Y CL    GRL+ SRHV FDE  FP++                  ++ SH+++P +  VL  P                    
Subjt:  LRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYA------------------SFASHSSIPKSNNVLSPP--------------------

Query:  -----LHSIIPSSLM-----------NHNEDR-----RHTDTVFDNTDHLN-----------PTIVYPLETGTQESPRDDGNSGGITQ--SPS-------
               S +PSS +           +HN  +       T     N+  LN           P    PL      SP     S  I++  SPS       
Subjt:  -----LHSIIPSSLM-----------NHNEDR-----RHTDTVFDNTDHLN-----------PTIVYPLETGTQESPRDDGNSGGITQ--SPS-------

Query:  ------PMEPLHQTDS-----------------------------------------AFNHPHWKKVMEEEFEALQKNGTWSLT-PQNPNQKIVGCKWVF
              P  P+ Q ++                                         A     W++ M  E  A   N TW L  P  P+  IVGC+W+F
Subjt:  ------PMEPLHQTDS-----------------------------------------AFNHPHWKKVMEEEFEALQKNGTWSLT-PQNPNQKIVGCKWVF

Query:  KIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSSYPMVCHLKKALY
          K NSDGS++RYKARLVAKG++Q P +DY ETFSPV+K  +IR+ L +A+ + W IRQLDVNNAFL G L + VYM QP GF  K     VC L+KA+Y
Subjt:  KIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVYMKQPFGFEVKSSYPMVCHLKKALY

Query:  GLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQS
        GLKQAPRAWY +L + L ++GF  S +DTSL +        Y+L+ VDD++I  +    +   + +L+ +F++K+   L YFLG+E       GL LSQ 
Subjt:  GLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQS

Query:  KYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRK
        +Y  DLL RT ML AKP++TPM + P L+   G    D   YR +VG+LQY   T P++SY+VN+  Q+MH P   HW  +KR+LRYL G   HG++ +K
Subjt:  KYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRK

Query:  SDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPI
         + +SL  ++DADWA D DD  ST+G+ VY G++ + W SKK   + RSSTEAEYR +A  ++EL WICSLL +L I L  PP+++CDN+ A +L ANP+
Subjt:  SDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPI

Query:  LHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVI
         HS+ K +  D +F+R+ +Q G L + H+   +Q+AD LTKPLS  +F     K+ VI
Subjt:  LHSKTKCVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.3e-10942.67Show/hide
Query:  WKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNN
        W   M++E  A++   TW +    PN+K +GCKWV+KIK NSDG+I RYKARLVAKG+ Q   ID+ ETFSPV K  ++++ L I+ +  +++ QLD++N
Subjt:  WKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNN

Query:  AFLHGNLDENVYMKQPFGFEVK--SSYP--MVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDV
        AFL+G+LDE +YMK P G+  +   S P   VC+LKK++YGLKQA R W+ K S +L   GF  S +D +  + +T T    VL+ VDD+II  +++  V
Subjt:  AFLHGNLDENVYMKQPFGFEVK--SSYP--MVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDV

Query:  NSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEIS
        + L   L S F L+DLG L YFLG+E++  +  G+ + Q KY  DLL  T +L  KP S PM      SA  G  F D   YR ++G L Y  +T  +IS
Subjt:  NSLVHSLNSQFALKDLGKLSYFLGVEVSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEIS

Query:  YSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLAL
        ++VNK  QF   P+  H Q V +IL Y+KG +  GL++     M L  F+DA + S  D R+ST+G+C++ G +L+ W SKK  ++S+SS EAEYR L+ 
Subjt:  YSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLAL

Query:  LATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRD
           E++W+     +L + L  P +L+CDN +A+H++ N + H +TK +E D + VR+
Subjt:  LATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTKCVEFDIYFVRD

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.1e-1439Show/hide
Query:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVP-W--GSKKHSIIS
        Y T+T P+++++VN+  QF    +    Q V ++L Y+KG +  GL++  + ++ L  FAD+DWAS PD R+S +GFC     +LVP W  G+ + SI+S
Subjt:  YATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVP-W--GSKKHSIIS

ATMG00810.1 DNA/RNA polymerases superfamily protein1.6e-5546.26Show/hide
Query:  YVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVH
        Y+L+ VDD+++  SS   +N L+  L+S F++KDLG + YFLG+++ ++P+  GLFLSQ+KY   +L    ML+ KP+STP+    L S+     + D  
Subjt:  YVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVEV-SYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVH

Query:  LYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGS
         +RS+VGALQY TLT P+ISY+VN  CQ MH P    + L+KR+LRY+KG ++HGL+  K+  +++  F D+DWA     R+ST+GFC + G N++ W +
Subjt:  LYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGLWFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGS

Query:  KKHSIISRSSTEAEYRCLALLATELVW
        K+   +SRSSTE EYR LAL A EL W
Subjt:  KKHSIISRSSTEAEYRCLALLATELVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.8e-2355.43Show/hide
Query:  AFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIA
        A   P W + M+EE +AL +N TW L P   NQ I+GCKWVFK K +SDG++ R KARLVAKGFHQ   I + ET+SPVV+  TIR  L +A
Subjt:  AFNHPHWKKVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAAATGGGTCAGGATCTGGATACTGGCCAAGTACTTCTTCAAGGACTACTTAATGATGGGCTCTACAAATTTACCATTCAACCATCACATAAAAGACTTCACCA
TTCTGACTCCAACACCAAGTCTGTTTTCAATACCGTCGTACCTAAATCTAATACTCCCTTACTTGATTTATGGCATAGAAGACTAGGTCATCCCCATTTACCTATTGTTA
AAGCTTTTTTGAATCACATTGACCATTCTTCTGGTACTATAAATAAACTGAATTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTCCCTT
ACTCTTTATACACATCCTTTACAACTTATTACTTGTGATTTATGGGGTCCTGCCGTAAATGTATCTCATAATGGTTTTAGATATTACATAAGTTTTGTTGATGCCTATAG
TAGATACACCTGGATATATTTCTTAAATTTCAAGTCTGATGCCTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTC
AAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGGCATAGTTGAG
AGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTCCACTAGTGTCTATCTCATAAATCG
TTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAAGCTATTTTGCCGGAAACCTAACTTTCCTTCTCTTAGAGTTTTTGGTTGCAAGTGTTATCCCTACCTTC
GACCCTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCAGATGGGCGTCTTTTC
ATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATATGCATCATTTGCATCTCATTCTAGCATACCCAAATCCAATAATGTTCTATCCCCACCACTTCACTCAAT
AATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTTTTGATAACACTGATCATCTAAACCCTACTATTGTGTATCCTTTAGAGACAGGTA
CTCAAGAGAGCCCTAGGGATGATGGTAACAGTGGTGGTATTACTCAATCTCCAAGTCCTATGGAACCACTGCATCAAACTGATTCTGCTTTTAACCATCCTCATTGGAAA
AAGGTCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGGAGCCTTACTCCACAAAATCCTAATCAGAAAATTGTTGGTTGCAAATGGGTTTTTAAGATAAA
AAGGAATTCAGATGGGTCTATTAGTAGATATAAAGCACGCTTAGTTGCTAAAGGGTTTCATCAAACACCTAATATTGATTACAATGAAACATTTAGCCCTGTTGTGAAAC
CCGTTACTATTCGCATGCACTTAACTATAGCAATTATGAAAGGATGGAGTATACGTCAATTAGATGTTAATAATGCTTTTCTTCATGGAAATTTAGATGAAAATGTTTAC
ATGAAACAACCATTTGGTTTTGAAGTTAAAAGTTCTTATCCTATGGTTTGTCATTTGAAAAAGGCTCTGTATGGTCTTAAACAAGCCCCTCGAGCATGGTATGAAAAGTT
GAGCTCAAGTTTACATTCCCTTGGATTTAGAACTTCTAAAGCTGATACATCTTTATTAATACATGTTACTCCTACACCTTGTTGCTATGTCTTGATTTGCGTTGATGACT
TGATTATCATGGACAGCTCTGAGAAAGATGTGAATTCTTTAGTTCATTCTTTAAACAGTCAATTTGCACTTAAGGATTTGGGAAAGCTGAGCTACTTTCTTGGAGTTGAG
GTGTCATACCCAACTAATGGAGGTTTGTTTTTATCTCAATCAAAGTATATTACTGATTTATTACAGAGAACAAAAATGCTGGAAGCTAAACCTATTTCTACACCTATGGT
AAGTGGTCCGTTACTTTCTGCTTTTCAAGGGGAACCATTTCATGATGTGCATCTGTATAGAAGTGTTGTTGGTGCATTACAGTATGCCACACTTACTCATCCTGAGATAT
CCTATAGTGTTAATAAAGCTTGTCAATTTATGCATACTCCAAAACATACACATTGGCAACTTGTGAAGAGAATTCTAAGATATCTTAAAGGTGTACTATATCATGGTTTA
TGGTTTCGTAAGTCTGATAATATGTCCTTAGTTGGTTTTGCTGATGCGGATTGGGCTTCTGATCCAGATGATAGGAAGTCTACTTCTGGTTTCTGTGTTTATTTTGGAAA
TAACTTAGTACCTTGGGGTTCCAAGAAACATTCTATTATTTCCAGATCTAGTACTGAAGCTGAATATCGTTGCCTTGCTCTTTTGGCAACTGAACTGGTATGGATTTGTT
CTCTCTTGAATGACTTATATATTGATCTACCTTTTCCTCCTATTTTGTGGTGTGATAATCTAAGTGCAGTGCATCTTAGTGCAAATCCTATATTACATTCCAAGACAAAG
TGTGTTGAATTTGACATCTACTTTGTTAGAGATCTTATACAAAAGGGAAAATTAGCTATCAGACATCTTCCCGCAACTGAACAAATTGCAGATATACTCACCAAACCATT
GTCTGCTCAAAGTTTTCACACGCTGAAGAATAAACTTACTGTCATTGATTCAACAACCATTGGTTTGCAGGGGGGGTGTTAA
mRNA sequenceShow/hide mRNA sequence
GGGAGCTACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATTTTAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGGATCTGGATACTGGC
CAAGTACTTCTTCAAGGACTACTTAATGATGGGCTCTACAAATTTACCATTCAACCATCACATAAAAGACTTCACCATTCTGACTCCAACACCAAGTCTGTTTTCAATAC
CGTCGTACCTAAATCTAATACTCCCTTACTTGATTTATGGCATAGAAGACTAGGTCATCCCCATTTACCTATTGTTAAAGCTTTTTTGAATCACATTGACCATTCTTCTG
GTACTATAAATAAACTGAATTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTCCCTTACTCTTTATACACATCCTTTACAACTTATTACT
TGTGATTTATGGGGTCCTGCCGTAAATGTATCTCATAATGGTTTTAGATATTACATAAGTTTTGTTGATGCCTATAGTAGATACACCTGGATATATTTCTTAAATTTCAA
GTCTGATGCCTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCA
AACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGGCATAGTTGAGAGAAAACATAGGCATATCATGGAAATGGGTCTT
ACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTCCACTAGTGTCTATCTCATAAATCGTTTGCCTACCCCAGTTCTTGATAATATAAGCCC
GTTGGAGAAGCTATTTTGCCGGAAACCTAACTTTCCTTCTCTTAGAGTTTTTGGTTGCAAGTGTTATCCCTACCTTCGACCCTACCAATCACATAAACTATCTCTCCGAT
CCACACCATGTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCAGATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCA
TTTCCATATGCATCATTTGCATCTCATTCTAGCATACCCAAATCCAATAATGTTCTATCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGA
TAGGCGACACACTGACACAGTTTTTGATAACACTGATCATCTAAACCCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCCCTAGGGATGATGGTAACAGTG
GTGGTATTACTCAATCTCCAAGTCCTATGGAACCACTGCATCAAACTGATTCTGCTTTTAACCATCCTCATTGGAAAAAGGTCATGGAAGAAGAGTTTGAAGCCTTACAA
AAAAATGGCACTTGGAGCCTTACTCCACAAAATCCTAATCAGAAAATTGTTGGTTGCAAATGGGTTTTTAAGATAAAAAGGAATTCAGATGGGTCTATTAGTAGATATAA
AGCACGCTTAGTTGCTAAAGGGTTTCATCAAACACCTAATATTGATTACAATGAAACATTTAGCCCTGTTGTGAAACCCGTTACTATTCGCATGCACTTAACTATAGCAA
TTATGAAAGGATGGAGTATACGTCAATTAGATGTTAATAATGCTTTTCTTCATGGAAATTTAGATGAAAATGTTTACATGAAACAACCATTTGGTTTTGAAGTTAAAAGT
TCTTATCCTATGGTTTGTCATTTGAAAAAGGCTCTGTATGGTCTTAAACAAGCCCCTCGAGCATGGTATGAAAAGTTGAGCTCAAGTTTACATTCCCTTGGATTTAGAAC
TTCTAAAGCTGATACATCTTTATTAATACATGTTACTCCTACACCTTGTTGCTATGTCTTGATTTGCGTTGATGACTTGATTATCATGGACAGCTCTGAGAAAGATGTGA
ATTCTTTAGTTCATTCTTTAAACAGTCAATTTGCACTTAAGGATTTGGGAAAGCTGAGCTACTTTCTTGGAGTTGAGGTGTCATACCCAACTAATGGAGGTTTGTTTTTA
TCTCAATCAAAGTATATTACTGATTTATTACAGAGAACAAAAATGCTGGAAGCTAAACCTATTTCTACACCTATGGTAAGTGGTCCGTTACTTTCTGCTTTTCAAGGGGA
ACCATTTCATGATGTGCATCTGTATAGAAGTGTTGTTGGTGCATTACAGTATGCCACACTTACTCATCCTGAGATATCCTATAGTGTTAATAAAGCTTGTCAATTTATGC
ATACTCCAAAACATACACATTGGCAACTTGTGAAGAGAATTCTAAGATATCTTAAAGGTGTACTATATCATGGTTTATGGTTTCGTAAGTCTGATAATATGTCCTTAGTT
GGTTTTGCTGATGCGGATTGGGCTTCTGATCCAGATGATAGGAAGTCTACTTCTGGTTTCTGTGTTTATTTTGGAAATAACTTAGTACCTTGGGGTTCCAAGAAACATTC
TATTATTTCCAGATCTAGTACTGAAGCTGAATATCGTTGCCTTGCTCTTTTGGCAACTGAACTGGTATGGATTTGTTCTCTCTTGAATGACTTATATATTGATCTACCTT
TTCCTCCTATTTTGTGGTGTGATAATCTAAGTGCAGTGCATCTTAGTGCAAATCCTATATTACATTCCAAGACAAAGTGTGTTGAATTTGACATCTACTTTGTTAGAGAT
CTTATACAAAAGGGAAAATTAGCTATCAGACATCTTCCCGCAACTGAACAAATTGCAGATATACTCACCAAACCATTGTCTGCTCAAAGTTTTCACACGCTGAAGAATAA
ACTTACTGTCATTGATTCAACAACCATTGGTTTGCAGGGGGGGTGTTAA
Protein sequenceShow/hide protein sequence
MQQMGQDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAFLNHIDHSSGTINKLNFCEACALGKHHALPFSHSL
TLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNFKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNGIVE
RKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLF
ISRHVLFDENSFPYASFASHSSIPKSNNVLSPPLHSIIPSSLMNHNEDRRHTDTVFDNTDHLNPTIVYPLETGTQESPRDDGNSGGITQSPSPMEPLHQTDSAFNHPHWK
KVMEEEFEALQKNGTWSLTPQNPNQKIVGCKWVFKIKRNSDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPVTIRMHLTIAIMKGWSIRQLDVNNAFLHGNLDENVY
MKQPFGFEVKSSYPMVCHLKKALYGLKQAPRAWYEKLSSSLHSLGFRTSKADTSLLIHVTPTPCCYVLICVDDLIIMDSSEKDVNSLVHSLNSQFALKDLGKLSYFLGVE
VSYPTNGGLFLSQSKYITDLLQRTKMLEAKPISTPMVSGPLLSAFQGEPFHDVHLYRSVVGALQYATLTHPEISYSVNKACQFMHTPKHTHWQLVKRILRYLKGVLYHGL
WFRKSDNMSLVGFADADWASDPDDRKSTSGFCVYFGNNLVPWGSKKHSIISRSSTEAEYRCLALLATELVWICSLLNDLYIDLPFPPILWCDNLSAVHLSANPILHSKTK
CVEFDIYFVRDLIQKGKLAIRHLPATEQIADILTKPLSAQSFHTLKNKLTVIDSTTIGLQGGC