; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011901 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011901
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153123:539273..545507
RNA-Seq ExpressionSgr011901
SyntenySgr011901
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]0.0e+0046.16Show/hide
Query:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+ +  L   N+L+WK QI+  ++GYGL+ ++  DD     +           +  ++  +          S   LS                      
Subjt:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ V+ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGLNFRGNRG-------WNGN----K
        +  ++D S P+ ++   ++ +KG        + N Q + S Y+                     N   N    RGG   RGN+G       WN +    K
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGLNFRGNRG-------WNGN----K

Query:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS
        P CQLCG+ GH   +CY RFD  F               Q P  Q+ S  NS     YS  S  +  ++    +  D NWYPDSGASNHVT +  NL  S
Subjt:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS

Query:  SPCTSDNRVHVGNGAGLSINHIGYSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL
        +     N+VHVGNG GLSI HIG S   S  +++  LLN+LLHVP ITKNLLSVS+FAKDN VFFEFH   CFVKD+ T  +L+ G + +GLY F   HL
Subjt:  SPCTSDNRVHVGNGAGLSINHIGYSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL

Query:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS
            TQ L ++  V   S SS        CT+  + ++  DLWH+RLGH +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  + PL 
Subjt:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS

Query:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGI
        LI  DLWGP +  S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI HRV+CP+T QQNG+
Subjt:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGI

Query:  VERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR
         ERKHR IV+ GLTLL  ASLPL+FWD+ F   VY  NRLPT +L    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGYS  H+
Subjt:  VERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR

Query:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPAS--SQFTSASQPSTIVPTSQPLDPATEVAIASPSA
        GYKC+   GRV+IS  V FNE+SFPY ++   S    S+V P   H S ++  PVL    + +P S  S     S+   IV T  P  P +     +P+ 
Subjt:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPAS--SQFTSASQPSTIVPTSQPLDPATEVAIASPSA

Query:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI
          S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  +  + I
Subjt:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI

Query:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK
        GCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+T+RV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    +  LVC+
Subjt:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK

Query:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG
        LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   +   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+  + G
Subjt:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG

Query:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS
        L LSQTKYI DLL K KM    P  TP+ +G  +   +G+   D+H YRS VGALQY  ITRPE+++SVNKVCQFM +PT+ HW+ VKRILRYL+G+   
Subjt:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS

Query:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH
        GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ I+SRSS E E+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDNL  V 
Subjt:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH

Query:  LSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
        LSANPV H+RTKH+ELD+YFVR+ V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  LSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0047.25Show/hide
Query:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S      L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E                              G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC
        V   L AHE R      + D S   VN T Q S++  + S N     SS   N  N F G    RG       RG      G KPQCQLC +FGHT  +C
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC

Query:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS
        + R+DPNFHGN   NG        G  +  +  +      + +  ++  NQ+YS     M+A++  P    +  W+PDSGA+NHVT+D GNL   +    
Subjt:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS

Query:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK
        ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L      K
Subjt:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK

Query:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL
           L  ++    L+  +++  H          N+  +V DLWH+RLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ + PL L
Subjt:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL

Query:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV
        + +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  G IHR++CP+TS+QNGI+
Subjt:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV

Query:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG
        ERKHRHIV++GLTLL+QASLPL++W D FS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGYS+ H+G
Subjt:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG

Query:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA
        YKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D ++ V 
Subjt:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA

Query:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN
        I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+ ALMKN
Subjt:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN

Query:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ
         TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E+V+M+Q
Subjt:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ

Query:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG
        PPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   FSLKDLG
Subjt:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG

Query:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH
         L+YFLGIEV    DGGL LSQ KYI DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY  ITRPEIA+SVNKVCQFM  P   H
Subjt:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH

Query:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI
        W+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+LL+ELQ 
Subjt:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI

Query:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE
          +  P++WCDN+  V LSANPV HSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V     + L+ G+
Subjt:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0047.39Show/hide
Query:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S      L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E+W T+ + + + + AK M YK+QMQ LKK G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC
        V   L AHE R      + D S   VN T Q S++  + S N     SS   N  N F G    RG       RG      G KPQCQLC +FGHT  +C
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC

Query:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS
        + R+DPNFHGN   NG        G  +  +  +      + +  ++  NQ+YS     M+A++  P    +  W+PDSGA+NHVT+D GNL   +    
Subjt:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS

Query:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK
        ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L      K
Subjt:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK

Query:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL
           L  ++    L+  +++  H          N+  +V DLWH+RLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ + PL L
Subjt:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL

Query:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV
        + +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  G IHR++CP+TS+QNGI+
Subjt:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV

Query:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG
        ERKHRHIV++GLTLL+QASLPL++W D FS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGYS+ H+G
Subjt:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG

Query:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA
        YKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D ++ V 
Subjt:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA

Query:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN
        I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+ ALMKN
Subjt:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN

Query:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ
         TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E+V+M+Q
Subjt:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ

Query:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG
        PPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   FSLKDLG
Subjt:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG

Query:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH
         L+YFLGIE                  DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY  ITRPEIA+SVNKVCQFM  P   H
Subjt:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH

Query:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI
        W+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+LL+ELQ 
Subjt:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI

Query:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE
          +  P++WCDN+  V LSANPV HSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V     + L+ G+
Subjt:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0044.81Show/hide
Query:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H     L  +NY++W+ Q+   +   G ED+I   +G        T+ G      E NP  ++W R DR+I SW+ SS+T  ++  ++  ++S   W  
Subjt:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE ++  S+ A+ M  + + Q  +KG +T+ EY  K+K LAD+L AIGE V+ +D I+ +L GLG +Y++IV+ +TA+   ++L  V+ +L  HE R   
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQM
          ++   SV   N+   + +       N++++      +G N+ RG  G    G    + ++PQCQLCG+FGHT ++CY RFD NF G            
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQM

Query:  VQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLN
                ++ N      N     + +QA+M +P+   D  W+ D+GA++H++     L+   P   +++V VGNG  L I H G +  + S++++F L 
Subjt:  VQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLN

Query:  NLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVP---PLSSSSSTTAHVLACTSENTKANV
         +LHVP I  NL+SVSQF  DN+ FFEFHP   FVKD+ T  ILLQG +  GLY+F            A  VP      SSS   +  L+ T+  T    
Subjt:  NLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVP---PLSSSSSTTAHVLACTSENTKANV

Query:  IDLWHQRLGHAATPIVSQILKECNISFTNNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSK
          LWH RLGH A  I+  IL  CNIS   +  + C AC   KSH LPF  S +  S PL+L+  DLWGPA   S  G RY+I FVD +SRF+W+Y L SK
Subjt:  IDLWHQRLGHAATPIVSQILKECNISFTNNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSK

Query:  SEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRL
         +A S F+ FK  VE      I+ L++D GGEF+A + YL + GI  + +CPYT +QNG  ERK RHI++ GL LL+ ASLP +FW   F  A++ INRL
Subjt:  SEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRL

Query:  PTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISRHVQFNESSFPYLQSFLHSSSVKPL
        PT VL+  SP + LFGK P+Y  FK FGCLC+P +RPYN +KL +RS+ CVFLGYS+ H+GY CL+  TGR++++RHV F+E+ FP+  +   SSSV  +
Subjt:  PTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISRHVQFNESSFPYLQSFLHSSSVKPL

Query:  PIHSSINSFLPVLISSPASSQFTSASQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAE
        P      +FLP   SSP  S   S + PST  P      P T +    PS++ S   L  +  + I     TS P  TN HPMVTR+KNGI   KV  + 
Subjt:  PIHSSINSFLPVLISSPASSQFTSASQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAE

Query:  YIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLL
        +I  EPTT  +A++  +W+ AM+ E++AL +N TW LVP  S    IGCKWV+K+K   DG++ RYKARLVA+GF Q   +DY ETFSPV+K +TIR++L
Subjt:  YIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLL

Query:  TYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV
          AL+  W +HQLD+ NAFLHG L E VFM+QPPGF  S     VCKL+KALYGLKQAPRAW+++LS+ LL  GF+ S+AD+S+   H       +LIYV
Subjt:  TYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV

Query:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLF-LSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIV
        DDI++ GSSS++++  I+ LN  F+L+DLG +NYFLGIEV   + G +F LSQ KY  DLL +  M ++ P TTP + G  +S  +GE FSD   YRS V
Subjt:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLF-LSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIV

Query:  GALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSII
        GALQY  +TRP+I+++VNK CQFM +PT  HW AVKRILRYLKG+ + G+ +++ ++L ++GY DADWAS PDDR+ST G+ IF G NLV+W S KQ ++
Subjt:  GALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSII

Query:  SRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSAT
        SRSS E+E+R+LA+ ++E+IW+Q +L EL + +S PP+LWCDN  A HL+ANPVFH+RTKH+E+D++F+RD VL+K+L+IQ+LP+  Q+ADIFTK +S++
Subjt:  SRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSAT

Query:  SFLHIRSKLNVCDAYDIGLRGGELDY
         FL  R+KL+V  +  + LRG +  Y
Subjt:  SFLHIRSKLNVCDAYDIGLRGGELDY

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0044.89Show/hide
Query:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+ +  L   N+L+WK QI+  ++GYGL+ ++  DD  P  +L+  +  S    +E     L W +QD+L+ SWLLSS++E +L  ++ C+TS  +W  
Subjt:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ ++ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGLNFRGNRG-------WNG
        +  ++D S P+ ++   ++ +KG        + N Q N S Y+                         NG ++   FRGRGG   RGNRG       WN 
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGLNFRGNRG-------WNG

Query:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDF
        +    KP CQLCG+ GH   +CY RFD  F               Q P  Q+ SG N      YS  S  +  ++    +  D NWYPDSGASNHVT + 
Subjt:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDF

Query:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF
         NL  S      N+VHVGNG G                                                  +P    V           G + +GLY F
Subjt:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF

Query:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII
           HL    TQ L ++  V   S SS      L+ T         DLWH+RLG  +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  
Subjt:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTS
        + PL LI +DLWGPA   S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI HRV+CP+T 
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        QQNG+ ERKHR IV+ GLTLL   SLPL+FWD+ F   VY  NRLPT VL    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVLISS--PASSQFTSASQPST---IVPTSQPLDPATEVAIA
        S  H+GYKC+   GRV+ISR V FNE+SFPY ++   S    S+V P   H S ++  PVL  +  PA +   S+++P +    + ++ P  P +     
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVLISS--PASSQFTSASQPST---IVPTSQPLDPATEVAIA

Query:  SPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSST
        +P+   S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  + 
Subjt:  SPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSST

Query:  HKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSP
         + IGCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+TIRV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    +  
Subjt:  HKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSP

Query:  LVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYP
        LVC+LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   S   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+ 
Subjt:  LVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYP

Query:  KDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKG
         + GL LSQTKYI DLL K KM    P  TP+ +G  + A +G+   D+H YRS VGALQY  ITRPE+++SVNKVCQFM +PT+ HW+AVKRILRYL+G
Subjt:  KDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKG

Query:  SFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNL
        +   GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ  +SRSSTEAE+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDNL
Subjt:  SFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNL

Query:  GAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
          V LSANPV H+RTKH+ELD+YFV + V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  GAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

TrEMBL top hitse value%identityAlignment
A0A2N9IMQ9 Integrase catalytic domain-containing protein0.0e+0046.07Show/hide
Query:  LTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVT
        LT  NYL+WK Q++  L+G  L  ++      P   +++++DG+S+T+   NP    W  QD+++ S L+SS++E V+  V+ C TSR++W TLE M+  
Subjt:  LTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVT

Query:  SNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNIDG
         + A+ M    Q+  L+KG  ++ ++F     LAD+L AI + +     + ++L+GLG EYD+ V+ +  ++ P+TL  +YG L  HE+R E+S   +  
Subjt:  SNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNIDG

Query:  SVPTVNLTQQSS-SKKGTGSTNDQKNGSSYHNNGPNSF---RGRGGLNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQ
           + N   + + S+ G G  N   + +    +   SF   RGRG    RG       +P CQ+C R GH AL CY RFD NF                 
Subjt:  SVPTVNLTQQSS-SKKGTGSTNDQKNGSSYHNNGPNSF---RGRGGLNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQ

Query:  PFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAV-SSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNL
                        YS +S  MQA         D NWY D+GA+NH+T+D  NL V S      +++ VGNG GLS+ H G S L S+   SF+LNN+
Subjt:  PFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAV-SSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNL

Query:  LHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWH
        LHVP ITKNL+SV +F  D D F EFHP    VKDR T  +L +G    GLY F                    ++SST+ + LA   E      ID WH
Subjt:  LHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWH

Query:  QRLGHAATPIVSQILKECNISFT--NNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEA
         RLGH A  +VS+IL + ++     NN    C AC   KS  L F PS T ++ PL LI TD+WGP+   S NGF+YY+SF+D YSR+ W++ +  K+E 
Subjt:  QRLGHAATPIVSQILKECNISFT--NNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEA

Query:  YSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTT
        +S F+TF+  VE+L    IK +Q+D GGEFR L  +  S GI HR++CP+T QQNG +ERKHRHIV+ GL LLS A +PL++WDD FS A Y INRLPT 
Subjt:  YSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTT

Query:  VLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLDR-TGRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIH
        +L   +P E LF  KP+Y F K FGC C+P LRPYN HKLQ RS  C+FLGYS +H+GYKCL   +GR++ISR V F E++FP     L +      P  
Subjt:  VLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLDR-TGRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIH

Query:  SSINSFLPVLISSPASSQFTSASQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNL---TSTPIVTNTHPMVTRSKNGIVCPK-----
         S +S LP+L++   S Q    + P  I+  S P  P +  A    S  TS  P T       P P+L   T TPIV ++HPMVTRSK  I  PK     
Subjt:  SSINSFLPVLISSPASSQFTSASQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNL---TSTPIVTNTHPMVTRSKNGIVCPK-----

Query:  --------VLLAEYIE--VEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTE
                 LLAE      EPT    A++ P W +AM  E+ AL+KN TW+LVP +     +G KWVF++KR  DGS+ RYKARLVAKGFHQ   IDYTE
Subjt:  --------VLLAEYIE--VEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTE

Query:  TFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLL
        TFSPV+KPTT+R +L+ AL+  W + QLD+ NAFLHG L+E+V+M QPPGF+       VCKLHKALYGLKQAPRAWF RL+++LL  GF  S++D+SL 
Subjt:  TFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLL

Query:  FRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFN
          H      Y LIYVDDI+I  S +S I  L+  L  +F++KDLG LNYFLGIEV  P   G+ LSQ KYI D+L + KM EA P+++PM S + +S   
Subjt:  FRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFN

Query:  GEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFG
        G+   D   YRS VGALQY  ITRP+IA+SVNK+ QFMH+PT +HWQ+VKR+LRYLK +   GL ++  S   L G+ DADWA D DDR+ST G+CIF G
Subjt:  GEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFG

Query:  GNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAF
         NLV+W  KKQ+ ++RSSTEAE+++LAN +AE+ W  ALL EL +    PPILWCDN+GA +LS+NPVFH+RTKHVE+D +FVRD+V  + + I+ L + 
Subjt:  GNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAF

Query:  AQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGELD
         QLADIFTKPLS   F  +R+KLNV     +GLRGG  D
Subjt:  AQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGELD

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0047.25Show/hide
Query:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S      L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E                              G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC
        V   L AHE R      + D S   VN T Q S++  + S N     SS   N  N F G    RG       RG      G KPQCQLC +FGHT  +C
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC

Query:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS
        + R+DPNFHGN   NG        G  +  +  +      + +  ++  NQ+YS     M+A++  P    +  W+PDSGA+NHVT+D GNL   +    
Subjt:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS

Query:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK
        ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L      K
Subjt:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK

Query:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL
           L  ++    L+  +++  H          N+  +V DLWH+RLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ + PL L
Subjt:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL

Query:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV
        + +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  G IHR++CP+TS+QNGI+
Subjt:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV

Query:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG
        ERKHRHIV++GLTLL+QASLPL++W D FS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGYS+ H+G
Subjt:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG

Query:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA
        YKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D ++ V 
Subjt:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA

Query:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN
        I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+ ALMKN
Subjt:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN

Query:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ
         TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E+V+M+Q
Subjt:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ

Query:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG
        PPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   FSLKDLG
Subjt:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG

Query:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH
         L+YFLGIEV    DGGL LSQ KYI DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY  ITRPEIA+SVNKVCQFM  P   H
Subjt:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH

Query:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI
        W+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+LL+ELQ 
Subjt:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI

Query:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE
          +  P++WCDN+  V LSANPV HSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V     + L+ G+
Subjt:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0047.39Show/hide
Query:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S      L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E+W T+ + + + + AK M YK+QMQ LKK G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC
        V   L AHE R      + D S   VN T Q S++  + S N     SS   N  N F G    RG       RG      G KPQCQLC +FGHT  +C
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRG----RGGLNFRGNRG----WNGNKPQCQLCGRFGHTALKC

Query:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS
        + R+DPNFHGN   NG        G  +  +  +      + +  ++  NQ+YS     M+A++  P    +  W+PDSGA+NHVT+D GNL   +    
Subjt:  YQRFDPNFHGN---NG--------GFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTS

Query:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK
        ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L      K
Subjt:  DNRVHVGNGAGLSINHIGYSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHP---SK

Query:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL
           L  ++    L+  +++  H          N+  +V DLWH+RLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ + PL L
Subjt:  TQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHQRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTIISTPLSL

Query:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV
        + +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  G IHR++CP+TS+QNGI+
Subjt:  IETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQG-IHRVTCPYTSQQNGIV

Query:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG
        ERKHRHIV++GLTLL+QASLPL++W D FS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGYS+ H+G
Subjt:  ERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRG

Query:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA
        YKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D ++ V 
Subjt:  YKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQP---------STIVPTSQPL---DPATEVA

Query:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN
        I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+ ALMKN
Subjt:  IASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKN

Query:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ
         TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E+V+M+Q
Subjt:  GTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQ

Query:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG
        PPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   FSLKDLG
Subjt:  PPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLG

Query:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH
         L+YFLGIE                  DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY  ITRPEIA+SVNKVCQFM  P   H
Subjt:  RLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVH

Query:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI
        W+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+LL+ELQ 
Subjt:  WQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQI

Query:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE
          +  P++WCDN+  V LSANPV HSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V     + L+ G+
Subjt:  PTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGGE

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0044.89Show/hide
Query:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+ +  L   N+L+WK QI+  ++GYGL+ ++  DD  P  +L+  +  S    +E     L W +QD+L+ SWLLSS++E +L  ++ C+TS  +W  
Subjt:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ ++ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGLNFRGNRG-------WNG
        +  ++D S P+ ++   ++ +KG        + N Q N S Y+                         NG ++   FRGRGG   RGNRG       WN 
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGLNFRGNRG-------WNG

Query:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDF
        +    KP CQLCG+ GH   +CY RFD  F               Q P  Q+ SG N      YS  S  +  ++    +  D NWYPDSGASNHVT + 
Subjt:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDF

Query:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF
         NL  S      N+VHVGNG G                                                  +P    V           G + +GLY F
Subjt:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF

Query:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII
           HL    TQ L ++  V   S SS      L+ T         DLWH+RLG  +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  
Subjt:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTS
        + PL LI +DLWGPA   S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI HRV+CP+T 
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        QQNG+ ERKHR IV+ GLTLL   SLPL+FWD+ F   VY  NRLPT VL    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVLISS--PASSQFTSASQPST---IVPTSQPLDPATEVAIA
        S  H+GYKC+   GRV+ISR V FNE+SFPY ++   S    S+V P   H S ++  PVL  +  PA +   S+++P +    + ++ P  P +     
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVLISS--PASSQFTSASQPST---IVPTSQPLDPATEVAIA

Query:  SPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSST
        +P+   S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  + 
Subjt:  SPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSST

Query:  HKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSP
         + IGCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+TIRV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    +  
Subjt:  HKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSP

Query:  LVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYP
        LVC+LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   S   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+ 
Subjt:  LVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYP

Query:  KDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKG
         + GL LSQTKYI DLL K KM    P  TP+ +G  + A +G+   D+H YRS VGALQY  ITRPE+++SVNKVCQFM +PT+ HW+AVKRILRYL+G
Subjt:  KDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKG

Query:  SFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNL
        +   GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ  +SRSSTEAE+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDNL
Subjt:  SFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNL

Query:  GAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
          V LSANPV H+RTKH+ELD+YFV + V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  GAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

A5BFT3 Integrase catalytic domain-containing protein0.0e+0046.16Show/hide
Query:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+ +  L   N+L+WK QI+  ++GYGL+ ++  DD     +           +  ++  +          S   LS                      
Subjt:  SHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ V+ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGLNFRGNRG-------WNGN----K
        +  ++D S P+ ++   ++ +KG        + N Q + S Y+                     N   N    RGG   RGN+G       WN +    K
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGLNFRGNRG-------WNGN----K

Query:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS
        P CQLCG+ GH   +CY RFD  F               Q P  Q+ S  NS     YS  S  +  ++    +  D NWYPDSGASNHVT +  NL  S
Subjt:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS

Query:  SPCTSDNRVHVGNGAGLSINHIGYSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL
        +     N+VHVGNG GLSI HIG S   S  +++  LLN+LLHVP ITKNLLSVS+FAKDN VFFEFH   CFVKD+ T  +L+ G + +GLY F   HL
Subjt:  SPCTSDNRVHVGNGAGLSINHIGYSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL

Query:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS
            TQ L ++  V   S SS        CT+  + ++  DLWH+RLGH +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  + PL 
Subjt:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS

Query:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGI
        LI  DLWGP +  S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI HRV+CP+T QQNG+
Subjt:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGI

Query:  VERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR
         ERKHR IV+ GLTLL  ASLPL+FWD+ F   VY  NRLPT +L    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGYS  H+
Subjt:  VERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR

Query:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPAS--SQFTSASQPSTIVPTSQPLDPATEVAIASPSA
        GYKC+   GRV+IS  V FNE+SFPY ++   S    S+V P   H S ++  PVL    + +P S  S     S+   IV T  P  P +     +P+ 
Subjt:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPAS--SQFTSASQPSTIVPTSQPLDPATEVAIASPSA

Query:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI
          S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  +  + I
Subjt:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI

Query:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK
        GCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+T+RV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    +  LVC+
Subjt:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK

Query:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG
        LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   +   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+  + G
Subjt:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG

Query:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS
        L LSQTKYI DLL K KM    P  TP+ +G  +   +G+   D+H YRS VGALQY  ITRPE+++SVNKVCQFM +PT+ HW+ VKRILRYL+G+   
Subjt:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS

Query:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH
        GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ I+SRSS E E+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDNL  V 
Subjt:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH

Query:  LSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
        LSANPV H+RTKH+ELD+YFVR+ V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  LSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-12027Show/hide
Query:  WTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLK-KGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSG
        W + +R   S ++  +++  L       T+R+I + L+ +Y   +LA  ++ + ++ +LK    M+L  +F    +L   L A G K+   D I ++L  
Subjt:  WTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLK-KGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSG

Query:  LGVEYDAIVSVI-TAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNK
        L   YD I++ I T     LTL  V   L             +D  +   N    +S K      ++  N ++Y NN   +   +    F+GN  +   K
Subjt:  LGVEYDAIVSVI-TAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNK

Query:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINL----DTNWYPDSGASNHVTNDFGN
         +C  CGR GH    C       FH                 +++  +  N    +     +    A MV    N     +  +  DSGAS+H+ ND   
Subjt:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINL----DTNWYPDSGASNHVTNDFGN

Query:  LAVSSPCTSDNRVHVG-NGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFH
           S       ++ V   G  +     G   L   N+    L ++L       NL+SV +  ++  +  EF        D+   TI   GLM        
Subjt:  LAVSSPCTSDNRVHVG-NGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFH

Query:  LHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTK-ANVIDLWHQRLGHAATPIVSQILKE---CNISFTNN---STSFCSACAIGKSHALPF--YPS
              ++    + VP ++             S N K  N   LWH+R GH +   + +I ++    + S  NN   S   C  C  GK   LPF     
Subjt:  LHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTK-ANVIDLWHQRLGHAATPIVSQILKE---CNISFTNN---STSFCSACAIGKSHALPF--YPS

Query:  QTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGI-HRV
        +T I  PL ++ +D+ GP    + +   Y++ FVD ++ +   Y ++ KS+ +S F  F    E      +  L  D G E+    +  +   +GI + +
Subjt:  QTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGI-HRV

Query:  TCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVL--SGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRS
        T P+T Q NG+ ER  R I +   T++S A L   FW +    A Y INR+P+  L  S  +P E    KKP     + FG   +  ++     K   +S
Subjt:  TCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVL--SGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRS

Query:  APCVFLGYSNMHRGYKCLDRTGRVFI--------------SRHVQFN-----------------------ESSFP----------YLQSFLHSSSVK-PL
           +F+GY     G+K  D     FI              SR V+F                        ++ FP          +L+    S +   P 
Subjt:  APCVFLGYSNMHRGYKCLDRTGRVFI--------------SRHVQFN-----------------------ESSFP----------YLQSFLHSSSVK-PL

Query:  PIHSSINSFLPVLISSPASSQFTSASQPSTIV----PTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNLTSTPIV---------TNTHPMVTRS
             I +  P       + QF   S+ S          +  D     +  S + + S    T   L  I   N T    +           T P ++ +
Subjt:  PIHSSINSFLPVLISSPASSQFTSASQPSTIV----PTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNLTSTPIV---------TNTHPMVTRS

Query:  KNGIVCPKVLL----------AEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQ
        +      KV+L            + E++    K +     W +A+  E  A   N TW++         +  +WVF +K N  G+  RYKARLVA+GF Q
Subjt:  KNGIVCPKVLL----------AEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQ

Query:  MADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKC
           IDY ETF+PV + ++ R +L+  +    ++HQ+D+  AFL+G L E+++M  P G  IS +S  VCKL+KA+YGLKQA R WF+     L    F  
Subjt:  MADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKC

Query:  SKADTSLLFRHVG--SSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPM
        S  D  +     G  +   Y+L+YVDD+VI     + +      L  +F + DL  + +F+GI +   +D  ++LSQ+ Y+  +L K  M   N ++TP+
Subjt:  SKADTSLLFRHVG--SSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPM

Query:  VSGSVVSAFNGEKFSDVHFYRSIVGALQYAII-TRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLG----LYGYADADWASD
         S       N ++  +    RS++G L Y ++ TRP++  +VN + ++        WQ +KR+LRYLKG+    L+ +K  NL     + GY D+DWA  
Subjt:  VSGSVVSAFNGEKFSDVHFYRSIVGALQYAII-TRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLG----LYGYADADWASD

Query:  PDDRKSTSGFCI-FFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVR
          DRKST+G+    F  NL+ W +K+Q+ ++ SSTEAE+ +L     E +WL+ LL  + I    P  ++ DN G + ++ NP  H R KH+++  +F R
Subjt:  PDDRKSTSGFCI-FFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVR

Query:  DLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV
        + V    + ++++P   QLADIFTKPL A  F+ +R KL +
Subjt:  DLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-15629.93Show/hide
Query:  WTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKG-GMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSG
        W   D   +S +   +++ V+ +++D +T+R IW  LE +Y++  L   +  K Q+  L    G     + +    L   L  +G K+  +D  I +L+ 
Subjt:  WTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKG-GMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSG

Query:  LGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNKP
        L   YD + + I      + L++V   L  +E                    ++    +G     + + G SY  +  N   GR G   +          
Subjt:  LGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNKP

Query:  QCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSS
         C  C + GH     ++R  PN     G  + Q N        Q+        N N  +  +  +  M       ++ W  D+ AS+H T    +L    
Subjt:  QCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSS

Query:  PCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDR----QTGTILLQGLMHEGLYKFHLH
               V +GN +   I  IG   + ++   + +L ++ HVP +  NL  +S  A D D +  +     F   +    +   ++ +G+    LY+    
Subjt:  PCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDR----QTGTILLQGLMHEGLYKFHLH

Query:  PSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKAN---VIDLWHQRLGHAATPIVSQILKECNISFTNNST-SFCSACAIGKSHALPFYPSQTIISTP
                             T A +  C  E   A     +DLWH+R+GH +   +  + K+  IS+   +T   C  C  GK H + F  S       
Subjt:  PSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKAN---VIDLWHQRLGHAATPIVSQILKECNISFTNNST-SFCSACAIGKSHALPFYPSQTIISTP

Query:  LSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGI-HRVTCPYTSQ
        L L+ +D+ GP    S  G +Y+++F+D  SR  WVY L++K + +  F  F   VE+  G  +K L++D GGE+  R    Y  S GI H  T P T Q
Subjt:  LSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGI-HRVTCPYTSQ

Query:  QNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYS
         NG+ ER +R IV+   ++L  A LP  FW +    A Y INR P+  L+   P      K+  YS  K FGC  F  +      KL  +S PC+F+GY 
Subjt:  QNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYS

Query:  NMHRGYKCLDRT-GRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIHSSINSFLPVLISSPASSQFTS------ASQPSTIVPTSQPLDPATEVAIASPS
        +   GY+  D    +V  SR V F ES          S  VK   I + +   +P   ++P S++ T+        QP  ++   + LD   E       
Subjt:  NMHRGYKCLDRT-GRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIHSSINSFLPVLISSPASSQFTS------ASQPSTIVPTSQPLDPATEVAIASPS

Query:  ASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCP---HWLQAMKDEYAALMKNGTWSLVPHSSTHKTI
            H PL   +   +      ST                     VL+++  + EP ++KE L  P     ++AM++E  +L KNGT+ LV      + +
Subjt:  ASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCP---HWLQAMKDEYAALMKNGTWSLVPHSSTHKTI

Query:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK
         CKWVFK+K++ D  + RYKARLV KGF Q   ID+ E FSPV+K T+IR +L+ A +   ++ QLD+  AFLHG L E+++MEQP GF ++G   +VCK
Subjt:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK

Query:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYI-LIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDG
        L+K+LYGLKQAPR W+ +  SF+ +  +  + +D  + F+    +   I L+YVDD++I+G     I +L   L+  F +KDLG     LG+++   +  
Subjt:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYI-LIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDG

Query:  -GLFLSQTKYITDLLHKAKMFEANPITTPM-----VSGSVVSAFNGEKFSDVHF-YRSIVGALQYAII-TRPEIAYSVNKVCQFMHSPTQVHWQAVKRIL
          L+LSQ KYI  +L +  M  A P++TP+     +S  +      EK +     Y S VG+L YA++ TRP+IA++V  V +F+ +P + HW+AVK IL
Subjt:  -GLFLSQTKYITDLLHKAKMFEANPITTPM-----VSGSVVSAFNGEKFSDVHF-YRSIVGALQYAII-TRPEIAYSVNKVCQFMHSPTQVHWQAVKRIL

Query:  RYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPIL
        RYL+G+ T   L    S+  L GY DAD A D D+RKS++G+   F G  ++W SK Q  ++ S+TEAE+ +   T  E+IWL+  L EL +   +  ++
Subjt:  RYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPIL

Query:  WCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSF
        +CD+  A+ LS N ++H+RTKH+++  +++R++V  + L +  +      AD+ TK +    F
Subjt:  WCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSF

P92519 Uncharacterized mitochondrial protein AtMg008101.2e-5547.6Show/hide
Query:  YILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHF
        Y+L+YVDDI++ GSS++ +  LI  L+  FS+KDLG ++YFLGI++      GLFLSQTKY   +L+ A M +  P++TP+    + S+ +  K+ D   
Subjt:  YILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHF

Query:  YRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSK
        +RSIVGALQY  +TRP+I+Y+VN VCQ MH PT   +  +KR+LRY+KG+   GL + K S L +  + D+DWA     R+ST+GFC F G N+++W +K
Subjt:  YRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSK

Query:  KQSIISRSSTEAEFRSLANTSAELIWLQA
        +Q  +SRSSTE E+R+LA T+AEL W  A
Subjt:  KQSIISRSSTEAEFRSLANTSAELIWLQA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-27138.54Show/hide
Query:  EETNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGV
        EE     +  L +N  + TK   LT +NYL+W  Q+     GY L  ++   DG+ ++  ++    ++  V   NP +  W RQD+LI S +L +++  V
Subjt:  EETNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGV

Query:  LEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTL
           V    T+ +IW+TL ++Y   +       + Q++   KG  T+ +Y   +    D L  +G+ +   + +  +L  L  EY  ++  I AK  P TL
Subjt:  LEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTL

Query:  QEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGS------SYHNNGPNSFRGRGGLNFRGNRGWNGNKP---QCQLCGRFGHTA
         E++  L  HES+     +    S   + +T  + S + T +TN+  NG+      + +NN  +    +   NF  N   N +KP   +CQ+CG  GH+A
Subjt:  QEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGS------SYHNNGPNSFRGRGGLNFRGNRGWNGNKP---QCQLCGRFGHTA

Query:  LKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINL-------DTNWYPDSGASNHVTNDFGNLAVSSPCTSD
         +C Q                         Q   S  NS        Q  P       P  NL         NW  DSGA++H+T+DF NL++  P T  
Subjt:  LKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINL-------DTNWYPDSGASNHVTNDFGNLAVSSPCTSD

Query:  NRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQ
        + V V +G+ + I+H G + L S+ ++   L+N+L+VP+I KNL+SV +    N V  EF P    VKD  TG  LLQG   + LY++ +  S+   L  
Subjt:  NRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQ

Query:  ASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGP
               +S SS   H                WH RLGH A  I++ ++   ++S  N S  F  CS C I KS+ +PF  S    + PL  I +D+W  
Subjt:  ASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGP

Query:  AVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIV
         + S  N +RYY+ FVD ++R+TW+Y L+ KS+   TF+TFK  +E      I    +D GGEF AL  Y    GI H  + P+T + NG+ ERKHRHIV
Subjt:  AVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIV

Query:  DMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RT
        + GLTLLS AS+P  +W   F+ AVY INRLPT +L   SP +KLFG  P+Y   + FGC C+P LRPYN HKL  +S  CVFLGYS     Y CL  +T
Subjt:  DMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RT

Query:  GRVFISRHVQFNESSFPY---------LQSFLHSSSVKPLPIHSSINSFLPVL-------------------------------ISSPASSQFTSASQPS
         R++ISRHV+F+E+ FP+         +Q     SS    P H+++ +  PVL                               + S  SS F S+ +P+
Subjt:  GRVFISRHVQFNESSFPY---------LQSFLHSSSVKPLPIHSSINSFLPVL-------------------------------ISSPASSQFTSASQPS

Query:  T-----IVPTSQPLDPATEV---------------------AIASP--SASTSHSPLTNID-----------LSHIPEP------NLTSTPIVTNTHPMV
                PT+QP    T+                      ++++P  S+S+S SP T+             L H P P      N    P+  NTH M 
Subjt:  T-----IVPTSQPLDPATEV---------------------AIASP--SASTSHSPLTNID-----------LSHIPEP------NLTSTPIVTNTHPMV

Query:  TRSKNGIV--CPKVLLAEYI--EVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMA
        TR+K GI+   PK  LA  +  E EP T  +AL+   W  AM  E  A + N TW LVP   +H TI GC+W+F  K N+DGS+ RYKARLVAKG++Q  
Subjt:  TRSKNGIV--CPKVLLAEYI--EVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMA

Query:  DIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSK
         +DY ETFSPVIK T+IR++L  A+   W I QLD+NNAFL G LT+DV+M QPPGF        VCKL KALYGLKQAPRAW+  L ++LL +GF  S 
Subjt:  DIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSK

Query:  ADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGS
        +DTSL     G S  Y+L+YVDDI+I G+  + +   +  L+ +FS+KD   L+YFLGIE       GL LSQ +YI DLL +  M  A P+TTPM    
Subjt:  ADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGS

Query:  VVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSG
         +S ++G K +D   YR IVG+LQY   TRP+I+Y+VN++ QFMH PT+ H QA+KRILRYL G+   G+ L+K + L L+ Y+DADWA D DD  ST+G
Subjt:  VVSAFNGEKFSDVHFYRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSG

Query:  FCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMI
        + ++ G + ++W SKKQ  + RSSTEAE+RS+ANTS+E+ W+ +LL EL I  +RPP+++CDN+GA +L ANPVFHSR KH+ +D +F+R+ V    L +
Subjt:  FCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMI

Query:  QHLPAFAQLADIFTKPLSATSFLHIRSKLNV
         H+    QLAD  TKPLS T+F +  SK+ V
Subjt:  QHLPAFAQLADIFTKPLSATSFLHIRSKLNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-25837.72Show/hide
Query:  LGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETS
        L +N  + TK   LT +NYL+W  Q+     GY L  ++  D  TP    +   D     V   NP +  W RQD+LI S +L +++  V   V    T+
Subjt:  LGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETS

Query:  REIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAH
         +IW+TL ++Y       N SY         G +T   + ++     D L  +G+ +   + +  +L  L  +Y  ++  I AK  P +L E++  L   
Subjt:  REIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAH

Query:  ESR----SERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNN
        ES+    +    V I  +V T   T  + ++   G   +  N    +NN  NS++     +   NR       +CQ+C   GH+A +C     P  H   
Subjt:  ESR----SERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNN

Query:  GGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLY
           N Q +     P+Q            N ++ S P  A           NW  DSGA++H+T+DF NL+   P T  + V + +G+ + I H G + L 
Subjt:  GGFNHQGNQMVQQPFQQSFSGNNSVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLY

Query:  SSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTS
        +S ++S  LN +L+VP+I KNL+SV +    N V  EF P    VKD  TG  LLQG   + LY++                 P++SS + +     C+ 
Subjt:  SSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTS

Query:  ENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRF
            +     WH RLGH +  I++ ++   ++   N S     CS C I KSH +PF  S    S PL  I +D+W   + S  N +RYY+ FVD ++R+
Subjt:  ENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRF

Query:  TWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFS
        TW+Y L+ KS+   TF+ FK  VE      I  L +D GGEF  L  YL   GI H  + P+T + NG+ ERKHRHIV+MGLTLLS AS+P  +W   FS
Subjt:  TWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGI-HRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFS

Query:  AAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISRHVQFNESSFPYLQSF
         AVY INRLPT +L   SP +KLFG+ P+Y   K FGC C+P LRPYN HKL+ +S  C F+GYS     Y CL   TGR++ SRHVQF+E  FP+  + 
Subjt:  AAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISRHVQFNESSFPYLQSF

Query:  L--------HSSSVKPLPIHSSINSFLPVL-------------------------------------ISSPASSQFTSASQPSTIVPTSQPLDPATEVAI
                  S S    P H+++ +   VL                                     ISSP+SS+ T+ S      PT+QP       + 
Subjt:  L--------HSSSVKPLPIHSSINSFLPVL-------------------------------------ISSPASSQFTSASQPSTIVPTSQPLDPATEVAI

Query:  A-----------SPSASTSHSPLTNIDLS--HIPEPNL------------TSTPIV-----------------TNTHPMVTRSKNGIVCP--KVLLAEYI
        +           SP++   +SPL    +S  HIP P+             TSTP +                  NTH M TR+K+GI  P  K   A  +
Subjt:  A-----------SPSASTSHSPLTNIDLS--HIPEPNL------------TSTPIV-----------------TNTHPMVTRSKNGIVCP--KVLLAEYI

Query:  EV--EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVL
            EP T  +A++   W QAM  E  A + N TW LVP      TI GC+W+F  K N+DGS+ RYKARLVAKG++Q   +DY ETFSPVIK T+IR++
Subjt:  EV--EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVL

Query:  LTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIY
        L  A+   W I QLD+NNAFL G LT++V+M QPPGF        VC+L KA+YGLKQAPRAW+  L ++LL +GF  S +DTSL     G S  Y+L+Y
Subjt:  LTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIY

Query:  VDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIV
        VDDI+I G+ +  +   +  L+ +FS+K+   L+YFLGIE       GL LSQ +Y  DLL +  M  A P+ TPM +   ++  +G K  D   YR IV
Subjt:  VDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIV

Query:  GALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSII
        G+LQY   TRP+++Y+VN++ Q+MH PT  HW A+KR+LRYL G+   G+ L+K + L L+ Y+DADWA D DD  ST+G+ ++ G + ++W SKKQ  +
Subjt:  GALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSII

Query:  SRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSAT
         RSSTEAE+RS+ANTS+EL W+ +LL EL I  S PP+++CDN+GA +L ANPVFHSR KH+ LD +F+R+ V    L + H+    QLAD  TKPLS  
Subjt:  SRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSAT

Query:  SFLHIRSKLNV
        +F +   K+ V
Subjt:  SFLHIRSKLNV

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.9e-1124.6Show/hide
Query:  LTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE-DVLDCETSREIWKTLEEMYV
        + ESNY  W+   L     + +  +I   DGT  L  ++ ND             + W ++D ++   L  ++T    +   +   TSR+IW  ++  + 
Subjt:  LTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE-DVLDCETSREIWKTLEEMYV

Query:  TSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNID
         +  A+ +   ++++    G M + +Y+ KMKKLADSL+ +   V+ ++ ++Y+L+GL  ++D I++VI  +    +  +   +L   E R +R+     
Subjt:  TSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNID

Query:  GSVPTVNLTQQSSSKKGTGSTNDQKNGSSY-----HNNGPNSFRGRGG
          V   + +   +  +    TN Q++G +         G N FRGRGG
Subjt:  GSVPTVNLTQQSSSKKGTGSTNDQKNGSSY-----HNNGPNSFRGRGG

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.1e-11241.45Show/hide
Query:  EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYAL
        EP+T  EA     W  AM DE  A+    TW +       K IGCKWV+KIK N+DG+I RYKARLVAKG+ Q   ID+ ETFSPV K T+++++L  + 
Subjt:  EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYAL

Query:  ANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPL----VCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV
           + +HQLDI+NAFL+G L E+++M+ PPG++      L    VC L K++YGLKQA R WF + S  L+  GF  S +D +   +   +    +L+YV
Subjt:  ANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPL----VCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV

Query:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVG
        DDI+I  ++ + + +L S L   F L+DLG L YFLG+E++     G+ + Q KY  DLL +  +    P + PM      SA +G  F D   YR ++G
Subjt:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVG

Query:  ALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIIS
         L Y  ITR +I+++VNK+ QF  +P   H QAV +IL Y+KG+   GL     + + L  ++DA + S  D R+ST+G+C+F G +L++W SKKQ ++S
Subjt:  ALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIIS

Query:  RSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLS
        +SS EAE+R+L+  + E++WL     ELQ+P S+P +L+CDN  A+H++ N VFH RTKH+E D + VR+  + +  +     A+ +  D FT+ LS
Subjt:  RSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLS

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.7e-1224.18Show/hide
Query:  LTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCE-TSREIWKTLEEMYV
        L + NY VW+         +G+  +I               DGSS+           W  +D L+  W+  ++T+ +L+ ++    T+R++W +LE ++ 
Subjt:  LTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCE-TSREIWKTLEEMYV

Query:  TSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNID
         +  A+ + ++N+++      +++ EY  K+K L+D L  +   +S +  ++++L+GL  +YD I++VI  KS   +  E   +L   ESR    +    
Subjt:  TSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNID

Query:  GSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRG----------NRGWNGNKPQCQLCG
         S+   N     S+   T     ++    YHNN  N  RGR     RG          N  W  N+P   + G
Subjt:  GSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRG----------NRGWNGNKPQCQLCG

ATMG00810.1 DNA/RNA polymerases superfamily protein8.2e-5747.6Show/hide
Query:  YILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHF
        Y+L+YVDDI++ GSS++ +  LI  L+  FS+KDLG ++YFLGI++      GLFLSQTKY   +L+ A M +  P++TP+    + S+ +  K+ D   
Subjt:  YILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHF

Query:  YRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSK
        +RSIVGALQY  +TRP+I+Y+VN VCQ MH PT   +  +KR+LRY+KG+   GL + K S L +  + D+DWA     R+ST+GFC F G N+++W +K
Subjt:  YRSIVGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSK

Query:  KQSIISRSSTEAEFRSLANTSAELIWLQA
        +Q  +SRSSTE E+R+LA T+AEL W  A
Subjt:  KQSIISRSSTEAEFRSLANTSAELIWLQA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-2549.6Show/hide
Query:  MVTRSKNGI--VCPK--VLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQM
        M+TRSK GI  + PK  + +   I+ EP +V  AL+ P W QAM++E  AL +N TW LVP       +GCKWVFK K ++DG++ R KARLVAKGFHQ 
Subjt:  MVTRSKNGI--VCPK--VLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQM

Query:  ADIDYTETFSPVIKPTTIRVLLTYA
          I + ET+SPV++  TIR +L  A
Subjt:  ADIDYTETFSPVIKPTTIRVLLTYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAACAAATACACAGCAATCGGTGACTCTGGGCATTAATCCTGGAAGTCACACCAAGGCAAATACACTTACCGAAAGCAATTATCTTGTTTGGAAACTTCAGAT
TCTCAGAACGCTACAGGGATATGGACTCGAAGATTATATTCTCGACGATGATGGAACCCCGTCTCTGTATCTTAGTTCTACTAATGATGGATCCTCGTCCACAGTGGAAG
AAGCCAATCCCGCACATTTACTTTGGACCCGTCAAGATCGTCTAATCTCTTCGTGGCTTCTCAGTTCGATGACAGAGGGAGTTCTTGAAGATGTGCTTGACTGTGAGACA
TCTCGCGAAATTTGGAAAACATTAGAAGAAATGTATGTTACAAGCAATTTGGCAAAAAATATGAGCTACAAGAATCAGATGCAAAACCTGAAGAAAGGAGGCATGACTCT
AAAAGAATACTTTTCAAAGATGAAGAAACTGGCAGATTCTTTAAAGGCCATTGGAGAAAAGGTATCCACTAAAGACCATATCATTTATATTCTCTCTGGATTGGGTGTCG
AATATGATGCTATAGTATCTGTTATCACTGCCAAATCTAGACCTTTGACACTGCAAGAAGTTTATGGATTACTGTATGCTCATGAGTCAAGGTCAGAAAGAAGCACTGTG
AATATTGATGGTTCTGTGCCCACAGTAAACCTAACTCAACAAAGTTCATCCAAGAAAGGTACTGGTTCTACCAATGATCAAAAGAATGGTTCATCATATCACAACAATGG
TCCTAATTCCTTCAGAGGTCGTGGAGGACTCAATTTCAGAGGAAACAGAGGTTGGAATGGTAATAAACCACAGTGTCAACTATGTGGACGTTTTGGACACACAGCTTTGA
AATGTTATCAGCGATTTGATCCAAACTTCCATGGTAACAATGGTGGTTTTAACCATCAAGGCAATCAAATGGTTCAACAACCTTTTCAGCAGTCTTTCTCTGGTAATAAT
AGCGTGGGCAATCAAAATTATTCTATGCAAAGTCACCCTATGCAGGCTATAATGGTGGCTCCCAATATTAATCTTGATACCAATTGGTATCCCGATTCAGGAGCTTCAAA
TCATGTGACGAATGATTTTGGTAATCTTGCAGTGAGTTCTCCTTGCACTAGTGACAATAGAGTTCATGTCGGTAATGGGGCAGGTTTGTCTATCAATCATATTGGCTATT
CTCATTTGTACTCCTCTAATAACCAATCCTTTTTACTCAATAACCTCTTACATGTTCCCCATATTACTAAGAATCTTTTGAGTGTCAGCCAATTTGCTAAAGATAATGAT
GTCTTTTTTGAATTCCATCCTCTTGTCTGTTTTGTGAAGGACCGTCAGACTGGTACAATTCTGCTCCAAGGACTGATGCATGAAGGACTGTACAAGTTTCATCTCCACCC
TTCCAAAACTCAAGATTTGAAGCAAGCATCCCTGGTTCCTCCTCTGTCCTCTTCCTCCTCTACCACAGCTCATGTTTTAGCTTGTACCTCTGAAAATACTAAAGCCAATG
TAATAGATCTGTGGCATCAAAGACTTGGTCATGCTGCCACTCCAATTGTATCCCAAATTTTAAAGGAATGCAATATTTCATTTACTAATAATTCTACTTCCTTTTGTTCT
GCCTGTGCAATTGGTAAAAGTCATGCCCTTCCATTTTACCCTTCACAGACTATTATTAGTACACCCCTGTCTCTTATCGAAACCGATCTTTGGGGACCAGCTGTTAAAAG
CTCCAAAAATGGTTTCAGATACTATATCAGTTTTGTTGATGTCTACTCTCGTTTCACTTGGGTTTATTTTCTTCAGTCCAAATCCGAGGCTTATTCTACCTTTCTTACTT
TTAAAATACATGTGGAAAAACTTCTTGGTCATTCAATTAAAATGTTACAAACTGATGGGGGTGGTGAGTTTCGTGCTCTTGCCCCATACTTAAAGTCTCAGGGTATTCAT
AGAGTTACTTGTCCCTACACCTCCCAACAAAATGGTATAGTTGAGCGTAAGCATAGGCACATTGTTGATATGGGACTGACCTTACTTTCTCAGGCTTCCCTTCCTCTTGA
GTTTTGGGACGATGTGTTTTCAGCAGCAGTCTATACCATTAATCGGTTGCCTACTACAGTCCTAAGTGGCATCAGTCCTGTGGAGAAATTGTTCGGTAAGAAACCTGATT
ATTCCTTTTTTAAAACCTTTGGCTGCTTATGTTTTCCATGCTTACGACCCTACAATGACCATAAACTCCAATTTCGTTCTGCTCCTTGTGTTTTTCTTGGCTATAGCAAT
ATGCACAGGGGCTACAAATGTTTAGATAGAACTGGACGTGTTTTTATTTCAAGACATGTTCAGTTTAATGAATCTTCTTTTCCATATCTTCAGTCTTTCTTACATTCCTC
CTCTGTCAAGCCTTTGCCTATACACTCATCTATCAACTCTTTCCTACCTGTGTTGATTTCGTCTCCTGCTTCTTCCCAGTTTACATCTGCTTCTCAGCCTTCTACTATTG
TTCCTACCTCCCAACCCTTGGATCCTGCCACTGAGGTTGCTATTGCTTCCCCATCTGCATCCACTTCACATTCTCCTTTGACTAATATTGATCTTTCCCATATACCTGAA
CCAAATCTGACATCTACTCCTATTGTCACTAACACTCACCCTATGGTTACTCGCTCAAAAAATGGTATTGTTTGCCCCAAGGTACTACTTGCAGAATACATTGAGGTTGA
ACCGACTACTGTGAAAGAGGCCTTACGTTGTCCTCATTGGCTTCAAGCAATGAAAGATGAATATGCTGCCCTTATGAAAAATGGAACTTGGTCTCTTGTACCTCATTCTT
CTACCCACAAAACAATCGGTTGTAAGTGGGTTTTTAAAATAAAGCGAAATACGGATGGCTCCATTGCTAGGTATAAGGCACGCTTAGTTGCAAAAGGATTTCATCAAATG
GCAGACATTGACTACACTGAAACTTTTAGTCCAGTTATTAAGCCCACGACAATACGTGTCTTACTTACTTATGCATTGGCTAATGGTTGGCAGATTCATCAACTGGATAT
TAATAATGCTTTTCTTCATGGTGTTCTTACTGAGGATGTTTTCATGGAGCAGCCTCCGGGATTCTCTATTTCTGGCTCTTCACCTCTGGTTTGTAAACTCCATAAAGCAC
TTTATGGACTCAAGCAAGCCCCACGAGCATGGTTTGATAGACTGTCTTCCTTTCTACTTGCTCTCGGTTTCAAATGTTCTAAAGCTGACACCTCTCTTCTTTTTCGCCAT
GTTGGTTCATCCAAATGCTATATACTGATCTATGTCGATGATATAGTCATCATGGGCTCTTCATCTTCTGAAATTACTCAGCTTATATCCTTGCTAAATCATCAGTTTTC
CTTGAAAGATCTTGGTAGGCTGAATTATTTTCTTGGTATTGAGGTCTCTTATCCAAAGGATGGAGGTTTATTCCTCTCTCAAACCAAGTATATTACTGACCTTCTTCATA
AGGCCAAGATGTTTGAAGCTAATCCTATTACTACTCCCATGGTAAGTGGCTCTGTAGTTTCTGCCTTTAATGGAGAAAAATTTTCTGATGTTCATTTTTATAGAAGTATT
GTGGGGGCCTTACAGTATGCTATAATCACTAGACCAGAAATTGCTTATAGTGTGAACAAGGTGTGTCAATTTATGCATTCTCCTACTCAAGTTCACTGGCAGGCGGTCAA
GAGGATTCTTAGATATCTCAAGGGTTCATTCACTTCAGGCCTTTTGCTTCGGAAGCCATCTAATCTTGGACTATATGGCTATGCGGATGCTGATTGGGCCTCAGACCCAG
ATGACCGCAAATCAACCTCTGGTTTCTGCATTTTCTTTGGTGGAAATCTTGTAACCTGGGGCTCGAAGAAGCAGAGCATTATCTCACGCTCCAGCACTGAGGCTGAGTTC
AGAAGCCTGGCAAATACATCAGCAGAGTTAATCTGGTTGCAAGCTCTTTTAGCTGAGTTACAAATCCCTACTTCTCGTCCACCCATCTTATGGTGTGACAACTTGGGAGC
TGTTCATCTCAGTGCCAATCCAGTTTTTCATTCTCGAACAAAACATGTTGAGTTAGACATCTACTTTGTCCGCGATTTGGTTCTCCAAAAGAGACTCATGATTCAACACC
TTCCAGCATTTGCCCAACTTGCTGATATATTTACAAAGCCCCTGTCTGCTACTTCCTTTCTGCATATTCGTTCCAAGCTCAATGTCTGTGATGCTTATGACATTGGCTTG
AGGGGGGGAGAGCTCGACTATGATTTGGTTCGAGGTAAGGTGTCAAGCTATGGTGCTTACCCAACATGCTCTAGACCGACCTTTGTTGAGCTGAGCAAGAGTGATATTGC
GTACCTAGGTACTACTCGTCGTACGGACACTATCTTGATAGAAGCCAAGGTCAAAAATTCCGAGTCAGTTCCACATTCGAGTCAGGTGCTCATGGCCATGTGGCTTTCAG
GCTGGGGGTGCCTTCTCCCTAATTCACGCCCCAAGGTCACTTTGGCCAGCCTAGTTGGCTCATCCTTCATCAGGTTGGCTCGAACTGGCCAGATTGGTGTTATCACATCA
CATCTAGATTGTCCACGTGAGCGGCATTTTCTTACCCACAACATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAACAAATACACAGCAATCGGTGACTCTGGGCATTAATCCTGGAAGTCACACCAAGGCAAATACACTTACCGAAAGCAATTATCTTGTTTGGAAACTTCAGAT
TCTCAGAACGCTACAGGGATATGGACTCGAAGATTATATTCTCGACGATGATGGAACCCCGTCTCTGTATCTTAGTTCTACTAATGATGGATCCTCGTCCACAGTGGAAG
AAGCCAATCCCGCACATTTACTTTGGACCCGTCAAGATCGTCTAATCTCTTCGTGGCTTCTCAGTTCGATGACAGAGGGAGTTCTTGAAGATGTGCTTGACTGTGAGACA
TCTCGCGAAATTTGGAAAACATTAGAAGAAATGTATGTTACAAGCAATTTGGCAAAAAATATGAGCTACAAGAATCAGATGCAAAACCTGAAGAAAGGAGGCATGACTCT
AAAAGAATACTTTTCAAAGATGAAGAAACTGGCAGATTCTTTAAAGGCCATTGGAGAAAAGGTATCCACTAAAGACCATATCATTTATATTCTCTCTGGATTGGGTGTCG
AATATGATGCTATAGTATCTGTTATCACTGCCAAATCTAGACCTTTGACACTGCAAGAAGTTTATGGATTACTGTATGCTCATGAGTCAAGGTCAGAAAGAAGCACTGTG
AATATTGATGGTTCTGTGCCCACAGTAAACCTAACTCAACAAAGTTCATCCAAGAAAGGTACTGGTTCTACCAATGATCAAAAGAATGGTTCATCATATCACAACAATGG
TCCTAATTCCTTCAGAGGTCGTGGAGGACTCAATTTCAGAGGAAACAGAGGTTGGAATGGTAATAAACCACAGTGTCAACTATGTGGACGTTTTGGACACACAGCTTTGA
AATGTTATCAGCGATTTGATCCAAACTTCCATGGTAACAATGGTGGTTTTAACCATCAAGGCAATCAAATGGTTCAACAACCTTTTCAGCAGTCTTTCTCTGGTAATAAT
AGCGTGGGCAATCAAAATTATTCTATGCAAAGTCACCCTATGCAGGCTATAATGGTGGCTCCCAATATTAATCTTGATACCAATTGGTATCCCGATTCAGGAGCTTCAAA
TCATGTGACGAATGATTTTGGTAATCTTGCAGTGAGTTCTCCTTGCACTAGTGACAATAGAGTTCATGTCGGTAATGGGGCAGGTTTGTCTATCAATCATATTGGCTATT
CTCATTTGTACTCCTCTAATAACCAATCCTTTTTACTCAATAACCTCTTACATGTTCCCCATATTACTAAGAATCTTTTGAGTGTCAGCCAATTTGCTAAAGATAATGAT
GTCTTTTTTGAATTCCATCCTCTTGTCTGTTTTGTGAAGGACCGTCAGACTGGTACAATTCTGCTCCAAGGACTGATGCATGAAGGACTGTACAAGTTTCATCTCCACCC
TTCCAAAACTCAAGATTTGAAGCAAGCATCCCTGGTTCCTCCTCTGTCCTCTTCCTCCTCTACCACAGCTCATGTTTTAGCTTGTACCTCTGAAAATACTAAAGCCAATG
TAATAGATCTGTGGCATCAAAGACTTGGTCATGCTGCCACTCCAATTGTATCCCAAATTTTAAAGGAATGCAATATTTCATTTACTAATAATTCTACTTCCTTTTGTTCT
GCCTGTGCAATTGGTAAAAGTCATGCCCTTCCATTTTACCCTTCACAGACTATTATTAGTACACCCCTGTCTCTTATCGAAACCGATCTTTGGGGACCAGCTGTTAAAAG
CTCCAAAAATGGTTTCAGATACTATATCAGTTTTGTTGATGTCTACTCTCGTTTCACTTGGGTTTATTTTCTTCAGTCCAAATCCGAGGCTTATTCTACCTTTCTTACTT
TTAAAATACATGTGGAAAAACTTCTTGGTCATTCAATTAAAATGTTACAAACTGATGGGGGTGGTGAGTTTCGTGCTCTTGCCCCATACTTAAAGTCTCAGGGTATTCAT
AGAGTTACTTGTCCCTACACCTCCCAACAAAATGGTATAGTTGAGCGTAAGCATAGGCACATTGTTGATATGGGACTGACCTTACTTTCTCAGGCTTCCCTTCCTCTTGA
GTTTTGGGACGATGTGTTTTCAGCAGCAGTCTATACCATTAATCGGTTGCCTACTACAGTCCTAAGTGGCATCAGTCCTGTGGAGAAATTGTTCGGTAAGAAACCTGATT
ATTCCTTTTTTAAAACCTTTGGCTGCTTATGTTTTCCATGCTTACGACCCTACAATGACCATAAACTCCAATTTCGTTCTGCTCCTTGTGTTTTTCTTGGCTATAGCAAT
ATGCACAGGGGCTACAAATGTTTAGATAGAACTGGACGTGTTTTTATTTCAAGACATGTTCAGTTTAATGAATCTTCTTTTCCATATCTTCAGTCTTTCTTACATTCCTC
CTCTGTCAAGCCTTTGCCTATACACTCATCTATCAACTCTTTCCTACCTGTGTTGATTTCGTCTCCTGCTTCTTCCCAGTTTACATCTGCTTCTCAGCCTTCTACTATTG
TTCCTACCTCCCAACCCTTGGATCCTGCCACTGAGGTTGCTATTGCTTCCCCATCTGCATCCACTTCACATTCTCCTTTGACTAATATTGATCTTTCCCATATACCTGAA
CCAAATCTGACATCTACTCCTATTGTCACTAACACTCACCCTATGGTTACTCGCTCAAAAAATGGTATTGTTTGCCCCAAGGTACTACTTGCAGAATACATTGAGGTTGA
ACCGACTACTGTGAAAGAGGCCTTACGTTGTCCTCATTGGCTTCAAGCAATGAAAGATGAATATGCTGCCCTTATGAAAAATGGAACTTGGTCTCTTGTACCTCATTCTT
CTACCCACAAAACAATCGGTTGTAAGTGGGTTTTTAAAATAAAGCGAAATACGGATGGCTCCATTGCTAGGTATAAGGCACGCTTAGTTGCAAAAGGATTTCATCAAATG
GCAGACATTGACTACACTGAAACTTTTAGTCCAGTTATTAAGCCCACGACAATACGTGTCTTACTTACTTATGCATTGGCTAATGGTTGGCAGATTCATCAACTGGATAT
TAATAATGCTTTTCTTCATGGTGTTCTTACTGAGGATGTTTTCATGGAGCAGCCTCCGGGATTCTCTATTTCTGGCTCTTCACCTCTGGTTTGTAAACTCCATAAAGCAC
TTTATGGACTCAAGCAAGCCCCACGAGCATGGTTTGATAGACTGTCTTCCTTTCTACTTGCTCTCGGTTTCAAATGTTCTAAAGCTGACACCTCTCTTCTTTTTCGCCAT
GTTGGTTCATCCAAATGCTATATACTGATCTATGTCGATGATATAGTCATCATGGGCTCTTCATCTTCTGAAATTACTCAGCTTATATCCTTGCTAAATCATCAGTTTTC
CTTGAAAGATCTTGGTAGGCTGAATTATTTTCTTGGTATTGAGGTCTCTTATCCAAAGGATGGAGGTTTATTCCTCTCTCAAACCAAGTATATTACTGACCTTCTTCATA
AGGCCAAGATGTTTGAAGCTAATCCTATTACTACTCCCATGGTAAGTGGCTCTGTAGTTTCTGCCTTTAATGGAGAAAAATTTTCTGATGTTCATTTTTATAGAAGTATT
GTGGGGGCCTTACAGTATGCTATAATCACTAGACCAGAAATTGCTTATAGTGTGAACAAGGTGTGTCAATTTATGCATTCTCCTACTCAAGTTCACTGGCAGGCGGTCAA
GAGGATTCTTAGATATCTCAAGGGTTCATTCACTTCAGGCCTTTTGCTTCGGAAGCCATCTAATCTTGGACTATATGGCTATGCGGATGCTGATTGGGCCTCAGACCCAG
ATGACCGCAAATCAACCTCTGGTTTCTGCATTTTCTTTGGTGGAAATCTTGTAACCTGGGGCTCGAAGAAGCAGAGCATTATCTCACGCTCCAGCACTGAGGCTGAGTTC
AGAAGCCTGGCAAATACATCAGCAGAGTTAATCTGGTTGCAAGCTCTTTTAGCTGAGTTACAAATCCCTACTTCTCGTCCACCCATCTTATGGTGTGACAACTTGGGAGC
TGTTCATCTCAGTGCCAATCCAGTTTTTCATTCTCGAACAAAACATGTTGAGTTAGACATCTACTTTGTCCGCGATTTGGTTCTCCAAAAGAGACTCATGATTCAACACC
TTCCAGCATTTGCCCAACTTGCTGATATATTTACAAAGCCCCTGTCTGCTACTTCCTTTCTGCATATTCGTTCCAAGCTCAATGTCTGTGATGCTTATGACATTGGCTTG
AGGGGGGGAGAGCTCGACTATGATTTGGTTCGAGGTAAGGTGTCAAGCTATGGTGCTTACCCAACATGCTCTAGACCGACCTTTGTTGAGCTGAGCAAGAGTGATATTGC
GTACCTAGGTACTACTCGTCGTACGGACACTATCTTGATAGAAGCCAAGGTCAAAAATTCCGAGTCAGTTCCACATTCGAGTCAGGTGCTCATGGCCATGTGGCTTTCAG
GCTGGGGGTGCCTTCTCCCTAATTCACGCCCCAAGGTCACTTTGGCCAGCCTAGTTGGCTCATCCTTCATCAGGTTGGCTCGAACTGGCCAGATTGGTGTTATCACATCA
CATCTAGATTGTCCACGTGAGCGGCATTTTCTTACCCACAACATTTGA
Protein sequenceShow/hide protein sequence
MEETNTQQSVTLGINPGSHTKANTLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSSSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCET
SREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTV
NIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGLNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGFNHQGNQMVQQPFQQSFSGNN
SVGNQNYSMQSHPMQAIMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGYSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDND
VFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHQRLGHAATPIVSQILKECNISFTNNSTSFCS
ACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIH
RVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDVFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSN
MHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIHSSINSFLPVLISSPASSQFTSASQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPE
PNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQM
ADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRH
VGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSI
VGALQYAIITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEF
RSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVFHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGL
RGGELDYDLVRGKVSSYGAYPTCSRPTFVELSKSDIAYLGTTRRTDTILIEAKVKNSESVPHSSQVLMAMWLSGWGCLLPNSRPKVTLASLVGSSFIRLARTGQIGVITS
HLDCPRERHFLTHNI