; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012076 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012076
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153207:80346..88844
RNA-Seq ExpressionSgr012076
SyntenySgr012076
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR000209 - Peptidase S8/S53 domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036852 - Peptidase S8/S53 domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]0.0e+0046.64Show/hide
Query:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+  + L   N+L+WK QI+  ++GYGL+ ++  DD     +       SP  + +          + +L +S   SS    +    L           
Subjt:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ V+ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGRNFRGNRG-------WNGN----K
        +  ++D S P+ ++   ++ +KG        + N Q + S Y+                     N   N    RGG   RGN+G       WN +    K
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGRNFRGNRG-------WNGN----K

Query:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS
        P CQLCG+ GH   +CY RFD  F               Q P  Q+ S  NS     Y   S  +  ++    +  D NWYPDSGASNHVT +  NL  S
Subjt:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS

Query:  SPCTSDNRVHVGNGAGLSINHIGSSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL
        +     N+VHVGNG GLSI HIG S   S  +++  LLN+LLHVP ITKNLLSVS+FAKDN VFFEFH   CFVKD+ T  +L+ G + +GLY F   HL
Subjt:  SPCTSDNRVHVGNGAGLSINHIGSSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL

Query:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS
            TQ L ++  V   S SS        CT+  + ++  DLWHKRLGH +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  + PL 
Subjt:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS

Query:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGI
        LI  DLWGP +  S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI+HRV+CP+T QQNG+
Subjt:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGI

Query:  VERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR
         ERKHR IV+ GLTLL  ASLPL+FWD++F   VY  NRLPT +L    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGYS  H+
Subjt:  VERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR

Query:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAIASPSA
        GYKC+   GRV+IS  V FNE+SFPY ++   S    S+V P   H S ++  PVL    + +PTS  S     S+   IV T  P  P +     +P+ 
Subjt:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAIASPSA

Query:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI
          S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  +  + I
Subjt:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI

Query:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK
        GCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+T+RV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    +  LVC+
Subjt:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK

Query:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG
        LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   +   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+  + G
Subjt:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG

Query:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS
        L LSQTKYI DLL K KM    P  TP+ +G  +   +G+   D+H YRS VGALQY TITRPE+++SVNKVCQFM +PT+ HW+ VKRILRYL+G+   
Subjt:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS

Query:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH
        GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ I+SRSS E E+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDNL  V 
Subjt:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH

Query:  LSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
        LSANPVLH+RTKH+ELD+YFVR+ V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  LSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0047.71Show/hide
Query:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S    + L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E                              G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC
        V   L AHE R      + D S   VN T Q S++  + S N                      GS  HN G    RGRG       R   G KPQCQLC
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC

Query:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV
         +FGHT  +C+ R+DPNFHGN   NG          +     S S   +V    Y  Q +     M+AM+  P    +  W+PDSGA+NHVT+D GNL  
Subjt:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV

Query:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH
         +    ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L 
Subjt:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH

Query:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII
             K   L  ++    L+  +++  H          N+  +V DLWHKRLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ 
Subjt:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS
        + PL L+ +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  GIIHR++CP+TS
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        +QNGI+ERKHRHIV++GLTLL+QASLPL++W DAFS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D
        S+ H+GYKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D

Query:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY
         ++ V I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+
Subjt:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY

Query:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE
         ALMKN TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E
Subjt:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE

Query:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF
        +V+M+QPPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   F
Subjt:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF

Query:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH
        SLKDLG L+YFLGIEV    DGGL LSQ KYI DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY TITRPEIA+SVNKVCQFM 
Subjt:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH

Query:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL
         P   HW+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+L
Subjt:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL

Query:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV
        L+ELQ   +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V
Subjt:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0047.85Show/hide
Query:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S    + L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E+W T+ + + + + AK M YK+QMQ LKK G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC
        V   L AHE R      + D S   VN T Q S++  + S N                      GS  HN G    RGRG       R   G KPQCQLC
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC

Query:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV
         +FGHT  +C+ R+DPNFHGN   NG          +     S S   +V    Y  Q +     M+AM+  P    +  W+PDSGA+NHVT+D GNL  
Subjt:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV

Query:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH
         +    ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L 
Subjt:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH

Query:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII
             K   L  ++    L+  +++  H          N+  +V DLWHKRLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ 
Subjt:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS
        + PL L+ +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  GIIHR++CP+TS
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        +QNGI+ERKHRHIV++GLTLL+QASLPL++W DAFS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D
        S+ H+GYKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D

Query:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY
         ++ V I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+
Subjt:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY

Query:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE
         ALMKN TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E
Subjt:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE

Query:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF
        +V+M+QPPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   F
Subjt:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF

Query:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH
        SLKDLG L+YFLGIE                  DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY TITRPEIA+SVNKVCQFM 
Subjt:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH

Query:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL
         P   HW+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+L
Subjt:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL

Query:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV
        L+ELQ   +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V
Subjt:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0045.18Show/hide
Query:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H   I L  +NY++W+ Q+   +   G ED+I   +G        T+ G      E NP  ++W R DR+I SW+ SS+T  ++  ++  ++S   W  
Subjt:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE ++  S+ A+ M  + + Q  +KG +T+ EY  K+K LAD+L AIGE V+ +D I+ +L GLG +Y++IV+ +TA+   ++L  V+ +L  HE R   
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQM
          ++   SV   N+   + +       N++++      +G N+ RG  G    G    + ++PQCQLCG+FGHT ++CY RFD NF G            
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQM

Query:  VQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLN
                ++ N      N P   + +QAMM +P+   D  W+ D+GA++H++     L+   P   +++V VGNG  L I H G++  + S++++F L 
Subjt:  VQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLN

Query:  NLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVP---PLSSSSSTTAHVLACTSENTKANV
         +LHVP I  NL+SVSQF  DN+ FFEFHP   FVKD+ T  ILLQG +  GLY+F            A  VP      SSS   +  L+ T+  T    
Subjt:  NLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVP---PLSSSSSTTAHVLACTSENTKANV

Query:  IDLWHKRLGHAATPIVSQILKECNISFTNNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSK
          LWH RLGH A  I+  IL  CNIS   +  + C AC   KSH LPF  S +  S PL+L+  DLWGPA   S  G RY+I FVD +SRF+W+Y L SK
Subjt:  IDLWHKRLGHAATPIVSQILKECNISFTNNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSK

Query:  SEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRL
         +A S F+ FK  VE      I+ L++D GGEF+A + YL + GI  + +CPYT +QNG  ERK RHI++ GL LL+ ASLP +FW  AF  A++ INRL
Subjt:  SEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRL

Query:  PTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISRHVQFNESSFPYLQSFLHSSSVKPL
        PT VL+  SP + LFGK P+Y  FK FGCLC+P +RPYN +KL +RS+ CVFLGYS+ H+GY CL+  TGR++++RHV F+E+ FP+  +   SSSV  +
Subjt:  PTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISRHVQFNESSFPYLQSFLHSSSVKPL

Query:  PIHSSINSFLPVLISSPTSSQFTSTSQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAE
        P      +FLP   SSP  S   S + PST  P      P T +    PS++ S   L  +  + I     TS P  TN HPMVTR+KNGI   KV  + 
Subjt:  PIHSSINSFLPVLISSPTSSQFTSTSQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAE

Query:  YIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLL
        +I  EPTT  +A++  +W+ AM+ E++AL +N TW LVP  S    IGCKWV+K+K   DG++ RYKARLVA+GF Q   +DY ETFSPV+K +TIR++L
Subjt:  YIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLL

Query:  TYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV
          AL+  W +HQLD+ NAFLHG L E VFM+QPPGF  S     VCKL+KALYGLKQAPRAW+++LS+ LL  GF+ S+AD+S+   H       +LIYV
Subjt:  TYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV

Query:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLF-LSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIV
        DDI++ GSSS++++  I+ LN  F+L+DLG +NYFLGIEV   + G +F LSQ KY  DLL +  M ++ P TTP + G  +S  +GE FSD   YRS V
Subjt:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLF-LSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIV

Query:  GALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSII
        GALQY T+TRP+I+++VNK CQFM +PT  HW AVKRILRYLKG+ + G+ +++ ++L ++GY DADWAS PDDR+ST G+ IF G NLV+W S KQ ++
Subjt:  GALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSII

Query:  SRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSAT
        SRSS E+E+R+LA+ ++E+IW+Q +L EL + +S PP+LWCDN  A HL+ANPV H+RTKH+E+D++F+RD VL+K+L+IQ+LP+  Q+ADIFTK +S++
Subjt:  SRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSAT

Query:  SFLHIRSKLNVCDAYDIGLRG
         FL  R+KL+V  +  + LRG
Subjt:  SFLHIRSKLNVCDAYDIGLRG

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0045.26Show/hide
Query:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+  + L   N+L+WK QI+  ++GYGL+ ++  DD  P  +L+  +  S    +E     L W +QD+L+ SWLLSS++E +L  ++ C+TS  +W  
Subjt:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ ++ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGRNFRGNRG-------WNG
        +  ++D S P+ ++   ++ +KG        + N Q N S Y+                         NG ++   FRGRGG   RGNRG       WN 
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGRNFRGNRG-------WNG

Query:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDF
        +    KP CQLCG+ GH   +CY RFD  F               Q P  Q+ SG N      Y   S  +  ++    +  D NWYPDSGASNHVT + 
Subjt:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDF

Query:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF
         NL  S      N+VHVGNG G                                                  +P    V           G + +GLY F
Subjt:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF

Query:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII
           HL    TQ L ++  V   S SS      L+ T         DLWHKRLG  +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  
Subjt:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS
        + PL LI +DLWGPA   S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI+HRV+CP+T 
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        QQNG+ ERKHR IV+ GLTLL   SLPL+FWD++F   VY  NRLPT VL    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAI
        S  H+GYKC+   GRV+ISR V FNE+SFPY ++   S    S+V P   H S ++  PVL    + +PTS  S     S+   IV T  P  P +    
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAI

Query:  ASPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSS
         +P+   S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  +
Subjt:  ASPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSS

Query:  THKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSS
          + IGCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+TIRV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    + 
Subjt:  THKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSS

Query:  PLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSY
         LVC+LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   S   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+
Subjt:  PLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSY

Query:  PKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLK
          + GL LSQTKYI DLL K KM    P  TP+ +G  + A +G+   D+H YRS VGALQY TITRPE+++SVNKVCQFM +PT+ HW+AVKRILRYL+
Subjt:  PKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLK

Query:  GSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDN
        G+   GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ  +SRSSTEAE+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDN
Subjt:  GSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDN

Query:  LGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
        L  V LSANPVLH+RTKH+ELD+YFV + V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  LGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

TrEMBL top hitse value%identityAlignment
A0A2N9IMQ9 Integrase catalytic domain-containing protein0.0e+0045.9Show/hide
Query:  HTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTL
        H   I LT  NYL+WK Q++  L+G  L  ++      P   +++++DG+ +T+   NP    W  QD+++ S L+SS++E V+  V+ C TSR++W TL
Subjt:  HTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTL

Query:  EEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERS
        E M+   + A+ M    Q+  L+KG  ++ ++F     LAD+L AI + +     + ++L+GLG EYD+ V+ +  ++ P+TL  +YG L  HE+R E+S
Subjt:  EEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERS

Query:  TVNIDGSVPTVNLTQQSS-SKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQM
           +     + N   + + S+ G G  N   + +    +   SFR   GR  RG       +P CQ+C R GH AL CY RFD NF              
Subjt:  TVNIDGSVPTVNLTQQSS-SKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQM

Query:  VQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV-SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLL
                           Y  +S  MQA         D NWY D+GA+NH+T+D  NL V S      +++ VGNG GLS+ H G+S L S+   SF+L
Subjt:  VQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV-SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLL

Query:  NNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVID
        NN+LHVP ITKNL+SV +F  D D F EFHP    VKDR T  +L +G    GLY F                    ++SST+ + LA   E      ID
Subjt:  NNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVID

Query:  LWHKRLGHAATPIVSQILKECNISFT--NNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSK
         WH RLGH A  +VS+IL + ++     NN    C AC   KS  L F PS T ++ PL LI TD+WGP+   S NGF+YY+SF+D YSR+ W++ +  K
Subjt:  LWHKRLGHAATPIVSQILKECNISFT--NNSTSFCSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSK

Query:  SEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRL
        +E +S F+TF+  VE+L    IK +Q+D GGEFR L  +  S GI HR++CP+T QQNG +ERKHRHIV+ GL LLS A +PL++WDDAFS A Y INRL
Subjt:  SEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRL

Query:  PTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLDR-TGRVFISRHVQFNESSFPYLQSFLHSSSVKPL
        PT +L   +P E LF  KP+Y F K FGC C+P LRPYN HKLQ RS  C+FLGYS +H+GYKCL   +GR++ISR V F E++FP     L +      
Subjt:  PTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLDR-TGRVFISRHVQFNESSFPYLQSFLHSSSVKPL

Query:  PIHSSINSFLPVLISSPTSSQFTSTSQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNL---TSTPIVTNTHPMVTRSKNGIVCPK--
        P   S +S LP+L++   S Q    + P  I+  S P  P +  A    S  TS  P T       P P+L   T TPIV ++HPMVTRSK  I  PK  
Subjt:  PIHSSINSFLPVLISSPTSSQFTSTSQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNL---TSTPIVTNTHPMVTRSKNGIVCPK--

Query:  -----------VLLAEYIE--VEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADID
                    LLAE      EPT    A++ P W +AM  E+ AL+KN TW+LVP +     +G KWVF++KR  DGS+ RYKARLVAKGFHQ   ID
Subjt:  -----------VLLAEYIE--VEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADID

Query:  YTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADT
        YTETFSPV+KPTT+R +L+ AL+  W + QLD+ NAFLHG L+E+V+M QPPGF+       VCKLHKALYGLKQAPRAWF RL+++LL  GF  S++D+
Subjt:  YTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADT

Query:  SLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVS
        SL   H      Y LIYVDDI+I  S +S I  L+  L  +F++KDLG LNYFLGIEV  P   G+ LSQ KYI D+L + KM EA P+++PM S + +S
Subjt:  SLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVS

Query:  AFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCI
           G+   D   YRS VGALQY +ITRP+IA+SVNK+ QFMH+PT +HWQ+VKR+LRYLK +   GL ++  S   L G+ DADWA D DDR+ST G+CI
Subjt:  AFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCI

Query:  FFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHL
        F G NLV+W  KKQ+ ++RSSTEAE+++LAN +AE+ W  ALL EL +    PPILWCDN+GA +LS+NPV H+RTKHVE+D +FVRD+V  + + I+ L
Subjt:  FFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHL

Query:  PAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
         +  QLADIFTKPLS   F  +R+KLNV     +GLRG
Subjt:  PAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0047.71Show/hide
Query:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S    + L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E                              G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC
        V   L AHE R      + D S   VN T Q S++  + S N                      GS  HN G    RGRG       R   G KPQCQLC
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC

Query:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV
         +FGHT  +C+ R+DPNFHGN   NG          +     S S   +V    Y  Q +     M+AM+  P    +  W+PDSGA+NHVT+D GNL  
Subjt:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV

Query:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH
         +    ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L 
Subjt:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH

Query:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII
             K   L  ++    L+  +++  H          N+  +V DLWHKRLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ 
Subjt:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS
        + PL L+ +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  GIIHR++CP+TS
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        +QNGI+ERKHRHIV++GLTLL+QASLPL++W DAFS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D
        S+ H+GYKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D

Query:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY
         ++ V I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+
Subjt:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY

Query:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE
         ALMKN TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E
Subjt:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE

Query:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF
        +V+M+QPPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   F
Subjt:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF

Query:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH
        SLKDLG L+YFLGIEV    DGGL LSQ KYI DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY TITRPEIA+SVNKVCQFM 
Subjt:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH

Query:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL
         P   HW+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+L
Subjt:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL

Query:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV
        L+ELQ   +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V
Subjt:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0047.85Show/hide
Query:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE
        T T +S+ + I+P S    + L + N+L+WK QI   ++GYGLE ++   +  P   ++         V   NP    + RQD L+ SWLLSS+    L 
Subjt:  TNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLE

Query:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE
         V+ C ++ E+W T+ + + + + AK M YK+QMQ LKK G+T+++Y +KMK   D L   G K+S  DHI+ I+ GLG EY+++++VI++K    +LQ 
Subjt:  DVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQE

Query:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC
        V   L AHE R      + D S   VN T Q S++  + S N                      GS  HN G    RGRG       R   G KPQCQLC
Subjt:  VYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQ------------------KNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLC

Query:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV
         +FGHT  +C+ R+DPNFHGN   NG          +     S S   +V    Y  Q +     M+AM+  P    +  W+PDSGA+NHVT+D GNL  
Subjt:  GRFGHTALKCYQRFDPNFHGN---NGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSH----PMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAV

Query:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH
         +    ++++H+GNG GL I+HIG S   SS+  N+   L N+L VP I KNLLSVSQFA+DN+V+FEFHP VCFVKD+   ++LLQG +H+GLY+F+L 
Subjt:  SSPCTSDNRVHVGNGAGLSINHIGSSHLYSSN--NQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLH

Query:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII
             K   L  ++    L+  +++  H          N+  +V DLWHKRLGH A+ IV+Q+L +  I F T + +S CSAC +GKSH LPF  SQT+ 
Subjt:  P---SKTQDLKQASLVPPLSSSSSTTAHVLAC---TSENTKANVIDLWHKRLGHAATPIVSQILKECNISF-TNNSTSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS
        + PL L+ +DLWGPA  +S  GF YY+SFVD YSR+TWVYFL++KS+    FL FK   E   G  +K  QTD GGEFR+L  Y +  GIIHR++CP+TS
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        +QNGI+ERKHRHIV++GLTLL+QASLPL++W DAFS AV+ INRLPT VL    P E LF  KP+YS  K FGCLCFP LRPYN HKL FRS+PC FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D
        S+ H+GYKCL++ GR+FISR V F+E+ FP+         + S S   LP    + +  P+ + SP+ S  TS++Q          S I    Q L   D
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSF-----LHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQP---------STIVPTSQPL---D

Query:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY
         ++ V I + SAS   S          PL TN D    P  ++ + P+      H MVTRSKNGI  PKV   +    EP T +EA+  P W +AM +E+
Subjt:  PATEVAIASPSASTSHS----------PL-TNIDLSHIPEPNLTSTPIV--TNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEY

Query:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE
         ALMKN TWSLV   +   ++GC+WVFK+KRN DGS++RYKARLVAKG+ Q+   D+ ETFSPV+KPTTIRV+L  A++  W I QLD+NNAFL+G L E
Subjt:  AALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTE

Query:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF
        +V+M+QPPGF    +    LVCKLHKALYGLKQAPRAWFD+L   L   GF  +K+D SL  R    S  ++L+YVDDIV+ GSSS EI +LIS L   F
Subjt:  DVFMEQPPGF--SISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQF

Query:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH
        SLKDLG L+YFLGIE                  DLL K KM  A  + TPM+SG  +SA  G+   +V  YRS+VGALQY TITRPEIA+SVNKVCQFM 
Subjt:  SLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMH

Query:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL
         P   HW+AVKRILRYL G+   G++L+    + L G+ DADW SD DDR+STSG C+F G +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+L
Subjt:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQAL

Query:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV
        L+ELQ   +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++++L++ H+P   Q+AD+FTKPLS   F  +R KL V
Subjt:  LAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0045.26Show/hide
Query:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+  + L   N+L+WK QI+  ++GYGL+ ++  DD  P  +L+  +  S    +E     L W +QD+L+ SWLLSS++E +L  ++ C+TS  +W  
Subjt:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ ++ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGRNFRGNRG-------WNG
        +  ++D S P+ ++   ++ +KG        + N Q N S Y+                         NG ++   FRGRGG   RGNRG       WN 
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYHN------------------------NGPNS---FRGRGGRNFRGNRG-------WNG

Query:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDF
        +    KP CQLCG+ GH   +CY RFD  F               Q P  Q+ SG N      Y   S  +  ++    +  D NWYPDSGASNHVT + 
Subjt:  N----KPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDF

Query:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF
         NL  S      N+VHVGNG G                                                  +P    V           G + +GLY F
Subjt:  GNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF

Query:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII
           HL    TQ L ++  V   S SS      L+ T         DLWHKRLG  +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  
Subjt:  ---HLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTII

Query:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS
        + PL LI +DLWGPA   S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI+HRV+CP+T 
Subjt:  STPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY
        QQNG+ ERKHR IV+ GLTLL   SLPL+FWD++F   VY  NRLPT VL    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGY
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGY

Query:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAI
        S  H+GYKC+   GRV+ISR V FNE+SFPY ++   S    S+V P   H S ++  PVL    + +PTS  S     S+   IV T  P  P +    
Subjt:  SNMHRGYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAI

Query:  ASPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSS
         +P+   S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  +
Subjt:  ASPSASTSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSS

Query:  THKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSS
          + IGCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+TIRV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    + 
Subjt:  THKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSS

Query:  PLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSY
         LVC+LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   S   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+
Subjt:  PLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSY

Query:  PKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLK
          + GL LSQTKYI DLL K KM    P  TP+ +G  + A +G+   D+H YRS VGALQY TITRPE+++SVNKVCQFM +PT+ HW+AVKRILRYL+
Subjt:  PKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLK

Query:  GSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDN
        G+   GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ  +SRSSTEAE+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDN
Subjt:  GSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDN

Query:  LGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
        L  V LSANPVLH+RTKH+ELD+YFV + V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  LGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

A5BFT3 Integrase catalytic domain-containing protein0.0e+0046.64Show/hide
Query:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT
        +H+  + L   N+L+WK QI+  ++GYGL+ ++  DD     +       SP  + +          + +L +S   SS    +    L           
Subjt:  SHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKT

Query:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER
        LE+ + +   AK   +K Q+Q+ KKGG T+ EY +K+K   DSL ++G  +STKDH+  IL GL  +Y++ V+ +  ++   +++E+  LL AHESR E+
Subjt:  LEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSER

Query:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGRNFRGNRG-------WNGN----K
        +  ++D S P+ ++   ++ +KG        + N Q + S Y+                     N   N    RGG   RGN+G       WN +    K
Subjt:  STVNIDGSVPTVNLTQQSSSKKGTG------STNDQKNGSSYH---------------------NNGPNSFRGRGGRNFRGNRG-------WNGN----K

Query:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS
        P CQLCG+ GH   +CY RFD  F               Q P  Q+ S  NS     Y   S  +  ++    +  D NWYPDSGASNHVT +  NL  S
Subjt:  PQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVS

Query:  SPCTSDNRVHVGNGAGLSINHIGSSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL
        +     N+VHVGNG GLSI HIG S   S  +++  LLN+LLHVP ITKNLLSVS+FAKDN VFFEFH   CFVKD+ T  +L+ G + +GLY F   HL
Subjt:  SPCTSDNRVHVGNGAGLSINHIGSSHLYSS-NNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKF---HL

Query:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS
            TQ L ++  V   S SS        CT+  + ++  DLWHKRLGH +   +  +L +CN++  N   ++FCS+C +GK H  PF  S T  + PL 
Subjt:  HPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNN-STSFCSACAIGKSHALPFYPSQTIISTPLS

Query:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGI
        LI  DLWGP +  S +G+RYYI FVD +SRF+W++ L++KSEA  TF+ FK  VE      IK LQTD GGEFRA   YL   GI+HRV+CP+T QQNG+
Subjt:  LIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGI

Query:  VERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR
         ERKHR IV+ GLTLL  ASLPL+FWD++F   VY  NRLPT +L    P+E LF   PDYSF K FGC CFP LRPYN HKLQ+RS  C FLGYS  H+
Subjt:  VERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHR

Query:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAIASPSA
        GYKC+   GRV+IS  V FNE+SFPY ++   S    S+V P   H S ++  PVL    + +PTS  S     S+   IV T  P  P +     +P+ 
Subjt:  GYKCLDRTGRVFISRHVQFNESSFPYLQSFLHS----SSVKPLPIHSSINSFLPVL----ISSPTS--SQFTSTSQPSTIVPTSQPLDPATEVAIASPSA

Query:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI
          S+   T +   +S I + ++T T      NTHPM+TR+K+GIV PK+ +A     EP++V  AL+   W +AM  EY AL +N TWSLVP  +  + I
Subjt:  STSHSPLTNID--LSHIPEPNLTST--PIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI

Query:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK
        GCKWV+K K N DG++ +YKARLVAKGFHQ A  D+TETFSPV+KP+T+RV+ T AL+  W I QLD+NNAFL+G L E+VFM+QP GF    +  LVC+
Subjt:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK

Query:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG
        LHKALYGLKQAPRAWF++L   LL+ GF  +K+D SL  R   +   Y+L+YVDDI+++GS ++ IT LI+ LN +FSLKDLG ++YFLGI+VS+  + G
Subjt:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGG

Query:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS
        L LSQTKYI DLL K KM    P  TP+ +G  +   +G+   D+H YRS VGALQY TITRPE+++SVNKVCQFM +PT+ HW+ VKRILRYL+G+   
Subjt:  LFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTS

Query:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH
        GL L+K SNL L G+ DADWASD DDR+STSG C+F G NL++W SKKQ I+SRSS E E+RSLA   AE+ WL++LL+ELQ+P ++PP++WCDNL  V 
Subjt:  GLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVH

Query:  LSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG
        LSANPVLH+RTKH+ELD+YFVR+ V++K + ++H+P+  QLAD+ TK +S+T F+  R KL + +   + LRG
Subjt:  LSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.4e-12226.76Show/hide
Query:  YLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAK
        Y +WK +I   L     +D +   DG             P+ V+++      W + +R   S ++  +++  L       T+R+I + L+ +Y   +LA 
Subjt:  YLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAK

Query:  NMSYKNQMQNLK-KGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVI-TAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVP
         ++ + ++ +LK    M+L  +F    +L   L A G K+   D I ++L  L   YD I++ I T     LTL  V   L             +D  + 
Subjt:  NMSYKNQMQNLK-KGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVI-TAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVP

Query:  TVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFS
          N    +S K      ++  N ++Y NN   +   +  + F+GN  +   K +C  CGR GH    C+     ++       N +  + VQ        
Subjt:  TVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFS

Query:  GNNSVGNQNYPMQSHPMQAMMVAPN---INLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPH
                     SH +  M+   N   +  +  +  DSGAS+H+ ND      S       ++ V    G  I       +   N+    L ++L    
Subjt:  GNNSVGNQNYPMQSHPMQAMMVAPN---INLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPH

Query:  ITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTK-ANVIDLWHKRLG
           NL+SV +  ++  +  EF        D+   TI   GLM              ++    + VP ++             S N K  N   LWH+R G
Subjt:  ITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTK-ANVIDLWHKRLG

Query:  HAATPIVSQILKE---CNISFTNN---STSFCSACAIGKSHALPF--YPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKS
        H +   + +I ++    + S  NN   S   C  C  GK   LPF     +T I  PL ++ +D+ GP    + +   Y++ FVD ++ +   Y ++ KS
Subjt:  HAATPIVSQILKE---CNISFTNN---STSFCSACAIGKSHALPF--YPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKS

Query:  EAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINR
        + +S F  F    E      +  L  D G E+    +  +   +GI + +T P+T Q NG+ ER  R I +   T++S A L   FW +A   A Y INR
Subjt:  EAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINR

Query:  LPTTVL--SGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLDRTGRVFI--------------SRHVQFN---
        +P+  L  S  +P E    KKP     + FG   +  ++     K   +S   +F+GY     G+K  D     FI              SR V+F    
Subjt:  LPTTVL--SGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLDRTGRVFI--------------SRHVQFN---

Query:  --------------------ESSFP----------YLQSFLHSSSVK-PLPIHSSINSFLPVLISSPTSSQFTSTSQPSTIV----PTSQPLDPATEVAI
                            ++ FP          +L+    S +   P      I +  P       + QF   S+ S          +  D     + 
Subjt:  --------------------ESSFP----------YLQSFLHSSSVK-PLPIHSSINSFLPVLISSPTSSQFTSTSQPSTIV----PTSQPLDPATEVAI

Query:  ASPSASTSHSPLTNIDLSHIPEPNLTSTPIV---------TNTHPMVTRSKNGIVCPKVLL----------AEYIEVEPTTVKEALRCPHWLQAMKDEYA
         S + + S    T   L  I   N T    +           T P ++ ++      KV+L            + E++    K +     W +A+  E  
Subjt:  ASPSASTSHSPLTNIDLSHIPEPNLTSTPIV---------TNTHPMVTRSKNGIVCPKVLL----------AEYIEVEPTTVKEALRCPHWLQAMKDEYA

Query:  ALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTED
        A   N TW++         +  +WVF +K N  G+  RYKARLVA+GF Q   IDY ETF+PV + ++ R +L+  +    ++HQ+D+  AFL+G L E+
Subjt:  ALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTED

Query:  VFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVG--SSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFS
        ++M  P G  IS +S  VCKL+KA+YGLKQA R WF+     L    F  S  D  +     G  +   Y+L+YVDD+VI     + +      L  +F 
Subjt:  VFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVG--SSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFS

Query:  LKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATI-TRPEIAYSVNKVCQFMH
        + DL  + +F+GI +   +D  ++LSQ+ Y+  +L K  M   N ++TP+ S       N ++  +    RS++G L Y  + TRP++  +VN + ++  
Subjt:  LKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYATI-TRPEIAYSVNKVCQFMH

Query:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLG----LYGYADADWASDPDDRKSTSGFCI-FFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELI
              WQ +KR+LRYLKG+    L+ +K  NL     + GY D+DWA    DRKST+G+    F  NL+ W +K+Q+ ++ SSTEAE+ +L     E +
Subjt:  SPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLG----LYGYADADWASDPDDRKSTSGFCI-FFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELI

Query:  WLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV
        WL+ LL  + I    P  ++ DN G + ++ NP  H R KH+++  +F R+ V    + ++++P   QLADIFTKPL A  F+ +R KL +
Subjt:  WLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-15730.15Show/hide
Query:  WTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKG-GMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSG
        W   D   +S +   +++ V+ +++D +T+R IW  LE +Y++  L   +  K Q+  L    G     + +    L   L  +G K+  +D  I +L+ 
Subjt:  WTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKG-GMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSG

Query:  LGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKP
        L   YD + + I      + L++V   L  +E                    ++    +G     + + G SY  +  N   GR G   +          
Subjt:  LGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKP

Query:  QCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSS
         C  C + GH     ++R  PN     G  + Q N        Q+        N N  +  +  +  M       ++ W  D+ AS+H T    +L    
Subjt:  QCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSS

Query:  PCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDR----QTGTILLQGLMHEGLYKFHLH
               V +GN +   I  IG   + ++   + +L ++ HVP +  NL  +S  A D D +  +     F   +    +   ++ +G+    LY+    
Subjt:  PCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDR----QTGTILLQGLMHEGLYKFHLH

Query:  PSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKAN---VIDLWHKRLGHAATPIVSQILKECNISFTNNST-SFCSACAIGKSHALPFYPSQTIISTP
                             T A +  C  E   A     +DLWHKR+GH +   +  + K+  IS+   +T   C  C  GK H + F  S       
Subjt:  PSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKAN---VIDLWHKRLGHAATPIVSQILKECNISFTNNST-SFCSACAIGKSHALPFYPSQTIISTP

Query:  LSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGIIHRVTCPYTSQ
        L L+ +D+ GP    S  G +Y+++F+D  SR  WVY L++K + +  F  F   VE+  G  +K L++D GGE+  R    Y  S GI H  T P T Q
Subjt:  LSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEF--RALAPYLKSQGIIHRVTCPYTSQ

Query:  QNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYS
         NG+ ER +R IV+   ++L  A LP  FW +A   A Y INR P+  L+   P      K+  YS  K FGC  F  +      KL  +S PC+F+GY 
Subjt:  QNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYS

Query:  NMHRGYKCLDRT-GRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIHSSINSFLPVLISSPTSSQFTST------SQPSTIVPTSQPLDPATEVAIASPS
        +   GY+  D    +V  SR V F ES          S  VK   I + +   +P   ++PTS++ T+        QP  ++   + LD   E       
Subjt:  NMHRGYKCLDRT-GRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIHSSINSFLPVLISSPTSSQFTST------SQPSTIVPTSQPLDPATEVAIASPS

Query:  ASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCP---HWLQAMKDEYAALMKNGTWSLVPHSSTHKTI
            H PL   +   +      ST                     VL+++  + EP ++KE L  P     ++AM++E  +L KNGT+ LV      + +
Subjt:  ASTSHSPLTNIDLSHIPEPNLTSTPIVTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCP---HWLQAMKDEYAALMKNGTWSLVPHSSTHKTI

Query:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK
         CKWVFK+K++ D  + RYKARLV KGF Q   ID+ E FSPV+K T+IR +L+ A +   ++ QLD+  AFLHG L E+++MEQP GF ++G   +VCK
Subjt:  GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCK

Query:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYI-LIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDG
        L+K+LYGLKQAPR W+ +  SF+ +  +  + +D  + F+    +   I L+YVDD++I+G     I +L   L+  F +KDLG     LG+++   +  
Subjt:  LHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYI-LIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDG

Query:  -GLFLSQTKYITDLLHKAKMFEANPITTPM-----VSGSVVSAFNGEKFSDVHF-YRSIVGALQYATI-TRPEIAYSVNKVCQFMHSPTQVHWQAVKRIL
          L+LSQ KYI  +L +  M  A P++TP+     +S  +      EK +     Y S VG+L YA + TRP+IA++V  V +F+ +P + HW+AVK IL
Subjt:  -GLFLSQTKYITDLLHKAKMFEANPITTPM-----VSGSVVSAFNGEKFSDVHF-YRSIVGALQYATI-TRPEIAYSVNKVCQFMHSPTQVHWQAVKRIL

Query:  RYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPIL
        RYL+G+ T   L    S+  L GY DAD A D D+RKS++G+   F G  ++W SK Q  ++ S+TEAE+ +   T  E+IWL+  L EL +   +  ++
Subjt:  RYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPIL

Query:  WCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSF
        +CD+  A+ LS N + H+RTKH+++  +++R++V  + L +  +      AD+ TK +    F
Subjt:  WCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSF

Q39547 Cucumisin3.7e-6057.34Show/hide
Query:  HIAAAVPPGDVRSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWSDGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLE
        HI   + PGDV  PRDTNGHGTHTAS                  TARGGVP ARIA YK+CW+DGC D DILAA+DD IAD VDIISLSVG   P+ Y  
Subjt:  HIAAAVPPGDVRSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWSDGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLE

Query:  DSIAIGTFHAMKHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGDAPNVDGGFSKYTSR--
        D+IAIG+FHA++ GILTSNSAGN GP ++TTA+ +PW LSVAAS++DRKF  QVQ+GNG  +QGV+INTFD   + YPL+   D PN   GF K TSR  
Subjt:  DSIAIGTFHAMKHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGDAPNVDGGFSKYTSR--

Query:  ANRLATPKILQIKEQVME
         ++   P +L+ K  V E
Subjt:  ANRLATPKILQIKEQVME

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-27338.79Show/hide
Query:  EETNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGV
        EE     +  L +N  + TK   LT +NYL+W  Q+     GY L  ++  D  T     +   D +P      NP +  W RQD+LI S +L +++  V
Subjt:  EETNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGV

Query:  LEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTL
           V    T+ +IW+TL ++Y   +       + Q++   KG  T+ +Y   +    D L  +G+ +   + +  +L  L  EY  ++  I AK  P TL
Subjt:  LEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTL

Query:  QEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGS------SYHNNGPNSFRGRGGRNFRGNRGWNGNKP---QCQLCGRFGHTA
         E++  L  HES+     +    S   + +T  + S + T +TN+  NG+      + +NN  +    +   NF  N   N +KP   +CQ+CG  GH+A
Subjt:  QEVYGLLYAHESRSERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGS------SYHNNGPNSFRGRGGRNFRGNRGWNGNKP---QCQLCGRFGHTA

Query:  LKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQ--AMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHV
         +C Q            L H                 +SV +Q  P    P Q  A +   +     NW  DSGA++H+T+DF NL++  P T  + V V
Subjt:  LKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQ--AMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHV

Query:  GNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVP
         +G+ + I+H GS+ L S+ ++   L+N+L+VP+I KNL+SV +    N V  EF P    VKD  TG  LLQG   + LY++ +  S+   L       
Subjt:  GNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVP

Query:  PLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSS
          +S SS   H                WH RLGH A  I++ ++   ++S  N S  F  CS C I KS+ +PF  S    + PL  I +D+W   + S 
Subjt:  PLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSS

Query:  KNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLT
         N +RYY+ FVD ++R+TW+Y L+ KS+   TF+TFK  +E      I    +D GGEF AL  Y    GI H  + P+T + NG+ ERKHRHIV+ GLT
Subjt:  KNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLT

Query:  LLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFI
        LLS AS+P  +W  AF+ AVY INRLPT +L   SP +KLFG  P+Y   + FGC C+P LRPYN HKL  +S  CVFLGYS     Y CL  +T R++I
Subjt:  LLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFI

Query:  SRHVQFNESSFPY---------LQSFLHSSSVKPLPIHSSINSFLPVL-------------------------------ISSPTSSQFTSTSQPST----
        SRHV+F+E+ FP+         +Q     SS    P H+++ +  PVL                               + S  SS F S+ +P+     
Subjt:  SRHVQFNESSFPY---------LQSFLHSSSVKPLPIHSSINSFLPVL-------------------------------ISSPTSSQFTSTSQPST----

Query:  -IVPTSQPLDPATEV---------------------AIASP--SASTSHSPLTNID-----------LSHIPEP------NLTSTPIVTNTHPMVTRSKN
           PT+QP    T+                      ++++P  S+S+S SP T+             L H P P      N    P+  NTH M TR+K 
Subjt:  -IVPTSQPLDPATEV---------------------AIASP--SASTSHSPLTNID-----------LSHIPEP------NLTSTPIVTNTHPMVTRSKN

Query:  GIV--CPKVLLAEYI--EVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYT
        GI+   PK  LA  +  E EP T  +AL+   W  AM  E  A + N TW LVP   +H TI GC+W+F  K N+DGS+ RYKARLVAKG++Q   +DY 
Subjt:  GIV--CPKVLLAEYI--EVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYT

Query:  ETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSL
        ETFSPVIK T+IR++L  A+   W I QLD+NNAFL G LT+DV+M QPPGF        VCKL KALYGLKQAPRAW+  L ++LL +GF  S +DTSL
Subjt:  ETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSL

Query:  LFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAF
             G S  Y+L+YVDDI+I G+  + +   +  L+ +FS+KD   L+YFLGIE       GL LSQ +YI DLL +  M  A P+TTPM     +S +
Subjt:  LFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAF

Query:  NGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFF
        +G K +D   YR IVG+LQY   TRP+I+Y+VN++ QFMH PT+ H QA+KRILRYL G+   G+ L+K + L L+ Y+DADWA D DD  ST+G+ ++ 
Subjt:  NGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFF

Query:  GGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPA
        G + ++W SKKQ  + RSSTEAE+RS+ANTS+E+ W+ +LL EL I  +RPP+++CDN+GA +L ANPV HSR KH+ +D +F+R+ V    L + H+  
Subjt:  GGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPA

Query:  FAQLADIFTKPLSATSFLHIRSKLNV
          QLAD  TKPLS T+F +  SK+ V
Subjt:  FAQLADIFTKPLSATSFLHIRSKLNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.5e-26037.61Show/hide
Query:  KEQVMEETNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSS
        +E V+  TN      L +N  + TK   LT +NYL+W  Q+     GY L  ++  D  TP    +   D  P      NP +  W RQD+LI S +L +
Subjt:  KEQVMEETNTQQSVTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSS

Query:  MTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKS
        ++  V   V    T+ +IW+TL ++Y       N SY         G +T   + ++     D L  +G+ +   + +  +L  L  +Y  ++  I AK 
Subjt:  MTEGVLEDVLDCETSREIWKTLEEMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKS

Query:  RPLTLQEVYGLLYAHESR----SERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTA
         P +L E++  L   ES+    +    V I  +V T   T  + ++   G   +  N    +NN  NS++     +   NR       +CQ+C   GH+A
Subjt:  RPLTLQEVYGLLYAHESR----SERSTVNIDGSVPTVNLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTA

Query:  LKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGN
         +C     P  H      N Q +     P+Q                   P   + V    N + NW  DSGA++H+T+DF NL+   P T  + V + +
Subjt:  LKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPMQSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGN

Query:  GAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPL
        G+ + I H GS+ L +S ++S  LN +L+VP+I KNL+SV +    N V  EF P    VKD  TG  LLQG   + LY++                 P+
Subjt:  GAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLVCFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPL

Query:  SSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKN
        +SS + +     C+     +     WH RLGH +  I++ ++   ++   N S     CS C I KSH +PF  S    S PL  I +D+W   + S  N
Subjt:  SSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNNSTSF--CSACAIGKSHALPFYPSQTIISTPLSLIETDLWGPAVKSSKN

Query:  GFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLL
         +RYY+ FVD ++R+TW+Y L+ KS+   TF+ FK  VE      I  L +D GGEF  L  YL   GI H  + P+T + NG+ ERKHRHIV+MGLTLL
Subjt:  GFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTSQQNGIVERKHRHIVDMGLTLL

Query:  SQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISR
        S AS+P  +W  AFS AVY INRLPT +L   SP +KLFG+ P+Y   K FGC C+P LRPYN HKL+ +S  C F+GYS     Y CL   TGR++ SR
Subjt:  SQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCLD-RTGRVFISR

Query:  HVQFNESSFPYLQSFL--------HSSSVKPLPIHSSINSFLPVL-------------------------------------ISSPTSSQFTSTSQPSTI
        HVQF+E  FP+  +           S S    P H+++ +   VL                                     ISSP+SS+ T+ S     
Subjt:  HVQFNESSFPYLQSFL--------HSSSVKPLPIHSSINSFLPVL-------------------------------------ISSPTSSQFTSTSQPSTI

Query:  VPTSQPLDPATEVAIA-----------SPSASTSHSPLTNIDLS--HIPEPNL------------TSTPIV-----------------TNTHPMVTRSKN
         PT+QP       + +           SP++   +SPL    +S  HIP P+             TSTP +                  NTH M TR+K+
Subjt:  VPTSQPLDPATEVAIA-----------SPSASTSHSPLTNIDLS--HIPEPNL------------TSTPIV-----------------TNTHPMVTRSKN

Query:  GIVCP--KVLLAEYIEV--EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYT
        GI  P  K   A  +    EP T  +A++   W QAM  E  A + N TW LVP      TI GC+W+F  K N+DGS+ RYKARLVAKG++Q   +DY 
Subjt:  GIVCP--KVLLAEYIEV--EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTI-GCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYT

Query:  ETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSL
        ETFSPVIK T+IR++L  A+   W I QLD+NNAFL G LT++V+M QPPGF        VC+L KA+YGLKQAPRAW+  L ++LL +GF  S +DTSL
Subjt:  ETFSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSL

Query:  LFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAF
             G S  Y+L+YVDDI+I G+ +  +   +  L+ +FS+K+   L+YFLGIE       GL LSQ +Y  DLL +  M  A P+ TPM +   ++  
Subjt:  LFRHVGSSKCYILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAF

Query:  NGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFF
        +G K  D   YR IVG+LQY   TRP+++Y+VN++ Q+MH PT  HW A+KR+LRYL G+   G+ L+K + L L+ Y+DADWA D DD  ST+G+ ++ 
Subjt:  NGEKFSDVHFYRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFF

Query:  GGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPA
        G + ++W SKKQ  + RSSTEAE+RS+ANTS+EL W+ +LL EL I  S PP+++CDN+GA +L ANPV HSR KH+ LD +F+R+ V    L + H+  
Subjt:  GGNLVTWGSKKQSIISRSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPA

Query:  FAQLADIFTKPLSATSFLHIRSKLNV
          QLAD  TKPLS  +F +   K+ V
Subjt:  FAQLADIFTKPLSATSFLHIRSKLNV

Arabidopsis top hitse value%identityAlignment
AT4G00230.1 xylem serine peptidase 11.8e-4148.72Show/hide
Query:  VPPGDVRSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWS-DGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLEDSIA
        VP G+VRSP D +GHGTHT+S                  TARG VPSAR+A+YK+CW+  GC D DILA F+  I D V+IIS+S+G      Y  DSI+
Subjt:  VPPGDVRSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWS-DGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLEDSIA

Query:  IGTFHAMKHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGD-APNVDGGFSKYTSR
        +G+FHAM+ GILT  SAGN+GP   T  N  PW L+VAAS IDR FK+++ LGNG  + G+ I+ F    + YPL+   D A N D    KY +R
Subjt:  IGTFHAMKHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGD-APNVDGGFSKYTSR

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-11141.25Show/hide
Query:  EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYAL
        EP+T  EA     W  AM DE  A+    TW +       K IGCKWV+KIK N+DG+I RYKARLVAKG+ Q   ID+ ETFSPV K T+++++L  + 
Subjt:  EPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTETFSPVIKPTTIRVLLTYAL

Query:  ANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPL----VCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV
           + +HQLDI+NAFL+G L E+++M+ PPG++      L    VC L K++YGLKQA R WF + S  L+  GF  S +D +   +   +    +L+YV
Subjt:  ANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPL----VCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYILIYV

Query:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVG
        DDI+I  ++ + + +L S L   F L+DLG L YFLG+E++     G+ + Q KY  DLL +  +    P + PM      SA +G  F D   YR ++G
Subjt:  DDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVG

Query:  ALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIIS
         L Y  ITR +I+++VNK+ QF  +P   H QAV +IL Y+KG+   GL     + + L  ++DA + S  D R+ST+G+C+F G +L++W SKKQ ++S
Subjt:  ALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIIS

Query:  RSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLS
        +SS EAE+R+L+  + E++WL     ELQ+P S+P +L+CDN  A+H++ N V H RTKH+E D + VR+  + +  +     A+ +  D FT+ LS
Subjt:  RSSTEAEFRSLANTSAELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLS

AT5G03620.1 Subtilisin-like serine endopeptidase family protein5.8e-4550.54Show/hide
Query:  VPPGDVRSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWSDGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLEDSIAI
        +P G+  +  D +GHGTHT+S                  TARGGVPSARIA YK+CW  GC D D+LAAFD+ I+D VDIIS+S+G     P+ ED IAI
Subjt:  VPPGDVRSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWSDGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLEDSIAI

Query:  GTFHAMKHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGDAPNVDGG
        G FHAMK GILT+ SAGNNGP  +T +N APW ++VAA+S+DRKF+  V+LGNG    G+++N F+   + YPL     A N+  G
Subjt:  GTFHAMKHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGDAPNVDGG

AT5G59100.1 Subtilisin-like serine endopeptidase family protein2.3e-4147.5Show/hide
Query:  RSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWSDGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLEDSIAIGTFHAM
        ++ RD +GHGTHTAS                  TARGGVP+ARIAVYK+C ++GC    +++AFDD IAD VD+IS+S+      P+ ED IAIG FHAM
Subjt:  RSPRDTNGHGTHTAS------------------TARGGVPSARIAVYKICWSDGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLEDSIAIGTFHAM

Query:  KHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGDAPNVDGGFSKYTSRANRLATPKILQIK
          G+LT N+AGNNGPK  T  + APW  SVAAS  +R F A+V LG+G I  G ++NT+D+ G  YPL+Y   A       S  +    RL  PK L  K
Subjt:  KHGILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGDAPNVDGGFSKYTSRANRLATPKILQIK

ATMG00810.1 DNA/RNA polymerases superfamily protein1.7e-5748.03Show/hide
Query:  YILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHF
        Y+L+YVDDI++ GSS++ +  LI  L+  FS+KDLG ++YFLGI++      GLFLSQTKY   +L+ A M +  P++TP+    + S+ +  K+ D   
Subjt:  YILIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHF

Query:  YRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSK
        +RSIVGALQY T+TRP+I+Y+VN VCQ MH PT   +  +KR+LRY+KG+   GL + K S L +  + D+DWA     R+ST+GFC F G N+++W +K
Subjt:  YRSIVGALQYATITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSK

Query:  KQSIISRSSTEAEFRSLANTSAELIWLQA
        +Q  +SRSSTE E+R+LA T+AEL W  A
Subjt:  KQSIISRSSTEAEFRSLANTSAELIWLQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTACGATGTCTTTATTGTTAGAAAAATCATCGGAGCTCGAGCATATCGCAGCCGCCGTTCCACCTGGTGATGTTAGAAGTCCGAGAGATACAAACGGCCATGGGAC
ACACACTGCGTCGACGGCGAGAGGAGGGGTTCCCTCTGCGCGCATTGCTGTGTACAAGATCTGTTGGTCCGATGGGTGCTTCGACGCCGACATTCTTGCAGCGTTCGACG
ACATAATTGCCGACAGCGTGGATATTATATCTCTTTCAGTGGGGCCGAAGAAACCGAAGCCTTACTTGGAGGATTCCATTGCCATCGGAACTTTCCACGCCATGAAACAT
GGAATATTGACGTCCAACTCCGCCGGAAATAATGGCCCCAAATACTACACCACCGCTAACGGCGCTCCGTGGTCTCTTTCTGTGGCCGCAAGCTCCATTGACAGAAAGTT
CAAGGCACAAGTGCAGCTTGGCAACGGAAATATCTATCAGGGAGTTGCAATTAACACATTTGATCTTATGGGAAGACAATATCCCTTAATTTATGCTGGAGATGCACCTA
ACGTTGATGGAGGTTTCTCTAAATATACCTCCAGAGCTAATCGTCTAGCAACACCCAAGATCCTGCAAATTAAAGAGCAAGTTATGGAAGAAACAAATACACAGCAATCG
GTGACTCTGGGCATTAATCCTGGAAGTCACACCAAGGTAATTACACTTACCGAAAGCAATTATCTTGTTTGGAAACTTCAGATTCTCAGAACGCTACAGGGATATGGACT
CGAAGATTATATTCTCGACGATGATGGAACCCCGTCTCTGTATCTTAGTTCTACTAATGATGGATCCCCGTCCACAGTGGAAGAAGCCAATCCCGCACATTTACTTTGGA
CCCGTCAAGATCGTCTAATCTCTTCGTGGCTTCTCAGTTCGATGACAGAGGGAGTTCTTGAAGATGTGCTTGACTGTGAGACATCTCGCGAAATTTGGAAAACATTAGAA
GAAATGTATGTTACAAGCAATTTGGCAAAAAATATGAGCTACAAGAATCAGATGCAAAACCTGAAGAAAGGAGGCATGACTCTAAAAGAATACTTTTCAAAGATGAAGAA
ACTGGCAGATTCTTTAAAGGCCATTGGAGAAAAGGTATCCACTAAAGACCATATCATTTATATTCTCTCTGGATTGGGTGTCGAATATGATGCTATAGTATCTGTTATCA
CTGCCAAATCTAGACCTTTGACACTACAAGAAGTTTATGGATTACTGTATGCTCATGAGTCAAGGTCAGAAAGAAGCACTGTGAATATTGATGGTTCTGTGCCCACAGTA
AACCTAACTCAACAAAGTTCATCCAAGAAAGGTACTGGTTCTACCAATGATCAAAAGAATGGTTCATCATATCACAACAATGGTCCTAATTCCTTCAGAGGTCGTGGAGG
ACGCAATTTCAGAGGAAACAGAGGTTGGAATGGTAATAAACCACAGTGTCAACTATGTGGACGTTTTGGACACACAGCTTTGAAATGTTATCAGCGATTTGATCCAAACT
TCCATGGAAACAATGGTGGTCTTAACCATCAAGGCAATCAAATGGTTCAACAACCTTTTCAGCAGTCTTTCTCTGGTAATAATAGCGTGGGCAATCAAAATTATCCTATG
CAAAGTCACCCTATGCAGGCTATGATGGTGGCTCCCAATATTAATCTTGATACCAATTGGTATCCCGATTCAGGAGCTTCAAATCATGTGACGAATGATTTTGGTAATCT
TGCAGTGAGTTCTCCTTGCACTAGTGACAATAGAGTTCATGTCGGTAATGGGGCAGGTTTGTCTATCAATCATATTGGCTCTTCTCATTTGTACTCCTCTAATAACCAAT
CCTTTTTACTCAATAACCTCTTACATGTTCCCCATATTACTAAGAATCTTTTGAGTGTCAGCCAATTTGCTAAAGATAATGATGTCTTTTTTGAATTCCATCCTCTTGTC
TGTTTTGTGAAGGACCGTCAGACTGGTACAATTCTGCTCCAAGGACTGATGCATGAAGGACTGTACAAGTTTCATCTCCACCCTTCCAAAACTCAAGATTTGAAGCAAGC
ATCCCTGGTTCCTCCTCTGTCTTCTTCCTCCTCTACCACAGCTCATGTTTTAGCTTGTACCTCTGAAAATACTAAAGCCAATGTAATAGATCTGTGGCATAAAAGACTTG
GTCATGCTGCCACTCCAATTGTATCCCAAATTTTAAAGGAATGCAATATTTCTTTTACTAATAATTCTACTTCCTTTTGTTCTGCCTGTGCAATTGGTAAAAGTCATGCC
CTTCCATTTTACCCTTCACAGACTATTATTAGTACACCCCTGTCTCTTATCGAAACCGATCTTTGGGGACCAGCTGTTAAAAGCTCCAAAAATGGTTTCAGATACTATAT
CAGTTTTGTTGATGTCTACTCTCGTTTCACTTGGGTTTATTTTCTTCAGTCCAAATCCGAGGCTTATTCTACCTTTCTTACTTTTAAAATACATGTGGAAAAACTTCTTG
GTCATTCAATTAAAATGTTACAAACTGATGGGGGTGGTGAGTTTCGTGCTCTTGCCCCATACTTAAAGTCTCAGGGTATTATTCATAGAGTTACTTGTCCCTACACCTCC
CAACAAAATGGTATAGTTGAGCGTAAGCATAGGCACATTGTTGATATGGGACTGACCTTACTTTCTCAGGCTTCCCTTCCTCTTGAGTTTTGGGACGATGCGTTTTCAGC
AGCAGTCTATACCATTAATCGGTTGCCTACTACAGTCCTAAGTGGCATCAGTCCTGTGGAGAAATTGTTCGGTAAGAAACCTGATTATTCCTTTTTTAAAACCTTTGGCT
GCTTATGTTTTCCATGCTTACGACCCTACAATGACCATAAACTCCAATTTCGTTCTGCTCCTTGTGTTTTTCTTGGCTATAGCAATATGCACAGGGGCTACAAATGTTTA
GATAGAACTGGACGTGTTTTTATTTCAAGACATGTTCAGTTTAATGAATCTTCTTTTCCATATCTTCAGTCTTTCTTACATTCCTCCTCTGTCAAGCCTTTGCCTATACA
CTCATCTATCAACTCTTTCCTACCTGTGTTGATTTCGTCTCCTACTTCTTCCCAGTTTACATCTACTTCTCAGCCTTCTACTATTGTTCCTACCTCCCAACCCTTGGATC
CTGCCACTGAGGTTGCTATTGCTTCCCCATCTGCATCCACTTCACATTCTCCTTTGACTAATATTGATCTTTCCCATATACCTGAACCAAATCTGACATCTACTCCTATT
GTCACTAACACTCACCCTATGGTTACTCGCTCAAAAAATGGTATTGTTTGCCCCAAGGTACTACTTGCAGAATACATTGAGGTTGAACCGACTACTGTGAAAGAGGCCTT
ACGTTGTCCTCATTGGCTTCAAGCAATGAAAGATGAATATGCTGCCCTTATGAAAAATGGAACTTGGTCTCTTGTACCTCATTCTTCTACCCACAAAACAATCGGTTGTA
AGTGGGTTTTTAAAATAAAGCGAAATACGGATGGCTCCATTGCTAGGTATAAGGCACGCTTAGTTGCAAAAGGATTTCATCAAATGGCAGACATTGACTACACTGAAACT
TTTAGTCCAGTTATTAAGCCCACGACAATACGTGTCTTACTTACTTATGCATTGGCTAATGGTTGGCAGATTCATCAACTGGATATTAATAATGCTTTTCTTCATGGTGT
TCTTACTGAGGATGTTTTCATGGAGCAGCCTCCGGGATTCTCTATCTCTGGCTCTTCACCTCTGGTTTGTAAACTCCATAAAGCACTTTATGGACTCAAGCAAGCCCCAC
GAGCATGGTTTGATAGACTGTCTTCCTTTTTACTTGCTCTCGGTTTCAAATGTTCTAAAGCTGACACCTCTCTTCTTTTTCGCCATGTTGGTTCATCCAAATGCTATATA
CTGATCTATGTCGATGATATAGTCATCATGGGCTCTTCATCTTCTGAAATTACTCAGCTTATATCCTTGCTAAATCATCAGTTTTCCTTGAAAGATCTTGGTAGGCTGAA
TTATTTTCTTGGTATTGAGGTCTCTTATCCAAAGGATGGAGGTTTATTCCTCTCTCAAACCAAGTATATTACTGATCTTCTTCATAAGGCCAAGATGTTTGAAGCTAATC
CTATTACTACTCCCATGGTAAGTGGCTCTGTAGTTTCTGCCTTTAATGGAGAAAAATTTTCTGATGTTCATTTTTATAGAAGTATTGTGGGGGCCTTACAGTATGCTACA
ATCACTAGACCAGAAATTGCTTATAGTGTGAACAAGGTGTGTCAATTTATGCATTCTCCTACTCAAGTTCACTGGCAGGCGGTCAAGAGGATTCTTAGATATCTCAAGGG
TTCATTCACTTCAGGCCTTTTGCTTCGGAAGCCATCTAATCTTGGACTATATGGCTATGCGGATGCTGATTGGGCCTCAGACCCAGATGACCGCAAATCAACCTCTGGTT
TCTGCATTTTCTTTGGTGGAAATCTTGTAACCTGGGGCTCGAAGAAGCAGAGCATTATCTCACGCTCCAGCACTGAGGCTGAGTTCAGAAGCCTGGCAAATACATCAGCA
GAGTTAATCTGGTTGCAAGCTCTTTTAGCTGAGTTACAAATCCCTACTTCTCGTCCACCCATCTTATGGTGTGACAACTTGGGAGCTGTTCATCTCAGTGCTAATCCAGT
TTTGCATTCTCGAACAAAACATGTTGAGTTAGACATCTACTTTGTCCGCGATTTGGTTCTCCAAAAGAGACTCATGATTCAACACCTTCCAGCATTTGCCCAACTTGCTG
ATATATTTACAAAGCCCCTGTCTGCTACTTCCTTTCTGCATATTCGTTCCAAGCTCAATGTCTGTGATGCTTATGACATTGGCTTGAGGGGGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTACGATGTCTTTATTGTTAGAAAAATCATCGGAGCTCGAGCATATCGCAGCCGCCGTTCCACCTGGTGATGTTAGAAGTCCGAGAGATACAAACGGCCATGGGAC
ACACACTGCGTCGACGGCGAGAGGAGGGGTTCCCTCTGCGCGCATTGCTGTGTACAAGATCTGTTGGTCCGATGGGTGCTTCGACGCCGACATTCTTGCAGCGTTCGACG
ACATAATTGCCGACAGCGTGGATATTATATCTCTTTCAGTGGGGCCGAAGAAACCGAAGCCTTACTTGGAGGATTCCATTGCCATCGGAACTTTCCACGCCATGAAACAT
GGAATATTGACGTCCAACTCCGCCGGAAATAATGGCCCCAAATACTACACCACCGCTAACGGCGCTCCGTGGTCTCTTTCTGTGGCCGCAAGCTCCATTGACAGAAAGTT
CAAGGCACAAGTGCAGCTTGGCAACGGAAATATCTATCAGGGAGTTGCAATTAACACATTTGATCTTATGGGAAGACAATATCCCTTAATTTATGCTGGAGATGCACCTA
ACGTTGATGGAGGTTTCTCTAAATATACCTCCAGAGCTAATCGTCTAGCAACACCCAAGATCCTGCAAATTAAAGAGCAAGTTATGGAAGAAACAAATACACAGCAATCG
GTGACTCTGGGCATTAATCCTGGAAGTCACACCAAGGTAATTACACTTACCGAAAGCAATTATCTTGTTTGGAAACTTCAGATTCTCAGAACGCTACAGGGATATGGACT
CGAAGATTATATTCTCGACGATGATGGAACCCCGTCTCTGTATCTTAGTTCTACTAATGATGGATCCCCGTCCACAGTGGAAGAAGCCAATCCCGCACATTTACTTTGGA
CCCGTCAAGATCGTCTAATCTCTTCGTGGCTTCTCAGTTCGATGACAGAGGGAGTTCTTGAAGATGTGCTTGACTGTGAGACATCTCGCGAAATTTGGAAAACATTAGAA
GAAATGTATGTTACAAGCAATTTGGCAAAAAATATGAGCTACAAGAATCAGATGCAAAACCTGAAGAAAGGAGGCATGACTCTAAAAGAATACTTTTCAAAGATGAAGAA
ACTGGCAGATTCTTTAAAGGCCATTGGAGAAAAGGTATCCACTAAAGACCATATCATTTATATTCTCTCTGGATTGGGTGTCGAATATGATGCTATAGTATCTGTTATCA
CTGCCAAATCTAGACCTTTGACACTACAAGAAGTTTATGGATTACTGTATGCTCATGAGTCAAGGTCAGAAAGAAGCACTGTGAATATTGATGGTTCTGTGCCCACAGTA
AACCTAACTCAACAAAGTTCATCCAAGAAAGGTACTGGTTCTACCAATGATCAAAAGAATGGTTCATCATATCACAACAATGGTCCTAATTCCTTCAGAGGTCGTGGAGG
ACGCAATTTCAGAGGAAACAGAGGTTGGAATGGTAATAAACCACAGTGTCAACTATGTGGACGTTTTGGACACACAGCTTTGAAATGTTATCAGCGATTTGATCCAAACT
TCCATGGAAACAATGGTGGTCTTAACCATCAAGGCAATCAAATGGTTCAACAACCTTTTCAGCAGTCTTTCTCTGGTAATAATAGCGTGGGCAATCAAAATTATCCTATG
CAAAGTCACCCTATGCAGGCTATGATGGTGGCTCCCAATATTAATCTTGATACCAATTGGTATCCCGATTCAGGAGCTTCAAATCATGTGACGAATGATTTTGGTAATCT
TGCAGTGAGTTCTCCTTGCACTAGTGACAATAGAGTTCATGTCGGTAATGGGGCAGGTTTGTCTATCAATCATATTGGCTCTTCTCATTTGTACTCCTCTAATAACCAAT
CCTTTTTACTCAATAACCTCTTACATGTTCCCCATATTACTAAGAATCTTTTGAGTGTCAGCCAATTTGCTAAAGATAATGATGTCTTTTTTGAATTCCATCCTCTTGTC
TGTTTTGTGAAGGACCGTCAGACTGGTACAATTCTGCTCCAAGGACTGATGCATGAAGGACTGTACAAGTTTCATCTCCACCCTTCCAAAACTCAAGATTTGAAGCAAGC
ATCCCTGGTTCCTCCTCTGTCTTCTTCCTCCTCTACCACAGCTCATGTTTTAGCTTGTACCTCTGAAAATACTAAAGCCAATGTAATAGATCTGTGGCATAAAAGACTTG
GTCATGCTGCCACTCCAATTGTATCCCAAATTTTAAAGGAATGCAATATTTCTTTTACTAATAATTCTACTTCCTTTTGTTCTGCCTGTGCAATTGGTAAAAGTCATGCC
CTTCCATTTTACCCTTCACAGACTATTATTAGTACACCCCTGTCTCTTATCGAAACCGATCTTTGGGGACCAGCTGTTAAAAGCTCCAAAAATGGTTTCAGATACTATAT
CAGTTTTGTTGATGTCTACTCTCGTTTCACTTGGGTTTATTTTCTTCAGTCCAAATCCGAGGCTTATTCTACCTTTCTTACTTTTAAAATACATGTGGAAAAACTTCTTG
GTCATTCAATTAAAATGTTACAAACTGATGGGGGTGGTGAGTTTCGTGCTCTTGCCCCATACTTAAAGTCTCAGGGTATTATTCATAGAGTTACTTGTCCCTACACCTCC
CAACAAAATGGTATAGTTGAGCGTAAGCATAGGCACATTGTTGATATGGGACTGACCTTACTTTCTCAGGCTTCCCTTCCTCTTGAGTTTTGGGACGATGCGTTTTCAGC
AGCAGTCTATACCATTAATCGGTTGCCTACTACAGTCCTAAGTGGCATCAGTCCTGTGGAGAAATTGTTCGGTAAGAAACCTGATTATTCCTTTTTTAAAACCTTTGGCT
GCTTATGTTTTCCATGCTTACGACCCTACAATGACCATAAACTCCAATTTCGTTCTGCTCCTTGTGTTTTTCTTGGCTATAGCAATATGCACAGGGGCTACAAATGTTTA
GATAGAACTGGACGTGTTTTTATTTCAAGACATGTTCAGTTTAATGAATCTTCTTTTCCATATCTTCAGTCTTTCTTACATTCCTCCTCTGTCAAGCCTTTGCCTATACA
CTCATCTATCAACTCTTTCCTACCTGTGTTGATTTCGTCTCCTACTTCTTCCCAGTTTACATCTACTTCTCAGCCTTCTACTATTGTTCCTACCTCCCAACCCTTGGATC
CTGCCACTGAGGTTGCTATTGCTTCCCCATCTGCATCCACTTCACATTCTCCTTTGACTAATATTGATCTTTCCCATATACCTGAACCAAATCTGACATCTACTCCTATT
GTCACTAACACTCACCCTATGGTTACTCGCTCAAAAAATGGTATTGTTTGCCCCAAGGTACTACTTGCAGAATACATTGAGGTTGAACCGACTACTGTGAAAGAGGCCTT
ACGTTGTCCTCATTGGCTTCAAGCAATGAAAGATGAATATGCTGCCCTTATGAAAAATGGAACTTGGTCTCTTGTACCTCATTCTTCTACCCACAAAACAATCGGTTGTA
AGTGGGTTTTTAAAATAAAGCGAAATACGGATGGCTCCATTGCTAGGTATAAGGCACGCTTAGTTGCAAAAGGATTTCATCAAATGGCAGACATTGACTACACTGAAACT
TTTAGTCCAGTTATTAAGCCCACGACAATACGTGTCTTACTTACTTATGCATTGGCTAATGGTTGGCAGATTCATCAACTGGATATTAATAATGCTTTTCTTCATGGTGT
TCTTACTGAGGATGTTTTCATGGAGCAGCCTCCGGGATTCTCTATCTCTGGCTCTTCACCTCTGGTTTGTAAACTCCATAAAGCACTTTATGGACTCAAGCAAGCCCCAC
GAGCATGGTTTGATAGACTGTCTTCCTTTTTACTTGCTCTCGGTTTCAAATGTTCTAAAGCTGACACCTCTCTTCTTTTTCGCCATGTTGGTTCATCCAAATGCTATATA
CTGATCTATGTCGATGATATAGTCATCATGGGCTCTTCATCTTCTGAAATTACTCAGCTTATATCCTTGCTAAATCATCAGTTTTCCTTGAAAGATCTTGGTAGGCTGAA
TTATTTTCTTGGTATTGAGGTCTCTTATCCAAAGGATGGAGGTTTATTCCTCTCTCAAACCAAGTATATTACTGATCTTCTTCATAAGGCCAAGATGTTTGAAGCTAATC
CTATTACTACTCCCATGGTAAGTGGCTCTGTAGTTTCTGCCTTTAATGGAGAAAAATTTTCTGATGTTCATTTTTATAGAAGTATTGTGGGGGCCTTACAGTATGCTACA
ATCACTAGACCAGAAATTGCTTATAGTGTGAACAAGGTGTGTCAATTTATGCATTCTCCTACTCAAGTTCACTGGCAGGCGGTCAAGAGGATTCTTAGATATCTCAAGGG
TTCATTCACTTCAGGCCTTTTGCTTCGGAAGCCATCTAATCTTGGACTATATGGCTATGCGGATGCTGATTGGGCCTCAGACCCAGATGACCGCAAATCAACCTCTGGTT
TCTGCATTTTCTTTGGTGGAAATCTTGTAACCTGGGGCTCGAAGAAGCAGAGCATTATCTCACGCTCCAGCACTGAGGCTGAGTTCAGAAGCCTGGCAAATACATCAGCA
GAGTTAATCTGGTTGCAAGCTCTTTTAGCTGAGTTACAAATCCCTACTTCTCGTCCACCCATCTTATGGTGTGACAACTTGGGAGCTGTTCATCTCAGTGCTAATCCAGT
TTTGCATTCTCGAACAAAACATGTTGAGTTAGACATCTACTTTGTCCGCGATTTGGTTCTCCAAAAGAGACTCATGATTCAACACCTTCCAGCATTTGCCCAACTTGCTG
ATATATTTACAAAGCCCCTGTCTGCTACTTCCTTTCTGCATATTCGTTCCAAGCTCAATGTCTGTGATGCTTATGACATTGGCTTGAGGGGGGTGTGA
Protein sequenceShow/hide protein sequence
MLTMSLLLEKSSELEHIAAAVPPGDVRSPRDTNGHGTHTASTARGGVPSARIAVYKICWSDGCFDADILAAFDDIIADSVDIISLSVGPKKPKPYLEDSIAIGTFHAMKH
GILTSNSAGNNGPKYYTTANGAPWSLSVAASSIDRKFKAQVQLGNGNIYQGVAINTFDLMGRQYPLIYAGDAPNVDGGFSKYTSRANRLATPKILQIKEQVMEETNTQQS
VTLGINPGSHTKVITLTESNYLVWKLQILRTLQGYGLEDYILDDDGTPSLYLSSTNDGSPSTVEEANPAHLLWTRQDRLISSWLLSSMTEGVLEDVLDCETSREIWKTLE
EMYVTSNLAKNMSYKNQMQNLKKGGMTLKEYFSKMKKLADSLKAIGEKVSTKDHIIYILSGLGVEYDAIVSVITAKSRPLTLQEVYGLLYAHESRSERSTVNIDGSVPTV
NLTQQSSSKKGTGSTNDQKNGSSYHNNGPNSFRGRGGRNFRGNRGWNGNKPQCQLCGRFGHTALKCYQRFDPNFHGNNGGLNHQGNQMVQQPFQQSFSGNNSVGNQNYPM
QSHPMQAMMVAPNINLDTNWYPDSGASNHVTNDFGNLAVSSPCTSDNRVHVGNGAGLSINHIGSSHLYSSNNQSFLLNNLLHVPHITKNLLSVSQFAKDNDVFFEFHPLV
CFVKDRQTGTILLQGLMHEGLYKFHLHPSKTQDLKQASLVPPLSSSSSTTAHVLACTSENTKANVIDLWHKRLGHAATPIVSQILKECNISFTNNSTSFCSACAIGKSHA
LPFYPSQTIISTPLSLIETDLWGPAVKSSKNGFRYYISFVDVYSRFTWVYFLQSKSEAYSTFLTFKIHVEKLLGHSIKMLQTDGGGEFRALAPYLKSQGIIHRVTCPYTS
QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSAAVYTINRLPTTVLSGISPVEKLFGKKPDYSFFKTFGCLCFPCLRPYNDHKLQFRSAPCVFLGYSNMHRGYKCL
DRTGRVFISRHVQFNESSFPYLQSFLHSSSVKPLPIHSSINSFLPVLISSPTSSQFTSTSQPSTIVPTSQPLDPATEVAIASPSASTSHSPLTNIDLSHIPEPNLTSTPI
VTNTHPMVTRSKNGIVCPKVLLAEYIEVEPTTVKEALRCPHWLQAMKDEYAALMKNGTWSLVPHSSTHKTIGCKWVFKIKRNTDGSIARYKARLVAKGFHQMADIDYTET
FSPVIKPTTIRVLLTYALANGWQIHQLDINNAFLHGVLTEDVFMEQPPGFSISGSSPLVCKLHKALYGLKQAPRAWFDRLSSFLLALGFKCSKADTSLLFRHVGSSKCYI
LIYVDDIVIMGSSSSEITQLISLLNHQFSLKDLGRLNYFLGIEVSYPKDGGLFLSQTKYITDLLHKAKMFEANPITTPMVSGSVVSAFNGEKFSDVHFYRSIVGALQYAT
ITRPEIAYSVNKVCQFMHSPTQVHWQAVKRILRYLKGSFTSGLLLRKPSNLGLYGYADADWASDPDDRKSTSGFCIFFGGNLVTWGSKKQSIISRSSTEAEFRSLANTSA
ELIWLQALLAELQIPTSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQKRLMIQHLPAFAQLADIFTKPLSATSFLHIRSKLNVCDAYDIGLRGV