; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036407 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036407
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:45994006..45999730
RNA-Seq ExpressionLag0036407
SyntenyLag0036407
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG96906.1 hypothetical protein Prudu_005862 [Prunus dulcis]1.9e-11936.97Show/hide
Query:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---
        R  VD S+F+ VSQE   Y LW KL  +YERKTA NKAS+I+RLVNLK + G+S++EHLSDFQ +IN LT MK+VLDDELQAL LLSSLPDSW TLV   
Subjt:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---

Query:  ------------------------KKEQG----------------------------------------------------------------KKQDSQK
                                +KEQG                                                                ++QD   
Subjt:  ------------------------KKEQG----------------------------------------------------------------KKQDSQK

Query:  EGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIP
        E  D      T+   ++V ++C      ++ S    W+VDSGAS+H   +R++F +Y +GDFG+V+MGN  L+ I+G GDI ++T+  C L+L++VRH+P
Subjt:  EGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIP

Query:  DLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR---------------------------------------------------------------
        D+RLNL+S  +LD EG+ + F +GKWKLSK S+ +AR                                                               
Subjt:  DLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR---------------------------------------------------------------

Query:  -----GKHHRVSFP-GQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------
             GK HR SF  G +  +   L++V+SDVCG +   +LGG RYFVTFIDD +R+ W Y L+TK+QV+E+F ++                        
Subjt:  -----GKHHRVSFP-GQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------

Query:  -----YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYM
             YC  +GIRHE++ P TPQHN +AERMNRTIVE++R ML  + LPK+FW EA+     LIN SPS PL  D+P +   G+D +Y+HL+VFG + ++
Subjt:  -----YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYM

Query:  HVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDD----DNLDESSS
        H+PK++RSKLD+K+  C+FVGYG+EE+GYRL+DP  +K++RS++VVFFE +   D+       D  ++  +    + P  +  +H D     ++ D+ +S
Subjt:  HVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDD----DNLDESSS

Query:  DEDIHE
        D  I+E
Subjt:  DEDIHE

KAG5549868.1 hypothetical protein RHGRI_014986 [Rhododendron griersonianum]5.0e-11737.12Show/hide
Query:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---
        R  +D S+F+ VS E   Y LWKKLE LY+RK+A NKA L K+LVNLK K GKSI+EHL++   I+N+L +MKIV DDELQAL LLSSLP++W+TLV   
Subjt:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---

Query:  ------------------------KKEQG--------------------------------------------------------------KKQDSQKEG
                                +K  G                                                              K+++  K G
Subjt:  ------------------------KKEQG--------------------------------------------------------------KKQDSQKEG

Query:  DDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDL
        D +        SD ++ ++C  S C ++ S    W++DSGAS+H   + ++F +Y  GDFG V+MGN+ L+ I+G+G+I ++TSVGC L+L++VRH+PD+
Subjt:  DDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDL

Query:  RLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-----------------------------------------------------------------
        RLNL+S   LD EG+ + F +GKWKL+KGS+ VA+                                                                 
Subjt:  RLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-----------------------------------------------------------------

Query:  ---GKHHRVSFPGQS-TGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY--------------------------
           GK +RV+F  +S T +   L++VHSDVCG +KV S  G  YFVTFIDD +R+ W Y LKTK+QV ++F  +                          
Subjt:  ---GKHHRVSFPGQS-TGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY--------------------------

Query:  ---YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHV
           YC   GI+H+KT   TPQ N VAERMNRT+VE++RCML  S LP++FWAEA++    LIN SPSVPL  D+PE+   G+D TY HLRVFG + ++H+
Subjt:  ---YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHV

Query:  PKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFE----HEKGADLLTNKNVEDVLD-------DVHDNVGE-IQPDGNVPDHVDDDN
        PK +RSKLD K   C+F+GYG EE+GYRL+DP  KK++RS++VVF E     E   D    +N  D++D        VH  + E +QP   V ++VD  N
Subjt:  PKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFE----HEKGADLLTNKNVEDVLD-------DVHDNVGE-IQPDGNVPDHVDDDN

Query:  LDESSSDEDIHEEDEHEEVIHE
                ++  E+E E+ I E
Subjt:  LDESSSDEDIHEEDEHEEVIHE

KYP66486.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]3.2e-14842.56Show/hide
Query:  KRLFKK---SGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSL
        K+L KK   + R  VDI+++NQVS+E  P  LWKKLE +YE K A +K  ++++L+NLKL+ G++++EHL+DF+ ++ +L +  + L+DE+QAL LL SL
Subjt:  KRLFKK---SGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSL

Query:  PDSWKTLV---------------------KKEQGKKQDSQK----------------------------------------------------------E
        PDSW TLV                       E+ +++D  K                                                          +
Subjt:  PDSWKTLV---------------------KKEQGKKQDSQK----------------------------------------------------------E

Query:  GDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPD
         +DR+T A TS SDD++TL+C   EC H+     EW++DS ASYHCVPKREYF  Y+ GDFG V MGN+S + I+G+GDI ++T VGC LIL++VRHIPD
Subjt:  GDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPD

Query:  LRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR----------------------------------------------------------------
        +RLNL+S NVLD+EG+ H+ + G+WKL+KGS+ VA+                                                                
Subjt:  LRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR----------------------------------------------------------------

Query:  ----GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIF-----------------------------
            GKHHRVSF   S  ++ KLELVHSDVCG ++VESLGGN+YFVTFIDDA+R+TWVY+L+ K+QVF+ F                             
Subjt:  ----GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIF-----------------------------

Query:  -IKYYCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH
          K YCS++GIRHEKT P TPQHN +AERMNRTIVEKVRCMLRM+ LPK FW EAVQ   YLINR PSVPLG DIPER   G++ +YSHL+VFG K +MH
Subjt:  -IKYYCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH

Query:  VPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLL---TNKNVEDVLDDVHDNVGEIQPDG---NVPDHVDDDNL--DE
        VPKEQRSKLD K  PCVFVGYG+EE+GY+L+DPE+K++VRS++V+F EHE   DL      K +ED ++       E   DG     P+H  ++ +  DE
Subjt:  VPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLL---TNKNVEDVLDDVHDNVGEIQPDG---NVPDHVDDDNL--DE

Query:  SSSDEDIHEEDEHEEVIHEEVYEQAE
         S DE++   D       E  Y   +
Subjt:  SSSDEDIHEEDEHEEVIHEEVYEQAE

PRQ60431.1 putative RNA-directed DNA polymerase [Rosa chinensis]1.9e-11938.86Show/hide
Query:  VDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV------
        VD S+F+ VS E     LW KL  L+E+KTA  KA LIK LVN+K K G  ++EHL++FQ  IN+L TM + +DDELQAL LL SLPD+W+T V      
Subjt:  VDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV------

Query:  -------------------------------------------------------------------------KKEQGKKQDSQKEG----------DDR
                                                                                 KK   K +   KEG          D R
Subjt:  -------------------------------------------------------------------------KKEQGKKQDSQKEG----------DDR

Query:  DTVAITSKSDDDVTLICATSECHHVES-SYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRL
        +T A  S S D+   +C   EC HV+  S + W+VDSGAS+H  P  EYF  YQ GDFG VKMGN   + I G+GDI  +T +G  L+L++VR++P LRL
Subjt:  DTVAITSKSDDDVTLICATSECHHVES-SYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRL

Query:  NLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-------------------------------------------------------------------
        NL+S  VLD++GF H   D KW+L+KGS+ +AR                                                                   
Subjt:  NLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-------------------------------------------------------------------

Query:  GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------
        GK H+VSF    T ++  L+LVHSDVCG ++VES+G N+YFVT+IDDA+R+ WVY+LKTK+QVF+ F ++                              
Subjt:  GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------

Query:  YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKE
        YCS+HGI+H KT P TPQHN VAERMNRTI+EKVR ML+ ++L K FW EA+    YLINR+P VPLGL+ PE    GR  +YSHLRVFG K + HVPKE
Subjt:  YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKE

Query:  QRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQ--PDGNVPDHVDDDNLDESSSDEDIHEE
        QRSKLD K  PC+F+GYG+EE GYRL++P+ KK+ RS++VVF E +  AD   +KN  +  D +  +   +Q  P     D V++D  D + ++    E+
Subjt:  QRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQ--PDGNVPDHVDDDNLDESSSDEDIHEE

Query:  DEHEEVIHEEVYEQAERP
        ++  E    E  E A+ P
Subjt:  DEHEEVIHEEVYEQAERP

RVW91307.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.3e-11737.63Show/hide
Query:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---
        R  +D S+F+ VS E   + LW KLE LY+RK A NKA L ++LVN K K G  I+EHL++ + I+N+L  MKI  DDELQAL LLSSLP+SW+TLV   
Subjt:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---

Query:  ----------------------------------------------KKE-----------QGKKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVE
                                                      KK            QG +Q    E    DT  +T    D   +I       ++ 
Subjt:  ----------------------------------------------KKE-----------QGKKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVE

Query:  SSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKG
            +W++DSGAS+H   + ++F +Y  GDFG+V+MGN++++ I+G+GDI ++T+ GC L+LR+VRH+PD+RLNL+SA  LD EG+ + F DGKWKLSKG
Subjt:  SSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKG

Query:  SITVAR--------------------------------------------------------------------GKHHRVSFPGQSTGRQ-RKLELVHSD
        S+ VA+                                                                    GK +R+SF      R+   L+L+HSD
Subjt:  SITVAR--------------------------------------------------------------------GKHHRVSFPGQSTGRQ-RKLELVHSD

Query:  VCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY-----------------------------YCSQHGIRHEKTEPNTPQHNRVAERM
        VCG +KV +LGG  YFVTFIDD +R+ W Y LKTK+QV ++F  +                             YC+ HGIRHEKT   TPQ N VAERM
Subjt:  VCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY-----------------------------YCSQHGIRHEKTEPNTPQHNRVAERM

Query:  NRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRL
        NRTI+E+VRC+L  S LP++FW EA++    LIN SPSVPL  D+PE+   G++ +Y HLRVFG + ++H+PK++RSKLD K   C+F+GYG EE+GYRL
Subjt:  NRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRL

Query:  YDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDES----------SSDEDIHEEDEHEEVIHEEVYEQAER
        YDP  KKVVRS++VVF E +   D+   ++ E   DDV     EI+ + ++ + +    L  S          SS+E +   D+ E   ++EV +   +
Subjt:  YDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDES----------SSDEDIHEEDEHEEVIHEEVYEQAER

TrEMBL top hitse value%identityAlignment
A0A151THM1 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-14842.56Show/hide
Query:  KRLFKK---SGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSL
        K+L KK   + R  VDI+++NQVS+E  P  LWKKLE +YE K A +K  ++++L+NLKL+ G++++EHL+DF+ ++ +L +  + L+DE+QAL LL SL
Subjt:  KRLFKK---SGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSL

Query:  PDSWKTLV---------------------KKEQGKKQDSQK----------------------------------------------------------E
        PDSW TLV                       E+ +++D  K                                                          +
Subjt:  PDSWKTLV---------------------KKEQGKKQDSQK----------------------------------------------------------E

Query:  GDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPD
         +DR+T A TS SDD++TL+C   EC H+     EW++DS ASYHCVPKREYF  Y+ GDFG V MGN+S + I+G+GDI ++T VGC LIL++VRHIPD
Subjt:  GDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPD

Query:  LRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR----------------------------------------------------------------
        +RLNL+S NVLD+EG+ H+ + G+WKL+KGS+ VA+                                                                
Subjt:  LRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR----------------------------------------------------------------

Query:  ----GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIF-----------------------------
            GKHHRVSF   S  ++ KLELVHSDVCG ++VESLGGN+YFVTFIDDA+R+TWVY+L+ K+QVF+ F                             
Subjt:  ----GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIF-----------------------------

Query:  -IKYYCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH
          K YCS++GIRHEKT P TPQHN +AERMNRTIVEKVRCMLRM+ LPK FW EAVQ   YLINR PSVPLG DIPER   G++ +YSHL+VFG K +MH
Subjt:  -IKYYCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH

Query:  VPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLL---TNKNVEDVLDDVHDNVGEIQPDG---NVPDHVDDDNL--DE
        VPKEQRSKLD K  PCVFVGYG+EE+GY+L+DPE+K++VRS++V+F EHE   DL      K +ED ++       E   DG     P+H  ++ +  DE
Subjt:  VPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLL---TNKNVEDVLDDVHDNVGEIQPDG---NVPDHVDDDNL--DE

Query:  SSSDEDIHEEDEHEEVIHEEVYEQAE
         S DE++   D       E  Y   +
Subjt:  SSSDEDIHEEDEHEEVIHEEVYEQAE

A0A2N9EW40 Integrase catalytic domain-containing protein2.7e-14045.24Show/hide
Query:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLVK--
        R  +++S+F+ VSQE     LWKKLE LYERKTA NKA  I++L +LKLK G+S++EHLS+FQD++N+LT M +V+DD+LQAL LLSSLPDSW+TLV   
Subjt:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLVK--

Query:  -------------------KEQGKKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQ
                            E+ +++D  K+           +S    +      EC HV   Y EW++DS ASYH  P+RE+F +Y++G+ G VKMGN+
Subjt:  -------------------KEQGKKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQ

Query:  SLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR---------------------------------
        S   I+ +GDI V+T+ G  L L++VRHIPD+RLNL+S +VLD+EG+     +GKWKL KGS+  AR                                 
Subjt:  SLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR---------------------------------

Query:  -----------------------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFE
                                           GK HRVSF   ST +   L+LV+SDVCG I+VESLGGNRYFVTFIDDA+R+ WVY+LKTK+QVF+
Subjt:  -----------------------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFE

Query:  IFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSV
        +F K+                              YCS++GIRHEKT P TPQHN VAER+NRTIVEKVRCMLRM+ LPK+FWAEAVQ   YLINRSPSV
Subjt:  IFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSV

Query:  PLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVH
        PL  DIPER   G D +Y+HL+VFG KT+ HVPKEQR KLD K  PC+FVGYGD E+GY+L+DP+KKK++RS           +DL    +  D+  D+ 
Subjt:  PLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVH

Query:  DNVGEIQPDGNVPDHVD-DDNLDESSSDED
        +   +++  G+ P  VD DD +D    +++
Subjt:  DNVGEIQPDGNVPDHVD-DDNLDESSSDED

A0A2N9EZ52 Uncharacterized protein7.8e-14037.39Show/hide
Query:  MMEDLLYCKDLFDPIEATTGADGAIIKLAKPKKMEEKDWGKLKRKTLGTIWQWVCNVLNVSSSDGVSYDPPTNFTAARDLSGEQQSATVHCVRRRVDKVA
        MMED+LYCKDL DPIE  +         AKP  M +K+W K+ RKT+G                                          C+R+      
Subjt:  MMEDLLYCKDLFDPIEATTGADGAIIKLAKPKKMEEKDWGKLKRKTLGTIWQWVCNVLNVSSSDGVSYDPPTNFTAARDLSGEQQSATVHCVRRRVDKVA

Query:  AASSGSVRVFRQRRQFFWEITSTYSGRPTIVFSGFFSVERFRQVSKACIVVTVPKRLFKKSGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKA
                                                       CI                 ++S+F+ VSQE     LWKKLE LYERKTA NKA
Subjt:  AASSGSVRVFRQRRQFFWEITSTYSGRPTIVFSGFFSVERFRQVSKACIVVTVPKRLFKKSGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKA

Query:  SLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---------------------------KKEQG---------
          I++L +LKLK G+S++EHLS+FQD++N+LT M +V+DDELQAL LLSSLPDSW+TLV                           +K+ G         
Subjt:  SLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---------------------------KKEQG---------

Query:  -------------------------------------------------KKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGAS
                                                         K++ +QK+ DD +T A++S  +D V L     EC HV   Y EW++DS AS
Subjt:  -------------------------------------------------KKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGAS

Query:  YHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLS---------------
        YH  P+RE+F +Y+  + G VKMGN+S   I+G+GDI V+T+ G  L L++VRHIPD+RLNL+S +VLD+EG+     +GKWK S               
Subjt:  YHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLS---------------

Query:  ----------------------------------KGSITVAR--------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGN
                                          KG   +A+                    GK HRVSF   ST +   L+LV+SDVCG I+V+SLGGN
Subjt:  ----------------------------------KGSITVAR--------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGN

Query:  RYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCML
        RYFVTFIDDA+R+ WVY+LKTK+QVF++F K+                              YCS++GIRHEKT P TPQHN VAER+NRTIVEKVRCML
Subjt:  RYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCML

Query:  RMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSK
        RM+ LPK+FWAEAVQ   YLINRSPSVPL  DIPER   G D +Y+HL+VFG KT+ HVPKEQR KLD K  PC+FVGYGD E+GY+L+DP+KKK++RS+
Subjt:  RMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSK

Query:  NVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDESSSDEDIHEEDEHEEVIHEEVYEQAERPTP
        +VVF E+E   D   ++  +  ++ V D    + P  +  D   D    +  +  D   E + ++ I  E  EQ E+P P
Subjt:  NVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDESSSDEDIHEEDEHEEVIHEEVYEQAERPTP

A0A2N9FXG4 Uncharacterized protein1.8e-14438.57Show/hide
Query:  MMEDLLYCKDLFDPIEATTGADGAIIKLAKPKKMEEKDWGKLKRKTLGTIWQWVCNVLNVSSSDGVSYDPPTNFTAARDLSGEQQSATVHCVRRRVDKVA
        MMED+LYCKDL DPIE  +         AKP  M +K+W K+ RKT+G                                          C+R+      
Subjt:  MMEDLLYCKDLFDPIEATTGADGAIIKLAKPKKMEEKDWGKLKRKTLGTIWQWVCNVLNVSSSDGVSYDPPTNFTAARDLSGEQQSATVHCVRRRVDKVA

Query:  AASSGSVRVFRQRRQFFWEITSTYSGRPTIVFSGFFSVERFRQVSKACIVVTVPKRLFKKSGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKA
                                                       CI                 ++S+F+ VSQE     LWKKLE LYERKTA NKA
Subjt:  AASSGSVRVFRQRRQFFWEITSTYSGRPTIVFSGFFSVERFRQVSKACIVVTVPKRLFKKSGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKA

Query:  SLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---------------------------KKEQG---------
          I++L +LKLK G+S++EHLS+FQD++N+LT M +V+DDELQAL LLSSLPDSW+TLV                           +K+ G         
Subjt:  SLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---------------------------KKEQG---------

Query:  -------------------------------------------------KKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGAS
                                                         K++ +QK+ DD ++ A+ S  +D V L     EC HV   Y EW++DS AS
Subjt:  -------------------------------------------------KKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGAS

Query:  YHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-------
        YH  P+RE+F +Y++G+ G VKMGN+S   I+G+GDI V+T+ G  L L++VRHIPD+RLNL+S +VLD+EG+     +GKWKL KGS+  AR       
Subjt:  YHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-------

Query:  -------------------------------------------------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNR
                                                                     GK HRVSF   ST +   L+LV+SDVCG I+VESLGGNR
Subjt:  -------------------------------------------------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNR

Query:  YFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLR
        YFVTFIDDA+R+ WVY+LKTK+QVF++F K+                              YCS++GIRHEKT P TPQHN VAER+NRTIVEKVRCMLR
Subjt:  YFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLR

Query:  MSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKN
        M+ LPK+FWAEAVQ   YLINRSPSVPL  DIPER   G D +Y+HL+VF  KT+ HVPKEQR KLD K  PC+FVGYGD E+GY+L+DP+KKK++RS++
Subjt:  MSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKN

Query:  VVFFEHEKGADLLTNKNVEDVLDDVHD
        VVF E+E   D   ++  +  ++ V D
Subjt:  VVFFEHEKGADLLTNKNVEDVLDDVHD

A0A2N9IWX1 Uncharacterized protein2.0e-14337.77Show/hide
Query:  MMEDLLYCKDLFDPIEATTGADGAIIKLAKPKKMEEKDWGKLKRKTLGTIWQWVCNVLNVSSSDGVSYDPPTNFTAARDLSGEQQSATVHCVRRRVDKVA
        MMED+LYCKDL DPIE  +         AKP  M +K+W K+ RKT+G                                          C+R+      
Subjt:  MMEDLLYCKDLFDPIEATTGADGAIIKLAKPKKMEEKDWGKLKRKTLGTIWQWVCNVLNVSSSDGVSYDPPTNFTAARDLSGEQQSATVHCVRRRVDKVA

Query:  AASSGSVRVFRQRRQFFWEITSTYSGRPTIVFSGFFSVERFRQVSKACIVVTVPKRLFKKSGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKA
                                                       CI                 ++S+F+ VSQE     LWKKLE LYERKTA NKA
Subjt:  AASSGSVRVFRQRRQFFWEITSTYSGRPTIVFSGFFSVERFRQVSKACIVVTVPKRLFKKSGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKA

Query:  SLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---------------------------KKEQG---------
          I++L +LKLK G+S++EHLS+FQD++N+LT M +V+DDELQAL LLSSLPDSW+TLV                           +K+ G         
Subjt:  SLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---------------------------KKEQG---------

Query:  -------------------------------------------------KKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGAS
                                                         K++ +QK+ DD +T A++S  +D V L     EC HV   Y EW++DS AS
Subjt:  -------------------------------------------------KKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGAS

Query:  YHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-------
        YH  P+RE+F +Y++G+ G VKMGN+S   I+G+GDI V+T+ G  L L++VRHIPD+RLNL+S +VLD+EG+     +GKWKL KGS+  AR       
Subjt:  YHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVAR-------

Query:  -------------------------------------------------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNR
                                                                     GK HRVSF   ST +   L+LV+SDVCG I+VESLGGNR
Subjt:  -------------------------------------------------------------GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNR

Query:  YFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLR
        YFVTFIDDA+R+ WVY+LKTK+QVF++F K+                              YCS++GIRHEKT P TPQHN VAER+NRTIVEKVRCMLR
Subjt:  YFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLR

Query:  MSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKN
        M+ LPK+FWAEAVQ   YLINRSPSVPL  DIPER   G D +Y+HL+VFG KT+ HVPKEQR KLD K  PC+FVGYGD E+GY+L+DP+KKK++RS++
Subjt:  MSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKN

Query:  VVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDESSSDEDIHEEDEHEEVIHEEVYEQAERPTP
        VVF E+E   D   ++  +  ++ V D    + P  +  D   D    +  +  D     + ++ I  E  EQ E+P P
Subjt:  VVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDESSSDEDIHEEDEHEEVIHEEVYEQAERPTP

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-3235.89Show/hide
Query:  GKHHRVSFP--GQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIF------------------------------I
        GK  R+ F      T  +R L +VHSDVCG I   +L    YFV F+D  T     Y++K K+ VF +F                              +
Subjt:  GKHHRVSFP--GQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIF------------------------------I

Query:  KYYCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPL--GLDIPERAGKGRDPTYSHLRVFGSKTYMH
        + +C + GI +  T P+TPQ N V+ERM RTI EK R M+  + L K+FW EAV    YLINR PS  L      P      + P   HLRVFG+  Y+H
Subjt:  KYYCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPL--GLDIPERAGKGRDPTYSHLRVFGSKTYMH

Query:  VPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFE
        + K ++ K D K+   +FVGY  E  G++L+D   +K + +++VV  E
Subjt:  VPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-11536.58Show/hide
Query:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---
        R  +   + N +  E     +W +LE LY  KT  NK  L K+L  L +  G +   HL+ F  +I +L  + + +++E +A+ LL+SLP S+  L    
Subjt:  RYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEHLSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLV---

Query:  -----------------------------------------------------------------------------------KKEQGKKQDSQKEGDDR
                                                                                              +GK + S ++ DD 
Subjt:  -----------------------------------------------------------------------------------KKEQGKKQDSQKEGDDR

Query:  DTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLN
        +T A+   +D+ V  I    EC H+     EW+VD+ AS+H  P R+ F  Y +GDFG+VKMGN S + I G+GDI +KT+VGC L+L++VRH+PDLR+N
Subjt:  DTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQSGDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLN

Query:  LLSANVLDQEGFRHTFEDGKWKLSKGSITVAR--------------------------------------------------------------------
        L+S   LD++G+   F + KW+L+KGS+ +A+                                                                    
Subjt:  LLSANVLDQEGFRHTFEDGKWKLSKGSITVAR--------------------------------------------------------------------

Query:  GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------
        GK HRVSF   S  +   L+LV+SDVCG +++ES+GGN+YFVTFIDDA+R+ WVY+LKTK+QVF++F K+                              
Subjt:  GKHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY------------------------------

Query:  YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKE
        YCS HGIRHEKT P TPQHN VAERMNRTIVEKVR MLRM+ LPK+FW EAVQ   YLINRSPSVPL  +IPER    ++ +YSHL+VFG + + HVPKE
Subjt:  YCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKE

Query:  QRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHE--KGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDESSSDEDIHEE
        QR+KLD K+ PC+F+GYGDEE+GYRL+DP KKKV+RS++VVF E E    AD+      E V + +  N   I    N P   +    + S   E   E 
Subjt:  QRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHE--KGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVDDDNLDESSSDEDIHEE

Query:  DEHEEVIHEEVYEQAERPT
         E  E + E V E+ E PT
Subjt:  DEHEEVIHEEVYEQAERPT

P92512 Uncharacterized mitochondrial protein AtMg007103.3e-1046.48Show/hide
Query:  MNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH
        MNRTI+EKVR ML    LPKTF A+A     ++IN+ PS  +   +P+       PTYS+LR FG   Y+H
Subjt:  MNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-3135.12Show/hide
Query:  KHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY----------------------------YCS
        K ++V F   +    R LE ++SDV  S  + S    RY+V F+D  TR TW+Y LK K+QV E FI +                            Y S
Subjt:  KHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY----------------------------YCS

Query:  QHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRS
        QHGI H  + P+TP+HN ++ER +R IVE    +L  + +PKT+W  A     YLINR P+  L L+ P +   G  P Y  LRVFG   Y  +    + 
Subjt:  QHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRS

Query:  KLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEH
        KLD K+  CVF+GY   +  Y     +  ++  S++V F E+
Subjt:  KLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-3134.02Show/hide
Query:  KHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY----------------------------YCS
        K H+V F   +    + LE ++SDV  S  + S+   RY+V F+D  TR TW+Y LK K+QV + FI +                            Y S
Subjt:  KHHRVSFPGQSTGRQRKLELVHSDVCGSIKVESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKY----------------------------YCS

Query:  QHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRS
        QHGI H  + P+TP+HN ++ER +R IVE    +L  + +PKT+W  A     YLINR P+  L L  P +   G+ P Y  L+VFG   Y  +    R 
Subjt:  QHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMHVPKEQRS

Query:  KLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFE
        KL+ K+  C F+GY   +  Y        ++  S++V F E
Subjt:  KLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFE

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-1146.48Show/hide
Query:  MNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH
        MNRTI+EKVR ML    LPKTF A+A     ++IN+ PS  +   +P+       PTYS+LR FG   Y+H
Subjt:  MNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIPERAGKGRDPTYSHLRVFGSKTYMH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAGGATTTGTTATATTGTAAAGATTTGTTTGATCCAATTGAAGCAACCACAGGAGCTGATGGTGCCATTATTAAACTTGCAAAACCAAAGAAAATGGAAGAGAA
AGATTGGGGAAAGTTAAAGAGGAAAACTTTGGGGACTATTTGGCAGTGGGTTTGTAACGTCCTTAATGTAAGCAGCTCCGACGGCGTCTCCTACGATCCTCCGACAAACT
TCACGGCGGCGCGCGACCTCTCCGGCGAGCAGCAGTCTGCGACAGTTCATTGCGTGCGGCGGCGCGTCGACAAGGTTGCAGCAGCAAGTTCAGGCTCTGTGCGTGTTTTT
CGGCAAAGGCGGCAGTTTTTTTGGGAGATTACCAGCACCTACAGTGGGCGGCCAACAATTGTTTTTAGTGGGTTTTTCTCCGTTGAGCGTTTCCGGCAAGTCTCCAAAGC
ATGCATTGTCGTAACGGTCCCGAAAAGACTATTTAAGAAGTCGGGTCGTTACAAGGTTGATATTAGCATTTTCAACCAGGTGTCTCAAGAGTTCGGCCCATATGCGTTGT
GGAAGAAATTGGAAGGTCTTTATGAAAGGAAGACCGCGCATAACAAAGCTTCATTAATCAAGAGGCTTGTTAATTTGAAATTGAAGGGTGGGAAGTCTATTTCTGAGCAT
TTGAGTGACTTCCAGGATATTATTAACAAGTTGACTACCATGAAAATTGTTCTTGATGATGAATTGCAAGCTTTGTTCTTGTTAAGTTCTTTGCCGGACAGTTGGAAAAC
TTTGGTTAAGAAAGAACAAGGTAAGAAGCAAGACTCACAAAAAGAGGGAGATGATAGAGATACTGTAGCTATCACTTCTAAGAGTGATGATGATGTCACTTTGATTTGTG
CAACTAGTGAGTGTCATCATGTAGAGAGTTCATATATAGAGTGGCTGGTAGATTCAGGTGCTTCATACCATTGTGTTCCCAAAAGGGAGTATTTCATGAACTATCAATCT
GGTGATTTTGGTAGTGTGAAGATGGGAAATCAGAGCTTAACTACCATTATTGGGTTAGGTGATATTCGTGTGAAGACAAGTGTTGGGTGCATTCTTATATTAAGGAATGT
GCGCCATATTCCTGATTTACGCCTTAACTTGTTGTCTGCAAATGTTCTTGATCAAGAAGGATTTCGACATACCTTTGAAGATGGTAAATGGAAATTGTCTAAGGGTTCAA
TAACAGTTGCTCGTGGCAAGCATCATAGAGTTTCTTTTCCTGGGCAGTCCACTGGAAGGCAAAGAAAGTTGGAGTTAGTTCACTCTGATGTTTGTGGTTCTATTAAGGTT
GAGTCTCTTGGTGGCAATAGATATTTTGTCACTTTCATTGATGATGCTACTCGAAGAACTTGGGTGTATATGTTGAAGACAAAAAACCAAGTGTTCGAGATTTTCATAAA
GTATTATTGTTCACAACATGGCATTAGACATGAGAAGACAGAGCCCAACACACCGCAACACAATAGAGTTGCTGAGAGGATGAACAGAACCATTGTAGAGAAGGTGAGGT
GCATGCTTAGAATGTCTGATCTTCCTAAGACATTTTGGGCTGAAGCAGTGCAATGTGTCGAGTATTTGATTAATCGATCTCCATCGGTTCCTCTTGGTTTAGACATTCCA
GAGAGAGCAGGGAAGGGACGTGATCCTACTTATTCGCATCTGAGAGTCTTTGGCAGCAAGACATACATGCATGTGCCTAAAGAACAGAGGTCAAAGCTTGATTCGAAGAC
CAATCCGTGCGTATTTGTTGGGTATGGGGATGAAGAGTATGGTTATAGGCTCTACGATCCAGAAAAGAAGAAGGTAGTGAGAAGCAAAAATGTAGTATTCTTTGAGCACG
AGAAGGGTGCAGATCTTCTGACAAACAAGAATGTTGAAGATGTACTTGATGATGTACACGATAATGTTGGTGAGATACAACCTGATGGTAATGTACCAGATCATGTTGAT
GATGATAATCTTGATGAGTCTTCATCTGATGAAGATATCCATGAGGAAGATGAACATGAGGAAGTGATTCATGAAGAAGTTTATGAGCAGGCGGAGCGACCTACCCCTTA
A
mRNA sequenceShow/hide mRNA sequence
ATGATGGAGGATTTGTTATATTGTAAAGATTTGTTTGATCCAATTGAAGCAACCACAGGAGCTGATGGTGCCATTATTAAACTTGCAAAACCAAAGAAAATGGAAGAGAA
AGATTGGGGAAAGTTAAAGAGGAAAACTTTGGGGACTATTTGGCAGTGGGTTTGTAACGTCCTTAATGTAAGCAGCTCCGACGGCGTCTCCTACGATCCTCCGACAAACT
TCACGGCGGCGCGCGACCTCTCCGGCGAGCAGCAGTCTGCGACAGTTCATTGCGTGCGGCGGCGCGTCGACAAGGTTGCAGCAGCAAGTTCAGGCTCTGTGCGTGTTTTT
CGGCAAAGGCGGCAGTTTTTTTGGGAGATTACCAGCACCTACAGTGGGCGGCCAACAATTGTTTTTAGTGGGTTTTTCTCCGTTGAGCGTTTCCGGCAAGTCTCCAAAGC
ATGCATTGTCGTAACGGTCCCGAAAAGACTATTTAAGAAGTCGGGTCGTTACAAGGTTGATATTAGCATTTTCAACCAGGTGTCTCAAGAGTTCGGCCCATATGCGTTGT
GGAAGAAATTGGAAGGTCTTTATGAAAGGAAGACCGCGCATAACAAAGCTTCATTAATCAAGAGGCTTGTTAATTTGAAATTGAAGGGTGGGAAGTCTATTTCTGAGCAT
TTGAGTGACTTCCAGGATATTATTAACAAGTTGACTACCATGAAAATTGTTCTTGATGATGAATTGCAAGCTTTGTTCTTGTTAAGTTCTTTGCCGGACAGTTGGAAAAC
TTTGGTTAAGAAAGAACAAGGTAAGAAGCAAGACTCACAAAAAGAGGGAGATGATAGAGATACTGTAGCTATCACTTCTAAGAGTGATGATGATGTCACTTTGATTTGTG
CAACTAGTGAGTGTCATCATGTAGAGAGTTCATATATAGAGTGGCTGGTAGATTCAGGTGCTTCATACCATTGTGTTCCCAAAAGGGAGTATTTCATGAACTATCAATCT
GGTGATTTTGGTAGTGTGAAGATGGGAAATCAGAGCTTAACTACCATTATTGGGTTAGGTGATATTCGTGTGAAGACAAGTGTTGGGTGCATTCTTATATTAAGGAATGT
GCGCCATATTCCTGATTTACGCCTTAACTTGTTGTCTGCAAATGTTCTTGATCAAGAAGGATTTCGACATACCTTTGAAGATGGTAAATGGAAATTGTCTAAGGGTTCAA
TAACAGTTGCTCGTGGCAAGCATCATAGAGTTTCTTTTCCTGGGCAGTCCACTGGAAGGCAAAGAAAGTTGGAGTTAGTTCACTCTGATGTTTGTGGTTCTATTAAGGTT
GAGTCTCTTGGTGGCAATAGATATTTTGTCACTTTCATTGATGATGCTACTCGAAGAACTTGGGTGTATATGTTGAAGACAAAAAACCAAGTGTTCGAGATTTTCATAAA
GTATTATTGTTCACAACATGGCATTAGACATGAGAAGACAGAGCCCAACACACCGCAACACAATAGAGTTGCTGAGAGGATGAACAGAACCATTGTAGAGAAGGTGAGGT
GCATGCTTAGAATGTCTGATCTTCCTAAGACATTTTGGGCTGAAGCAGTGCAATGTGTCGAGTATTTGATTAATCGATCTCCATCGGTTCCTCTTGGTTTAGACATTCCA
GAGAGAGCAGGGAAGGGACGTGATCCTACTTATTCGCATCTGAGAGTCTTTGGCAGCAAGACATACATGCATGTGCCTAAAGAACAGAGGTCAAAGCTTGATTCGAAGAC
CAATCCGTGCGTATTTGTTGGGTATGGGGATGAAGAGTATGGTTATAGGCTCTACGATCCAGAAAAGAAGAAGGTAGTGAGAAGCAAAAATGTAGTATTCTTTGAGCACG
AGAAGGGTGCAGATCTTCTGACAAACAAGAATGTTGAAGATGTACTTGATGATGTACACGATAATGTTGGTGAGATACAACCTGATGGTAATGTACCAGATCATGTTGAT
GATGATAATCTTGATGAGTCTTCATCTGATGAAGATATCCATGAGGAAGATGAACATGAGGAAGTGATTCATGAAGAAGTTTATGAGCAGGCGGAGCGACCTACCCCTTA
A
Protein sequenceShow/hide protein sequence
MMEDLLYCKDLFDPIEATTGADGAIIKLAKPKKMEEKDWGKLKRKTLGTIWQWVCNVLNVSSSDGVSYDPPTNFTAARDLSGEQQSATVHCVRRRVDKVAAASSGSVRVF
RQRRQFFWEITSTYSGRPTIVFSGFFSVERFRQVSKACIVVTVPKRLFKKSGRYKVDISIFNQVSQEFGPYALWKKLEGLYERKTAHNKASLIKRLVNLKLKGGKSISEH
LSDFQDIINKLTTMKIVLDDELQALFLLSSLPDSWKTLVKKEQGKKQDSQKEGDDRDTVAITSKSDDDVTLICATSECHHVESSYIEWLVDSGASYHCVPKREYFMNYQS
GDFGSVKMGNQSLTTIIGLGDIRVKTSVGCILILRNVRHIPDLRLNLLSANVLDQEGFRHTFEDGKWKLSKGSITVARGKHHRVSFPGQSTGRQRKLELVHSDVCGSIKV
ESLGGNRYFVTFIDDATRRTWVYMLKTKNQVFEIFIKYYCSQHGIRHEKTEPNTPQHNRVAERMNRTIVEKVRCMLRMSDLPKTFWAEAVQCVEYLINRSPSVPLGLDIP
ERAGKGRDPTYSHLRVFGSKTYMHVPKEQRSKLDSKTNPCVFVGYGDEEYGYRLYDPEKKKVVRSKNVVFFEHEKGADLLTNKNVEDVLDDVHDNVGEIQPDGNVPDHVD
DDNLDESSSDEDIHEEDEHEEVIHEEVYEQAERPTP