; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028603 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028603
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:26115317..26117444
RNA-Seq ExpressionLag0028603
SyntenyLag0028603
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.7e-10438.78Show/hide
Query:  SIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWLYNSMT
        S+KLDR N+ LWK+L +P++R  KL+G++L T  CP EF+                            +SS+S +  N  +  W A DQ LLGW+ NSMT
Subjt:  SIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWLYNSMT

Query:  PEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQGRASV
         E+ATQ++  + +K+ W+  + L G  +R++  +L+  F   RKG MKM DYL  MKN  D L  +G+P+S  +L  Q L GLD EYN VV  +  + ++
Subjt:  PEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQGRASV

Query:  FWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR--YRGRGRGYNNYNNRTTCQVCGKVGHSAMV
         W +LQA+LL FE R+E  N   +  N   N+TAN+            N ++  G+   NNN RG+ +R    GRGRG +  N    CQVCG   H A+ 
Subjt:  FWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR--YRGRGRGYNNYNNRTTCQVCGKVGHSAMV

Query:  CYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGS
        C+HRFDK +S    R N +         G   Q   N F+ +Q      ++ D +WY DSG SNHVT   E   + T++ G   + +GNG KL I   GS
Subjt:  CYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGS

Query:  SSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSH------------GHSSTLQSNK
        S L +    L+L ++L VP I KNL+S+SKLA DN++ +E   N C V DK T +V+LKG LKDGLYQL G + + S             GH +    +K
Subjt:  SSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSH------------GHSSTLQSNK

Query:  NLESVFVVSNVMPSVNVVVSKQVWH--------RCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFLNVVK
         LES  V   V PS N    +   +        +     A + L+L         PI+           +DDFSRF W+YPLK KS+T  AF  F N+ +
Subjt:  NLESVFVVSNVMPSVNVVVSKQVWH--------RCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFLNVVK

Query:  TQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
         QF   IK +Q D GGE+  V KL  + GI+ R SCPYTS QNGRAERK+RH+ E
Subjt:  TQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.7e-10338.24Show/hide
Query:  LNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWL
        L    S+KLDR N+ LW+++ +PI+R  +L+G++L    CP EF+                            ++++S +  NP++E W A DQ LLGWL
Subjt:  LNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWL

Query:  YNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQ
         NSMT  +ATQ++  + + + W+  + L G  +R++  +L+  F  TRKG MKM DYL  MKN AD L  +G+PIS  +L  Q L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQ

Query:  GRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR----YRGRGRGYNNYNNRTTCQVCGK
         + ++ W +LQA+LL FE R+E  N   S  N   N+TAN V  K  +   + N NN     G NNN RG+  R     RGRGR +     +TTCQVCG 
Subjt:  GRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR----YRGRGRGYNNYNNRTTCQVCGK

Query:  VGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKL
          H A+ C++RFDK +S    R N + N           Q   N F+ +Q      +I D +WY DSG SNHVT   +   N +++ G   + +GNG KL
Subjt:  VGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKL

Query:  PITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSSTLQSNKNLESV
         I   GSS L +    L+L ++L VP+I KNL+S+SKLA DN++ +E   N C V DK T + +L+G LKDGLYQL     S       +        + 
Subjt:  PITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSSTLQSNKNLESV

Query:  FVVSNVMPSVNVVVSKQ-----------------VWHRCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFL
         V+  V+ S NV +S                    +     H A ++L+L         PI+           +DDF+RF W+YPLK KSDT  AF  F 
Subjt:  FVVSNVMPSVNVVVSKQ-----------------VWHRCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFL

Query:  NVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
        N+V+ QF   IK +Q D GGE+  V K   + GI+ R SCPYTS QNGRAERK+RH+ E
Subjt:  NVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]1.5e-9936.97Show/hide
Query:  NQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLL
        N L ++I S+ LDR NF LWK+L +PI+R  +L+G++L T  CP +F+                            S+  S + +NP +  W A DQ +L
Subjt:  NQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLL

Query:  GWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVA
        GWL N+MT   A+Q++  + +K+ WE  + L    +R++  +LR  F  TRKG  KM DYL  MK+ AD L  +GSPI+N +L  Q L GLD +YN +V 
Subjt:  GWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVA

Query:  IVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRTTCQVCGKV
         +  + ++ W +LQA+LL FE RL+  NS     N  +N+T N V +K        NH        F N + G     RG+GR  N+      CQVC K 
Subjt:  IVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRTTCQVCGKV

Query:  GHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKLP
        GH+A+ C HR+DK ++     G+   N    R      Q+  N F+ ++  +      D  WY DSG SNHVT   +     T+  G   + +GNGAKL 
Subjt:  GHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKLP

Query:  ITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSSTLQSNKNLESVF
        I   GSS L N    L+L +VL VP+I KNL+S+SKL  DN++ +E   + C V DK T +V+L+G LKDGLYQL       S+G S   Q+NK+     
Subjt:  ITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSSTLQSNKNLESVF

Query:  VVSNVMPSVNVVVSKQVWHRCLGHPAAKMLDLYSIPILCVV-----------------------------------------------------------
              P V + V K+ WHR LGHP+  +LD   +  +C V                                                           
Subjt:  VVSNVMPSVNVVVSKQVWHRCLGHPAAKMLDLYSIPILCVV-----------------------------------------------------------

Query:  -LDDFSRFVWLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
         +DD SRF W+YPLK KSDT  AF  F N+V+ QF   IK +Q D GGEF  V K+  + GI+ R SCPYTS QNGRAERK+RHV E
Subjt:  -LDDFSRFVWLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]1.6e-10145.1Show/hide
Query:  MENVYFTRIPYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGAS
        M N   T  P S S+  FS+PPLNQ+LNQ+T++KLDR N+LLWK L +PIL+ YKLEGHL A + CP  F+     +N++ T E   GA        GAS
Subjt:  MENVYFTRIPYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGAS

Query:  SSESRRTVNPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSP
        SS + R VNP +E WV  D LLLGWLYNSMTP+VA Q+MG  N ++ W+A ++ FGVQSRA+EDFLRQ  Q TRK                         
Subjt:  SSESRRTVNPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSP

Query:  ISNRNLFSQVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSE-KSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGN
                    GLDE YN+V+ ++QG+  + W ++Q++LL+FEKRL+ QN++ K+  N  Q+   NM      N  + Q++    G     N Q  +G 
Subjt:  ISNRNLFSQVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSE-KSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGN

Query:  RYRGRGRGYNNYNNRTTCQVCGKVGHSAMVCYHRFDKEF-SPVHKRGN---GNGNYPPSRGNGQQLQQQSNVFMTAQQT---ATPETIADPNWYADSGTS
        R         N NN  TCQ+CGK GHSA+VCY+RF+KEF SP+ +  N    NG+  P+            VF++ Q     ATP+T+ DPNWY DSG +
Subjt:  RYRGRGRGYNNYNNRTTCQVCGKVGHSAMVCYHRFDKEF-SPVHKRGN---GNGNYPPSRGNGQQLQQQSNVFMTAQQT---ATPETIADPNWYADSGTS

Query:  NHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKST
        NHVT+   N+TNPT+Y G E VT+GNG +L I+ VG++ L++G   L L+N+LCVP+IAKNL+S+SKLA+DN ++IE HG  C + DKST
Subjt:  NHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKST

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]7.8e-11241.25Show/hide
Query:  PYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFL-QIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTV
        P   S   F+SPPLNQLLNQITSIK+DRGNFLLW+NL +PILRSYKL  +L     CPP  L   + PTN                 + G++SS+S  T+
Subjt:  PYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFL-QIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTV

Query:  NPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFS
        NP YEAW+ VD+LLLGWLYNSM  +VA QVMG   ++E W A++ELFGVQSRA+ D+L+Q FQQT KG+++M +YL+ MK+HADNL  +GS +S R+L S
Subjt:  NPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFS

Query:  QVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGY
        QVL GLDEEYN +V  VQG+ ++ W E+ AELL +EKRLE QNS KS +   Q  T ++    G +    Q  NN        NN  G+      RG GY
Subjt:  QVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGY

Query:  NNYNNRTTCQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFM---TAQQTATPETIADPNWYADSGTSNHVTKNYENITNP
                    G  G                  +R  G G  P    N        NVF    T+    TPET+ DP+WYADSG ++HVT N  N+   
Subjt:  NNYNNRTTCQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFM---TAQQTATPETIADPNWYADSGTSNHVTKNYENITNP

Query:  TDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQL---HGAQ
         DY G E V + NG KL I+ +GS+++      L L++VL VP+IAKNL                        DK++ R +LKGTLKD LY+L   H + 
Subjt:  TDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQL---HGAQ

Query:  PSTS------HGHSSTLQSNKNLESVFVVSN--VMPSVNVVVSKQVWHRCLGHPAAKMLDLYSIPILCVVLDDFSRFVWLYPLKLKSDTQAAFSHFLNVV
        P+T         H+    SN  L S     +      +NVVVS  VWH+ LGHP+ ++L L                                       
Subjt:  PSTS------HGHSSTLQSNKNLESVFVVSN--VMPSVNVVVSKQVWHRCLGHPAAKMLDLYSIPILCVVLDDFSRFVWLYPLKLKSDTQAAFSHFLNVV

Query:  KTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVET
                   QSD GGE+  +H LC  LGI+ R S PYTSAQNGRAERK+RH+VET
Subjt:  KTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVET

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X17.9e-10245.1Show/hide
Query:  MENVYFTRIPYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGAS
        M N   T  P S S+  FS+PPLNQ+LNQ+T++KLDR N+LLWK L +PIL+ YKLEGHL A + CP  F+     +N++ T E   GA        GAS
Subjt:  MENVYFTRIPYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGAS

Query:  SSESRRTVNPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSP
        SS + R VNP +E WV  D LLLGWLYNSMTP+VA Q+MG  N ++ W+A ++ FGVQSRA+EDFLRQ  Q TRK                         
Subjt:  SSESRRTVNPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSP

Query:  ISNRNLFSQVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSE-KSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGN
                    GLDE YN+V+ ++QG+  + W ++Q++LL+FEKRL+ QN++ K+  N  Q+   NM      N  + Q++    G     N Q  +G 
Subjt:  ISNRNLFSQVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSE-KSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGN

Query:  RYRGRGRGYNNYNNRTTCQVCGKVGHSAMVCYHRFDKEF-SPVHKRGN---GNGNYPPSRGNGQQLQQQSNVFMTAQQT---ATPETIADPNWYADSGTS
        R         N NN  TCQ+CGK GHSA+VCY+RF+KEF SP+ +  N    NG+  P+            VF++ Q     ATP+T+ DPNWY DSG +
Subjt:  RYRGRGRGYNNYNNRTTCQVCGKVGHSAMVCYHRFDKEF-SPVHKRGN---GNGNYPPSRGNGQQLQQQSNVFMTAQQT---ATPETIADPNWYADSGTS

Query:  NHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKST
        NHVT+   N+TNPT+Y G E VT+GNG +L I+ VG++ L++G   L L+N+LCVP+IAKNL+S+SKLA+DN ++IE HG  C + DKST
Subjt:  NHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKST

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)8.4e-10438.24Show/hide
Query:  LNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWL
        L    S+KLDR N+ LW+++ +PI+R  +L+G++L    CP EF+                            ++++S +  NP++E W A DQ LLGWL
Subjt:  LNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWL

Query:  YNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQ
         NSMT  +ATQ++  + + + W+  + L G  +R++  +L+  F  TRKG MKM DYL  MKN AD L  +G+PIS  +L  Q L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQ

Query:  GRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR----YRGRGRGYNNYNNRTTCQVCGK
         + ++ W +LQA+LL FE R+E  N   S  N   N+TAN V  K  +   + N NN     G NNN RG+  R     RGRGR +     +TTCQVCG 
Subjt:  GRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR----YRGRGRGYNNYNNRTTCQVCGK

Query:  VGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKL
          H A+ C++RFDK +S    R N + N           Q   N F+ +Q      +I D +WY DSG SNHVT   +   N +++ G   + +GNG KL
Subjt:  VGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKL

Query:  PITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSSTLQSNKNLESV
         I   GSS L +    L+L ++L VP+I KNL+S+SKLA DN++ +E   N C V DK T + +L+G LKDGLYQL     S       +        + 
Subjt:  PITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSSTLQSNKNLESV

Query:  FVVSNVMPSVNVVVSKQ-----------------VWHRCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFL
         V+  V+ S NV +S                    +     H A ++L+L         PI+           +DDF+RF W+YPLK KSDT  AF  F 
Subjt:  FVVSNVMPSVNVVVSKQ-----------------VWHRCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFL

Query:  NVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
        N+V+ QF   IK +Q D GGE+  V K   + GI+ R SCPYTS QNGRAERK+RH+ E
Subjt:  NVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.3e-10438.78Show/hide
Query:  SIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWLYNSMT
        S+KLDR N+ LWK+L +P++R  KL+G++L T  CP EF+                            +SS+S +  N  +  W A DQ LLGW+ NSMT
Subjt:  SIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWLYNSMT

Query:  PEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQGRASV
         E+ATQ++  + +K+ W+  + L G  +R++  +L+  F   RKG MKM DYL  MKN  D L  +G+P+S  +L  Q L GLD EYN VV  +  + ++
Subjt:  PEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQGRASV

Query:  FWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR--YRGRGRGYNNYNNRTTCQVCGKVGHSAMV
         W +LQA+LL FE R+E  N   +  N   N+TAN+            N ++  G+   NNN RG+ +R    GRGRG +  N    CQVCG   H A+ 
Subjt:  FWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR--YRGRGRGYNNYNNRTTCQVCGKVGHSAMV

Query:  CYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGS
        C+HRFDK +S    R N +         G   Q   N F+ +Q      ++ D +WY DSG SNHVT   E   + T++ G   + +GNG KL I   GS
Subjt:  CYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGS

Query:  SSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSH------------GHSSTLQSNK
        S L +    L+L ++L VP I KNL+S+SKLA DN++ +E   N C V DK T +V+LKG LKDGLYQL G + + S             GH +    +K
Subjt:  SSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSH------------GHSSTLQSNK

Query:  NLESVFVVSNVMPSVNVVVSKQVWH--------RCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFLNVVK
         LES  V   V PS N    +   +        +     A + L+L         PI+           +DDFSRF W+YPLK KS+T  AF  F N+ +
Subjt:  NLESVFVVSNVMPSVNVVVSKQVWH--------RCLGHPAAKMLDLY------SIPILCV--------VLDDFSRFVWLYPLKLKSDTQAAFSHFLNVVK

Query:  TQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
         QF   IK +Q D GGE+  V KL  + GI+ R SCPYTS QNGRAERK+RH+ E
Subjt:  TQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

A0A6J1DCW4 uncharacterized protein LOC1110195983.8e-11241.25Show/hide
Query:  PYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFL-QIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTV
        P   S   F+SPPLNQLLNQITSIK+DRGNFLLW+NL +PILRSYKL  +L     CPP  L   + PTN                 + G++SS+S  T+
Subjt:  PYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFL-QIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTV

Query:  NPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFS
        NP YEAW+ VD+LLLGWLYNSM  +VA QVMG   ++E W A++ELFGVQSRA+ D+L+Q FQQT KG+++M +YL+ MK+HADNL  +GS +S R+L S
Subjt:  NPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFS

Query:  QVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGY
        QVL GLDEEYN +V  VQG+ ++ W E+ AELL +EKRLE QNS KS +   Q  T ++    G +    Q  NN        NN  G+      RG GY
Subjt:  QVLLGLDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGY

Query:  NNYNNRTTCQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFM---TAQQTATPETIADPNWYADSGTSNHVTKNYENITNP
                    G  G                  +R  G G  P    N        NVF    T+    TPET+ DP+WYADSG ++HVT N  N+   
Subjt:  NNYNNRTTCQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFM---TAQQTATPETIADPNWYADSGTSNHVTKNYENITNP

Query:  TDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQL---HGAQ
         DY G E V + NG KL I+ +GS+++      L L++VL VP+IAKNL                        DK++ R +LKGTLKD LY+L   H + 
Subjt:  TDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQL---HGAQ

Query:  PSTS------HGHSSTLQSNKNLESVFVVSN--VMPSVNVVVSKQVWHRCLGHPAAKMLDLYSIPILCVVLDDFSRFVWLYPLKLKSDTQAAFSHFLNVV
        P+T         H+    SN  L S     +      +NVVVS  VWH+ LGHP+ ++L L                                       
Subjt:  PSTS------HGHSSTLQSNKNLESVFVVSN--VMPSVNVVVSKQVWHRCLGHPAAKMLDLYSIPILCVVLDDFSRFVWLYPLKLKSDTQAAFSHFLNVV

Query:  KTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVET
                   QSD GGE+  +H LC  LGI+ R S PYTSAQNGRAERK+RH+VET
Subjt:  KTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVET

A0A803PEH4 Uncharacterized protein4.6e-11039.19Show/hide
Query:  LNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWL
        LNQ  S+KLDR N+ LWK +   I+R ++L G+L  T  CPPEF+ + +       T+VT                      NP+YE W+  DQLL+GWL
Subjt:  LNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLLLGWL

Query:  YNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQ
        Y+SMT  +AT+VMG  +A      +E L+G  S++K D  R   Q TRKG+  MS+YLR  KN ++ L  +G P    +L + VL GLD EY  +V  ++
Subjt:  YNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVVAIVQ

Query:  GRASVFWFELQAELLVFEKRLE-LQNSEKSAVNFGQNS-TANMVVSKGGNSPKQ--QNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRTTCQVCGK
         R++  W ELQ  LL F+ ++E LQN   ++     +S  ANM      N   +  Q+ N  T   G  +N RG  NR+RGRGRG  +  +R TCQV GK
Subjt:  GRASVFWFELQAELLVFEKRLE-LQNSEKSAVNFGQNS-TANMVVSKGGNSPKQ--QNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRTTCQVCGK

Query:  VGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKL
         GH+A VCY+RFD+ +         + N P ++    Q     + F+     ATPE +    W+ADSG SNH+T +  N+T   DY G E V +GNG+KL
Subjt:  VGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKL

Query:  PITCVGSSSLS--NGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSS---------
         IT +G+  L+  +G ++L L+++L VP+IAKNLVS+SKLA DN+V IE + NFCLV DK T++V+L G LKD LYQL      +SH +           
Subjt:  PITCVGSSSLS--NGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSS---------

Query:  TLQSNKNLE--------------------SVFVVSNVMPSVNVVVSKQVW----------------HRCLGHPAAKMLDLY------SIPILCVV-----
        ++ SN N                      S+ V+++V+ SVNV VSK                    R     A  +LDL         PI   +     
Subjt:  TLQSNKNLE--------------------SVFVVSNVMPSVNVVVSKQVW----------------HRCLGHPAAKMLDLY------SIPILCVV-----

Query:  ---LDDFSRFVWLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
           +DD+SR+ WLYPLKLKSD  AAF  F  +V+ QF   IK+++SD+GGE+     L    GIE +H CP+TS QNGRA+RK+RH VE
Subjt:  ---LDDFSRFVWLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-1622.86Show/hide
Query:  ESRRTVNPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFL-RQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPI
        +S++    + E W  +D+     +   ++ +V   ++  D A+  W  +E L+  ++   + +L +Q +            +L         L   G  I
Subjt:  ESRRTVNPQYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFL-RQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPI

Query:  SNRNLFSQVLLGLDEEY-NVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR
           +    +L  L   Y N+   I+ G+ ++   ++ + LL+ EK        K   N GQ      ++++G     Q++ NN  GR G           
Subjt:  SNRNLFSQVLLGLDEEY-NVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNR

Query:  YRGRGRGYNNYNNRT-TCQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSN---VFMTAQQTATPETIADPNWYADSGTSNHVT
           RG+  N   +R   C  C + GH    C         P  ++G G  +   +  N   + Q ++   +F+  ++     +  +  W  D+  S+H T
Subjt:  YRGRGRGYNNYNNRT-TCQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSN---VFMTAQQTATPETIADPNWYADSGTSNHVT

Query:  KNYENITNPTDYGGNEFVTI--GNGAKLPITCVGSSSL-SNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKD
           +       Y   +F T+  GN +   I  +G   + +N    L L++V  VP++  NL+  S +A D D +     N    + K +  V+ KG  + 
Subjt:  KNYENITNPTDYGGNEFVTI--GNGAKLPITCVGSSSL-SNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKD

Query:  GLY---------QLHGAQPSTS-------HGHSS----TLQSNKNLESVFVVSNVMPSVNVVVSKQ--VWHRCLGHPAAKMLDLYSIPI-----------
         LY         +L+ AQ   S        GH S     + + K+L S    + V P    +  KQ  V  +        +LDL    +           
Subjt:  GLY---------QLHGAQPSTS-------HGHSS----TLQSNKNLESVFVVSNVMPSVNVVVSKQ--VWHRCLGHPAAKMLDLYSIPI-----------

Query:  ---LCVVLDDFSRFVWLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFV--KVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
               +DD SR +W+Y LK K      F  F  +V+ + G  +K ++SDNGGE+   +  + CS  GI    + P T   NG AER NR +VE
Subjt:  ---LCVVLDDFSRFVWLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFV--KVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-4926.44Show/hide
Query:  LNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLL
        LN  ++ +T  KL   N+L+W      +   Y+L G L  +++ PP  +                              +++   VNP Y  W   D+L+
Subjt:  LNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLL

Query:  LGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVV
           +  +++  V   V     A + WE + +++   S      LR   +Q  KG   + DY++ +    D L   G P+ +     +VL  L EEY  V+
Subjt:  LGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVV

Query:  AIVQGR-ASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRT-----T
          +  +       E+   LL  E ++ L  S  + +      TAN V  +   +    N+ N   R   N N   N   ++     ++  NN++      
Subjt:  AIVQGR-ASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRT-----T

Query:  CQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTI
        CQ+CG  GHSA  C           H   + N   PPS       Q ++N+ + +  ++        NW  DSG ++H+T ++ N++    Y G + V +
Subjt:  CQVCGKVGHSAMVCYHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTI

Query:  GNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQ--LHGAQPSTSHGHSSTLQ
         +G+ +PI+  GS+SLS     L+L N+L VP I KNL+S+ +L   N V +E       V D +T   +L+G  KD LY+  +  +QP +     S+  
Subjt:  GNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQ--LHGAQPSTSHGHSSTLQ

Query:  SNKNLESVFVVSNVMPSV-NVVVSKQVWHRCLGHPAAKML-------------------------------DLYSIPIL--------CVVLDDFSRFVWL
        ++ +  +   + +  PS+ N V+S   +   + +P+ K L                               D++S PIL         + +D F+R+ WL
Subjt:  SNKNLESVFVVSNVMPSV-NVVVSKQVWHRCLGHPAAKML-------------------------------DLYSIPIL--------CVVLDDFSRFVWL

Query:  YPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVET
        YPLK KS  +  F  F N+++ +F + I    SDNGGEFV + +  SQ GI    S P+T   NG +ERK+RH+VET
Subjt:  YPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVET

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-4126.25Show/hide
Query:  LNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLL
        LN  ++ +T  KL   N+L+W      +   Y+L G L  ++  PP  +                              +++   VNP Y  W   D+L+
Subjt:  LNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNPQYEAWVAVDQLL

Query:  LGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVV
           +  +++  V   V     A + WE + +++   S      LR                        D L   G P+ +     +VL  L ++Y  V+
Subjt:  LGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNVVV

Query:  AIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQ--NSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRT---TC
          +  +      +    L    +RL  + S+  A+N  +    TAN+V  +  N+ + QN N    R   NNN R N  +    G   +N   +     C
Subjt:  AIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQ--NSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRT---TC

Query:  QVCGKVGHSAMVC--YHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADP----NWYADSGTSNHVTKNYENITNPTDYGGN
        Q+C   GHSA  C   H+F    +                      QQQS    T  Q      +  P    NW  DSG ++H+T ++ N++    Y G 
Subjt:  QVCGKVGHSAMVC--YHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADP----NWYADSGTSNHVTKNYENITNPTDYGGN

Query:  EFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQ---------LHGAQ
        + V I +G+ +PIT  GS+SL      L L  VL VP I KNL+S+ +L   N V +E       V D +T   +L+G  KD LY+            A 
Subjt:  EFVTIGNGAKLPITCVGSSSLSNGYHVLHLENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQ---------LHGAQ

Query:  PSTSHGHSSTLQSNKNLESVFVVSNV-----MPSVNVVVSKQVWHRCLGHPAAKM-----------------LDLYSIPIL--------CVVLDDFSRFV
        P +   HSS   S     S+ ++++V     +P +N          C  + + K+                  D++S PIL         + +D F+R+ 
Subjt:  PSTSHGHSSTLQSNKNLESVFVVSNV-----MPSVNVVVSKQVWHRCLGHPAAKM-----------------LDLYSIPIL--------CVVLDDFSRFV

Query:  WLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE
        WLYPLK KS  +  F  F ++V+ +F + I  + SDNGGEFV +    SQ GI    S P+T   NG +ERK+RH+VE
Subjt:  WLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVE

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-0525.25Show/hide
Query:  WVAVDQLLLGWLYNSMTP-EVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLG
        W   D ++   LY ++TP +     +    +++ W  I+  F     A+   L    +    G+M+++DY R MK  AD+L     P+++RNL   VL G
Subjt:  WVAVDQLLLGWLYNSMTP-EVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLG

Query:  LDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNN
        L+ +++ ++ +++ R     F+  A  ++ E+   L+ + K       +S+++ V++     P      +   +MG+    RGN N +RGRG  ++ YN 
Subjt:  LDEEYNVVVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNN

Query:  RT
         T
Subjt:  RT

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.7e-0627.23Show/hide
Query:  WVAVDQLLLGWLYNSMTPEVATQVMGVD-NAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLG
        W   D L+  W+Y ++T  +   ++ V   A++ W ++E LF     A+        + T   ++ + +Y + +K+ +D L    SPIS+R L   +L G
Subjt:  WVAVDQLLLGWLYNSMTPEVATQVMGVD-NAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLG

Query:  LDEEYNVVVAIVQGRASVFWF-ELQAELLVFEKRLELQN-SEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNY
        L E+Y+ ++ +++ ++    F E ++ LL+ E RL  ++ S  S  N    S     V +      Q+ HNN    MG   +++ N       GR YNN 
Subjt:  LDEEYNVVVAIVQGRASVFWF-ELQAELLVFEKRLELQN-SEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNY

Query:  NN
        NN
Subjt:  NN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACGTCTATTTCACTAGAATCCCTTATTCTGCCAGCACCGGTACGTTTAGTAGCCCACCTCTGAATCAACTGTTGAACCAAATCACCTCCATCAAGCTTGATCG
AGGAAACTTCCTTCTATGGAAGAATCTAACAATGCCAATACTGAGGAGCTATAAGCTCGAGGGACATTTATTGGCCACTAGTTCATGCCCTCCTGAGTTTCTGCAAATTG
AAGAACCAACCAATTCTTCCGCAACCACAGAAGTGACAGCAGGAGCATCAAACTTAGAAACTGTTGTAAGTGGTGCCTCCTCCTCCGAATCAAGAAGAACCGTAAATCCT
CAATATGAAGCCTGGGTAGCAGTTGATCAATTGTTACTTGGATGGTTGTACAATTCAATGACTCCCGAAGTAGCAACTCAAGTAATGGGAGTCGACAACGCAAAGGAACA
GTGGGAAGCGATTGAAGAATTATTCGGAGTGCAGTCGAGAGCTAAAGAAGATTTTCTTCGACAAACCTTCCAACAAACAAGGAAAGGTAATATGAAAATGTCCGATTACT
TAAGAACTATGAAGAATCATGCTGATAATTTAGGTTTTTCTGGTAGTCCCATCTCAAATAGGAATTTATTTTCACAAGTTTTATTGGGCCTTGATGAGGAATATAATGTT
GTTGTTGCTATAGTGCAAGGAAGAGCTAGTGTCTTTTGGTTTGAATTGCAGGCAGAATTGTTGGTGTTTGAAAAACGATTGGAACTTCAGAACTCTGAGAAAAGTGCTGT
GAATTTTGGGCAAAACTCAACTGCAAACATGGTCGTGAGCAAGGGAGGAAACTCTCCTAAACAACAGAACCACAATAATATGACAGGAAGAATGGGATTCAATAACAACC
AACGAGGGAATGGAAATCGCTACAGAGGCAGGGGCCGAGGTTACAATAATTATAACAATAGAACAACGTGTCAAGTCTGTGGCAAGGTAGGTCATTCTGCTATGGTATGT
TATCATAGGTTTGACAAAGAATTTTCACCTGTTCATAAAAGGGGCAATGGCAATGGAAATTATCCTCCAAGTCGTGGGAATGGTCAGCAACTGCAACAACAATCAAATGT
CTTCATGACTGCTCAACAAACTGCCACACCAGAAACAATAGCTGACCCAAATTGGTATGCAGACAGTGGAACATCCAATCATGTCACCAAAAATTATGAAAACATCACTA
ATCCCACTGACTATGGAGGTAATGAGTTTGTAACTATAGGAAATGGTGCTAAACTGCCTATAACCTGTGTTGGATCATCTAGTTTGAGTAATGGATATCATGTTCTGCAT
TTAGAGAATGTTCTGTGTGTGCCTGAAATAGCAAAGAACTTAGTGAGCATGTCCAAGTTAGCCGAAGATAATGATGTATTTATTGAACTCCATGGAAATTTTTGTCTTGT
TATGGACAAGAGTACGAGACGTGTGGTGCTGAAAGGAACACTTAAGGACGGACTTTATCAACTCCATGGTGCTCAACCAAGTACTTCTCATGGTCACAGTTCAACGTTAC
AGTCAAATAAAAATCTAGAATCTGTTTTTGTTGTTTCAAATGTAATGCCCAGTGTTAATGTTGTGGTTTCTAAGCAAGTTTGGCATAGATGCTTGGGTCATCCAGCAGCC
AAGATGTTAGATTTATATTCAATACCTATACTATGTGTTGTTTTGGATGACTTTAGTCGTTTTGTGTGGCTATATCCCTTGAAGTTGAAAAGTGATACACAAGCAGCCTT
CAGTCACTTTCTGAATGTAGTAAAAACTCAGTTTGGTAGTATGATTAAAGCAGTTCAATCTGATAATGGTGGGGAATTTGTCAAAGTTCATAAGCTTTGCTCCCAGTTAG
GAATTGAATCTCGCCACTCTTGTCCTTACACTTCAGCACAAAATGGGCGAGCAGAGAGAAAAAACAGACACGTAGTTGAAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACGTCTATTTCACTAGAATCCCTTATTCTGCCAGCACCGGTACGTTTAGTAGCCCACCTCTGAATCAACTGTTGAACCAAATCACCTCCATCAAGCTTGATCG
AGGAAACTTCCTTCTATGGAAGAATCTAACAATGCCAATACTGAGGAGCTATAAGCTCGAGGGACATTTATTGGCCACTAGTTCATGCCCTCCTGAGTTTCTGCAAATTG
AAGAACCAACCAATTCTTCCGCAACCACAGAAGTGACAGCAGGAGCATCAAACTTAGAAACTGTTGTAAGTGGTGCCTCCTCCTCCGAATCAAGAAGAACCGTAAATCCT
CAATATGAAGCCTGGGTAGCAGTTGATCAATTGTTACTTGGATGGTTGTACAATTCAATGACTCCCGAAGTAGCAACTCAAGTAATGGGAGTCGACAACGCAAAGGAACA
GTGGGAAGCGATTGAAGAATTATTCGGAGTGCAGTCGAGAGCTAAAGAAGATTTTCTTCGACAAACCTTCCAACAAACAAGGAAAGGTAATATGAAAATGTCCGATTACT
TAAGAACTATGAAGAATCATGCTGATAATTTAGGTTTTTCTGGTAGTCCCATCTCAAATAGGAATTTATTTTCACAAGTTTTATTGGGCCTTGATGAGGAATATAATGTT
GTTGTTGCTATAGTGCAAGGAAGAGCTAGTGTCTTTTGGTTTGAATTGCAGGCAGAATTGTTGGTGTTTGAAAAACGATTGGAACTTCAGAACTCTGAGAAAAGTGCTGT
GAATTTTGGGCAAAACTCAACTGCAAACATGGTCGTGAGCAAGGGAGGAAACTCTCCTAAACAACAGAACCACAATAATATGACAGGAAGAATGGGATTCAATAACAACC
AACGAGGGAATGGAAATCGCTACAGAGGCAGGGGCCGAGGTTACAATAATTATAACAATAGAACAACGTGTCAAGTCTGTGGCAAGGTAGGTCATTCTGCTATGGTATGT
TATCATAGGTTTGACAAAGAATTTTCACCTGTTCATAAAAGGGGCAATGGCAATGGAAATTATCCTCCAAGTCGTGGGAATGGTCAGCAACTGCAACAACAATCAAATGT
CTTCATGACTGCTCAACAAACTGCCACACCAGAAACAATAGCTGACCCAAATTGGTATGCAGACAGTGGAACATCCAATCATGTCACCAAAAATTATGAAAACATCACTA
ATCCCACTGACTATGGAGGTAATGAGTTTGTAACTATAGGAAATGGTGCTAAACTGCCTATAACCTGTGTTGGATCATCTAGTTTGAGTAATGGATATCATGTTCTGCAT
TTAGAGAATGTTCTGTGTGTGCCTGAAATAGCAAAGAACTTAGTGAGCATGTCCAAGTTAGCCGAAGATAATGATGTATTTATTGAACTCCATGGAAATTTTTGTCTTGT
TATGGACAAGAGTACGAGACGTGTGGTGCTGAAAGGAACACTTAAGGACGGACTTTATCAACTCCATGGTGCTCAACCAAGTACTTCTCATGGTCACAGTTCAACGTTAC
AGTCAAATAAAAATCTAGAATCTGTTTTTGTTGTTTCAAATGTAATGCCCAGTGTTAATGTTGTGGTTTCTAAGCAAGTTTGGCATAGATGCTTGGGTCATCCAGCAGCC
AAGATGTTAGATTTATATTCAATACCTATACTATGTGTTGTTTTGGATGACTTTAGTCGTTTTGTGTGGCTATATCCCTTGAAGTTGAAAAGTGATACACAAGCAGCCTT
CAGTCACTTTCTGAATGTAGTAAAAACTCAGTTTGGTAGTATGATTAAAGCAGTTCAATCTGATAATGGTGGGGAATTTGTCAAAGTTCATAAGCTTTGCTCCCAGTTAG
GAATTGAATCTCGCCACTCTTGTCCTTACACTTCAGCACAAAATGGGCGAGCAGAGAGAAAAAACAGACACGTAGTTGAAACATGA
Protein sequenceShow/hide protein sequence
MENVYFTRIPYSASTGTFSSPPLNQLLNQITSIKLDRGNFLLWKNLTMPILRSYKLEGHLLATSSCPPEFLQIEEPTNSSATTEVTAGASNLETVVSGASSSESRRTVNP
QYEAWVAVDQLLLGWLYNSMTPEVATQVMGVDNAKEQWEAIEELFGVQSRAKEDFLRQTFQQTRKGNMKMSDYLRTMKNHADNLGFSGSPISNRNLFSQVLLGLDEEYNV
VVAIVQGRASVFWFELQAELLVFEKRLELQNSEKSAVNFGQNSTANMVVSKGGNSPKQQNHNNMTGRMGFNNNQRGNGNRYRGRGRGYNNYNNRTTCQVCGKVGHSAMVC
YHRFDKEFSPVHKRGNGNGNYPPSRGNGQQLQQQSNVFMTAQQTATPETIADPNWYADSGTSNHVTKNYENITNPTDYGGNEFVTIGNGAKLPITCVGSSSLSNGYHVLH
LENVLCVPEIAKNLVSMSKLAEDNDVFIELHGNFCLVMDKSTRRVVLKGTLKDGLYQLHGAQPSTSHGHSSTLQSNKNLESVFVVSNVMPSVNVVVSKQVWHRCLGHPAA
KMLDLYSIPILCVVLDDFSRFVWLYPLKLKSDTQAAFSHFLNVVKTQFGSMIKAVQSDNGGEFVKVHKLCSQLGIESRHSCPYTSAQNGRAERKNRHVVET