; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005510 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005510
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:19882444..19883778
RNA-Seq ExpressionLag0005510
SyntenyLag0005510
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]8.2e-3934.71Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ
        MK+ VD L  AG+PVST  LI Q L GLD E+NPV+  +  +  ++W ++QA+LL +E R+E  N   +   L+   + N+AN  D+   K+  NN+   
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ

Query:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQPSAFMA-NQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNLTNPVDYEGKERVT
        +  G    RG G  G++  +  G  N+    C      +   S   A +    S+N F+AS  +V D +WY DSGASNHVT       +  ++ GK  + 
Subjt:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQPSAFMA-NQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNLTNPVDYEGKERVT

Query:  VGDGNQLQITCVGSS---SLNAGRQSYGQANAERVLSEG--------LYRFDDGRTAAVEKSKSVICSSGTLSRKNNVDLSVLTLSSSVNVVVSKVTWHR
        VG+G +L I   GSS   SLN     Y     + +LS          L  FD+      +K    +   G L +     LS    + S  V V K +WHR
Subjt:  VGDGNQLQITCVGSS---SLNAGRQSYGQANAERVLSEG--------LYRFDDGRTAAVEKSKSVICSSGTLSRKNNVDLSVLTLSSSVNVVVSKVTWHR

Query:  RVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG
        R+GHP+ KV D +++ C +    ++  +FCEA Q+GK H LPF  S S A    EL+H D+WG
Subjt:  RVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]9.4e-4337.24Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ
        MK   D+L  AGS VST  L++Q L GLD E+NP++  +  +  +TW EMQA+LL YE RLE  N  +S+L L+  PS N++ +  N   K+        
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ

Query:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQ---------------FN-----LHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVT
         G GRG Q   G RG   GRG G     +  CQ               FN      ++  Q S     QNYN +N +VAS  TV D +WY DSGASNHVT
Subjt:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQ---------------FN-----LHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVT

Query:  VDYTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQS-------YGQANAERVLSEGLYRFDDG------RTAAVEKSKSVICSSGTLSRKNNVD
         D   +    + +GK  +TVG+G  L+I   G SSL+  ++S       Y     + +LS     FD+         A   K K     +G +  +  + 
Subjt:  VDYTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQS-------YGQANAERVLSEGLYRFDDG------RTAAVEKSKSVICSSGTLSRKNNVD

Query:  LSVLTL---SSSVN----VVVS-KVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG
          +  L   S+S N    V  S K TWHR++GHP+ KV + ++K CN+     E   FCEA QFGKAH LPF +SVS A    +L+H+D+WG
Subjt:  LSVLTL---SSSVN----VVVS-KVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]2.1e-4233.6Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDN-NNQKNRGNNYNR
        MK+  D L  AG+P+ST  LI Q L GLD E+NPV+  +  +  ++W ++QA+LL +E R+E  N+  +   L+   + N+A   D+  N+ N  NN+  
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDN-NNQKNRGNNYNR

Query:  QSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQF-NLHNSVQPSAFMA-------------NQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTN
         +   RG+   G   GR +GR +      K TCQ   L N +    F               N    S+N F+AS  ++ D +WY DSGASNHVT     
Subjt:  QSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQF-NLHNSVQPSAFMA-------------NQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTN

Query:  LTNPVDYEGKERVTVGDGNQLQITCVGSS---SLNAGRQSYGQANAERVLSEGLYRFDDGRTAAVEKS----KSVICSSGTLSRKNNVDLSVLTLSSSVN
          N  ++ GK  + VG+G +L+I   GSS   SLN     Y     + +LS      D+      +++    K  +     L       L  L+   S  
Subjt:  LTNPVDYEGKERVTVGDGNQLQITCVGSS---SLNAGRQSYGQANAERVLSEGLYRFDDGRTAAVEKS----KSVICSSGTLSRKNNVDLSVLTLSSSVN

Query:  VVVSKVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWGQHQLI
         V  K +WHR++GHP+ KV D+++K CN+    +++ +FCEA Q+GK H LPF  S S A  + EL+H D+WG   +I
Subjt:  VVVSKVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWGQHQLI

PNX81106.1 glutamate receptor [Trifolium pratense]5.7e-4033.6Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ
        MK   D L  AGS +S   L+ Q L GLD ++NPV+  +  +  ++W E+QA+LL +E RLE  N   +   L+   + N+A     N  K  GN +N  
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ

Query:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQPSAF--------------MANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNL
            RGN RG   RG   GRG G F+N KP CQ    +    + F                N+   ++N FVAS     D  WY DSGASNHVT      
Subjt:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQPSAF--------------MANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNL

Query:  TNPVDYEGKERVTVGDGNQLQITCVGSSSLNA---GRQSYGQANAERVLS--------EGLYRFDDGRTAAVEKSKSVICSSGTLSRKNNVDLSVLTLSS
         +  ++ GK  + VG+G +L+I   GSS +N        Y     + +LS          +  FD       +K    +   G L +     LS  T ++
Subjt:  TNPVDYEGKERVTVGDGNQLQITCVGSSSLNA---GRQSYGQANAERVLS--------EGLYRFDDGRTAAVEKSKSVICSSGTLSRKNNVDLSVLTLSS

Query:  SVNVVVSKV--TWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG
          + V   V  +WHR++GHP+ K  D ++K CN+    +++  FCEA Q GK+H LPF  S S A    EL+H D+WG
Subjt:  SVNVVVSKV--TWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.0e-4139.05Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLML--SQGPSVN-------MANVKDNNNQK
        MKSH DNL  AGS VS R L+SQVL GLDEE+NP++  +QG+  ++WSEM AELL YEKRLE QN+ KS + +  +Q PSVN         N + NN   
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLML--SQGPSVN-------MANVKDNNNQK

Query:  NRGNNYNRQSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQ-PSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNLTNPV
        + G+N +R    G G QRG  G+ R++GRG       +PT   N   S   P+ F A+    +    V + ETV+DP+WYADSGA++HVT +  N+   V
Subjt:  NRGNNYNRQSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQ-PSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNLTNPV

Query:  DYEGKERVTVGDGNQLQITCVGSSSLNAGRQSY-------------------GQANAERVLSEGLYRFDDGR---------TAAVEKSKSVICSSGTLSR
        DY G E V V +GN+L I+ +GS++++A   S                    G+   +  L + LYR D            TA +     V  S+ TLS 
Subjt:  DYEGKERVTVGDGNQLQITCVGSSSLNAGRQSY-------------------GQANAERVLSEGLYRFDDGR---------TAAVEKSKSVICSSGTLSR

Query:  KNNVDLSVLTLSSSVNVVVSKVTWHRRVGHPSEKVFDL
        +          +  +NVVVS   WH+R+GHPS +V  L
Subjt:  KNNVDLSVLTLSSSVNVVVSKVTWHRRVGHPSEKVFDL

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-944.6e-4337.24Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ
        MK   D+L  AGS VST  L++Q L GLD E+NP++  +  +  +TW EMQA+LL YE RLE  N  +S+L L+  PS N++ +  N   K+        
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ

Query:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQ---------------FN-----LHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVT
         G GRG Q   G RG   GRG G     +  CQ               FN      ++  Q S     QNYN +N +VAS  TV D +WY DSGASNHVT
Subjt:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQ---------------FN-----LHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVT

Query:  VDYTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQS-------YGQANAERVLSEGLYRFDDG------RTAAVEKSKSVICSSGTLSRKNNVD
         D   +    + +GK  +TVG+G  L+I   G SSL+  ++S       Y     + +LS     FD+         A   K K     +G +  +  + 
Subjt:  VDYTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQS-------YGQANAERVLSEGLYRFDDG------RTAAVEKSKSVICSSGTLSRKNNVD

Query:  LSVLTL---SSSVN----VVVS-KVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG
          +  L   S+S N    V  S K TWHR++GHP+ KV + ++K CN+     E   FCEA QFGKAH LPF +SVS A    +L+H+D+WG
Subjt:  LSVLTL---SSSVN----VVVS-KVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)1.0e-4233.6Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDN-NNQKNRGNNYNR
        MK+  D L  AG+P+ST  LI Q L GLD E+NPV+  +  +  ++W ++QA+LL +E R+E  N+  +   L+   + N+A   D+  N+ N  NN+  
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDN-NNQKNRGNNYNR

Query:  QSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQF-NLHNSVQPSAFMA-------------NQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTN
         +   RG+   G   GR +GR +      K TCQ   L N +    F               N    S+N F+AS  ++ D +WY DSGASNHVT     
Subjt:  QSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQF-NLHNSVQPSAFMA-------------NQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTN

Query:  LTNPVDYEGKERVTVGDGNQLQITCVGSS---SLNAGRQSYGQANAERVLSEGLYRFDDGRTAAVEKS----KSVICSSGTLSRKNNVDLSVLTLSSSVN
          N  ++ GK  + VG+G +L+I   GSS   SLN     Y     + +LS      D+      +++    K  +     L       L  L+   S  
Subjt:  LTNPVDYEGKERVTVGDGNQLQITCVGSS---SLNAGRQSYGQANAERVLSEGLYRFDDGRTAAVEKS----KSVICSSGTLSRKNNVDLSVLTLSSSVN

Query:  VVVSKVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWGQHQLI
         V  K +WHR++GHP+ KV D+++K CN+    +++ +FCEA Q+GK H LPF  S S A  + EL+H D+WG   +I
Subjt:  VVVSKVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWGQHQLI

A0A2K3LRE0 Glutamate receptor2.8e-4033.6Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ
        MK   D L  AGS +S   L+ Q L GLD ++NPV+  +  +  ++W E+QA+LL +E RLE  N   +   L+   + N+A     N  K  GN +N  
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQ

Query:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQPSAF--------------MANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNL
            RGN RG   RG   GRG G F+N KP CQ    +    + F                N+   ++N FVAS     D  WY DSGASNHVT      
Subjt:  SGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQPSAF--------------MANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNL

Query:  TNPVDYEGKERVTVGDGNQLQITCVGSSSLNA---GRQSYGQANAERVLS--------EGLYRFDDGRTAAVEKSKSVICSSGTLSRKNNVDLSVLTLSS
         +  ++ GK  + VG+G +L+I   GSS +N        Y     + +LS          +  FD       +K    +   G L +     LS  T ++
Subjt:  TNPVDYEGKERVTVGDGNQLQITCVGSSSLNA---GRQSYGQANAERVLS--------EGLYRFDDGRTAAVEKSKSVICSSGTLSRKNNVDLSVLTLSS

Query:  SVNVVVSKV--TWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG
          + V   V  +WHR++GHP+ K  D ++K CN+    +++  FCEA Q GK+H LPF  S S A    EL+H D+WG
Subjt:  SVNVVVSKV--TWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG

A0A6J1DCW4 uncharacterized protein LOC1110195985.0e-4239.05Show/hide
Query:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLML--SQGPSVN-------MANVKDNNNQK
        MKSH DNL  AGS VS R L+SQVL GLDEE+NP++  +QG+  ++WSEM AELL YEKRLE QN+ KS + +  +Q PSVN         N + NN   
Subjt:  MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLML--SQGPSVN-------MANVKDNNNQK

Query:  NRGNNYNRQSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQ-PSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNLTNPV
        + G+N +R    G G QRG  G+ R++GRG       +PT   N   S   P+ F A+    +    V + ETV+DP+WYADSGA++HVT +  N+   V
Subjt:  NRGNNYNRQSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQ-PSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNLTNPV

Query:  DYEGKERVTVGDGNQLQITCVGSSSLNAGRQSY-------------------GQANAERVLSEGLYRFDDGR---------TAAVEKSKSVICSSGTLSR
        DY G E V V +GN+L I+ +GS++++A   S                    G+   +  L + LYR D            TA +     V  S+ TLS 
Subjt:  DYEGKERVTVGDGNQLQITCVGSSSLNAGRQSY-------------------GQANAERVLSEGLYRFDDGR---------TAAVEKSKSVICSSGTLSR

Query:  KNNVDLSVLTLSSSVNVVVSKVTWHRRVGHPSEKVFDL
        +          +  +NVVVS   WH+R+GHPS +V  L
Subjt:  KNNVDLSVLTLSSSVNVVVSKVTWHRRVGHPSEKVFDL

A0A803PEH4 Uncharacterized protein1.6e-4033.57Show/hide
Query:  KSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLE-LQN-TFKSSLMLSQGPSVNMANVKDNNNQKNRG---NN
        K+  + L  AG P     L++ VL GLD E+  ++  I+ R   TW E+Q  LL ++ ++E LQN T  S+   S  P  NMA  K NNN + RG    N
Subjt:  KSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLE-LQN-TFKSSLMLSQGPSVNMANVKDNNNQKNRG---NN

Query:  YNRQSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQF-------------------------NLHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYA
         +  SG    N RG   R R +GRG GS    +PTCQ                          N HN  +     A Q  N+++ FVA+ E +    W+A
Subjt:  YNRQSGCGRGNQRGGGGRGRSKGRGYGSFNNGKPTCQF-------------------------NLHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYA

Query:  DSGASNHVTVDYTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQSY--------------------------------------------GQAN
        DSGASNH+T D  NLT   DY GKE V VG+G++L+IT +G+  LN    +Y                                             +  
Subjt:  DSGASNHVTVDYTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQSY--------------------------------------------GQAN

Query:  AERVLSEGLYRFDDGRTAAVEK-SKSVICSSGTLSRKNNVDLSVLTLSSSVNVVVSKV-TWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGK
           VL + LY+ D   T +     +S   S+ T+S  +NV+      S + ++++S++   HRR+GHPS KV + +++  N+    N   T C+A Q+GK
Subjt:  AERVLSEGLYRFDDGRTAAVEK-SKSVICSSGTLSRKNNVDLSVLTLSSSVNVVVSKV-TWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGK

Query:  AHALPFPHSVSRASSVFELIHADLWG
        AHALPF  S +RA SV +LIH DLWG
Subjt:  AHALPFPHSVSRASSVFELIHADLWG

SwissProt top hitse value%identityAlignment
P93293 Uncharacterized mitochondrial protein AtMg003006.2e-0531.82Show/hide
Query:  WHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG
        WH R+ H S++  +L+VK+  L         FCE   +GK H + F        +  + +H+DLWG
Subjt:  WHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-1225.19Show/hide
Query:  DNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRG-GITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQSGCG
        D L   G P+     + +VL  L EE+ PVI  I  +    T +E+   LL +E ++   ++     + +   S       +NNN  NR N Y+      
Subjt:  DNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRG-GITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQSGCG

Query:  RGNQRGGGGRGRSKGRGYGSFNNGKP---TCQF-------------------NLHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVD
        R N        +S    + + N  KP    CQ                    ++++   PS F   Q     NL + S  +    NW  DSGA++H+T D
Subjt:  RGNQRGGGGRGRSKGRGYGSFNNGKP---TCQF-------------------NLHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVD

Query:  YTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQSYGQANAERVLS-----EGLYRFDDGRTAAVE------KSKSVICSSGTLSRKNNVDLSVL
        + NL+    Y G + V V DG+ + I+  GS+SL+   +     N   V +       +YR  +    +VE      + K +      L  K   +L   
Subjt:  YTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGRQSYGQANAERVLS-----EGLYRFDDGRTAAVE------KSKSVICSSGTLSRKNNVDLSVL

Query:  TLSSSVNVVV-----SKVT---WHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTF--CEAFQFGKAHALPFPHSVSRASSVFELIHADLW
         ++SS  V +     SK T   WH R+GHP+  + + ++   +L   LN    F  C      K++ +PF  S   ++   E I++D+W
Subjt:  TLSSSVNVVV-----SKVT---WHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTF--CEAFQFGKAHALPFPHSVSRASSVFELIHADLW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.6e-1425.46Show/hide
Query:  DNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRG-GITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQSGCG
        D L   G P+     + +VL  L +++ PVI  I  +    + +E+   L+  E +L   N+  + ++      V   N   N NQ NRG+N N  +   
Subjt:  DNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRG-GITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQSGCG

Query:  RGNQRGGGGRG-RSKGRGYGSFNNGKPTCQFNLHNSVQ-------PSAFMANQNYNSYNLFVASSETVVDP-----NWYADSGASNHVTVDYTNLTNPVD
        R N       G RS  R    +      C    H++ +        S     Q+ + +  +   +   V+      NW  DSGA++H+T D+ NL+    
Subjt:  RGNQRGGGGRG-RSKGRGYGSFNNGKPTCQFNLHNSVQ-------PSAFMANQNYNSYNLFVASSETVVDP-----NWYADSGASNHVTVDYTNLTNPVD

Query:  YEGKERVTVGDGNQLQITCVGSSSLNAGRQS-------YGQANAERVLSEGLYRFDDGRTAAVE------KSKSVICSSGTLSRKNNVDLSVLTLSSSVN
        Y G + V + DG+ + IT  GS+SL    +S       Y     + ++S  +YR  +    +VE      + K +      L  K   +L    ++SS  
Subjt:  YEGKERVTVGDGNQLQITCVGSSSLNAGRQS-------YGQANAERVLSEGLYRFDDGRTAAVE------KSKSVICSSGTLSRKNNVDLSVLTLSSSVN

Query:  VVV-----SKVT---WHRRVGHPSEKVFDLIVKQCNLP-YRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLW
        V +     SK T   WH R+GHPS  + + ++   +LP    + K   C      K+H +PF +S   +S   E I++D+W
Subjt:  VVV-----SKVT---WHRRVGHPSEKVFDLIVKQCNLP-YRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLW

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.4e-0631.82Show/hide
Query:  WHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG
        WH R+ H S++  +L+VK+  L         FCE   +GK H + F        +  + +H+DLWG
Subjt:  WHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALPFPHSVSRASSVFELIHADLWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGCCACGTAGATAACTTGGGTCAAGCTGGCAGTCCGGTCAGCACACGATCGTTGATTTCACAAGTCCTGTTGGGTCTTGATGAAGAATTTAACCCTGTCATTGC
TATGATACAAGGGAGAGGAGGAATCACCTGGTCTGAGATGCAGGCCGAATTACTCGTATATGAAAAGCGACTTGAGCTTCAGAACACCTTCAAAAGCTCACTAATGCTGA
GTCAGGGACCTTCGGTGAACATGGCAAATGTCAAAGACAACAACAATCAGAAGAATCGAGGTAACAACTACAACAGACAGAGTGGATGTGGTAGAGGTAATCAAAGAGGA
GGCGGAGGAAGAGGTCGCAGCAAAGGTCGTGGCTATGGCTCTTTCAATAATGGTAAACCGACTTGCCAGTTCAACCTTCATAATTCAGTTCAACCTTCAGCATTTATGGC
AAACCAGAATTATAACTCCTATAATCTGTTTGTTGCATCCTCTGAGACCGTAGTTGACCCCAATTGGTATGCCGATAGTGGGGCATCCAATCATGTAACAGTAGACTATA
CCAACCTGACAAATCCAGTCGATTATGAAGGTAAGGAAAGGGTAACAGTTGGTGACGGTAATCAACTTCAAATTACTTGTGTTGGTAGCTCTAGTTTGAATGCTGGAAGA
CAAAGCTATGGGCAAGCAAATGCTGAAAGGGTACTCAGTGAAGGCTTATATCGTTTTGATGATGGAAGGACTGCTGCAGTTGAAAAGTCTAAGTCAGTAATCTGTAGCAG
TGGAACTCTCTCTCGAAAGAATAATGTTGATCTATCTGTTCTTACTTTATCTAGTTCAGTAAATGTTGTTGTGTCCAAGGTCACGTGGCATAGACGTGTAGGACATCCCT
CTGAGAAAGTTTTTGATCTGATTGTGAAACAATGTAATCTTCCTTATAGATTAAATGAGAAGTCTACCTTTTGTGAAGCTTTTCAGTTTGGTAAAGCTCATGCATTACCC
TTTCCACACTCTGTCTCACGAGCATCTAGTGTATTTGAACTAATTCATGCTGATCTTTGGGGCCAGCACCAATTAATTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGCCACGTAGATAACTTGGGTCAAGCTGGCAGTCCGGTCAGCACACGATCGTTGATTTCACAAGTCCTGTTGGGTCTTGATGAAGAATTTAACCCTGTCATTGC
TATGATACAAGGGAGAGGAGGAATCACCTGGTCTGAGATGCAGGCCGAATTACTCGTATATGAAAAGCGACTTGAGCTTCAGAACACCTTCAAAAGCTCACTAATGCTGA
GTCAGGGACCTTCGGTGAACATGGCAAATGTCAAAGACAACAACAATCAGAAGAATCGAGGTAACAACTACAACAGACAGAGTGGATGTGGTAGAGGTAATCAAAGAGGA
GGCGGAGGAAGAGGTCGCAGCAAAGGTCGTGGCTATGGCTCTTTCAATAATGGTAAACCGACTTGCCAGTTCAACCTTCATAATTCAGTTCAACCTTCAGCATTTATGGC
AAACCAGAATTATAACTCCTATAATCTGTTTGTTGCATCCTCTGAGACCGTAGTTGACCCCAATTGGTATGCCGATAGTGGGGCATCCAATCATGTAACAGTAGACTATA
CCAACCTGACAAATCCAGTCGATTATGAAGGTAAGGAAAGGGTAACAGTTGGTGACGGTAATCAACTTCAAATTACTTGTGTTGGTAGCTCTAGTTTGAATGCTGGAAGA
CAAAGCTATGGGCAAGCAAATGCTGAAAGGGTACTCAGTGAAGGCTTATATCGTTTTGATGATGGAAGGACTGCTGCAGTTGAAAAGTCTAAGTCAGTAATCTGTAGCAG
TGGAACTCTCTCTCGAAAGAATAATGTTGATCTATCTGTTCTTACTTTATCTAGTTCAGTAAATGTTGTTGTGTCCAAGGTCACGTGGCATAGACGTGTAGGACATCCCT
CTGAGAAAGTTTTTGATCTGATTGTGAAACAATGTAATCTTCCTTATAGATTAAATGAGAAGTCTACCTTTTGTGAAGCTTTTCAGTTTGGTAAAGCTCATGCATTACCC
TTTCCACACTCTGTCTCACGAGCATCTAGTGTATTTGAACTAATTCATGCTGATCTTTGGGGCCAGCACCAATTAATTATGTAA
Protein sequenceShow/hide protein sequence
MKSHVDNLGQAGSPVSTRSLISQVLLGLDEEFNPVIAMIQGRGGITWSEMQAELLVYEKRLELQNTFKSSLMLSQGPSVNMANVKDNNNQKNRGNNYNRQSGCGRGNQRG
GGGRGRSKGRGYGSFNNGKPTCQFNLHNSVQPSAFMANQNYNSYNLFVASSETVVDPNWYADSGASNHVTVDYTNLTNPVDYEGKERVTVGDGNQLQITCVGSSSLNAGR
QSYGQANAERVLSEGLYRFDDGRTAAVEKSKSVICSSGTLSRKNNVDLSVLTLSSSVNVVVSKVTWHRRVGHPSEKVFDLIVKQCNLPYRLNEKSTFCEAFQFGKAHALP
FPHSVSRASSVFELIHADLWGQHQLIM