; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008699 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008699
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:28336229..28337764
RNA-Seq ExpressionLag0008699
SyntenyLag0008699
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]1.7e-10152.26Show/hide
Query:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ
        FS+PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHL+GE PCP  ++ +A++   + TE  A +++    G SS      VN  +E W+  + 
Subjt:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ

Query:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP
        LLLGWLYNSM+P+VA Q+MGF   +DLW A QD FGVQSRAEED+LRQ+ Q  RKG+ KM EYL VMK++ DNLGQ GSPV  R L+SQVLLGLDE YN 
Subjt:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP

Query:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAV---SFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFN
        V+ +IQG+  I+W +MQ++LL+FEK L+ QNTQK      + + +  +NMA    + GQR  +    Y YN    S  RGN                   
Subjt:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAV---SFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFN

Query:  NNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEG
        NN P CQ+CGK GH AL CY RFNKEFS P  Q+R E+      S P P  FV++QN  PF A+PDTVVDP+WY+DSGA+NHVT E +++ NPT+Y G
Subjt:  NNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEG

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.9e-9250Show/hide
Query:  VATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMIQGRSGITW
        +A Q+MGF  ++DLW A QDLFGVQSRAEED+LRQ+FQ  RK      +YLR+MK+++D LGQAGSPV  R  +SQ LLGLDE YNPV+A+IQG+  I+W
Subjt:  VATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMIQGRSGITW

Query:  SEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGG-QRGQNYNYNNSQGSYGRGNQRG-NSGRSRGRGRGYDNFNNNKPICQVCGKPGHMALT
         +MQ+ELL FEKRLE Q+TQK+  +   N +VN+A N+N    ++  N+ ++ +  +  +G + G N GR RG+GRG      NKP CQVC K GH AL 
Subjt:  SEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGG-QRGQNYNYNNSQGSYGRGNQRG-NSGRSRGRGRGYDNFNNNKPICQVCGKPGHMALT

Query:  CYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDGNKLKILHVGNS
        CY RFNKEF  P  Q+RG         +      V  Q+ N F A+ DTV++ +WY+DSGA+NH+T EY++++NP++Y G E + VG+G+ L I ++GN+
Subjt:  CYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDGNKLKILHVGNS

Query:  CLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAK
         L+DG+N L+L++VLCVP I KNLVSVSKLAQDN++++EFH  +C +KDK T + LL   + +G+Y  +  +
Subjt:  CLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAK

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]7.2e-11349.15Show/hide
Query:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ
        FS+PPLNQ+LNQ+T++KLDR N+LLWK LALPIL+ YKLEGHL+ E PCP  ++ +A++   + TE  A +++    G SS      VNP +E W+  + 
Subjt:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ

Query:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP
        LLLGWLYNSM+P+VA Q+MGF   +DLW A QD FGVQSRAEED+LRQ+ Q  RK                                     GLDE YN 
Subjt:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP

Query:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQ-KSAVSFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNN
        V+ +IQG+  I+W +MQ++LL+FEKRL+ QNTQ K+  + + +  +NMA    + GQR Q+    Y YN    S  RGN                   NN
Subjt:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQ-KSAVSFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNN

Query:  KPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECV
         P CQ+CGK GH AL CY RFNKEFS P  QNR E+      S P P  FV++QN  PF A+PDTVVDP+WY+DSGA+NHVT E +++ NPT+Y G E V
Subjt:  KPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECV

Query:  TVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQ
        TVG+GN+L I +VGN+CL+DG   L L+++LCVP IAKNL+SVSKLAQDN +++EFH   C +KDK T +
Subjt:  TVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQ

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]2.6e-10247.54Show/hide
Query:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ
        F+SPPLNQLLNQITSIK+DRGNFLLW+NLALPILRSYKL  +L+G+KPCPP +L               + + T + G +S   + T+NP YEAWIVV++
Subjt:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ

Query:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP
        LLLGWLYNSM+ +VA QVMGF TS++LW AVQ+LFGVQSRAE DYL+Q+FQQ  KGSL+M EYL++MKSHADNL  AGS V+ R+LVSQVL GLDEEYNP
Subjt:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP

Query:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTL--VNMASNQNMGGQRGQNYNYNNSQGS---YGRGNQRGNSG-RSRGRGRGYDNFNN
        +V  +QG+  ++WSEM AELL +EKRLE QN+ KS +  +      VN    ++    +  N N NNS GS    G G QRG+ G R+RGRG       N
Subjt:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTL--VNMASNQNMGGQRGQNYNYNNSQGS---YGRGNQRGNSG-RSRGRGRGYDNFNN

Query:  NKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTEC
          P                                        SN  PN F A+ + +  V +P+TV+DPSWY DSGA++HVTA  N++    DY GTE 
Subjt:  NKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTEC

Query:  VTVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENA
        V V +GNKL I H+G++ +      L L+ VL VP IAKNL                        DK + + LLKG L + +YR + +
Subjt:  VTVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENA

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]8.9e-8750.67Show/hide
Query:  ATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQLLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKG
        +T  G +    + SS     SG SS   A  VNPQYE+W+ V+QLLLGWLYNSM+PEVA QVMG E ++DLW ++  LFGVQSR EEDYLR +FQ  RKG
Subjt:  ATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQLLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKG

Query:  SLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSH--NTLVNMASNQNMG
        +LKM EYL+ MK + DNL QAGSP+  R LVSQVLLGLDEEYN +VAMIQGR  ++W +MQ+ELL++E+RLE Q+ QK+ V F+   N  VNM + +++ 
Subjt:  SLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSH--NTLVNMASNQNMG

Query:  GQRGQNYNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGE--NGRQPMQSNPPPNAFVASQNNN
            QN   N+S  S G G QRG  G  RGRGRG    NN KP+CQVCGK GH+A  C+ R++++F     QN+ E     Q   + P P A   +  +N
Subjt:  GQRGQNYNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGE--NGRQPMQSNPPPNAFVASQNNN

Query:  PFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDGNKLKILHVGNSCLSDGLNKLSLESVLC
        PF+   + + D +WY DSGASNHVT+++N++ NP +Y GT       GN L I HVG  CLS     L L  +LC
Subjt:  PFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDGNKLKILHVGNSCLSDGLNKLSLESVLC

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X13.5e-11349.15Show/hide
Query:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ
        FS+PPLNQ+LNQ+T++KLDR N+LLWK LALPIL+ YKLEGHL+ E PCP  ++ +A++   + TE  A +++    G SS      VNP +E W+  + 
Subjt:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ

Query:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP
        LLLGWLYNSM+P+VA Q+MGF   +DLW A QD FGVQSRAEED+LRQ+ Q  RK                                     GLDE YN 
Subjt:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP

Query:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQ-KSAVSFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNN
        V+ +IQG+  I+W +MQ++LL+FEKRL+ QNTQ K+  + + +  +NMA    + GQR Q+    Y YN    S  RGN                   NN
Subjt:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQ-KSAVSFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNN

Query:  KPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECV
         P CQ+CGK GH AL CY RFNKEFS P  QNR E+      S P P  FV++QN  PF A+PDTVVDP+WY+DSGA+NHVT E +++ NPT+Y G E V
Subjt:  KPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECV

Query:  TVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQ
        TVG+GN+L I +VGN+CL+DG   L L+++LCVP IAKNL+SVSKLAQDN +++EFH   C +KDK T +
Subjt:  TVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQ

A0A5A7SIT7 Uncharacterized protein8.1e-10252.26Show/hide
Query:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ
        FS+PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHL+GE PCP  ++ +A++   + TE  A +++    G SS      VN  +E W+  + 
Subjt:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ

Query:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP
        LLLGWLYNSM+P+VA Q+MGF   +DLW A QD FGVQSRAEED+LRQ+ Q  RKG+ KM EYL VMK++ DNLGQ GSPV  R L+SQVLLGLDE YN 
Subjt:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP

Query:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAV---SFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFN
        V+ +IQG+  I+W +MQ++LL+FEK L+ QNTQK      + + +  +NMA    + GQR  +    Y YN    S  RGN                   
Subjt:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAV---SFSHNTLVNMASNQNMGGQRGQN----YNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFN

Query:  NNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEG
        NN P CQ+CGK GH AL CY RFNKEFS P  Q+R E+      S P P  FV++QN  PF A+PDTVVDP+WY+DSGA+NHVT E +++ NPT+Y G
Subjt:  NNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEG

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-9350Show/hide
Query:  VATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMIQGRSGITW
        +A Q+MGF  ++DLW A QDLFGVQSRAEED+LRQ+FQ  RK      +YLR+MK+++D LGQAGSPV  R  +SQ LLGLDE YNPV+A+IQG+  I+W
Subjt:  VATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMIQGRSGITW

Query:  SEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGG-QRGQNYNYNNSQGSYGRGNQRG-NSGRSRGRGRGYDNFNNNKPICQVCGKPGHMALT
         +MQ+ELL FEKRLE Q+TQK+  +   N +VN+A N+N    ++  N+ ++ +  +  +G + G N GR RG+GRG      NKP CQVC K GH AL 
Subjt:  SEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGG-QRGQNYNYNNSQGSYGRGNQRG-NSGRSRGRGRGYDNFNNNKPICQVCGKPGHMALT

Query:  CYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDGNKLKILHVGNS
        CY RFNKEF  P  Q+RG         +      V  Q+ N F A+ DTV++ +WY+DSGA+NH+T EY++++NP++Y G E + VG+G+ L I ++GN+
Subjt:  CYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDGNKLKILHVGNS

Query:  CLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAK
         L+DG+N L+L++VLCVP I KNLVSVSKLAQDN++++EFH  +C +KDK T + LL   + +G+Y  +  +
Subjt:  CLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAK

A0A6J1DCW4 uncharacterized protein LOC1110195981.2e-10247.54Show/hide
Query:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ
        F+SPPLNQLLNQITSIK+DRGNFLLW+NLALPILRSYKL  +L+G+KPCPP +L               + + T + G +S   + T+NP YEAWIVV++
Subjt:  FSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQ

Query:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP
        LLLGWLYNSM+ +VA QVMGF TS++LW AVQ+LFGVQSRAE DYL+Q+FQQ  KGSL+M EYL++MKSHADNL  AGS V+ R+LVSQVL GLDEEYNP
Subjt:  LLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNP

Query:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTL--VNMASNQNMGGQRGQNYNYNNSQGS---YGRGNQRGNSG-RSRGRGRGYDNFNN
        +V  +QG+  ++WSEM AELL +EKRLE QN+ KS +  +      VN    ++    +  N N NNS GS    G G QRG+ G R+RGRG       N
Subjt:  VVAMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTL--VNMASNQNMGGQRGQNYNYNNSQGS---YGRGNQRGNSG-RSRGRGRGYDNFNN

Query:  NKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTEC
          P                                        SN  PN F A+ + +  V +P+TV+DPSWY DSGA++HVTA  N++    DY GTE 
Subjt:  NKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTEC

Query:  VTVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENA
        V V +GNKL I H+G++ +      L L+ VL VP IAKNL                        DK + + LLKG L + +YR + +
Subjt:  VTVGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENA

A0A803PEH4 Uncharacterized protein1.3e-8640.28Show/hide
Query:  MTNVNSTGISTLATGTPNFSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSI
        +   +S+  +  A+  PN  +PP    LNQ  S+KLDR N+ LWK +   I+R ++L G+LSG   CPP++                     V+ G++ +
Subjt:  MTNVNSTGISTLATGTPNFSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSI

Query:  APAATVNPQYEAWIVVNQLLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVT
              NP+YE WI+ +QLL+GWLY+SM+  +AT+VMG  ++ +L   ++ L+G  S+++ D  R + Q  RKGS  M+EYLR  K+ ++ L  AG P  
Subjt:  APAATVNPQYEAWIVVNQLLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVT

Query:  ARNLVSQVLLGLDEEYNPVVAMIQGRSGITWSEMQAELLVFEKRLE-LQN-TQKSAVSFSHNTLVNMASNQNMGGQRGQNYNYNNSQGSYGR--GNQRGN
          +LV+ VL GLD EY  +V  I+ RS  TW E+Q  LL F+ ++E LQN T  S  + S +   NMA+  N  G RG+ +   N+  + G    N RG 
Subjt:  ARNLVSQVLLGLDEEYNPVVAMIQGRSGITWSEMQAELLVFEKRLE-LQN-TQKSAVSFSHNTLVNMASNQNMGGQRGQNYNYNNSQGSYGR--GNQRGN

Query:  SGRSRGRGRGYDNFNNNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTA
        S R RGRGRG    + ++P CQV GK GH A  CY RF++ + G    N   N  +  Q+N          N++ FVA+P+ +   +W+ DSGASNH+T+
Subjt:  SGRSRGRGRGYDNFNNNKPICQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTA

Query:  EYNSIANPTDYEGTECVTVGDGNKLKILHVGNSCLS-DGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIY
        +  ++    DY G E V VG+G+KL+I H+GN  L+ +  N L L+ +L VP+IAKNLVSVSKLA DN++ +EF+ NFCLVKDK T +VLL GVL + +Y
Subjt:  EYNSIANPTDYEGTECVTVGDGNKLKILHVGNSCLS-DGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIY

Query:  RFEN
        + ++
Subjt:  RFEN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-3327.46Show/hide
Query:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQLLLGW
        LN  ++ +T  KL   N+L+W      +   Y+L G L G    PP     AT G    T+A                 A  VNP Y  W   ++L+   
Subjt:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQLLLGW

Query:  LYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMI
        +  ++S  V   V    T+  +W  ++ ++   S      LR   +Q  KG+  + +Y++ + +  D L   G P+     V +VL  L EEY PV+  I
Subjt:  LYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMI

Query:  QGR-SGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGGQRGQNYNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNF----NNNKPI--
          + +  T +E+   LL         N +   ++ S  T++ + +N  +  +     N NN+     R + R N+  S+   +   NF    N +KP   
Subjt:  QGR-SGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGGQRGQNYNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNF----NNNKPI--

Query:  -CQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQ-NNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVT
         CQ+CG  GH A  C Q               ++    + S  PP+ F   Q   N  + SP +    +W +DSGA++H+T+++N+++    Y G + V 
Subjt:  -CQVCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQ-NNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVT

Query:  VGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAKA
        V DG+ + I H G++ LS     L+L ++L VP I KNL+SV +L   N + VEF      VKD  T   LL+G   + +Y +  A +
Subjt:  VGDGNKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAKA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-3127.78Show/hide
Query:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQLLLGW
        LN  ++ +T  KL   N+L+W      +   Y+L G L G  P PP     AT G    T+A                    VNP Y  W   ++L+   
Subjt:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQYEAWIVVNQLLLGW

Query:  LYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMI
        +  ++S  V   V    T+  +W               + LR+I+     G +    ++    +  D L   G P+     V +VL  L ++Y PV+  I
Subjt:  LYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAMI

Query:  QGR-SGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMG-GQRGQNYNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNNKPI---CQ
          + +  + +E+   L+  E +L   N+ +  V  + N + +  +N N     RG N NYNN+       N R NS +    G   DN    KP    CQ
Subjt:  QGR-SGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMG-GQRGQNYNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNNKPI---CQ

Query:  VCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDG
        +C   GH A  C Q    +F    +Q +  +   P Q    P A +A   N+P+ A+       +W +DSGA++H+T+++N+++    Y G + V + DG
Subjt:  VCGKPGHMALTCYQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDG

Query:  NKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAKATA
        + + I H G++ L      L L  VL VP I KNL+SV +L   N + VEF      VKD  T   LL+G   + +Y +  A + A
Subjt:  NKLKILHVGNSCLSDGLNKLSLESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAKATA

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.0e-0526.61Show/hide
Query:  LYNSMSP-EVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAM
        LY +++P +     +   TS+D+W+ +++ F     A    L    +    G +++A+Y R MK  AD+L     PVT RNLV  VL GL+ +++ ++ +
Subjt:  LYNSMSP-EVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVVAM

Query:  IQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGGQRGQNYNYNNSQGSYGRGNQRGNSGRSRG----RGRGYDNFNNNKPICQ
        I+ R      +  A +L      E ++  K A+  +   + + +S+  +              G    GNQ G  GR RG    RGRG      N P   
Subjt:  IQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGGQRGQNYNYNNSQGSYGRGNQRGNSGRSRG----RGRGYDNFNNNKPICQ

Query:  VCGKPGHMALTCYQRFNKEFSGPQSQN-RGENG
           +P       YQ +N  +  P   N  G NG
Subjt:  VCGKPGHMALTCYQRFNKEFSGPQSQN-RGENG

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.6e-0927.75Show/hide
Query:  AATVNPQYE-AWIVVNQLLLGWLYNSMSPEVATQVMGFE-TSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVT
        ++T  P  E  W   + L+  W+Y +++  +   ++    T++DLW+++++LF     A         +      L + EY + +KS +D L    SP++
Subjt:  AATVNPQYE-AWIVVNQLLLGWLYNSMSPEVATQVMGFE-TSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVT

Query:  ARNLVSQVLLGLDEEYNPVVAMIQGRSGI-TWSEMQAELLVFEKRLELQNTQKSAVSF-SHNTLVNMASNQNMGGQR-GQNYNYNNSQGSYGRGNQRG-N
         R LV  +L GL E+Y+ ++ +I+ +S   +++E ++ LL+ E R  L N  KS++S  +H +L N+        +R  Q Y+ NNS    GR  ++   
Subjt:  ARNLVSQVLLGLDEEYNPVVAMIQGRSGI-TWSEMQAELLVFEKRLELQNTQKSAVSF-SHNTLVNMASNQNMGGQR-GQNYNYNNSQGSYGRGNQRG-N

Query:  SGRSRGRGRGYDNFNNNKPICQVCGKP
         G S GR    +N+  N+P   + G P
Subjt:  SGRSRGRGRGYDNFNNNKPICQVCGKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAACGTCAACTCCACCGGAATTTCAACCTTAGCTACCGGAACACCAAACTTTAGCAGCCCACCTCTAAATCAGTTGTTGAATCAGATCACATCAATCAAACTTGA
TAGGGGAAATTTCCTACTCTGGAAGAACCTGGCACTTCCAATTCTCCGAAGCTACAAATTAGAGGGTCATCTCTCTGGGGAGAAACCGTGTCCTCCTCAATACCTTGCAG
CTGCCACAAATGGTGGAAATTCTAGCACAGAAGCAAGAGCATCAAGCTCCGTAACAGTTGTAAGTGGAGAATCGAGCATTGCTCCAGCGGCAACTGTGAATCCTCAATAT
GAAGCTTGGATAGTAGTCAATCAGCTTCTGTTAGGATGGCTTTACAACTCGATGTCGCCTGAAGTTGCGACACAAGTGATGGGTTTCGAAACATCACAAGACCTATGGGT
TGCGGTACAAGACCTGTTTGGTGTCCAATCTCGTGCTGAAGAAGACTACCTACGACAAATATTTCAACAATGCAGAAAAGGGAGTCTGAAAATGGCCGAATATCTTCGAG
TAATGAAAAGCCATGCCGATAACTTGGGACAAGCTGGAAGCCCAGTGACGGCAAGGAATCTGGTTTCACAAGTTCTCCTAGGACTTGATGAAGAATACAATCCGGTGGTA
GCCATGATCCAAGGCAGGTCTGGAATCACATGGTCTGAAATGCAAGCTGAGCTCCTGGTGTTCGAAAAGAGACTTGAATTGCAGAACACTCAGAAAAGTGCAGTGTCCTT
CAGCCATAATACTTTGGTTAACATGGCAAGCAATCAGAACATGGGAGGACAAAGAGGACAAAACTACAACTACAATAACAGTCAAGGCTCATATGGCAGAGGCAACCAAA
GGGGAAACAGTGGTAGAAGCCGTGGTAGAGGACGAGGTTACGACAATTTCAACAACAACAAACCAATATGCCAGGTATGTGGAAAACCAGGCCATATGGCCTTAACTTGT
TATCAAAGGTTTAACAAAGAGTTTTCTGGTCCTCAAAGTCAGAATAGGGGAGAAAATGGAAGGCAACCTATGCAGAGCAATCCTCCACCGAATGCCTTTGTAGCAAGTCA
GAACAACAACCCGTTTGTAGCCTCTCCTGATACAGTAGTTGATCCAAGCTGGTATGTCGATAGCGGTGCATCAAATCATGTAACAGCTGAGTATAATTCCATTGCCAATC
CAACTGACTATGAAGGTACAGAGTGTGTGACTGTGGGTGATGGAAATAAACTAAAAATTTTGCATGTAGGGAATTCTTGCTTGTCTGATGGTTTGAATAAACTGAGTTTA
GAAAGTGTTCTGTGTGTTCCACAAATAGCAAAAAATCTTGTGAGTGTGTCTAAGCTTGCTCAAGACAATGACTTATTTGTTGAATTTCATGATAACTTTTGCTTGGTAAA
GGACAAGGGTACGAGCCAAGTGCTGCTGAAAGGGGTCCTCAGTGAAGGGATATACCGGTTTGAGAATGCTAAAGCTACTGCCAAGGATATTTCAAAGAGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAACGTCAACTCCACCGGAATTTCAACCTTAGCTACCGGAACACCAAACTTTAGCAGCCCACCTCTAAATCAGTTGTTGAATCAGATCACATCAATCAAACTTGA
TAGGGGAAATTTCCTACTCTGGAAGAACCTGGCACTTCCAATTCTCCGAAGCTACAAATTAGAGGGTCATCTCTCTGGGGAGAAACCGTGTCCTCCTCAATACCTTGCAG
CTGCCACAAATGGTGGAAATTCTAGCACAGAAGCAAGAGCATCAAGCTCCGTAACAGTTGTAAGTGGAGAATCGAGCATTGCTCCAGCGGCAACTGTGAATCCTCAATAT
GAAGCTTGGATAGTAGTCAATCAGCTTCTGTTAGGATGGCTTTACAACTCGATGTCGCCTGAAGTTGCGACACAAGTGATGGGTTTCGAAACATCACAAGACCTATGGGT
TGCGGTACAAGACCTGTTTGGTGTCCAATCTCGTGCTGAAGAAGACTACCTACGACAAATATTTCAACAATGCAGAAAAGGGAGTCTGAAAATGGCCGAATATCTTCGAG
TAATGAAAAGCCATGCCGATAACTTGGGACAAGCTGGAAGCCCAGTGACGGCAAGGAATCTGGTTTCACAAGTTCTCCTAGGACTTGATGAAGAATACAATCCGGTGGTA
GCCATGATCCAAGGCAGGTCTGGAATCACATGGTCTGAAATGCAAGCTGAGCTCCTGGTGTTCGAAAAGAGACTTGAATTGCAGAACACTCAGAAAAGTGCAGTGTCCTT
CAGCCATAATACTTTGGTTAACATGGCAAGCAATCAGAACATGGGAGGACAAAGAGGACAAAACTACAACTACAATAACAGTCAAGGCTCATATGGCAGAGGCAACCAAA
GGGGAAACAGTGGTAGAAGCCGTGGTAGAGGACGAGGTTACGACAATTTCAACAACAACAAACCAATATGCCAGGTATGTGGAAAACCAGGCCATATGGCCTTAACTTGT
TATCAAAGGTTTAACAAAGAGTTTTCTGGTCCTCAAAGTCAGAATAGGGGAGAAAATGGAAGGCAACCTATGCAGAGCAATCCTCCACCGAATGCCTTTGTAGCAAGTCA
GAACAACAACCCGTTTGTAGCCTCTCCTGATACAGTAGTTGATCCAAGCTGGTATGTCGATAGCGGTGCATCAAATCATGTAACAGCTGAGTATAATTCCATTGCCAATC
CAACTGACTATGAAGGTACAGAGTGTGTGACTGTGGGTGATGGAAATAAACTAAAAATTTTGCATGTAGGGAATTCTTGCTTGTCTGATGGTTTGAATAAACTGAGTTTA
GAAAGTGTTCTGTGTGTTCCACAAATAGCAAAAAATCTTGTGAGTGTGTCTAAGCTTGCTCAAGACAATGACTTATTTGTTGAATTTCATGATAACTTTTGCTTGGTAAA
GGACAAGGGTACGAGCCAAGTGCTGCTGAAAGGGGTCCTCAGTGAAGGGATATACCGGTTTGAGAATGCTAAAGCTACTGCCAAGGATATTTCAAAGAGGAAATGA
Protein sequenceShow/hide protein sequence
MTNVNSTGISTLATGTPNFSSPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGEKPCPPQYLAAATNGGNSSTEARASSSVTVVSGESSIAPAATVNPQY
EAWIVVNQLLLGWLYNSMSPEVATQVMGFETSQDLWVAVQDLFGVQSRAEEDYLRQIFQQCRKGSLKMAEYLRVMKSHADNLGQAGSPVTARNLVSQVLLGLDEEYNPVV
AMIQGRSGITWSEMQAELLVFEKRLELQNTQKSAVSFSHNTLVNMASNQNMGGQRGQNYNYNNSQGSYGRGNQRGNSGRSRGRGRGYDNFNNNKPICQVCGKPGHMALTC
YQRFNKEFSGPQSQNRGENGRQPMQSNPPPNAFVASQNNNPFVASPDTVVDPSWYVDSGASNHVTAEYNSIANPTDYEGTECVTVGDGNKLKILHVGNSCLSDGLNKLSL
ESVLCVPQIAKNLVSVSKLAQDNDLFVEFHDNFCLVKDKGTSQVLLKGVLSEGIYRFENAKATAKDISKRK