; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029517 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029517
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr8:39672754..39673281
RNA-Seq ExpressionLag0029517
SyntenyLag0029517
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-2940.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+S+  H +  V+  S       C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.4e-2938.82Show/hide
Query:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPS---QAMAHNFCKEAFFWKGNS
        LW  E PKK K FIWT++H  INT DRLQ    +   SP+ C +C+   E ++HL +HCP +   W    A +  +      Q++  N C         +
Subjt:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPS---QAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAFL
        Q+ +I  +  A ILW +W+ERN+RIF    ++   +WED +A   LW+  SKLF+N+D  SIALN  AF+
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAFL

KAA0040037.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.9e-2940.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQ--SLHAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+  S H    V+  S     + C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQ--SLHAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

TYK13741.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-2940.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+S+  H +  V+  S       C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]1.7e-2940.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+S+  H +  V+  S       C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein8.2e-3040.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+S+  H +  V+  S       C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein4.1e-2938.82Show/hide
Query:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPS---QAMAHNFCKEAFFWKGNS
        LW  E PKK K FIWT++H  INT DRLQ    +   SP+ C +C+   E ++HL +HCP +   W    A +  +      Q++  N C         +
Subjt:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPS---QAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAFL
        Q+ +I  +  A ILW +W+ERN+RIF    ++   +WED +A   LW+  SKLF+N+D  SIALN  AF+
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAFL

A0A5A7TES3 LINE-1 retrotransposable element ORF2 protein2.4e-2940.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQ--SLHAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+  S H    V+  S     + C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQ--SLHAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

A0A5D3CPL6 LINE-1 retrotransposable element ORF2 protein8.2e-3040.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+S+  H +  V+  S       C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein8.2e-3040.24Show/hide
Query:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS
        NLW   IPKK   FIWT+L+  +NT ++L     +    PS C++C  + E   HL + CP+A + W+S+  H +  V+  S       C     WK  +
Subjt:  NLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSL--HAAMGVHFPSQAMAHNFCKEAFFWKGNS

Query:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF
        ++++I+ +  A+ LW++W+ERN+RIFN   ++ + IWEDI A+A LW S S LF+N+ ASSIALN  AF
Subjt:  QRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.7e-0626.24Show/hide
Query:  EIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVH-FPSQAMAHNFCKEAFFWKGN----SQR
        +IPK   I  W  +  R++T DR+          P  C+ C++H ET  HL   C  A   W  ++    VH FP         ++   W  N       
Subjt:  EIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVH-FPSQAMAHNFCKEAFFWKGN----SQR

Query:  SIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAI
        + I++    A ++++W ERN+R+ ++ SR A+ +  +I ++
Subjt:  SIIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAI

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-1027.66Show/hide
Query:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFWKGNSQRS
        +W      K     W V   R++T DRLQ    S    P+ C+LC++H ++  HL   C  +   W+   A+  ++ P+Q M      +   W  +  R 
Subjt:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFWKGNSQRS

Query:  ----IIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDI
            +II+    + ++++W ERN R+ + +SRS   I +DI
Subjt:  ----IIIQSLTAAILWSLWMERNSRIFNNISRSASHIWEDI

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)1.2e-0428.3Show/hide
Query:  PSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFWKGNSQR----SIIIQSLTAAILWSLWMERNSRIFNNISRSASHIW
        PS C+LC  ++E  +HL   C ++   W +      +  P   M      +   W  +  R    S+II+ +  A ++ LW ERN R+ ++ SRS++ I 
Subjt:  PSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFWKGNSQR----SIIIQSLTAAILWSLWMERNSRIFNNISRSASHIW

Query:  EDIIAI
        ++I  I
Subjt:  EDIIAI

AT4G29090.1 Ribonuclease H-like superfamily protein1.8e-0526.81Show/hide
Query:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFW-----KG
        +W ++   KI+ F+W  L   +     L     S     S CI C S  ET++HLL  C  A   W    A   +  P      +      +W      G
Subjt:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFW-----KG

Query:  NSQRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHI
        N Q     Q L   +LW LW  RN  +F     +A  +
Subjt:  NSQRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHI

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-0627.01Show/hide
Query:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFW----KGN
        +W  E   +  +  W     R+ T DRL+    +    PS  +LC +  ET  HL   C  +   W+   +      P    A      A  W       
Subjt:  LWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFW----KGN

Query:  SQRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHI
        S  + I++ L  + ++ +W ERN+RIF +IS SAS +
Subjt:  SQRSIIIQSLTAAILWSLWMERNSRIFNNISRSASHI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTCAGGCCCTCTCTAACCTTTGGGGTGCTGAAATCCCAAAGAAAATCAAAATTTTCATCTGGACAGTCCTTCATCGGAGGATTAATACGACGGACAGATTACA
AGCGGTCTTCAAGAGCTGTGCCTTTAGTCCGAGCTGCTGCATTCTGTGTCATAGCCATTCGGAGACCTTAGACCACCTATTACTTCACTGCCCCCTTGCTTCAAACTTTT
GGCAATCTCTTCATGCTGCCATGGGGGTACATTTCCCGTCTCAGGCTATGGCTCATAATTTCTGCAAGGAGGCCTTTTTCTGGAAGGGCAATTCCCAGCGGAGTATTATT
ATCCAGTCGCTAACCGCAGCTATATTGTGGTCCCTATGGATGGAGCGAAACTCTAGGATTTTTAACAATATATCACGCTCAGCCTCCCACATATGGGAAGATATCATTGC
TATCGCTTCTTTATGGGCTTCCTCTTCTAAGCTTTTTGCTAACCATGACGCATCCTCAATAGCTTTGAACTGGAAAGCTTTCCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTCAGGCCCTCTCTAACCTTTGGGGTGCTGAAATCCCAAAGAAAATCAAAATTTTCATCTGGACAGTCCTTCATCGGAGGATTAATACGACGGACAGATTACA
AGCGGTCTTCAAGAGCTGTGCCTTTAGTCCGAGCTGCTGCATTCTGTGTCATAGCCATTCGGAGACCTTAGACCACCTATTACTTCACTGCCCCCTTGCTTCAAACTTTT
GGCAATCTCTTCATGCTGCCATGGGGGTACATTTCCCGTCTCAGGCTATGGCTCATAATTTCTGCAAGGAGGCCTTTTTCTGGAAGGGCAATTCCCAGCGGAGTATTATT
ATCCAGTCGCTAACCGCAGCTATATTGTGGTCCCTATGGATGGAGCGAAACTCTAGGATTTTTAACAATATATCACGCTCAGCCTCCCACATATGGGAAGATATCATTGC
TATCGCTTCTTTATGGGCTTCCTCTTCTAAGCTTTTTGCTAACCATGACGCATCCTCAATAGCTTTGAACTGGAAAGCTTTCCTGTAG
Protein sequenceShow/hide protein sequence
MEAQALSNLWGAEIPKKIKIFIWTVLHRRINTTDRLQAVFKSCAFSPSCCILCHSHSETLDHLLLHCPLASNFWQSLHAAMGVHFPSQAMAHNFCKEAFFWKGNSQRSII
IQSLTAAILWSLWMERNSRIFNNISRSASHIWEDIIAIASLWASSSKLFANHDASSIALNWKAFL