; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029094 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029094
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr8:35283253..35284995
RNA-Seq ExpressionLag0029094
SyntenyLag0029094
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-1236.29Show/hide
Query:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE
        S NTA++L     N    PS C++C  + E   HL I CP+A S W+S+  H +  V+ L P+ +    C      K  + +++I+    A+ LW++W E
Subjt:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE

Query:  RNSRIFNNISRSASQIWEDIIASA
        RN+RIFN   ++ ++IWEDI A A
Subjt:  RNSRIFNNISRSASQIWEDIIASA

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-1233.33Show/hide
Query:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN
        NTADRLQ    N   +P+ C +C+   E ++HL IHCP +   W    A +  +  P   Q++  + C     L   + + +I    +A ILW +W ERN
Subjt:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN

Query:  SRIFNNISRSASQIWEDIIA
        +RIF    ++   +WED +A
Subjt:  SRIFNNISRSASQIWEDIIA

KAA0044451.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.2e-1335Show/hide
Query:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN
        NTADRLQ    N A +P+ C +C+   E ++HL IHCP +   W    A +  +  P   Q++  + C     L   + + +I    SA +LW +W ERN
Subjt:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN

Query:  SRIFNNISRSASQIWEDIIA
        +RIF    + +  +WEDI+A
Subjt:  SRIFNNISRSASQIWEDIIA

TYK29578.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.2e-1335Show/hide
Query:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN
        NTADRLQ    N A +P+ C +C+   E ++HL IHCP +   W    A +  +  P   Q++  + C     L   + + +I    SA +LW +W ERN
Subjt:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN

Query:  SRIFNNISRSASQIWEDIIA
        +RIF    + +  +WEDI+A
Subjt:  SRIFNNISRSASQIWEDIIA

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]2.4e-1236.29Show/hide
Query:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE
        S NTA++L     N    PS C++C  + E   HL I CP+A S W+S+  H +  V+ L P+ +    C      K  + +++I+    A+ LW++W E
Subjt:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE

Query:  RNSRIFNNISRSASQIWEDIIASA
        RN+RIFN   ++ ++IWEDI A A
Subjt:  RNSRIFNNISRSASQIWEDIIASA

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.2e-1236.29Show/hide
Query:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE
        S NTA++L     N    PS C++C  + E   HL I CP+A S W+S+  H +  V+ L P+ +    C      K  + +++I+    A+ LW++W E
Subjt:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE

Query:  RNSRIFNNISRSASQIWEDIIASA
        RN+RIFN   ++ ++IWEDI A A
Subjt:  RNSRIFNNISRSASQIWEDIIASA

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein8.9e-1333.33Show/hide
Query:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN
        NTADRLQ    N   +P+ C +C+   E ++HL IHCP +   W    A +  +  P   Q++  + C     L   + + +I    +A ILW +W ERN
Subjt:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN

Query:  SRIFNNISRSASQIWEDIIA
        +RIF    ++   +WED +A
Subjt:  SRIFNNISRSASQIWEDIIA

A0A5A7TSA7 LINE-1 retrotransposable element ORF2 protein1.0e-1335Show/hide
Query:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN
        NTADRLQ    N A +P+ C +C+   E ++HL IHCP +   W    A +  +  P   Q++  + C     L   + + +I    SA +LW +W ERN
Subjt:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN

Query:  SRIFNNISRSASQIWEDIIA
        +RIF    + +  +WEDI+A
Subjt:  SRIFNNISRSASQIWEDIIA

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.2e-1236.29Show/hide
Query:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE
        S NTA++L     N    PS C++C  + E   HL I CP+A S W+S+  H +  V+ L P+ +    C      K  + +++I+    A+ LW++W E
Subjt:  SFNTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSL--HAAMGVHFL-PQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSE

Query:  RNSRIFNNISRSASQIWEDIIASA
        RN+RIFN   ++ ++IWEDI A A
Subjt:  RNSRIFNNISRSASQIWEDIIASA

A0A5D3E1E3 LINE-1 retrotransposable element ORF2 protein1.0e-1335Show/hide
Query:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN
        NTADRLQ    N A +P+ C +C+   E ++HL IHCP +   W    A +  +  P   Q++  + C     L   + + +I    SA +LW +W ERN
Subjt:  NTADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLP---QAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERN

Query:  SRIFNNISRSASQIWEDIIA
        +RIF    + +  +WEDI+A
Subjt:  SRIFNNISRSASQIWEDIIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.2e-0525.26Show/hide
Query:  PSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSRIFNNISRSASQI
        P  C+ C++H ET  HL   C  A   W  ++    VH  P  +     +       + + + I++    A ++++W ERN+R+ ++ SR A+ +
Subjt:  PSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSRIFNNISRSASQI

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.7e-0626.72Show/hide
Query:  NTADRLQAVFKNCAFN-PSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSR
        +T DRLQ    N   + P+ C+LC++H ++  HL   C  +   W+   A+  ++  P A                +  +II+    + ++++W ERN R
Subjt:  NTADRLQAVFKNCAFN-PSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSR

Query:  IFNNISRSASQIWEDI
        + + +SRS   I +DI
Subjt:  IFNNISRSASQIWEDI

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)2.0e-0426.26Show/hide
Query:  PSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSRIFNNISRSASQIWEDI
        PS C+LC  ++E  +HL   C ++ + W +          P  +             + + S+II+ +  A ++ LW ERN R+ ++ SRS++ I ++I
Subjt:  PSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSRIFNNISRSASQIWEDI

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.5e-0428.57Show/hide
Query:  RLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSRIFN
        R + V      +P CC LC    ET DHL++ C  +SS W  + A + +  + Q            L   S  SII + ++ A + ++W +RN+ + N
Subjt:  RLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSRIFN

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-0530.77Show/hide
Query:  TADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQ---SLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNS
        T DRL+    N    PS  +LC +  ET  HL   C  + + W+   S         LP A +         L   SH + I++ L  + ++ +W ERN+
Subjt:  TADRLQAVFKNCAFNPSCCILCHSHSETLDHLLIHCPLASSFWQ---SLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNS

Query:  RIFNNISRSASQIWEDIIASASFSVLLLPS
        RIF +IS SAS +   I  +    +L  PS
Subjt:  RIFNNISRSASQIWEDIIASASFSVLLLPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTCCTATGGCCTCTCTTCTGTATGCTTCTTTGCTTCTATTGTTTGTCTCTCTAATTTTGCCCTCAACAGCCACTGCGAATCCTGTTTATGAATATCCGCCGCA
GCCGCCACCGCGAACGGCGGTTCAAGAGCCAGCCTTCAACAAACGATACAAACTTCTTTCGATTCATTCCACAATATCAACCATGAGTAAACATAAGCAAGCACAATTAC
AAGGTCAGCTCCAGTCACCTGACGCTGGATTGTCCTTTAACACGGCGGACAGATTACAAGCGGTCTTCAAGAACTGTGCCTTCAACCCGAGCTGCTGCATTTTGTGTCAT
AGCCACTCGGAGACCTTAGACCACCTCTTAATTCACTGCCCCCTTGCTTCAAGCTTTTGGCAATCTCTTCATGCTGCCATGGGGGTTCACTTCCTGCCTCAGGCTATGGC
TCATCATTTCTGCAAGGAGGCCTTTTTCTTGAAGGGCAATTCCCATCGAAGTATTATTATTCAGACGTTATCTGCAGCTATTTTGTGGTCACTTTGGTCGGAGCGAAACT
CTCGGATTTTTAACAATATATCTCGCTCAGCCTCTCAAATTTGGGAAGATATCATTGCTAGCGCTTCGTTTTCGGTTCTTCTTCTTCCAAGCTTTTTGCTAGCTAAAATG
CATCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTCCTATGGCCTCTCTTCTGTATGCTTCTTTGCTTCTATTGTTTGTCTCTCTAATTTTGCCCTCAACAGCCACTGCGAATCCTGTTTATGAATATCCGCCGCA
GCCGCCACCGCGAACGGCGGTTCAAGAGCCAGCCTTCAACAAACGATACAAACTTCTTTCGATTCATTCCACAATATCAACCATGAGTAAACATAAGCAAGCACAATTAC
AAGGTCAGCTCCAGTCACCTGACGCTGGATTGTCCTTTAACACGGCGGACAGATTACAAGCGGTCTTCAAGAACTGTGCCTTCAACCCGAGCTGCTGCATTTTGTGTCAT
AGCCACTCGGAGACCTTAGACCACCTCTTAATTCACTGCCCCCTTGCTTCAAGCTTTTGGCAATCTCTTCATGCTGCCATGGGGGTTCACTTCCTGCCTCAGGCTATGGC
TCATCATTTCTGCAAGGAGGCCTTTTTCTTGAAGGGCAATTCCCATCGAAGTATTATTATTCAGACGTTATCTGCAGCTATTTTGTGGTCACTTTGGTCGGAGCGAAACT
CTCGGATTTTTAACAATATATCTCGCTCAGCCTCTCAAATTTGGGAAGATATCATTGCTAGCGCTTCGTTTTCGGTTCTTCTTCTTCCAAGCTTTTTGCTAGCTAAAATG
CATCCTTAA
Protein sequenceShow/hide protein sequence
MSSPMASLLYASLLLLFVSLILPSTATANPVYEYPPQPPPRTAVQEPAFNKRYKLLSIHSTISTMSKHKQAQLQGQLQSPDAGLSFNTADRLQAVFKNCAFNPSCCILCH
SHSETLDHLLIHCPLASSFWQSLHAAMGVHFLPQAMAHHFCKEAFFLKGNSHRSIIIQTLSAAILWSLWSERNSRIFNNISRSASQIWEDIIASASFSVLLLPSFLLAKM
HP