; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035556 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035556
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr3:23944560..23945276
RNA-Seq ExpressionLag0035556
SyntenyLag0035556
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG98819.1 VIRB2-interacting protein 2 [Prunus dulcis]1.2e-3038.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]1.2e-3038.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

VVA13439.1 Hypothetical predicted protein, partial [Prunus dulcis]1.2e-3038.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

VVA21938.1 Hypothetical predicted protein, partial [Prunus dulcis]1.2e-3038.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

VVA41200.1 PREDICTED: RNA-directed DNA polymerase, partial [Prunus dulcis]1.2e-3038.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

TrEMBL top hitse value%identityAlignment
A0A4Y1R3V4 VIRB2-interacting protein 25.9e-3138.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

A0A5E4EEP2 Reverse transcriptase domain-containing protein (Fragment)5.9e-3138.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

A0A5E4F859 Reverse transcriptase domain-containing protein (Fragment)5.9e-3138.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

A0A5E4GN72 PREDICTED: RNA-directed DNA polymerase (Fragment)5.9e-3138.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

A0A5H2Y6K0 VIRB2-interacting protein 25.9e-3138.8Show/hide
Query:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP
        +E  G L  + ++ R +L   + DL  +E ++W QR K++W ++GD NTKFFHRI + R+++N I ++    G  +V+  +IE+E I+F++ L++ +   
Subjt:  LEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNHP

Query:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
         +    ++W+ IS  +A  L+  F EEEV +A+   G  KSPGPDGF+   F+  W+I+K+DLM ++ DFFN  IIN   NET
Subjt:  RFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-0521.48Show/hide
Query:  SQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTK-----DNHPRFLP
        S+++  + +R +++++  Q+ +Q     +  + +  ++  +   R++  ++ KN I  + + +G+      +I+    ++Y+ L+       +    FL 
Subjt:  SQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTK-----DNHPRFLP

Query:  TNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFK
        T      +++ +  SL    +  E+   ++SL + KSPGPDGFTAEF++
Subjt:  TNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFK

P08548 LINE-1 reverse transcriptase homolog7.0e-0522.02Show/hide
Query:  SQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFI--DFYQKLFT-KDNHPRFLPTN
        S+++  + +R ++ ++  +  IQ   + K  + ++ ++  K    +   ++ K+ I+ +  R GN+ +T +  E++ I  ++Y+KL++ K  + + +   
Subjt:  SQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFI--DFYQKLFT-KDNHPRFLPTN

Query:  VD---WSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFF
        ++      +S+ +   L    S  E+   + +L   KSPGPDGFT+EF++      K++L+ ++ + F
Subjt:  VD---WSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFF

P11369 LINE-1 retrotransposable element ORF2 protein4.9e-0623.21Show/hide
Query:  SQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTK-----DNHPRFLP
        S+++    LR +I  +  +  IQ   + +  + ++ ++  K   R+    + K  I ++ + +G+      +I+     FY++L++      D   +FL 
Subjt:  SQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTK-----DNHPRFLP

Query:  TNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFN
               ++++Q   L +  S +E+   ++SL + KSPGPDGF+AEF++      K+DL+ +++  F+
Subjt:  TNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFN

P14381 Transposon TX1 uncharacterized 149 kDa protein2.3e-0828.99Show/hide
Query:  RCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDN-HPRFLPTNVDWSP-ISENQATSLEAVFSEEEVFQAM
        R +++ L + D  ++FF+ +   +  +  IT + + +G  L     I      FYQ LF+ D   P       D  P +SE +   LE   + +E+ QA+
Subjt:  RCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDN-HPRFLPTNVDWSP-ISENQATSLEAVFSEEEVFQAM

Query:  SSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFF
          +  +KSPG DG T EFF+F W+ +  D   ++ + F
Subjt:  SSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.3e-1729.49Show/hide
Query:  WHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNH---PRFLPTNVDWSPI--SENQATSLEAVFSEE
        + Q+ ++KWL++GD NT+FFH+++ A + KN I  +   +   +     ++   + +Y  L   D+    P  +    D  P   ++  A+ L A+ S++
Subjt:  WHQRCKLKWLKEGDENTKFFHRILAARKRKNAITEVLSREGNNLVTANDIEVEFIDFYQKLFTKDNH---PRFLPTNVDWSPI--SENQATSLEAVFSEE

Query:  EVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET
        E+  A+ ++  +K+PGPD FTAEFF  SW ++K   +  + +FF T  +    N T
Subjt:  EVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIINVALNET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGTGTATTATGGCCATCCCTCCCATCCAACCACCTAAGCGAAAGGCTGCGAATACCACTGGGAAGAAAACTAAACTCACGAGAGAGCTCGATAATCTCAAGACC
ACAATGGGTCAAAAGGAATCTAGAGGATAATGGACTCCTTACTATTTCCCAAAAGGAATCTAGGAGCAATCTTCGTGAGCAGATTGAAGATTTAACGGGTCAAGAACACA
TCCAATGGCATCAACGTTGTAAGTTAAAATGGCTTAAGGAAGGTGATGAAAATACTAAATTCTTTCACCGTATTTTGGCAGCGCGTAAAAGGAAAAATGCGATTACTGAG
GTGTTATCCCGCGAAGGAAACAATTTAGTTACAGCTAATGATATTGAAGTGGAGTTCATTGATTTCTATCAAAAATTGTTCACCAAAGATAATCATCCCCGTTTTCTCCC
AACAAATGTTGATTGGAGTCCAATTAGCGAGAACCAAGCGACAAGCTTGGAAGCTGTCTTTTCTGAGGAAGAAGTCTTTCAGGCCATGAGTTCTTTAGGATCAAGTAAGT
CCCCTGGCCCGGATGGTTTTACAGCTGAATTTTTTAAGTTCTCATGGAATATTATTAAGCAAGATCTTATGACCATGATCAATGATTTTTTCAATACTGATATTATTAAT
GTGGCTCTGAATGAAACTGTTGGGTATCCAACTGATTTCCTGCAAAGAAAAACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTGTGTATTATGGCCATCCCTCCCATCCAACCACCTAAGCGAAAGGCTGCGAATACCACTGGGAAGAAAACTAAACTCACGAGAGAGCTCGATAATCTCAAGACC
ACAATGGGTCAAAAGGAATCTAGAGGATAATGGACTCCTTACTATTTCCCAAAAGGAATCTAGGAGCAATCTTCGTGAGCAGATTGAAGATTTAACGGGTCAAGAACACA
TCCAATGGCATCAACGTTGTAAGTTAAAATGGCTTAAGGAAGGTGATGAAAATACTAAATTCTTTCACCGTATTTTGGCAGCGCGTAAAAGGAAAAATGCGATTACTGAG
GTGTTATCCCGCGAAGGAAACAATTTAGTTACAGCTAATGATATTGAAGTGGAGTTCATTGATTTCTATCAAAAATTGTTCACCAAAGATAATCATCCCCGTTTTCTCCC
AACAAATGTTGATTGGAGTCCAATTAGCGAGAACCAAGCGACAAGCTTGGAAGCTGTCTTTTCTGAGGAAGAAGTCTTTCAGGCCATGAGTTCTTTAGGATCAAGTAAGT
CCCCTGGCCCGGATGGTTTTACAGCTGAATTTTTTAAGTTCTCATGGAATATTATTAAGCAAGATCTTATGACCATGATCAATGATTTTTTCAATACTGATATTATTAAT
GTGGCTCTGAATGAAACTGTTGGGTATCCAACTGATTTCCTGCAAAGAAAAACGTAA
Protein sequenceShow/hide protein sequence
MDCVLWPSLPSNHLSERLRIPLGRKLNSRESSIISRPQWVKRNLEDNGLLTISQKESRSNLREQIEDLTGQEHIQWHQRCKLKWLKEGDENTKFFHRILAARKRKNAITE
VLSREGNNLVTANDIEVEFIDFYQKLFTKDNHPRFLPTNVDWSPISENQATSLEAVFSEEEVFQAMSSLGSSKSPGPDGFTAEFFKFSWNIIKQDLMTMINDFFNTDIIN
VALNETVGYPTDFLQRKT