; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012569 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012569
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr1:42289026..42290384
RNA-Seq ExpressionLag0012569
SyntenyLag0012569
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044451.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-1625.66Show/hide
Query:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------
        S LNIN +  KS    I++       +  ++G  +GH P++YLG+PLGG   + +FW  V++++Q  L S                              
Subjt:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------

Query:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----
                                           SV  FW  S  +  +              HN+  +  T LP          L  N+I D      
Subjt:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----

Query:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP
                  N H N L++ +W   +P K K F+W L  G INTA+RLQK +P++ LSP+WC MC  S E   HLF HC ++ + W+       W+   P
Subjt:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP

Query:  NSIQ
          +Q
Subjt:  NSIQ

RVW15386.1 putative ribonuclease H protein [Vitis vinifera]3.5e-1729.3Show/hide
Query:  WPSTYLGLPLGGNSKAIHFWQLVIER-LQNLLISTSVI--------------LFWEGSRGEGGMHNVNWATTQLPQLLGG--------NDIDDLGP----
        WP +YLGLPLGGN K I FW  V+ER L++ ++S   I                W G+      H + W     P+ LGG         +I  LG     
Subjt:  WPSTYLGLPLGGNSKAIHFWQLVIER-LQNLLISTSVI--------------LFWEGSRGEGGMHNVNWATTQLPQLLGG--------NDIDDLGP----

Query:  -----------------NAHVNG---------LHRVIWTNLY-----PKIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCT
                           H NG          HR  W  L+      K+K   W +  G +NT N+LQ   P+  L P  CI+C  + E  +HLF HC 
Subjt:  -----------------NAHVNG---------LHRVIWTNLY-----PKIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCT

Query:  FASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWRKEDFVACFEQCLSLVFM
             W  + +  G + V P SI D + + F G   S R +        CL+L++M
Subjt:  FASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWRKEDFVACFEQCLSLVFM

RVW45746.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.5e-1729.25Show/hide
Query:  VSRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLISTSVILFWEGS--RGEGGMHNV-----NWA
        +S L +N    KS   GI+LD   ++ L  T   K   WP  YLGLPLGGN +A  FW  VIER+   L        W+ +  R E   H +        
Subjt:  VSRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLISTSVILFWEGS--RGEGGMHNV-----NWA

Query:  TTQLPQLLGGNDIDDLGPNAHV-----NGLHRV---------------------IWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCA
           L + L G  +    P+A       +GL  V                     +W +  P K++ F+W ++   +NT + LQ   P+  LSP  CI+C 
Subjt:  TTQLPQLLGGNDIDDLGPNAHV-----NGLHRV---------------------IWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCA

Query:  ASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWR
           E  +H+F HC+     W  +        V P SI D + + F G   S R
Subjt:  ASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWR

TYK29578.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-1625.66Show/hide
Query:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------
        S LNIN +  KS    I++       +  ++G  +GH P++YLG+PLGG   + +FW  V++++Q  L S                              
Subjt:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------

Query:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----
                                           SV  FW  S  +  +              HN+  +  T LP          L  N+I D      
Subjt:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----

Query:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP
                  N H N L++ +W   +P K K F+W L  G INTA+RLQK +P++ LSP+WC MC  S E   HLF HC ++ + W+       W+   P
Subjt:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP

Query:  NSIQ
          +Q
Subjt:  NSIQ

XP_038891595.1 uncharacterized protein LOC120080986 [Benincasa hispida]4.4e-2047.54Show/hide
Query:  KSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQN---------LLISTSVIL----FWEGSRGEGGMHNVNWATTQLP
        KSE LGIH+ D E +WL ++FG+K+ + P TYLGLPLGGN   + FWQLV+ER+Q+         L+IS    L    FWE S  + G+HNVNW  +Q P
Subjt:  KSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQN---------LLISTSVIL----FWEGSRGEGGMHNVNWATTQLP

Query:  QLLGGNDIDDLGPNAHVNGLHR
        QLLGG  I +        GL R
Subjt:  QLLGGNDIDDLGPNAHVNGLHR

TrEMBL top hitse value%identityAlignment
A0A438BWM3 Putative ribonuclease H protein1.7e-1729.3Show/hide
Query:  WPSTYLGLPLGGNSKAIHFWQLVIER-LQNLLISTSVI--------------LFWEGSRGEGGMHNVNWATTQLPQLLGG--------NDIDDLGP----
        WP +YLGLPLGGN K I FW  V+ER L++ ++S   I                W G+      H + W     P+ LGG         +I  LG     
Subjt:  WPSTYLGLPLGGNSKAIHFWQLVIER-LQNLLISTSVI--------------LFWEGSRGEGGMHNVNWATTQLPQLLGG--------NDIDDLGP----

Query:  -----------------NAHVNG---------LHRVIWTNLY-----PKIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCT
                           H NG          HR  W  L+      K+K   W +  G +NT N+LQ   P+  L P  CI+C  + E  +HLF HC 
Subjt:  -----------------NAHVNG---------LHRVIWTNLY-----PKIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCT

Query:  FASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWRKEDFVACFEQCLSLVFM
             W  + +  G + V P SI D + + F G   S R +        CL+L++M
Subjt:  FASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWRKEDFVACFEQCLSLVFM

A0A438ED36 Transposon TX1 uncharacterized 149 kDa protein1.7e-1729.25Show/hide
Query:  VSRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLISTSVILFWEGS--RGEGGMHNV-----NWA
        +S L +N    KS   GI+LD   ++ L  T   K   WP  YLGLPLGGN +A  FW  VIER+   L        W+ +  R E   H +        
Subjt:  VSRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLISTSVILFWEGS--RGEGGMHNV-----NWA

Query:  TTQLPQLLGGNDIDDLGPNAHV-----NGLHRV---------------------IWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCA
           L + L G  +    P+A       +GL  V                     +W +  P K++ F+W ++   +NT + LQ   P+  LSP  CI+C 
Subjt:  TTQLPQLLGGNDIDDLGPNAHV-----NGLHRV---------------------IWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCA

Query:  ASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWR
           E  +H+F HC+     W  +        V P SI D + + F G   S R
Subjt:  ASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIPNSIQDSLPLIFMGHPFSWR

A0A5A7TSA7 LINE-1 retrotransposable element ORF2 protein8.4e-1725.66Show/hide
Query:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------
        S LNIN +  KS    I++       +  ++G  +GH P++YLG+PLGG   + +FW  V++++Q  L S                              
Subjt:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------

Query:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----
                                           SV  FW  S  +  +              HN+  +  T LP          L  N+I D      
Subjt:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----

Query:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP
                  N H N L++ +W   +P K K F+W L  G INTA+RLQK +P++ LSP+WC MC  S E   HLF HC ++ + W+       W+   P
Subjt:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP

Query:  NSIQ
          +Q
Subjt:  NSIQ

A0A5D3E1E3 LINE-1 retrotransposable element ORF2 protein8.4e-1725.66Show/hide
Query:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------
        S LNIN +  KS    I++       +  ++G  +GH P++YLG+PLGG   + +FW  V++++Q  L S                              
Subjt:  SRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLIS------------------------------

Query:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----
                                           SV  FW  S  +  +              HN+  +  T LP          L  N+I D      
Subjt:  ----------------------------------TSVILFWEGSRGEGGM--------------HNVNWA-TTQLPQ--------LLGGNDIDDLG----

Query:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP
                  N H N L++ +W   +P K K F+W L  G INTA+RLQK +P++ LSP+WC MC  S E   HLF HC ++ + W+       W+   P
Subjt:  ---------PNAHVNGLHRVIWTNLYP-KIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIP

Query:  NSIQ
          +Q
Subjt:  NSIQ

A0A803QGT5 Uncharacterized protein9.3e-1623.82Show/hide
Query:  VSRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIER-------LQN--------------------------
        +S L +N +  K + LG+ +D+L ++   +  G + G WP  YLG+PL G+ +   FW LV+E+       L N                          
Subjt:  VSRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIER-------LQN--------------------------

Query:  --LLISTSV-----ILFWE-------------------------------GSRGEGGMHNVNW--------ATTQLPQLLG-------------GNDIDD
           L+   V     I FWE                                  G  G   V+W           ++P L+G               DI  
Subjt:  --LLISTSV-----ILFWE-------------------------------GSRGEGGMHNVNW--------ATTQLPQLLG-------------GNDIDD

Query:  LGPNA--------------HVNGLHRVIWTN------LYPKIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAM
          P+A                  + R  WT          ++KIF+W ++ G +N  + LQ+  P   +SP WC+ C ++ E  EHLF HC F+S+ W+M
Subjt:  LGPNA--------------HVNGLHRVIWTN------LYPKIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAM

Query:  ILDGFGWSTVIPNSIQDSL
        +L+ FG    +P S+   L
Subjt:  ILDGFGWSTVIPNSIQDSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.5e-0530Show/hide
Query:  LGPNAHVNGLHRVIW-TNLYPKIKIFLWELSLGAINTANRLQKHMPHFQLS-PSWCIMCAASSEHPEHLFFHCTFASRYW
        L P  H+   ++ +W  N  PK     W ++   ++T +RL+     + LS P+ C++C +  E   HLFF C F    W
Subjt:  LGPNAHVNGLHRVIW-TNLYPKIKIFLWELSLGAINTANRLQKHMPHFQLS-PSWCIMCAASSEHPEHLFFHCTFASRYW

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.3e-0632.5Show/hide
Query:  LGPNAHVNGLHRVIW-TNLYPKIKIFLWELSLGAINTANRLQKHMPHFQLS-PSWCIMCAASSEHPEHLFFHCTFASRYW
        L P +H    H+ +W  N  PK     W ++   ++T +RLQ    ++ LS P+ C++C A  +   HLFF C F+   W
Subjt:  LGPNAHVNGLHRVIW-TNLYPKIKIFLWELSLGAINTANRLQKHMPHFQLS-PSWCIMCAASSEHPEHLFFHCTFASRYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGATTGAATATAAATTATGCCAAAGTCAAAAGTGAATTTCTGGGGATCCATTTAGATGATTTGGAATTGAATTGGCTAACATCAACTTTTGGTTAC
AAGCAAGGGCATTGGCCTTCTACATATCTTGGTTTACCTTTGGGAGGCAATTCAAAAGCTATTCACTTTTGGCAACTGGTTATTGAAAGGTTGCAAAATCTCTTG
ATAAGCACTTCCGTGATTTTGTTTTGGGAAGGATCTAGAGGTGAAGGTGGTATGCACAATGTTAATTGGGCTACAACACAGCTACCCCAATTGTTAGGTGGTAAT
GATATCGATGATCTTGGACCAAATGCACATGTGAATGGCCTCCATAGGGTGATTTGGACCAATCTATATCCTAAGATTAAGATTTTTCTATGGGAGCTTAGTCTT
GGAGCAATAAACACAGCTAATCGTCTTCAGAAGCATATGCCTCATTTCCAACTTTCTCCATCTTGGTGCATCATGTGTGCTGCTAGTTCTGAACACCCTGAACAT
CTATTTTTCCATTGCACATTTGCATCTAGATATTGGGCAATGATTCTTGATGGCTTTGGGTGGTCCACAGTAATACCAAATTCTATTCAAGACTCGCTTCCTCTC
ATTTTTATGGGTCATCCTTTTTCATGGAGAAAAGAAGATTTTGTGGCTTGCTTTGAACAGTGTCTTTCTTTGGTATTTATGGGGCGAAAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTAGATTGAATATAAATTATGCCAAAGTCAAAAGTGAATTTCTGGGGATCCATTTAGATGATTTGGAATTGAATTGGCTAACATCAACTTTTGGTTAC
AAGCAAGGGCATTGGCCTTCTACATATCTTGGTTTACCTTTGGGAGGCAATTCAAAAGCTATTCACTTTTGGCAACTGGTTATTGAAAGGTTGCAAAATCTCTTG
ATAAGCACTTCCGTGATTTTGTTTTGGGAAGGATCTAGAGGTGAAGGTGGTATGCACAATGTTAATTGGGCTACAACACAGCTACCCCAATTGTTAGGTGGTAAT
GATATCGATGATCTTGGACCAAATGCACATGTGAATGGCCTCCATAGGGTGATTTGGACCAATCTATATCCTAAGATTAAGATTTTTCTATGGGAGCTTAGTCTT
GGAGCAATAAACACAGCTAATCGTCTTCAGAAGCATATGCCTCATTTCCAACTTTCTCCATCTTGGTGCATCATGTGTGCTGCTAGTTCTGAACACCCTGAACAT
CTATTTTTCCATTGCACATTTGCATCTAGATATTGGGCAATGATTCTTGATGGCTTTGGGTGGTCCACAGTAATACCAAATTCTATTCAAGACTCGCTTCCTCTC
ATTTTTATGGGTCATCCTTTTTCATGGAGAAAAGAAGATTTTGTGGCTTGCTTTGAACAGTGTCTTTCTTTGGTATTTATGGGGCGAAAGGAATGA
Protein sequenceShow/hide protein sequence
MVSRLNINYAKVKSEFLGIHLDDLELNWLTSTFGYKQGHWPSTYLGLPLGGNSKAIHFWQLVIERLQNLLISTSVILFWEGSRGEGGMHNVNWATTQLPQLLGGN
DIDDLGPNAHVNGLHRVIWTNLYPKIKIFLWELSLGAINTANRLQKHMPHFQLSPSWCIMCAASSEHPEHLFFHCTFASRYWAMILDGFGWSTVIPNSIQDSLPL
IFMGHPFSWRKEDFVACFEQCLSLVFMGRKE