; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000155 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000155
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr4:624396..629248
RNA-Seq ExpressionLag0000155
SyntenyLag0000155
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.0e-2334.64Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        P+L K +W+  +PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ S P ++  ++ ++    
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWNIEC
           N KGL+  N      W +W ERN R+F+  + +     ++T+ ++    C
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWNIEC

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-2338.27Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        PSL K +W+ ++PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ S P N V  LA  +   
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK
             KGL+  N +    W +W ERN R+F+       Q KKE  +++W     +   W  K
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK

KAA0044451.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-2238.97Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        P+L K +W+  +PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ + P ++  ++ ++    
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVF--QGLD
           N KGL+  N      W +W ERN R+F  QG D
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVF--QGLD

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-2338.27Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        PSL K +W+ ++PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ S P N V  LA  +   
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK
             KGL+  N +    W +W ERN R+F+       Q KKE  +++W     +   W  K
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK

XP_030505044.1 uncharacterized protein LOC115720016 [Cannabis sativa]2.5e-2234.15Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        P   K +W+     ++K F+W +AH  +N  +VLQ+R P++ + PGWC  C+SS E  +H+F  C F+ + WS + + FG   +LP ++  +L S + G 
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSL----SQSKKETIKEVWNIECNFWDLKF
             + +LW   V A  W +W ERN R+F+ ++ SL     Q+K  T   V+  + +F DL F
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSL----SQSKKETIKEVWNIECNFWDLKF

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.4e-2334.64Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        P+L K +W+  +PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ S P ++  ++ ++    
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWNIEC
           N KGL+  N      W +W ERN R+F+  + +     ++T+ ++    C
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWNIEC

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein4.9e-2438.27Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        PSL K +W+ ++PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ S P N V  LA  +   
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK
             KGL+  N +    W +W ERN R+F+       Q KKE  +++W     +   W  K
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein4.9e-2438.27Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        PSL K +W+ ++PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ S P N V  LA  +   
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK
             KGL+  N +    W +W ERN R+F+       Q KKE  +++W     +   W  K
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWN---IECNFWDLK

A0A5A7TSA7 LINE-1 retrotransposable element ORF2 protein2.7e-2238.97Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        P+L K +W+  +PKK KFF+W L H  INT + LQKR+P  +L P WCY C  S E  +H+F  CP++ Q WS+      W+ + P ++  ++ ++    
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVF--QGLD
           N KGL+  N      W +W ERN R+F  QG D
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVF--QGLD

A0A803PZR8 Uncharacterized protein1.2e-2234.15Show/hide
Query:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH
        P   K +W+     ++K F+W +AH  +N  +VLQ+R P++ + PGWC  C+SS E  +H+F  C F+ + WS + + FG   +LP ++  +L S + G 
Subjt:  PSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGH

Query:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSL----SQSKKETIKEVWNIECNFWDLKF
             + +LW   V A  W +W ERN R+F+ ++ SL     Q+K  T   V+  + +F DL F
Subjt:  PFINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSL----SQSKKETIKEVWNIECNFWDLKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGAAATATTAGTCTCATTGAGCCATCTTTGGCAAAGATTATTTGGAGAGATAACTACCCAAAGAAAATCAAATTCTTTCTCTGGGAATTAGCCCACAAGGCCAT
CAACACCAAGGAGGTTCTTCAAAAACGCATGCCTTACTTGTCCCTCCCCCCTGGATGGTGTTATTTTTGTAGATCTAGTGATGAATCGCAGGATCACATTTTTGCTTCTT
GCCCCTTTGCTGGACAGTTCTGGAGTAGGATCACTAAATCTTTTGGGTGGTCCATGTCTCTTCCGTGTAATATGGTGGATATCCTTGCCTCCGTTATGTTGGGACATCCT
TTTATCAATATCAAAGGCCTTCTATGGATGAACATCGTTAGAGCTTTCTTTTGGATCTTATGGAAAGAAAGAAATAGAAGAGTTTTCCAAGGATTAGATCTCTCACTATC
ACAAAGCAAAAAGGAGACGATTAAAGAGGTTTGGAACATAGAATGCAACTTTTGGGACCTCAAATTTGGAAGGAATTTGAAGGATGAGGAGGCTGCTGAATGGGCTGACC
TTAGCCTCGATCTTGCCCCTATCGTCCTCTCTCACGAGGAAAATTCTTGGAGGTGGAACATCCACCCTAGCGGACAGTTCTCTACTAAATCCCTTCTCTCAAATATGAGC
GACTCTAGATGTCTCTTAGAGTCATCCTTGGCTAAAGGCATAGTAGCTATGAGCCCTGTCTTAAACTCCTATGGTACATTATTAAGAACAATACAGTTATTGATGGGAGG
GCTGCATGGTTCTCTGGAGAGTAGAATTTCGGCACTGGAAAAGCTGAACCAGAATGATGGTTCCCAACATGTTCAAACAGTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGGAAATATTAGTCTCATTGAGCCATCTTTGGCAAAGATTATTTGGAGAGATAACTACCCAAAGAAAATCAAATTCTTTCTCTGGGAATTAGCCCACAAGGCCAT
CAACACCAAGGAGGTTCTTCAAAAACGCATGCCTTACTTGTCCCTCCCCCCTGGATGGTGTTATTTTTGTAGATCTAGTGATGAATCGCAGGATCACATTTTTGCTTCTT
GCCCCTTTGCTGGACAGTTCTGGAGTAGGATCACTAAATCTTTTGGGTGGTCCATGTCTCTTCCGTGTAATATGGTGGATATCCTTGCCTCCGTTATGTTGGGACATCCT
TTTATCAATATCAAAGGCCTTCTATGGATGAACATCGTTAGAGCTTTCTTTTGGATCTTATGGAAAGAAAGAAATAGAAGAGTTTTCCAAGGATTAGATCTCTCACTATC
ACAAAGCAAAAAGGAGACGATTAAAGAGGTTTGGAACATAGAATGCAACTTTTGGGACCTCAAATTTGGAAGGAATTTGAAGGATGAGGAGGCTGCTGAATGGGCTGACC
TTAGCCTCGATCTTGCCCCTATCGTCCTCTCTCACGAGGAAAATTCTTGGAGGTGGAACATCCACCCTAGCGGACAGTTCTCTACTAAATCCCTTCTCTCAAATATGAGC
GACTCTAGATGTCTCTTAGAGTCATCCTTGGCTAAAGGCATAGTAGCTATGAGCCCTGTCTTAAACTCCTATGGTACATTATTAAGAACAATACAGTTATTGATGGGAGG
GCTGCATGGTTCTCTGGAGAGTAGAATTTCGGCACTGGAAAAGCTGAACCAGAATGATGGTTCCCAACATGTTCAAACAGTGGAGTAG
Protein sequenceShow/hide protein sequence
MMGNISLIEPSLAKIIWRDNYPKKIKFFLWELAHKAINTKEVLQKRMPYLSLPPGWCYFCRSSDESQDHIFASCPFAGQFWSRITKSFGWSMSLPCNMVDILASVMLGHP
FINIKGLLWMNIVRAFFWILWKERNRRVFQGLDLSLSQSKKETIKEVWNIECNFWDLKFGRNLKDEEAAEWADLSLDLAPIVLSHEENSWRWNIHPSGQFSTKSLLSNMS
DSRCLLESSLAKGIVAMSPVLNSYGTLLRTIQLLMGGLHGSLESRISALEKLNQNDGSQHVQTVE