; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036671 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036671
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr2:300594..301397
RNA-Seq ExpressionLag0036671
SyntenyLag0036671
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW25035.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.1e-5641.76Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS KKR +++  +S+ NP +V+LQETK    DR+ + S+W+   + W ++   GASGGIVILW+ S F   E V G FS++++ +  +  +FW+T +YG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P +   R+ FW +L DL  L  P W +GGDFNV R   EK   S  T  MR F+ FI ++ LLD PL N  F+WS+ + +P    +DRFL +    + FS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
         +    L R TSDH PICL     KWGP PF+  N WL H  F      WW+    +GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

RVX07754.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.1e-5642.15Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS KKR  ++  +S+ NP +V+LQETK    DR+++ S+W   ++ W ++   GASGGIVILW+   F   E V G FS++++L+  +  +FW+T +YG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        PN +  R  FW +L DL  L  P W +GGDFNV R   EK   S  T  MR+F+ FI ++ LLD PL N  F+WS+ + +P    +DRFL +    S FS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
              L R TSDH PICL      WGP PF+  N WL H  F      WW+    +GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

RVX15530.1 putative ribonuclease H protein [Vitis vinifera]1.5e-5641.38Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS KKR +++  +S+ NP +V+LQETK    DR+ + S+W    + W+++   GASGGIVILW+ S F   E V G FS++++ +  +  +FW+T +YG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P +   R+ FW +L DL  L  P W +GGDFNV R   EK   +  T  MR F+ FI ++ LLD PL N  F+WS+ + +P    +DRFL +    + FS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
         +    L R TSDH PICL     KWGP PF+  N WL H  F      WW+    +GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]2.9e-6546.74Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS  KRA IK+ I+S  P +VIL ETK   I+ K IKSLWSS +IAW+S+D  GASGGI++LW++      E++ G FS+S+   LAD FT+W+TG+Y 
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P   K R+LFW +L DL  LC P W+LG DFN+ RW+ E SS + P   M KFN FID   L+D  ++NG+++WS+ RP+  ++ I+RFL + G   KFS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
            +RL R  SDH+PI L    ++WG  PF+L N WL    F   +E+ W +  S G+ G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]3.1e-7550.19Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        + SWKK ALIK  IS  NP +VILQETK+  +D  I+KSLWS+  I WS++D  G + GI+ILWN+      E++EG+FSL++   L+DGF FW++GIYG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P++++   LFW +L+DL  LC  +WIL GDFNVTRW+WEKS+    T++M  FN FI+D+ L+D+PL+NG+ +WS    N + +LID FL+T+G I K  
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
            +R+ R TSDHFPI L+ G+  WG  PF+  N WLSH TF   +E+WW N    GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

TrEMBL top hitse value%identityAlignment
A0A438CP96 LINE-1 retrotransposable element ORF2 protein5.4e-5741.76Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS KKR +++  +S+ NP +V+LQETK    DR+ + S+W+   + W ++   GASGGIVILW+ S F   E V G FS++++ +  +  +FW+T +YG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P +   R+ FW +L DL  L  P W +GGDFNV R   EK   S  T  MR F+ FI ++ LLD PL N  F+WS+ + +P    +DRFL +    + FS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
         +    L R TSDH PICL     KWGP PF+  N WL H  F      WW+    +GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

A0A438JFM2 LINE-1 retrotransposable element ORF2 protein5.4e-5742.15Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS KKR  ++  +S+ NP +V+LQETK    DR+++ S+W   ++ W ++   GASGGIVILW+   F   E V G FS++++L+  +  +FW+T +YG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        PN +  R  FW +L DL  L  P W +GGDFNV R   EK   S  T  MR+F+ FI ++ LLD PL N  F+WS+ + +P    +DRFL +    S FS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
              L R TSDH PICL      WGP PF+  N WL H  F      WW+    +GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

A0A438K2W1 Putative ribonuclease H protein7.1e-5741.38Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS KKR +++  +S+ NP +V+LQETK    DR+ + S+W    + W+++   GASGGIVILW+ S F   E V G FS++++ +  +  +FW+T +YG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P +   R+ FW +L DL  L  P W +GGDFNV R   EK   +  T  MR F+ FI ++ LLD PL N  F+WS+ + +P    +DRFL +    + FS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
         +    L R TSDH PICL     KWGP PF+  N WL H  F      WW+    +GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

A0A6J1CVN2 uncharacterized protein LOC1110146571.4e-6546.74Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        +GS  KRA IK+ I+S  P +VIL ETK   I+ K IKSLWSS +IAW+S+D  GASGGI++LW++      E++ G FS+S+   LAD FT+W+TG+Y 
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P   K R+LFW +L DL  LC P W+LG DFN+ RW+ E SS + P   M KFN FID   L+D  ++NG+++WS+ RP+  ++ I+RFL + G   KFS
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
            +RL R  SDH+PI L    ++WG  PF+L N WL    F   +E+ W +  S G+ G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

A0A6J1E2G6 uncharacterized protein LOC1110254051.5e-7550.19Show/hide
Query:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG
        + SWKK ALIK  IS  NP +VILQETK+  +D  I+KSLWS+  I WS++D  G + GI+ILWN+      E++EG+FSL++   L+DGF FW++GIYG
Subjt:  MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYG

Query:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS
        P++++   LFW +L+DL  LC  +WIL GDFNVTRW+WEKS+    T++M  FN FI+D+ L+D+PL+NG+ +WS    N + +LID FL+T+G I K  
Subjt:  PNSSKDRRLFWAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFS

Query:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG
            +R+ R TSDHFPI L+ G+  WG  PF+  N WLSH TF   +E+WW N    GW G
Subjt:  TATTRRLERVTSDHFPICLNLGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCCTGGAAAAAGAGAGCTCTTATTAAAAATCTTATATCCTCTCACAACCCAGCTTTGGTGATCCTCCAAGAGACTAAGATGGTTGGTATTGACAGAAAGATTAT
CAAATCCCTATGGAGTTCGGGGAATATTGCTTGGTCCTCTATAGATGTTGATGGTGCTTCTGGGGGTATTGTGATCCTTTGGAATGAATCTTTTTTTTATGTCAAGGAGA
TCGTGGAAGGTATGTTCTCTCTATCCCTACAACTATCATTAGCTGATGGCTTCACCTTCTGGATTACAGGAATTTATGGGCCTAATTCCTCCAAAGATAGGCGTTTATTT
TGGGCAAAACTAATGGATCTCCAAGCTTTATGTCTTCCTAATTGGATATTGGGTGGTGATTTCAATGTTACACGATGGACATGGGAGAAATCATCATATTCGGCCCCAAC
TCGAGCCATGAGGAAATTCAATAGATTTATTGATGACAACGAGCTACTTGACATCCCTTTATCCAACGGAAAATTTTCTTGGTCCAGCTTCAGGCCTAATCCCACCATGA
CCCTTATCGACAGGTTCCTCATAACTGATGGCATAATCTCCAAATTTTCCACTGCAACAACCCGTAGATTGGAACGTGTTACTTCTGATCACTTCCCAATCTGTCTAAAT
CTGGGAAAAGAAAAATGGGGACCGGCCCCTTTCAAGCTCAATAATGCATGGCTCTCGCATCATACTTTCCTCACCACGGTTGAATCATGGTGGAAGAACACACTTTCTCA
AGGATGGTCGGGACAGGATTCATTCATAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTCCTGGAAAAAGAGAGCTCTTATTAAAAATCTTATATCCTCTCACAACCCAGCTTTGGTGATCCTCCAAGAGACTAAGATGGTTGGTATTGACAGAAAGATTAT
CAAATCCCTATGGAGTTCGGGGAATATTGCTTGGTCCTCTATAGATGTTGATGGTGCTTCTGGGGGTATTGTGATCCTTTGGAATGAATCTTTTTTTTATGTCAAGGAGA
TCGTGGAAGGTATGTTCTCTCTATCCCTACAACTATCATTAGCTGATGGCTTCACCTTCTGGATTACAGGAATTTATGGGCCTAATTCCTCCAAAGATAGGCGTTTATTT
TGGGCAAAACTAATGGATCTCCAAGCTTTATGTCTTCCTAATTGGATATTGGGTGGTGATTTCAATGTTACACGATGGACATGGGAGAAATCATCATATTCGGCCCCAAC
TCGAGCCATGAGGAAATTCAATAGATTTATTGATGACAACGAGCTACTTGACATCCCTTTATCCAACGGAAAATTTTCTTGGTCCAGCTTCAGGCCTAATCCCACCATGA
CCCTTATCGACAGGTTCCTCATAACTGATGGCATAATCTCCAAATTTTCCACTGCAACAACCCGTAGATTGGAACGTGTTACTTCTGATCACTTCCCAATCTGTCTAAAT
CTGGGAAAAGAAAAATGGGGACCGGCCCCTTTCAAGCTCAATAATGCATGGCTCTCGCATCATACTTTCCTCACCACGGTTGAATCATGGTGGAAGAACACACTTTCTCA
AGGATGGTCGGGACAGGATTCATTCATAAACTAA
Protein sequenceShow/hide protein sequence
MGSWKKRALIKNLISSHNPALVILQETKMVGIDRKIIKSLWSSGNIAWSSIDVDGASGGIVILWNESFFYVKEIVEGMFSLSLQLSLADGFTFWITGIYGPNSSKDRRLF
WAKLMDLQALCLPNWILGGDFNVTRWTWEKSSYSAPTRAMRKFNRFIDDNELLDIPLSNGKFSWSSFRPNPTMTLIDRFLITDGIISKFSTATTRRLERVTSDHFPICLN
LGKEKWGPAPFKLNNAWLSHHTFLTTVESWWKNTLSQGWSGQDSFIN