; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy01g007690 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy01g007690
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr01:14704500..14707846
RNA-Seq ExpressionLcy01g007690
SyntenyLcy01g007690
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048674.1 Ankyrin repeat protein [Cucumis melo var. makuwa]1.2e-2248.97Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF D TGVV+ GLNHQFSFR FG    G P V G++  +V  Q   EAP GIAPVGL+T I+K+ KA S+ L+I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID
        KG   +S+ HE +  +G NI  S+GGKVS+KS GSV  S+GG I+
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID

KAA0048677.1 Ankyrin repeat protein [Cucumis melo var. makuwa]6.3e-2147.59Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V  Q   EAP GIAPVGLETPI+K +K  S   +I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID--VSK-GGKGKLMGALLRSKK
        KG   VS+ HE    SG NI  S  GKVS+KS  SV  S+GG I+  +SK   KG  +G  +  KK
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID--VSK-GGKGKLMGALLRSKK

KAE8646351.1 hypothetical protein Csa_023818, partial [Cucumis sativus]1.3e-2150.96Show/hide
Query:  ATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKK----GLTTSLGFEIP-KCKGDIV
        ATFNDVTGVV+L LNH FSFR FG    G P   GS+A +V+  +  EAP GIAPVGL+TPI+K +K  +FGLEI K    G+  S   EI  K  G+I 
Subjt:  ATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKK----GLTTSLGFEIP-KCKGDIV

Query:  VSHGHEIS--KKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKGKLMGALLR
        +SHG +IS   K GV++ I NGGK  +KSSG+VL S+G    +SK  KG   G  ++
Subjt:  VSHGHEIS--KKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKGKLMGALLR

KAE8646352.1 hypothetical protein Csa_015951, partial [Cucumis sativus]9.5e-2551.72Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V+ Q   EAP GIAPVGLET I+K++K  S+ L+I K +           
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID
         G  VVSH HE +  SG NI  S+GGKVS+KS GSV  S+GG I+
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID

XP_023004881.1 uncharacterized protein LOC111498059 [Cucurbita maxima]2.8e-1641.46Show/hide
Query:  CCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAPGIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIV
        CCA++ ATFNDVTGV+  G NH+FSFR FG H  GI K   +   S                  + +S D+KA SF  EIKKG    +G E+PK  G   
Subjt:  CCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAPGIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIV

Query:  VSHGHEISKKSGVNIDISNGGK---VSMKSSGSVLSNGGNIDVSKGGKGKLMGALLRSKKTLRV
         S GHEI +K G NI  S GGK   V +K SG    NGG     + G   L G+ +R K +  V
Subjt:  VSHGHEISKKSGVNIDISNGGK---VSMKSSGSVLSNGGNIDVSKGGKGKLMGALLRSKKTLRV

TrEMBL top hitse value%identityAlignment
A0A0A0K5J0 Uncharacterized protein6.6e-2451.23Show/hide
Query:  CAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKK----GLTTSLGFEIP-KC
        C KV ATFNDVTGVV+L LNH FSFR FG    G P   GS+A +V+  +  EAP GIAPVGL+TPI+K +K  +FGLEI K    G+  S   EI  K 
Subjt:  CAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKK----GLTTSLGFEIP-KC

Query:  KGDIVVSHGHEIS--KKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKGKLMGALLR
         G+I +SHG +IS   K GV++ I NGGK  +KSSG+VL S+G    +SK  KG   G  ++
Subjt:  KGDIVVSHGHEIS--KKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKGKLMGALLR

A0A0A0K7W7 Uncharacterized protein1.3e-2451.72Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V  Q   EAP GIAPVGLET I+K++K  S+ L+I K +           
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID
         G  VVSH HE +  SG NI  S+GGKVS+KS GSV  S+GG I+
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID

A0A0A0KAZ8 Uncharacterized protein2.1e-2252.21Show/hide
Query:  TATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIVVSHG
        +ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V+ Q   EAP GIAPVGLET I+K++K  S+ L+I K +            G  VVSH 
Subjt:  TATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIVVSHG

Query:  HEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID
        HE +  SG NI  S+GGKVS+KS GSV  S+GG I+
Subjt:  HEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID

A0A5A7U057 Ankyrin repeat protein5.6e-2348.97Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF D TGVV+ GLNHQFSFR FG    G P V G++  +V  Q   EAP GIAPVGL+T I+K+ KA S+ L+I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID
        KG   +S+ HE +  +G NI  S+GGKVS+KS GSV  S+GG I+
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID

A0A5A7U318 Ankyrin repeat protein3.1e-2147.59Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V  Q   EAP GIAPVGLETPI+K +K  S   +I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID--VSK-GGKGKLMGALLRSKK
        KG   VS+ HE    SG NI  S  GKVS+KS  SV  S+GG I+  +SK   KG  +G  +  KK
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNID--VSK-GGKGKLMGALLRSKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.6e-0830.39Show/hide
Query:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHW--KPNSQRHI--LIQSFFTATLWTLWNERNSRIFKGISRSSAQIR
        P  C LC    +S  +LF  C  +  +W FF  +T    P  L      M+  +W   P+ +++I  +I+  F + ++ +W ERN R+  G+SRS+  I 
Subjt:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHW--KPNSQRHI--LIQSFFTATLWTLWNERNSRIFKGISRSSAQIR

Query:  ED
        +D
Subjt:  ED

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)4.7e-0628.28Show/hide
Query:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQR----HILIQSFFTATLWTLWNERNSRIFKGISRSSAQI
        P+ C LC  + E  ++LF  C +++ +W        F+    L     +M+   W  +  R     ++I+  F A+++ LW ERN R+    SRSS  I
Subjt:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQR----HILIQSFFTATLWTLWNERNSRIFKGISRSSAQI

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.9e-0528.12Show/hide
Query:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQRHILIQSFFTATLWTLWNERNSRIFKGISRSSAQIR
        P+   LC    E+  +LF  C  +  IW FF  A+ F    P  +      I      S    +++    + ++ +W ERN+RIF  IS S++ +R
Subjt:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQRHILIQSFFTATLWTLWNERNSRIFKGISRSSAQIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAGCCCGAACGGGTGTTTCCTTTGCTACAAAAGTTTGGAAAGTATGGACTATCTCTTTATTCGTTGTGACCTTGCTTATCCCATTTGGGTGTTCTTCCATCAGGC
CACTGGTTTCCTTTTGCCGATTCCTCTCCATGTGGATCAATTTTATATGGAGATCTTTCATTGGAAGCCAAATTCTCAGAGGCATATTCTTATCCAATCATTCTTTACTG
CCACTTTGTGGACTTTGTGGAATGAGCGCAACAGTCGAATCTTCAAAGGGATCTCTCGCAGCTCTGCTCAAATTAGGGAGGATAGCTTTGCTCTTTCTGGATTTTGGCCG
TGTACCTCAAGCCTTTTTTGTAATTATGAAGCCTCTTCTATTTTCCTTATAATTGGGAGGCTTTCATGTAGCTTTTCCTTGTGTTGTGCAAAGGTAACTGCCACTTTCAA
CGATGTCACTGGTGTTGTCGATCTCGGCCTCAATCATCAATTTTCCTTCCGTGTATTTGGATCACATGGTTTTGGGATCCCAAAAGTTGGTGGTAGTGTCGCACCAAGTG
TAGACATGCAGAGTCCTGTAGAAGCACCAGGGATTGCCCCTGTTGGGCTTGAAACTCCAATAAGCAAAGACAATAAAGCTTTGTCTTTTGGTCTTGAAATTAAGAAGGGT
CTCACTACATCCCTTGGTTTTGAAATTCCCAAGTGTAAGGGCGACATTGTTGTTTCTCATGGTCACGAAATAAGTAAGAAGAGTGGCGTCAACATTGATATTTCCAATGG
TGGAAAAGTTAGTATGAAGAGCAGTGGTAGTGTTCTTTCAAATGGTGGCAACATTGATGTTTCCAAAGGTGGAAAAGGGAAGCTTATGGGGGCTTTATTGAGGTCTAAAA
AAACACTACGCGTAAACTGGATCTTATCAAGGCTGTCATCAAGATTAAAGATAATTTATGTGGCTTCATTTTGGCGGCAATCAGAATTACAGATGATCAAGGGTAATGCT
TCACGGTTCGTACGGTGTTGCCGAACAAAGGAAAATGGATGGTTTGCTGAAACCCTAAGATCCATGGAACGTTTACTCGAGAAGCGTCTTTGGAATATGATGAATTTGAC
GTCAAATCTGAATCATTTCTGTTCAGTGGAAATGAGGCATGCAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTAGCCCGAACGGGTGTTTCCTTTGCTACAAAAGTTTGGAAAGTATGGACTATCTCTTTATTCGTTGTGACCTTGCTTATCCCATTTGGGTGTTCTTCCATCAGGC
CACTGGTTTCCTTTTGCCGATTCCTCTCCATGTGGATCAATTTTATATGGAGATCTTTCATTGGAAGCCAAATTCTCAGAGGCATATTCTTATCCAATCATTCTTTACTG
CCACTTTGTGGACTTTGTGGAATGAGCGCAACAGTCGAATCTTCAAAGGGATCTCTCGCAGCTCTGCTCAAATTAGGGAGGATAGCTTTGCTCTTTCTGGATTTTGGCCG
TGTACCTCAAGCCTTTTTTGTAATTATGAAGCCTCTTCTATTTTCCTTATAATTGGGAGGCTTTCATGTAGCTTTTCCTTGTGTTGTGCAAAGGTAACTGCCACTTTCAA
CGATGTCACTGGTGTTGTCGATCTCGGCCTCAATCATCAATTTTCCTTCCGTGTATTTGGATCACATGGTTTTGGGATCCCAAAAGTTGGTGGTAGTGTCGCACCAAGTG
TAGACATGCAGAGTCCTGTAGAAGCACCAGGGATTGCCCCTGTTGGGCTTGAAACTCCAATAAGCAAAGACAATAAAGCTTTGTCTTTTGGTCTTGAAATTAAGAAGGGT
CTCACTACATCCCTTGGTTTTGAAATTCCCAAGTGTAAGGGCGACATTGTTGTTTCTCATGGTCACGAAATAAGTAAGAAGAGTGGCGTCAACATTGATATTTCCAATGG
TGGAAAAGTTAGTATGAAGAGCAGTGGTAGTGTTCTTTCAAATGGTGGCAACATTGATGTTTCCAAAGGTGGAAAAGGGAAGCTTATGGGGGCTTTATTGAGGTCTAAAA
AAACACTACGCGTAAACTGGATCTTATCAAGGCTGTCATCAAGATTAAAGATAATTTATGTGGCTTCATTTTGGCGGCAATCAGAATTACAGATGATCAAGGGTAATGCT
TCACGGTTCGTACGGTGTTGCCGAACAAAGGAAAATGGATGGTTTGCTGAAACCCTAAGATCCATGGAACGTTTACTCGAGAAGCGTCTTTGGAATATGATGAATTTGAC
GTCAAATCTGAATCATTTCTGTTCAGTGGAAATGAGGCATGCAGAGTGA
Protein sequenceShow/hide protein sequence
MISPNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQRHILIQSFFTATLWTLWNERNSRIFKGISRSSAQIREDSFALSGFWP
CTSSLFCNYEASSIFLIIGRLSCSFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAPGIAPVGLETPISKDNKALSFGLEIKKG
LTTSLGFEIPKCKGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVLSNGGNIDVSKGGKGKLMGALLRSKKTLRVNWILSRLSSRLKIIYVASFWRQSELQMIKGNA
SRFVRCCRTKENGWFAETLRSMERLLEKRLWNMMNLTSNLNHFCSVEMRHAE