; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g015330 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g015330
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr06:33358591..33371466
RNA-Seq ExpressionLcy06g015330
SyntenyLcy06g015330
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032807.1 UBN2_3 domain-containing protein [Cucumis melo var. makuwa]1.6e-5952.84Show/hide
Query:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK
        SSS++++   S          S+SPY L+H DTSNL+LV++L+TD+NYV WS SM+LA+SI NKLGFID  +TKP+G+LLP+WI NNNVVIAWILNS SK
Subjt:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK

Query:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------
         I SSI+ + SA + W+DL+D FQ +NGP IF+LK  L+TL QDQ++VT YF+++K +WDEY SY P C+CG+  C G ++++                 
Subjt:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------

Query:  ------SQILLMDPPPAISKAFSLIAQEE
              SQ+L+MDPPPAI+KAFSLI Q+E
Subjt:  ------SQILLMDPPPAISKAFSLIAQEE

XP_008457013.1 PREDICTED: uncharacterized protein LOC103496792 [Cucumis melo]1.1e-6053.28Show/hide
Query:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK
        SSS++++   S          S+SPY L+H DTSNL+LV++L+TD+NYV WS SM+LA+SI NKLGFID  +TKP+G+LLP+WI NNNVVIAWILNS SK
Subjt:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK

Query:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------
         I SSI+F+ SA + W+DL+D FQ +NGP IF+LK  L+TL QDQ++VT YF+++K +WDEY SY P C+CG+  C G ++++                 
Subjt:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------

Query:  ------SQILLMDPPPAISKAFSLIAQEE
              SQ+L+MDPPPAI+KAFSLI Q+E
Subjt:  ------SQILLMDPPPAISKAFSLIAQEE

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]5.1e-5849.57Show/hide
Query:  SSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDR
        ++PYFLHH D ++LVLVSDLLTD+NY +WS S+++AL++ NK+GF+DG++++P+   L  WI  NNVVI+WI NS+SK IS+S++FSDSAH IWLDLK+R
Subjt:  SSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDR

Query:  FQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAF
        FQ QN P IF+L+R L+ LTQDQ +VT YF+ LK +W E   YRP CSCGR + GG ++I+                      ++Q+LLM+P P I++AF
Subjt:  FQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAF

Query:  SLIAQEEHQRSLRILPLLSATALVVAECLRSTTA
        +L+AQE  QRS+ +  + S TA  V     S+ +
Subjt:  SLIAQEEHQRSLRILPLLSATALVVAECLRSTTA

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]7.4e-6559.55Show/hide
Query:  LHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDRFQSQN
        +HH DTSNLVLVS  LT+ NYV+WS SM +ALSI NKLGFI+G+L KP+GDLLP+WI N +VVIAW LNSVSK IS+S+IF++S H IWLDLKDRFQ QN
Subjt:  LHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDRFQSQN

Query:  GPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAFSLIAQ
        GP IF+L+R LATLTQDQ +VT Y++ LK +WDEYVSYRP C+CG  +CGG   ++                      ++QILLMDPPP+I KAFSLI+Q
Subjt:  GPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAFSLIAQ

Query:  EEHQRSLRILPLLSATALVV
        EE Q   R++PL S  +  V
Subjt:  EEHQRSLRILPLLSATALVV

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]2.6e-6255.51Show/hide
Query:  VDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDL
        +D  +PY LHH DTSNLVLVS+LLTDDNYV+WS SM+L L I NKLGFIDG+L +P+GDLL +WI NNNVV++WIL SVSK+ISSSI+F++SA +IWLDL
Subjt:  VDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDL

Query:  KDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAIS
        +D FQ +NGP IF LKR L++L QDQ +VT YF+ +K   DEYVSYRP C+CG+  CGG ++++                      +SQ+LLMDPPP ++
Subjt:  KDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAIS

Query:  KAFSLIAQEEHQRSLRILPLLSATALV
        KAFS + Q+E  +SL   P    T  V
Subjt:  KAFSLIAQEEHQRSLRILPLLSATALV

TrEMBL top hitse value%identityAlignment
A0A1S3C5T4 uncharacterized protein LOC1034967925.3e-6153.28Show/hide
Query:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK
        SSS++++   S          S+SPY L+H DTSNL+LV++L+TD+NYV WS SM+LA+SI NKLGFID  +TKP+G+LLP+WI NNNVVIAWILNS SK
Subjt:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK

Query:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------
         I SSI+F+ SA + W+DL+D FQ +NGP IF+LK  L+TL QDQ++VT YF+++K +WDEY SY P C+CG+  C G ++++                 
Subjt:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------

Query:  ------SQILLMDPPPAISKAFSLIAQEE
              SQ+L+MDPPPAI+KAFSLI Q+E
Subjt:  ------SQILLMDPPPAISKAFSLIAQEE

A0A5A7SU21 UBN2_3 domain-containing protein7.7e-6052.84Show/hide
Query:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK
        SSS++++   S          S+SPY L+H DTSNL+LV++L+TD+NYV WS SM+LA+SI NKLGFID  +TKP+G+LLP+WI NNNVVIAWILNS SK
Subjt:  SSSSSSVQTVSA------LVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSK

Query:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------
         I SSI+ + SA + W+DL+D FQ +NGP IF+LK  L+TL QDQ++VT YF+++K +WDEY SY P C+CG+  C G ++++                 
Subjt:  AISSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQ----------------

Query:  ------SQILLMDPPPAISKAFSLIAQEE
              SQ+L+MDPPPAI+KAFSLI Q+E
Subjt:  ------SQILLMDPPPAISKAFSLIAQEE

A0A6J1DLQ9 uncharacterized protein LOC1110221173.6e-6559.55Show/hide
Query:  LHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDRFQSQN
        +HH DTSNLVLVS  LT+ NYV+WS SM +ALSI NKLGFI+G+L KP+GDLLP+WI N +VVIAW LNSVSK IS+S+IF++S H IWLDLKDRFQ QN
Subjt:  LHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDRFQSQN

Query:  GPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAFSLIAQ
        GP IF+L+R LATLTQDQ +VT Y++ LK +WDEYVSYRP C+CG  +CGG   ++                      ++QILLMDPPP+I KAFSLI+Q
Subjt:  GPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAFSLIAQ

Query:  EEHQRSLRILPLLSATALVV
        EE Q   R++PL S  +  V
Subjt:  EEHQRSLRILPLLSATALVV

A0A6J1DNP7 uncharacterized protein LOC1110220652.5e-5849.57Show/hide
Query:  SSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDR
        ++PYFLHH D ++LVLVSDLLTD+NY +WS S+++AL++ NK+GF+DG++++P+   L  WI  NNVVI+WI NS+SK IS+S++FSDSAH IWLDLK+R
Subjt:  SSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFSDSAHSIWLDLKDR

Query:  FQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAF
        FQ QN P IF+L+R L+ LTQDQ +VT YF+ LK +W E   YRP CSCGR + GG ++I+                      ++Q+LLM+P P I++AF
Subjt:  FQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ----------------------QSQILLMDPPPAISKAF

Query:  SLIAQEEHQRSLRILPLLSATALVVAECLRSTTA
        +L+AQE  QRS+ +  + S TA  V     S+ +
Subjt:  SLIAQEEHQRSLRILPLLSATALVVAECLRSTTA

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein4.2e-5854.27Show/hide
Query:  SSSSSSSVQTVSALVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSG---DLLPIWICNNNVVIAWILNSVSKAI
        S +S SS+  + A  D SSPYFLHH D   LVLVS  LT DNY +W+ +M++ALS+ NKLGFIDG++TKP G   +LL  WI NNNVVI+WILNSVSK I
Subjt:  SSSSSSSVQTVSALVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSG---DLLPIWICNNNVVIAWILNSVSKAI

Query:  SSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ-------------------
        S+SIIFS SA+ IW+DLKDRFQ  NGP IF+L+R L    QDQ  V+ YF+ LK IW+E  +YRP CSCG   CGG + +                    
Subjt:  SSSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQ-------------------

Query:  ---QSQILLMDPPPAISKAFSLIAQEEHQRSLRI
           + Q+LLMDP P I+K FSLI+QEEHQR + I
Subjt:  ---QSQILLMDPPPAISKAFSLIAQEEHQRSLRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.4e-2631.17Show/hide
Query:  SSSVQTVSALVDSSSPYFL----HHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPS--GDLLPIWICNNNVVIAWILNSVSKAIS
        + ++++VS   D  SPY+L    HH    ++  +S    +DNYV W +     L +  K GFIDGTL KP     L   W   N +V+ W++NS++  + 
Subjt:  SSSVQTVSALVDSSSPYFL----HHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPS--GDLLPIWICNNNVVIAWILNSVSKAIS

Query:  SSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYR--PRCSCGRSNC-------------------------
         S++++++AH +W DL+  F       I++L+RRLATL Q   +V  YF  L  +W E   Y   P C CG  NC                         
Subjt:  SSIIFSDSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYR--PRCSCGRSNC-------------------------

Query:  GGNEAIQQSQILLMDPPPAISKAFSLIAQEE
         G EA+  ++I+   PPP++ +AF+++   E
Subjt:  GGNEAIQQSQILLMDPPPAISKAFSLIAQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTTCGTCTTCTTCCTCCGTTCAAACCGTTTCTGCACTTGTTGATTCTTCTAGCCCCTACTTTCTTCACCATTTCGATACCTCTAATCTTGTTCTGGTA
TCTGATCTACTCACTGATGACAATTACGTCACCTGGAGCCTGTCCATGCTACTTGCTCTTTCAATATGGAACAAGCTAGGTTTCATCGATGGCACTTTAACCAAA
CCTTCTGGCGACCTTCTCCCAATCTGGATTTGTAACAACAATGTTGTAATCGCTTGGATTCTGAACTCAGTATCCAAAGCCATCTCCTCTAGCATTATCTTCTCT
GATTCGGCTCACTCTATCTGGCTCGATCTCAAAGATCGATTCCAAAGTCAGAATGGACCACACATCTTTGAGTTGAAACGCAGACTTGCTACTTTGACTCAAGAT
CAACAAACTGTCACCAATTACTTCTCCCATTTGAAGGGAATCTGGGATGAATACGTTTCTTATCGCCCAAGATGTTCGTGTGGACGCAGCAATTGTGGAGGAAAT
GAGGCCATTCAACAGTCTCAAATTCTTCTCATGGATCCTCCACCTGCGATTTCGAAGGCCTTTTCTTTAATCGCTCAAGAAGAACATCAGCGGTCTCTCCGTATC
CTCCCTCTGTTGTCCGCGACTGCTCTCGTTGTTGCCGAATGCCTCCGTTCCACAACAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTTCGTCTTCTTCCTCCGTTCAAACCGTTTCTGCACTTGTTGATTCTTCTAGCCCCTACTTTCTTCACCATTTCGATACCTCTAATCTTGTTCTGGTA
TCTGATCTACTCACTGATGACAATTACGTCACCTGGAGCCTGTCCATGCTACTTGCTCTTTCAATATGGAACAAGCTAGGTTTCATCGATGGCACTTTAACCAAA
CCTTCTGGCGACCTTCTCCCAATCTGGATTTGTAACAACAATGTTGTAATCGCTTGGATTCTGAACTCAGTATCCAAAGCCATCTCCTCTAGCATTATCTTCTCT
GATTCGGCTCACTCTATCTGGCTCGATCTCAAAGATCGATTCCAAAGTCAGAATGGACCACACATCTTTGAGTTGAAACGCAGACTTGCTACTTTGACTCAAGAT
CAACAAACTGTCACCAATTACTTCTCCCATTTGAAGGGAATCTGGGATGAATACGTTTCTTATCGCCCAAGATGTTCGTGTGGACGCAGCAATTGTGGAGGAAAT
GAGGCCATTCAACAGTCTCAAATTCTTCTCATGGATCCTCCACCTGCGATTTCGAAGGCCTTTTCTTTAATCGCTCAAGAAGAACATCAGCGGTCTCTCCGTATC
CTCCCTCTGTTGTCCGCGACTGCTCTCGTTGTTGCCGAATGCCTCCGTTCCACAACAGCATAA
Protein sequenceShow/hide protein sequence
MSSSSSSSVQTVSALVDSSSPYFLHHFDTSNLVLVSDLLTDDNYVTWSLSMLLALSIWNKLGFIDGTLTKPSGDLLPIWICNNNVVIAWILNSVSKAISSSIIFS
DSAHSIWLDLKDRFQSQNGPHIFELKRRLATLTQDQQTVTNYFSHLKGIWDEYVSYRPRCSCGRSNCGGNEAIQQSQILLMDPPPAISKAFSLIAQEEHQRSLRI
LPLLSATALVVAECLRSTTA