; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035217 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035217
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationscaffold4:24307289..24307873
RNA-Seq ExpressionSpg035217
SyntenySpg035217
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142327.1 uncharacterized protein LOC111012468 [Momordica charantia]1.4e-1951.2Show/hide
Query:  SSSSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSK---MDNELK-------HSWIICNEFVTASILNSLSKEIST
        SSSS+T    + SI+E Y NPY+L       LVLVSD L E+NYTSWS++M+I+LTVK+K   +D  +        +SW ICN  V A +LNSLSKEIS 
Subjt:  SSSSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSK---MDNELK-------HSWIICNEFVTASILNSLSKEIST

Query:  SVNFAKTAREIWLDLQQRSSAEELP
        SV F+ +AR+IWLDLQ+R   +  P
Subjt:  SVNFAKTAREIWLDLQQRSSAEELP

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]2.6e-2152.03Show/hide
Query:  SSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEISTSV
        ++ST+   +   IEQY NPYFL       LVLVSDPLT  NYTSWS++M+I+LTVK+K+            +L HSWIICN  V + ILNSLSKEIS S+
Subjt:  SSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEISTSV

Query:  NFAKTAREIWLDLQQRSSAEELP
         F+ +AREIWLDL++R   +  P
Subjt:  NFAKTAREIWLDLQQRSSAEELP

XP_038874906.1 uncharacterized protein LOC120067409 [Benincasa hispida]7.5e-2150.78Show/hide
Query:  NPSSSSSTT-QPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKMD----------NELKHSWIICNEFVTASILNSLSKE
        +PSS+  TT  P  QS ++QY   YFL       LVLVSD LT+SNY+SWSQ+M +S TVK+KM            +L++SWIICN  VT  I N+LSK+
Subjt:  NPSSSSSTT-QPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKMD----------NELKHSWIICNEFVTASILNSLSKE

Query:  ISTSVNFAKTAREIWLDLQQRSSAEELP
        I+ SVNF+ + REIWLDLQQR  +++ P
Subjt:  ISTSVNFAKTAREIWLDLQQRSSAEELP

XP_038875043.1 uncharacterized protein LOC120067569 [Benincasa hispida]2.0e-2152Show/hide
Query:  SSSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEIST
        S+S    + +  SI+EQY+NPYFL       LV +S+ LTESNY SWSQAM I LTVK+K+            EL  SWIICN  VTA ILNSLSKEIST
Subjt:  SSSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEIST

Query:  SVNFAKTAREIWLDLQQRSSAEELP
        S+NF+ + +EIW+D Q+R   +  P
Subjt:  SVNFAKTAREIWLDLQQRSSAEELP

XP_038887186.1 uncharacterized protein LOC120077373 [Benincasa hispida]2.4e-1951.2Show/hide
Query:  SSSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEIST
        S+S    + +  S +EQY+NPYFL       LVLVS+ LTESNY SWSQAM I LTVK+K+            EL  SWII N  VT  ILNSLSKEI  
Subjt:  SSSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEIST

Query:  SVNFAKTAREIWLDLQQRSSAEELP
        S+NF+ +A+EIW DLQ+R   +  P
Subjt:  SVNFAKTAREIWLDLQQRSSAEELP

TrEMBL top hitse value%identityAlignment
A0A5J5BKC2 Uncharacterized protein1.1e-1444.09Show/hide
Query:  SSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKM-------------DNELKHSWIICNEFVTASILNSLSKEI
        S+S     +++S IE+  NPY+L      R +LVS  LT  NYT+WS+AM+I+L+VK+K+              N L +SWI  N  V + ILNS+SKEI
Subjt:  SSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKM-------------DNELKHSWIICNEFVTASILNSLSKEI

Query:  STSVNFAKTAREIWLDLQQRSSAEELP
        S S+ FA +AREIWLDL+ R      P
Subjt:  STSVNFAKTAREIWLDLQQRSSAEELP

A0A6J1CMF8 uncharacterized protein LOC1110124686.9e-2051.2Show/hide
Query:  SSSSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSK---MDNELK-------HSWIICNEFVTASILNSLSKEIST
        SSSS+T    + SI+E Y NPY+L       LVLVSD L E+NYTSWS++M+I+LTVK+K   +D  +        +SW ICN  V A +LNSLSKEIS 
Subjt:  SSSSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSK---MDNELK-------HSWIICNEFVTASILNSLSKEIST

Query:  SVNFAKTAREIWLDLQQRSSAEELP
        SV F+ +AR+IWLDLQ+R   +  P
Subjt:  SVNFAKTAREIWLDLQQRSSAEELP

A0A6J1DIP8 uncharacterized protein LOC1110203991.3e-2152.03Show/hide
Query:  SSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEISTSV
        ++ST+   +   IEQY NPYFL       LVLVSDPLT  NYTSWS++M+I+LTVK+K+            +L HSWIICN  V + ILNSLSKEIS S+
Subjt:  SSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKM----------DNELKHSWIICNEFVTASILNSLSKEISTSV

Query:  NFAKTAREIWLDLQQRSSAEELP
         F+ +AREIWLDL++R   +  P
Subjt:  NFAKTAREIWLDLQQRSSAEELP

A0A6J1DKR8 uncharacterized protein LOC1110218316.6e-1541.3Show/hide
Query:  MDTNPPVNP-----------SSSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKMD----------NELKHSWI
        M T PP +P             SSS    +S S ++   NPY+L       LVLV+ PLTE NY+SWS++M+I+L++K+K+            EL  +WI
Subjt:  MDTNPPVNP-----------SSSSSTTQPTSQSIIEQYENPYFLQ-----RLVLVSDPLTESNYTSWSQAMVISLTVKSKMD----------NELKHSWI

Query:  ICNEFVTASILNSLSKEISTSVNFAKTAREIWLDLQQR
          N  V A ILNS+SKEIS+S+ F+++AR+IW+DL++R
Subjt:  ICNEFVTASILNSLSKEISTSVNFAKTAREIWLDLQQR

A0A6J1DNP7 uncharacterized protein LOC1110220654.9e-1845.93Show/hide
Query:  MDTNPPVNPSSSSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKM-----------DNELKHSWIICNEFVTASI
        M+ +  +NP++      P +  ++EQ+ NPYFL       LVLVSD LT+ NYTSWS+++VI+LTVK+K+           D  L HSWIICN  V + I
Subjt:  MDTNPPVNPSSSSSTTQPTSQSIIEQYENPYFLQR-----LVLVSDPLTESNYTSWSQAMVISLTVKSKM-----------DNELKHSWIICNEFVTASI

Query:  LNSLSKEISTSVNFAKTAREIWLDLQQRSSAEELP
         NSLSK+IS SV F+ +A EIWLDL++R   +  P
Subjt:  LNSLSKEISTSVNFAKTAREIWLDLQQRSSAEELP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACTAATCCACCAGTAAACCCTAGTTCTTCTTCATCCACGACTCAACCTACCAGTCAATCGATTATTGAACAGTATGAGAATCCTTATTTTCTTCAACGTCTTGT
TCTCGTATCTGATCCCTTAACCGAATCTAATTACACTTCTTGGAGTCAAGCAATGGTAATCAGTCTCACTGTCAAGAGCAAGATGGATAATGAATTGAAGCATTCTTGGA
TCATCTGCAATGAATTCGTGACTGCCTCGATCCTTAATTCTCTTTCGAAAGAAATTTCCACAAGTGTGAATTTTGCTAAGACTGCTAGAGAAATATGGCTCGACCTTCAG
CAGCGCTCATCAGCGGAAGAATTGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACACTAATCCACCAGTAAACCCTAGTTCTTCTTCATCCACGACTCAACCTACCAGTCAATCGATTATTGAACAGTATGAGAATCCTTATTTTCTTCAACGTCTTGT
TCTCGTATCTGATCCCTTAACCGAATCTAATTACACTTCTTGGAGTCAAGCAATGGTAATCAGTCTCACTGTCAAGAGCAAGATGGATAATGAATTGAAGCATTCTTGGA
TCATCTGCAATGAATTCGTGACTGCCTCGATCCTTAATTCTCTTTCGAAAGAAATTTCCACAAGTGTGAATTTTGCTAAGACTGCTAGAGAAATATGGCTCGACCTTCAG
CAGCGCTCATCAGCGGAAGAATTGCCCTAA
Protein sequenceShow/hide protein sequence
MDTNPPVNPSSSSSTTQPTSQSIIEQYENPYFLQRLVLVSDPLTESNYTSWSQAMVISLTVKSKMDNELKHSWIICNEFVTASILNSLSKEISTSVNFAKTAREIWLDLQ
QRSSAEELP