; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004790 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004790
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF3128 domain-containing protein
Genome locationscaffold5:21661229..21671563
RNA-Seq ExpressionSpg004790
SyntenySpg004790
Gene Ontology termsNA
InterPro domainsIPR021475 - Protein of unknown function DUF3128
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652796.1 hypothetical protein Csa_022703 [Cucumis sativus]9.8e-3563.48Show/hide
Query:  IPFELGDMEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWAL
        +PFELGDMEGVE ITPEK++N+ GSR RLSCTTCFD+LWFCYSPVHQMQQYYR+GVFDNCS+KWT L DCL LKTKRASEVQ ++     ++   W++  
Subjt:  IPFELGDMEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWAL

Query:  EDNRAFSTSSLSKHL
         +  +     L  HL
Subjt:  EDNRAFSTSSLSKHL

XP_022964668.1 uncharacterized protein LOC111464679 [Cucurbita moschata]1.9e-3387.18Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI
        MEGVEKITPEK+DNS GSR RLSCTTCFD+LWFCYSPVHQMQQYYRLG FDNCSDKWT+L DCLNLKTKRASEVQ ++
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI

XP_022970353.1 uncharacterized protein LOC111469344 [Cucurbita maxima]1.9e-3387.18Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI
        MEGVEKITPEK+DNS GSR RLSCTTCFD+LWFCYSPVHQMQQYYRLG FDNCSDKWT+L DCLNLKTKRASEVQ ++
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI

XP_023519221.1 uncharacterized protein LOC111782657 [Cucurbita pepo subsp. pepo]4.1e-3378.65Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI-NLSRGDSWSW
        MEGVEKITPEK+DNS GSR +LSCTTCFD+LWFCYSPVHQMQQYYRLG FDNCSDKWT+L DCLNLKTKRASEVQ ++ +  R  S  W
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI-NLSRGDSWSW

XP_038875103.1 uncharacterized protein LOC120067633, partial [Benincasa hispida]4.9e-3465.18Show/hide
Query:  ELGDMEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDN
        +LGDMEGVE ITPEK++NS GSR RLSCTTCFD+LWFCYSPVHQMQQYYRLGVFDNCSDKWT L DCLNLKTKR SEVQ ++     ++   W++   + 
Subjt:  ELGDMEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDN

Query:  RAFSTSSLSKHL
         +     L  HL
Subjt:  RAFSTSSLSKHL

TrEMBL top hitse value%identityAlignment
A0A0A0LSX6 Uncharacterized protein7.1e-3162.04Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDNRAFS
        MEGVE ITPEK++N+ GSR RLSCTTCFD+LWFCYSPVHQMQQYYR+GVFDNCS+KWT L DCL LKTKRASEVQ ++     ++   W++   +  +  
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDNRAFS

Query:  TSSLSKHL
           L  HL
Subjt:  TSSLSKHL

A0A1S3CG12 uncharacterized protein LOC1035004953.5e-3061.11Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDNRAFS
        MEGVE IT EK++NS G + RLSCTTCFD+LWFCYSPVHQMQQYYR+GVFDNCS KWT L DCLNLKTKRASEVQ ++     ++   W++   +  +  
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDNRAFS

Query:  TSSLSKHL
           L  HL
Subjt:  TSSLSKHL

A0A6J1DHI6 uncharacterized protein C227.17c8.4e-3263.89Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDNRAFS
        ME VEK+TPEKED   G RGRLSCTTCFD+LWFCYSPVHQMQQYYRLGVFDNCSDKWT L DCL+LKTKRASEVQ ++     ++   W++   +  +  
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI---NLSRGDSWSWALEDNRAFS

Query:  TSSLSKHL
           L  HL
Subjt:  TSSLSKHL

A0A6J1HJL4 uncharacterized protein LOC1114646799.0e-3487.18Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI
        MEGVEKITPEK+DNS GSR RLSCTTCFD+LWFCYSPVHQMQQYYRLG FDNCSDKWT+L DCLNLKTKRASEVQ ++
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI

A0A6J1I2M7 uncharacterized protein LOC1114693449.0e-3487.18Show/hide
Query:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI
        MEGVEKITPEK+DNS GSR RLSCTTCFD+LWFCYSPVHQMQQYYRLG FDNCSDKWT+L DCLNLKTKRASEVQ ++
Subjt:  MEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G25315.1 Expressed protein4.4e-1746.15Show/hide
Query:  ITPEKEDNSKGSRGRLSCTTCFDSLWFCY-------------------SPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI
        ++ E ED+S   R R+SCT CFD+LWFCY                   +P +QMQQYYR+G  D+C+ K++ LFDCL+LKTKRASE + ++
Subjt:  ITPEKEDNSKGSRGRLSCTTCFDSLWFCY-------------------SPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI

AT4G25315.2 Expressed protein6.6e-2159.72Show/hide
Query:  ITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI
        ++ E ED+S   R R+SCT CFD+LWFCYSP +QMQQYYR+G  D+C+ K++ LFDCL+LKTKRASE + ++
Subjt:  ITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNCSDKWTTLFDCLNLKTKRASEVQPLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGGTCTCAGGTGAATATTGGAGCACCTGGGTACAAAAGGCCAGAGATCTATATTTGTTGTAGGTATTTTGGGAATATTGAGGCGCCTGAGTACAAAGGTACAAA
GGTTAGGGATCGATGTTGCAAGTATATGGGAGATATTGGGGTGCCTGGGTACAAAGGTTACGAGCCAATATTTTTCATATACAGGTATCGGGGTACTTGGGTACAAAGGC
CAAGAGTTGATGGATTAGAATGTAGAGCAAGTGAGGCTCTTTTCAGCTTAAAAGTTGTGTTTGAGTGCTTTGCTAGAGTCAGGATTTTGAGTGCAAAGGTTAGGAATCGA
CTAGTTAACCTATGTTGTGATTGTAGGGCAAGTCCTGTGTTTTATTTGATTACGTGTGAGCTACTTACTAAGTACACATTATGTACCGACTCAGCACCGAGTTTTGCAGG
TGATGGAAACCTATTCGAACTTGGTGATGTAGAGGAGTTCATACCTTTTGAACTGGGAGACATGGAAGGAGTAGAGAAAATAACACCAGAGAAGGAGGATAACTCCAAGG
GATCAAGGGGGCGGTTGTCCTGCACCACCTGCTTTGATTCTCTGTGGTTCTGCTATTCTCCAGTTCATCAGATGCAGCAATATTACAGGCTTGGGGTTTTTGATAACTGT
TCTGACAAATGGACAACTCTTTTTGACTGTTTAAATCTCAAGACAAAAAGAGCTTCCGAGGTTCAGCCATTGATCAACCTATCCAGAGGAGATTCGTGGTCATGGGCTCT
CGAAGATAATCGGGCCTTTTCCACCAGCTCACTCTCGAAGCATCTCTCCACCACAAGCCCCAACGTATTATCAGTTTTATATAAGCATATATGGGCTGGTCATTATCCAA
AGAAGGTGAAATTCTTCCTCTGGGAGGTTAGCCACTCATGCATCAACACTCAAGACAAGCTCCAACGTAGATCTCCATGGCTGGTTATTTCTCCATCTTGCTGCCCTATG
TGCTATGGAGATGCAGAATCCTTGATGCACATTTTCAGCAACTGTCCATATGCTTCTAAGTATTGGAGCTTCCTCCAAGCGGTTTTTGAATGGTCCTTCCCTAGACCGGG
TGATATTCTCTCTCTCTTATCTCTTCTTCTCATGGGCCACCCCTTTAAGAATGAAAAGAAGATCCTTTGGCTTTGTCATGTTTACATTATTGTGACACCGTCTCAACGCT
ATGCCCTCAGGTTCCTTGACTTGGAGCTTGACTTTTTGACTTCTCTGCACTACTTGGATGTGAATTGGACTTTTGATAATGAGGAGTCTCATTCCAATGGAGATGATAAT
TTATCAAGTGAAAGTGGGAATTCTTTACCAACAAGGACGTTCATGTCGCTTCATCAGTTGAAGCACAACCCCAATCAGGTAGAGCACATCCCCTATATTAGCTTCAAGAA
GAAACCTCTAATTAAGGGACCCTTGTTGGTTAAGGATTGGAAAAGTTGTTGGTTCCTTGTCAAGGGGAGTTCGATGGCTTTGGATGTGAACATGTTGAATTATGTGGTTT
CGACGATATTTAAGGATTATGACATGAAGGCTCACATAAATCCCCTCTATGAAAGTCGTCAGAGGCACAATTGGAGGATAGGCTATGAGAGTTGCAACAAGGACTTCAAG
CCTAACGGGTCACATTCCCTCATTAGAGGTGCAGCAATGGCATCAAAGTGTTACAATCCCCCAGCCCAGGATGAGGTTAAACCCCACTTAAGTCAGCCATCCATAGCACT
GAAGTTCCTTGGCGTTGAGGTCGAGCCTCCAAGTTCTTCTGAACTGATGTGTTTGCTTGGTTGCAAGCACTCCATGGGAGACACCGTCTATTGGGCCAAGCCCGTATCAT
ACTCCCCAAATGGATTCCATGTATTTGGTCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGGTCTCAGGTGAATATTGGAGCACCTGGGTACAAAAGGCCAGAGATCTATATTTGTTGTAGGTATTTTGGGAATATTGAGGCGCCTGAGTACAAAGGTACAAA
GGTTAGGGATCGATGTTGCAAGTATATGGGAGATATTGGGGTGCCTGGGTACAAAGGTTACGAGCCAATATTTTTCATATACAGGTATCGGGGTACTTGGGTACAAAGGC
CAAGAGTTGATGGATTAGAATGTAGAGCAAGTGAGGCTCTTTTCAGCTTAAAAGTTGTGTTTGAGTGCTTTGCTAGAGTCAGGATTTTGAGTGCAAAGGTTAGGAATCGA
CTAGTTAACCTATGTTGTGATTGTAGGGCAAGTCCTGTGTTTTATTTGATTACGTGTGAGCTACTTACTAAGTACACATTATGTACCGACTCAGCACCGAGTTTTGCAGG
TGATGGAAACCTATTCGAACTTGGTGATGTAGAGGAGTTCATACCTTTTGAACTGGGAGACATGGAAGGAGTAGAGAAAATAACACCAGAGAAGGAGGATAACTCCAAGG
GATCAAGGGGGCGGTTGTCCTGCACCACCTGCTTTGATTCTCTGTGGTTCTGCTATTCTCCAGTTCATCAGATGCAGCAATATTACAGGCTTGGGGTTTTTGATAACTGT
TCTGACAAATGGACAACTCTTTTTGACTGTTTAAATCTCAAGACAAAAAGAGCTTCCGAGGTTCAGCCATTGATCAACCTATCCAGAGGAGATTCGTGGTCATGGGCTCT
CGAAGATAATCGGGCCTTTTCCACCAGCTCACTCTCGAAGCATCTCTCCACCACAAGCCCCAACGTATTATCAGTTTTATATAAGCATATATGGGCTGGTCATTATCCAA
AGAAGGTGAAATTCTTCCTCTGGGAGGTTAGCCACTCATGCATCAACACTCAAGACAAGCTCCAACGTAGATCTCCATGGCTGGTTATTTCTCCATCTTGCTGCCCTATG
TGCTATGGAGATGCAGAATCCTTGATGCACATTTTCAGCAACTGTCCATATGCTTCTAAGTATTGGAGCTTCCTCCAAGCGGTTTTTGAATGGTCCTTCCCTAGACCGGG
TGATATTCTCTCTCTCTTATCTCTTCTTCTCATGGGCCACCCCTTTAAGAATGAAAAGAAGATCCTTTGGCTTTGTCATGTTTACATTATTGTGACACCGTCTCAACGCT
ATGCCCTCAGGTTCCTTGACTTGGAGCTTGACTTTTTGACTTCTCTGCACTACTTGGATGTGAATTGGACTTTTGATAATGAGGAGTCTCATTCCAATGGAGATGATAAT
TTATCAAGTGAAAGTGGGAATTCTTTACCAACAAGGACGTTCATGTCGCTTCATCAGTTGAAGCACAACCCCAATCAGGTAGAGCACATCCCCTATATTAGCTTCAAGAA
GAAACCTCTAATTAAGGGACCCTTGTTGGTTAAGGATTGGAAAAGTTGTTGGTTCCTTGTCAAGGGGAGTTCGATGGCTTTGGATGTGAACATGTTGAATTATGTGGTTT
CGACGATATTTAAGGATTATGACATGAAGGCTCACATAAATCCCCTCTATGAAAGTCGTCAGAGGCACAATTGGAGGATAGGCTATGAGAGTTGCAACAAGGACTTCAAG
CCTAACGGGTCACATTCCCTCATTAGAGGTGCAGCAATGGCATCAAAGTGTTACAATCCCCCAGCCCAGGATGAGGTTAAACCCCACTTAAGTCAGCCATCCATAGCACT
GAAGTTCCTTGGCGTTGAGGTCGAGCCTCCAAGTTCTTCTGAACTGATGTGTTTGCTTGGTTGCAAGCACTCCATGGGAGACACCGTCTATTGGGCCAAGCCCGTATCAT
ACTCCCCAAATGGATTCCATGTATTTGGTCCATAG
Protein sequenceShow/hide protein sequence
MSGSQVNIGAPGYKRPEIYICCRYFGNIEAPEYKGTKVRDRCCKYMGDIGVPGYKGYEPIFFIYRYRGTWVQRPRVDGLECRASEALFSLKVVFECFARVRILSAKVRNR
LVNLCCDCRASPVFYLITCELLTKYTLCTDSAPSFAGDGNLFELGDVEEFIPFELGDMEGVEKITPEKEDNSKGSRGRLSCTTCFDSLWFCYSPVHQMQQYYRLGVFDNC
SDKWTTLFDCLNLKTKRASEVQPLINLSRGDSWSWALEDNRAFSTSSLSKHLSTTSPNVLSVLYKHIWAGHYPKKVKFFLWEVSHSCINTQDKLQRRSPWLVISPSCCPM
CYGDAESLMHIFSNCPYASKYWSFLQAVFEWSFPRPGDILSLLSLLLMGHPFKNEKKILWLCHVYIIVTPSQRYALRFLDLELDFLTSLHYLDVNWTFDNEESHSNGDDN
LSSESGNSLPTRTFMSLHQLKHNPNQVEHIPYISFKKKPLIKGPLLVKDWKSCWFLVKGSSMALDVNMLNYVVSTIFKDYDMKAHINPLYESRQRHNWRIGYESCNKDFK
PNGSHSLIRGAAMASKCYNPPAQDEVKPHLSQPSIALKFLGVEVEPPSSSELMCLLGCKHSMGDTVYWAKPVSYSPNGFHVFGP