; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020837 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020837
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionD-ribose-binding periplasmic protein
Genome locationChr05:2851233..2851964
RNA-Seq ExpressionHG10020837
SyntenyHG10020837
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584235.1 hypothetical protein SDJN03_20167, partial [Cucurbita argyrosperma subsp. sororia]5.0e-6981.11Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIM +NPGHYVALLISTKICPS++TA   RR    D QTN+TN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRN MDFEGK GN EKGSEGE+N QGMK E+NR       VS++AKSRGWQPSLQSISE GS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

XP_004152463.1 uncharacterized protein LOC101220404 [Cucumis sativus]2.3e-8290Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIM TNPGHYVALLISTK+C SETT++HHRRR +ND+QTNSTN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRNL++FEGK GNSEKGSEGE+N QGMK ERNRVKKCNSTVST+AKSRGWQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

XP_008438114.1 PREDICTED: uncharacterized protein LOC103483316 [Cucumis melo]6.1e-8390.56Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIM TNPGHYVALLISTK+C SETT++HHRRR +N++QTNSTN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRNL+DFEGKQGNSEKGSEGE+N QGMK ERNRVKKCNSTVST+AKSRGWQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

XP_022923864.1 uncharacterized protein LOC111431455 [Cucurbita moschata]2.3e-6981.11Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIM +NPGHYVALLISTKICPS++TA   RR  ++D QTN+TN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRN MDFEGK GN EKGSEGE+N QGMK E+NR       VS++AKSRGWQPSLQSISE GS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

XP_038894996.1 uncharacterized protein LOC120083345 [Benincasa hispida]3.7e-7283.89Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIM TNPGHYVALLISTKIC SETTA HHRRR +N +QTNSTN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLK KQEAKTK        KQGN + GSEGE++ +GMKYERN VKKC+STVS +AKSRGWQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

TrEMBL top hitse value%identityAlignment
A0A0A0LU27 Uncharacterized protein1.1e-8290Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIM TNPGHYVALLISTK+C SETT++HHRRR +ND+QTNSTN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRNL++FEGK GNSEKGSEGE+N QGMK ERNRVKKCNSTVST+AKSRGWQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

A0A1S3AV95 uncharacterized protein LOC1034833163.0e-8390.56Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIM TNPGHYVALLISTK+C SETT++HHRRR +N++QTNSTN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRNL+DFEGKQGNSEKGSEGE+N QGMK ERNRVKKCNSTVST+AKSRGWQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

A0A5D3BH89 Uncharacterized protein3.0e-8390.56Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIM TNPGHYVALLISTK+C SETT++HHRRR +N++QTNSTN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRNL+DFEGKQGNSEKGSEGE+N QGMK ERNRVKKCNSTVST+AKSRGWQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

A0A6J1E7K0 uncharacterized protein LOC1114314551.1e-6981.11Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIM +NPGHYVALLISTKICPS++TA   RR  ++D QTN+TN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRN MDFEGK GN EKGSEGE+N QGMK E+NR       VS++AKSRGWQPSLQSISE GS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

A0A6J1KF34 uncharacterized protein LOC1114952313.2e-6980.56Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQAIDTASLIIQHPNGKVDR YWPVNAGEIM +NPGHYVALLISTKICPS++TA   RR  + D QTN+TN+NSVRLTRIKLLKPTDSLVLGQIYRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        VT+QDVLQGLKAKQEAK KRN +DFEGK GN EKGSEGE+N QGMK E+NR       VS++AKSRGWQPSLQSISE GS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10530.1 unknown protein1.4e-2439.56Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQA++ A L++QHP G +DR Y  V+  E+M   PGHYV+L+I         +    +     +   +     +VR TR++LL+PT++LVLG  YRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEV--NQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        +TSQ+V++ L+ K+ AKTK++ ++   K   ++K S+ +V   +QG ++   R    NST  +  KS+ W+PSLQSISE  S
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEV--NQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

AT1G60010.1 unknown protein6.8e-3243.48Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLI--STKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIY
        MGNCQA+D A+L++QHP+GK+DR Y PV+  EIM   PGHYV+L+I    K  P+ TT +            + +    VR TR+KLL+PT++LVLG  Y
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLI--STKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIY

Query:  RLVTSQDVLQGLKAKQEAKTKRNLMDF--EGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        RL+TSQ+V++ L+AK+ AKTK++  +   E K+ +SEK  + E ++      ++  ++   T S S++S+ W+PSLQSISE  S
Subjt:  RLVTSQDVLQGLKAKQEAKTKRNLMDF--EGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

AT5G50090.1 unknown protein9.9e-3142.47Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQA+DTA ++IQHPNGK ++L  PV+A  +M  NPGH V+LLIST                   S  +S +   +RLTRIKLL+PTD+LVLG +YRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQ------GNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        +T+++V++GL AK+ +K K+     + K        +++  +E ++  +  + ER+R+            SR WQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQ------GNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

AT5G50090.2 unknown protein4.4e-3143.33Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQA+DTA ++IQHPNGK ++L  PV+A  +M  NPGH V+LLIST                   S  +S +   +RLTRIKLL+PTD+LVLG +YRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        +T+++V++GL AK+ +K K+     + K    +  +  +++ +  + ER+R+            SR WQPSLQSISEGGS
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS

AT5G62900.1 unknown protein1.1e-2436.36Show/hide
Query:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL
        MGNCQA + A+ +IQ P+GK  R Y  VNA E++ ++PGH+VALL+S+ +                       +  S+R+TRIKLL+P+D+L+LG +YRL
Subjt:  MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRL

Query:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQG-------NSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS
        ++S++V++G++AK+  K K+   +F   +         SE  S+ +  ++  + +R  +    +T   + K R WQPSLQSISE  S
Subjt:  VTSQDVLQGLKAKQEAKTKRNLMDFEGKQG-------NSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCCAAGCCATAGACACAGCTTCTCTAATCATCCAACACCCAAACGGAAAAGTCGACCGACTTTACTGGCCGGTAAACGCCGGCGAGATCATGAACACAAA
TCCCGGCCACTACGTCGCCCTTCTCATCTCCACAAAAATCTGCCCATCGGAAACCACCGCCAGCCACCACCGTCGCCGCCTTGAAAACGACAGCCAAACAAACAGTACAA
ATTACAACTCGGTCCGATTGACCCGAATCAAACTCCTGAAGCCGACCGATTCCCTCGTTCTCGGCCAAATTTACCGCCTCGTCACATCCCAAGATGTTTTGCAAGGATTG
AAAGCCAAACAGGAAGCAAAAACGAAGAGAAATTTGATGGATTTTGAAGGGAAACAGGGGAATTCGGAGAAGGGATCTGAAGGAGAGGTTAATCAGCAGGGGATGAAATA
TGAGAGAAACAGAGTGAAAAAATGCAATTCAACAGTGTCGACATCGGCGAAATCGAGAGGGTGGCAGCCTTCATTGCAAAGCATTTCAGAGGGTGGAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAATTGCCAAGCCATAGACACAGCTTCTCTAATCATCCAACACCCAAACGGAAAAGTCGACCGACTTTACTGGCCGGTAAACGCCGGCGAGATCATGAACACAAA
TCCCGGCCACTACGTCGCCCTTCTCATCTCCACAAAAATCTGCCCATCGGAAACCACCGCCAGCCACCACCGTCGCCGCCTTGAAAACGACAGCCAAACAAACAGTACAA
ATTACAACTCGGTCCGATTGACCCGAATCAAACTCCTGAAGCCGACCGATTCCCTCGTTCTCGGCCAAATTTACCGCCTCGTCACATCCCAAGATGTTTTGCAAGGATTG
AAAGCCAAACAGGAAGCAAAAACGAAGAGAAATTTGATGGATTTTGAAGGGAAACAGGGGAATTCGGAGAAGGGATCTGAAGGAGAGGTTAATCAGCAGGGGATGAAATA
TGAGAGAAACAGAGTGAAAAAATGCAATTCAACAGTGTCGACATCGGCGAAATCGAGAGGGTGGCAGCCTTCATTGCAAAGCATTTCAGAGGGTGGAAGTTGA
Protein sequenceShow/hide protein sequence
MGNCQAIDTASLIIQHPNGKVDRLYWPVNAGEIMNTNPGHYVALLISTKICPSETTASHHRRRLENDSQTNSTNYNSVRLTRIKLLKPTDSLVLGQIYRLVTSQDVLQGL
KAKQEAKTKRNLMDFEGKQGNSEKGSEGEVNQQGMKYERNRVKKCNSTVSTSAKSRGWQPSLQSISEGGS