; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005413 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005413
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationChr07:2342385..2344354
RNA-Seq ExpressionHG10005413
SyntenyHG10005413
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011900.1 hypothetical protein SDJN02_26807, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-8992.97Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPDDCVKHDPEVSQDTEKNFADLLLEELKLKEA
        MAIQ SSCNPSLIKFGLALIA+SIAGYILGPPLYWHFKEGLAVV+ SSSSSSCPPCFCDCPSQP+I+IP DC+KHDPEVSQDTEKNFADLLLEELKLKE 
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPDDCVKHDPEVSQDTEKNFADLLLEELKLKEA

Query:  EALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG AKSRTQKQ NIQTA
Subjt:  EALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

XP_004144504.1 uncharacterized protein LOC101202853 isoform X1 [Cucumis sativus]1.4e-9092.23Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQPSSCNPSLIK GLALIAI+I GYILGPPLYWHFKEGLAVV+HSSSSSSCPPCFCDCPS PVISIP+        DCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

XP_008455457.1 PREDICTED: uncharacterized protein LOC103495614 isoform X1 [Cucumis melo]4.1e-9091.71Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQP+SCNPSLIK GLALIAI+I GYILGPPLYWHFKEGLAVVSHSS+SSSCPPCFCDCPSQPVISIP+        DCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTA WEQRARQRGWKEGTAKSRTQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

XP_022952351.1 uncharacterized protein LOC111455059 isoform X1 [Cucurbita moschata]5.5e-8789.12Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQ SSCNPSLIKFGLALIA+SIAGYILGPPLYWHFKEGLAVV+ SSSSSSCPPCFCDCPSQP+I+IP         DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKE EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG AKSRTQKQ NIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

XP_038888598.1 uncharacterized protein LOC120078397 [Benincasa hispida]9.1e-9093.26Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQ SSCNPSLIKFGLALIAISI GYILGPPLYWHFKEGLAVVSH SSSSSCPPCFCDCPSQPVISIP+        DCVKHDPE+SQDTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

TrEMBL top hitse value%identityAlignment
A0A0A0K245 Uncharacterized protein6.8e-9192.23Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQPSSCNPSLIK GLALIAI+I GYILGPPLYWHFKEGLAVV+HSSSSSSCPPCFCDCPS PVISIP+        DCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

A0A1S3C135 uncharacterized protein LOC103495614 isoform X12.0e-9091.71Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQP+SCNPSLIK GLALIAI+I GYILGPPLYWHFKEGLAVVSHSS+SSSCPPCFCDCPSQPVISIP+        DCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTA WEQRARQRGWKEGTAKSRTQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

A0A6J1ESC6 uncharacterized protein LOC111437126 isoform X25.0e-8690.81Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPDDCVKHDPEVSQDTEKNFADLLLEELKLKEA
        MAIQ  SC+PSLIK GLALIA+SIAGYILGPPLYWH KEG AVVS SSSSSSCPPCFCDCPSQPVISIP+DCVKHDPEVSQDTEKNFADLLLEELKLKEA
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPDDCVKHDPEVSQDTEKNFADLLLEELKLKEA

Query:  EALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EA E+ RRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTA WEQRARQRGWKEG AKSR QKQGNIQTA
Subjt:  EALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

A0A6J1GK01 uncharacterized protein LOC111455059 isoform X12.7e-8789.12Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQ SSCNPSLIKFGLALIA+SIAGYILGPPLYWHFKEGLAVV+ SSSSSSCPPCFCDCPSQP+I+IP         DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKE EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG AKSRTQKQ NIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

A0A6J1HXB7 uncharacterized protein LOC1114683217.7e-8788.6Show/hide
Query:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL
        MAIQ SSCNPSLIKFGLALIA+SIAGYIL PPLYWHFKEGLAVV+ SSSSSSCPPCFCDCPSQP+I+IP         DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
        EELKLKE EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG AKSRTQKQ NIQTA
Subjt:  EELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)2.8e-5761.33Show/hide
Query:  IKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE
        +K GLAL+ +S+AGYILGPPLYWH  E LA V    S+SSCP C C+C +   ++IP         DC KHDPEV++DTEKN+A+LL EELKL+EAE+LE
Subjt:  IKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE

Query:  NQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
          +RAD+ LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT+ WE+RARQ+GW+EG+ K   + + N+Q A
Subjt:  NQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

AT2G32580.1 Protein of unknown function (DUF1068)2.5e-5359.12Show/hide
Query:  IKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIP--------DDCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE
        +K GLAL+A+S+ GYILGPPLYWH  E LAV     S++SC  C CDC S P+++IP         DC K DPEV++DTEKN+A+LL EELK +EA ++E
Subjt:  IKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIP--------DDCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE

Query:  NQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA
          +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT+MWEQRARQ+G+K+G  KS  + +   + A
Subjt:  NQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA

AT2G32580.2 Protein of unknown function (DUF1068)5.2e-3563.48Show/hide
Query:  DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEG
        +C K DPEV++DTEKN+A+LL EELK +EA ++E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT+MWEQRARQ+G+K+G
Subjt:  DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEG

Query:  TAKSRTQKQGNIQTA
          KS  + +   + A
Subjt:  TAKSRTQKQGNIQTA

AT4G04360.1 Protein of unknown function (DUF1068)5.2e-4356.71Show/hide
Query:  LIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRAD
        ++ + I  YI GP LYWH  E +A     S  SSCPPC CDC SQP++SIPD        DC++H+ E S+++E +F +++ EELKL+EA+A E++ RAD
Subjt:  LIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD--------DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRAD

Query:  VALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKS
          LL+AKK  SQYQKEADKC+ GMETCE AREKAEA L  Q+RL+ MWE RARQ GWKEGT  S
Subjt:  VALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKS

AT4G30996.1 Protein of unknown function (DUF1068)2.6e-3447.85Show/hide
Query:  LALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD-----------DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALEN
        L + A+  A  + GP LYW F +G   V  + ++S CPPC CDCP  P +S+             DC   DPE+ Q+ EK F DLL EELKL+EA A E+
Subjt:  LALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPD-----------DCVKHDPEVSQDTEKNFADLLLEELKLKEAEALEN

Query:  QRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWK
         R  +V L EAK++ SQYQKEA+KCN+  E CE ARE+AEA+L  ++++T++WE+RARQ GW+
Subjt:  QRRADVALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCAACCCAGTTCTTGCAATCCCTCACTCATCAAGTTCGGATTGGCTTTGATCGCCATCTCCATTGCCGGTTACATTCTCGGCCCTCCTCTCTATTGGCACTT
CAAGGAAGGTTTAGCCGTCGTTAGCCACTCTTCTTCTTCTTCCTCATGCCCTCCTTGTTTCTGCGATTGCCCTTCTCAGCCTGTCATCTCTATTCCCGATGATTGTGTTA
AGCATGACCCAGAAGTGAGTCAAGACACAGAGAAGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAGTTAAAGGAAGCCGAAGCTTTGGAAAATCAACGCCGTGCTGAT
GTGGCTCTACTCGAGGCGAAAAAGATGACATCTCAGTATCAAAAAGAAGCAGACAAGTGCAATTCTGGGATGGAAACATGTGAAGAAGCAAGAGAGAAAGCTGAAGCAGT
ATTAACTGCACAAAAGAGACTAACAGCTATGTGGGAGCAAAGGGCTCGCCAAAGGGGATGGAAAGAAGGCACTGCCAAGTCTCGTACTCAAAAACAAGGAAATATTCAAA
CTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTCAACCCAGTTCTTGCAATCCCTCACTCATCAAGTTCGGATTGGCTTTGATCGCCATCTCCATTGCCGGTTACATTCTCGGCCCTCCTCTCTATTGGCACTT
CAAGGAAGGTTTAGCCGTCGTTAGCCACTCTTCTTCTTCTTCCTCATGCCCTCCTTGTTTCTGCGATTGCCCTTCTCAGCCTGTCATCTCTATTCCCGATGATTGTGTTA
AGCATGACCCAGAAGTGAGTCAAGACACAGAGAAGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAGTTAAAGGAAGCCGAAGCTTTGGAAAATCAACGCCGTGCTGAT
GTGGCTCTACTCGAGGCGAAAAAGATGACATCTCAGTATCAAAAAGAAGCAGACAAGTGCAATTCTGGGATGGAAACATGTGAAGAAGCAAGAGAGAAAGCTGAAGCAGT
ATTAACTGCACAAAAGAGACTAACAGCTATGTGGGAGCAAAGGGCTCGCCAAAGGGGATGGAAAGAAGGCACTGCCAAGTCTCGTACTCAAAAACAAGGAAATATTCAAA
CTGCATAA
Protein sequenceShow/hide protein sequence
MAIQPSSCNPSLIKFGLALIAISIAGYILGPPLYWHFKEGLAVVSHSSSSSSCPPCFCDCPSQPVISIPDDCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRAD
VALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGTAKSRTQKQGNIQTA