; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020843 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020843
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNHL domain protein
Genome locationChr05:2889946..2890446
RNA-Seq ExpressionHG10020843
SyntenyHG10020843
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011660353.1 uncharacterized protein LOC105436380 [Cucumis sativus]4.5e-8089.94Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGED-HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN
        MGDRDGAFKELES   IDGADALL SKRYSCFCFPCFGP RS SDELSWWERVKTKAKSTKFD ED HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGED-HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYG-GGGFQNFSDRFAAV-PTPVKSSGAAAVKG
        RNRPATVKLGKFQYDPISYALNFDEGHNGDVDF+GDEY  GGGFQNFSDRFAA+ P PVKSS +AAV G
Subjt:  RNRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYG-GGGFQNFSDRFAAV-PTPVKSSGAAAVKG

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]1.2e-8090.12Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+RY CFCFPCFG  RSASDELSWWER K KAKSTKFDGEDHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA
        NRPA VKLGKFQYDPISYALNFDEGHNGDVDFEGDEY  GGFQNFSDRF+AVP   KSSGAA
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]6.9e-8190.12Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+RYSCFCFPCFG  RSASDELSWWER K KAKSTKFDGEDHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA
        NRPA VKLGKFQYDPISYALNFDEGHNGDV+FEGDEY  GGFQNFSDRF+AVP   KSSGAA
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]4.0e-8190.74Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+RYSCFCFPCFG  RSASDELSWWER K KAKSTKFDGEDHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA
        NRPA VKLGKFQYDPISYALNFDEGHNGDVDFEGDEY  GGFQNFSDRF+AVP   KSSGAA
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]1.6e-9096.39Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDE+SWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAAAVKG
        NRPATVKLGKFQYDPISYALNFD+GHNGDVDF+GDEY GGGFQNFSDRFAAVPTPVKSSGA AV G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAAAVKG

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein2.2e-8089.94Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGED-HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN
        MGDRDGAFKELES   IDGADALL SKRYSCFCFPCFGP RS SDELSWWERVKTKAKSTKFD ED HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGED-HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYG-GGGFQNFSDRFAAV-PTPVKSSGAAAVKG
        RNRPATVKLGKFQYDPISYALNFDEGHNGDVDF+GDEY  GGGFQNFSDRFAA+ P PVKSS +AAV G
Subjt:  RNRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYG-GGGFQNFSDRFAAV-PTPVKSSGAAAVKG

A0A5A7ULQ3 Uncharacterized protein1.1e-7990Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKELES   IDGADALLASKRYSCFCFPCFGP RS SDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGH-NGDVDFEGD-EY-GGGGFQNFSDRFAAV-PTPVKSSGAAAVKG
        NRPATVKLGKFQYDPISYALNFDEGH NGDVDFEG+ EY GGGGFQNFSDRFAAV P P+KSS + A+ G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGH-NGDVDFEGD-EY-GGGGFQNFSDRFAAV-PTPVKSSGAAAVKG

A0A5D3BLD0 Uncharacterized protein9.1e-7989.41Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKELES   IDGADALLASKRYSCFCFPCFGP RS SDELSWWERVKTKAKSTKFDGEDHWWTG IRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGH-NGDVDFEGD-EY-GGGGFQNFSDRFAAV-PTPVKSSGAAAVKG
        NRPATVKLGKFQYDPISYALNFDEGH NGDVDFEG+ EY GGGGFQNFSDRFAAV P P+KSS + A+ G
Subjt:  NRPATVKLGKFQYDPISYALNFDEGH-NGDVDFEGD-EY-GGGGFQNFSDRFAAV-PTPVKSSGAAAVKG

A0A6J1ED51 uncharacterized protein LOC1114314565.7e-8190.12Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+RY CFCFPCFG  RSASDELSWWER K KAKSTKFDGEDHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA
        NRPA VKLGKFQYDPISYALNFDEGHNGDVDFEGDEY  GGFQNFSDRF+AVP   KSSGAA
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA

A0A6J1KJ01 uncharacterized protein LOC1114950353.3e-8190.12Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+RYSCFCFPCFG  RSASDELSWWER K KAKSTKFDGEDHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA
        NRPA VKLGKFQYDPISYALNFDEGHNGDV+FEGDEY  GGFQNFSDRF+AVP   KSSGAA
Subjt:  NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)3.8e-2136.05Show/hide
Query:  IGEIDGADAL---LASKRYSCFCFPCFGPRRSASDELS-WWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR-------
        I E+D  D +   L +KR  CF  PC    + ++   S WW+R+ T     K + ++ WW   IR  +++REWSE+VAGPRWKT+IRRF R+        
Subjt:  IGEIDGADAL---LASKRYSCFCFPCFGPRRSASDELS-WWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR-------

Query:  ---------------PATVKLGKFQYDPISYALNFDEGH-NGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKS
                         +   GKF+YD +SY+LNFD+G+  G  D   DE+    ++++S RFAA   PV +
Subjt:  ---------------PATVKLGKFQYDPISYALNFDEGH-NGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKS

AT3G48020.1 unknown protein9.5e-2043.9Show/hide
Query:  SASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGHNGDVDFEGDEYG
        S + + SWW+R+           E  WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YDP+SY L+F++    D     DE G
Subjt:  SASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDPISYALNFDEGHNGDVDFEGDEYG

Query:  GGGFQNFSDRFAAVPTPVKSSGA
         GG ++FS R+A+VP     S A
Subjt:  GGGFQNFSDRFAAVPTPVKSSGA

AT5G14890.1 NHL domain-containing protein2.7e-2240.83Show/hide
Query:  FKELES--IGEIDGADAL---LASKRYSCFCFPCFGPRRSASDELS-WWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-
        F  LES  I E+D  D +   + +KR  CF  PC G  + +    S WW+R++T     K + ++ WW  G     K+REWSEIVAGP+WKTFIRRF R 
Subjt:  FKELES--IGEIDGADAL---LASKRYSCFCFPCFGPRRSASDELS-WWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR-

Query:  -----------NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKS
                   NRP  V    F+YD  SY+LNFD+G      FE DE+    ++++S RFAA   PV +
Subjt:  -----------NRPATVKLGKFQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKS

AT5G25240.1 unknown protein2.2e-0847.37Show/hide
Query:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGHNG
        G   LK L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G +G
Subjt:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDPISYALNFDEGHNG

AT5G62865.1 unknown protein2.4e-2349.65Show/hide
Query:  CFCFPCFGPRRSASD-ELSWWERVKT---KAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF
        C CFP F   RS++    S W R++T      S     E  WW   IR+  K+REWSEIVAGPRWKTFIRRFNR+ P   +      KFQYDP+SY+LNF
Subjt:  CFCFPCFGPRRSASD-ELSWWERVKT---KAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDPISYALNF

Query:  DEGHNGDVDFEGDEYGG-GGFQNFSDRFAAVPTPVKSSGAAAV
        D+      D E DEY G GG ++FS RFA+V  PV S  A A+
Subjt:  DEGHNGDVDFEGDEYGG-GGFQNFSDRFAAVPTPVKSSGAAAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATTAGAATCAATCGGAGAAATCGACGGCGCCGATGCTCTTTTAGCCTCTAAACGCTATAGTTGTTTTTGCTTCCCTTGCTT
CGGACCTCGCCGGTCGGCTTCCGATGAGCTTTCATGGTGGGAGCGGGTGAAGACGAAGGCAAAATCCACGAAGTTCGACGGTGAAGATCACTGGTGGACCGGCGGAATCA
GATCCCTCAAAAAGCTTCGTGAATGGTCGGAGATCGTCGCCGGTCCGAGATGGAAAACTTTCATTCGTCGGTTCAACCGGAACCGTCCCGCCACTGTGAAGCTCGGGAAA
TTCCAATATGATCCTATCAGTTACGCTTTGAATTTCGACGAGGGACATAATGGTGATGTGGATTTCGAAGGGGATGAATACGGTGGTGGTGGGTTTCAGAATTTCTCCGA
CCGGTTTGCTGCCGTGCCGACGCCTGTGAAGTCGTCGGGTGCGGCGGCGGTGAAGGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATTAGAATCAATCGGAGAAATCGACGGCGCCGATGCTCTTTTAGCCTCTAAACGCTATAGTTGTTTTTGCTTCCCTTGCTT
CGGACCTCGCCGGTCGGCTTCCGATGAGCTTTCATGGTGGGAGCGGGTGAAGACGAAGGCAAAATCCACGAAGTTCGACGGTGAAGATCACTGGTGGACCGGCGGAATCA
GATCCCTCAAAAAGCTTCGTGAATGGTCGGAGATCGTCGCCGGTCCGAGATGGAAAACTTTCATTCGTCGGTTCAACCGGAACCGTCCCGCCACTGTGAAGCTCGGGAAA
TTCCAATATGATCCTATCAGTTACGCTTTGAATTTCGACGAGGGACATAATGGTGATGTGGATTTCGAAGGGGATGAATACGGTGGTGGTGGGTTTCAGAATTTCTCCGA
CCGGTTTGCTGCCGTGCCGACGCCTGTGAAGTCGTCGGGTGCGGCGGCGGTGAAGGGTTAG
Protein sequenceShow/hide protein sequence
MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASDELSWWERVKTKAKSTKFDGEDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGK
FQYDPISYALNFDEGHNGDVDFEGDEYGGGGFQNFSDRFAAVPTPVKSSGAAAVKG