; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G013400 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G013400
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionNHL domain protein
Genome locationCiama_Chr01:26337053..26337553
RNA-Seq ExpressionCaUC01G013400
SyntenyCaUC01G013400
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011660353.1 uncharacterized protein LOC105436380 [Cucumis sativus]1.9e-7887.57Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDD-HWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFK
        MGDRDGAFKELES   IDGADALL SKRYSCFCFPCFGP RS S+ELSWWE+VKTKAKSTKFDS+D HWWTGGIRSL+KLREWSEIVAGPRWKTFIRRF 
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDD-HWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFK

Query:  RNRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PTPVKSSLAAAVNG
        RNRPATVKLGK QYDPISYALNFDEGHNGDVDF+GD+YN GGGFQNFSDRFAA+ P PVKSS +AAVNG
Subjt:  RNRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PTPVKSSLAAAVNG

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]1.1e-7583.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDG FKE+ESIGEIDGADALLAS+RY CFCFPCFG  RSAS+ELSWWE+ K KAKSTKFD +DHWW+GGIRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN
        NRPA VKLGK QYDPISYALNFDEGHNGDVDFEGD+Y+ GGFQNFSDRF+AVP   KSS AA  +
Subjt:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]6.7e-7683.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDG FKE+ESIGEIDGADALLAS+RYSCFCFPCFG  RSAS+ELSWWE+ K KAKSTKFD +DHWW+GGIRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN
        NRPA VKLGK QYDPISYALNFDEGHNGDV+FEGD+Y+ GGFQNFSDRF+AVP   KSS AA  +
Subjt:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]3.9e-7683.64Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDG FKE+ESIGEIDGADALLAS+RYSCFCFPCFG  RSAS+ELSWWE+ K KAKSTKFD +DHWW+GGIRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN
        NRPA VKLGK QYDPISYALNFDEGHNGDVDFEGD+Y+ GGFQNFSDRF+AVP   KSS AA  +
Subjt:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]4.2e-8691.57Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSAS+E+SWWE+VKTKAKSTKFD +DHWWTGGIRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVNG
        NRPATVKLGK QYDPISYALNFD+GHNGDVDF+GD+Y+GGGFQNFSDRFAAVPTPVKSS A AVNG
Subjt:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVNG

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein9.1e-7987.57Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDD-HWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFK
        MGDRDGAFKELES   IDGADALL SKRYSCFCFPCFGP RS S+ELSWWE+VKTKAKSTKFDS+D HWWTGGIRSL+KLREWSEIVAGPRWKTFIRRF 
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDD-HWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFK

Query:  RNRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PTPVKSSLAAAVNG
        RNRPATVKLGK QYDPISYALNFDEGHNGDVDF+GD+YN GGGFQNFSDRFAA+ P PVKSS +AAVNG
Subjt:  RNRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PTPVKSSLAAAVNG

A0A5A7ULQ3 Uncharacterized protein1.6e-7585.29Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDGAFKELES   IDGADALLASKRYSCFCFPCFGP RS S+ELSWWE+VKTKAKSTKFD +DHWWTGGIRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PTPVKSSLAAAVNG
        NRPATVKLGK QYDPISYALNFDEGH NGDVDFEG+ +Y  GGGFQNFSDRFAAV P P+KSS + A+NG
Subjt:  NRPATVKLGKLQYDPISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PTPVKSSLAAAVNG

A0A5D3BLD0 Uncharacterized protein1.4e-7484.71Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDGAFKELES   IDGADALLASKRYSCFCFPCFGP RS S+ELSWWE+VKTKAKSTKFD +DHWWTG IRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PTPVKSSLAAAVNG
        NRPATVKLGK QYDPISYALNFDEGH NGDVDFEG+ +Y  GGGFQNFSDRFAAV P P+KSS + A+NG
Subjt:  NRPATVKLGKLQYDPISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PTPVKSSLAAAVNG

A0A6J1ED51 uncharacterized protein LOC1114314565.5e-7683.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDG FKE+ESIGEIDGADALLAS+RY CFCFPCFG  RSAS+ELSWWE+ K KAKSTKFD +DHWW+GGIRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN
        NRPA VKLGK QYDPISYALNFDEGHNGDVDFEGD+Y+ GGFQNFSDRF+AVP   KSS AA  +
Subjt:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN

A0A6J1KJ01 uncharacterized protein LOC1114950353.2e-7683.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR
        MGDRDG FKE+ESIGEIDGADALLAS+RYSCFCFPCFG  RSAS+ELSWWE+ K KAKSTKFD +DHWW+GGIRSL+KLREWSEIVAGPRWKTFIRRF R
Subjt:  MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR

Query:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN
        NRPA VKLGK QYDPISYALNFDEGHNGDV+FEGD+Y+ GGFQNFSDRF+AVP   KSS AA  +
Subjt:  NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)7.2e-2034.88Show/hide
Query:  IGEIDGADAL---LASKRYSCFCFPCFGPRRSASEELS-WWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKRNR-------
        I E+D  D +   L +KR  CF  PC    + ++   S WW+++ T     K + D+ WW   IR  +++REWSE+VAGPRWKT+IRRF R+        
Subjt:  IGEIDGADAL---LASKRYSCFCFPCFGPRRSASEELS-WWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKRNR-------

Query:  ---------------PATVKLGKLQYDPISYALNFDEGH-NGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKS
                         +   GK +YD +SY+LNFD+G+  G  D   D++    ++++S RFAA   PV +
Subjt:  ---------------PATVKLGKLQYDPISYALNFDEGH-NGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKS

AT3G48020.1 unknown protein3.4e-1742.24Show/hide
Query:  SASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKRNRPATV---KLGKLQYDPISYALNFDEGHNGDVDFEGDDYN
        S + + SWW+++           +  WW   +R+  K+REWSEIVAGPRWKTFIRRF R+           K +YDP+SY L+F+     D D + DD  
Subjt:  SASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKRNRPATV---KLGKLQYDPISYALNFDEGHNGDVDFEGDDYN

Query:  G-GGFQNFSDRFAAVP
        G GG ++FS R+A+VP
Subjt:  G-GGFQNFSDRFAAVP

AT5G14890.1 NHL domain-containing protein3.8e-2139.64Show/hide
Query:  FKELES--IGEIDGADAL---LASKRYSCFCFPCFGPRRSASEELS-WWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR-
        F  LES  I E+D  D +   + +KR  CF  PC G  + +    S WW++++T     K + D+ WW  G     K+REWSEIVAGP+WKTFIRRF R 
Subjt:  FKELES--IGEIDGADAL---LASKRYSCFCFPCFGPRRSASEELS-WWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKR-

Query:  -----------NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKS
                   NRP  V     +YD  SY+LNFD+G      FE D++    ++++S RFAA   PV +
Subjt:  -----------NRPATVKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKS

AT5G25240.1 unknown protein4.1e-0743.86Show/hide
Query:  GIRSLQKLREWSEIVAGPRWKTFIRRFKRNRPATVKLGKLQYDPISYALNFDEGHNG
        G   L+ L+E SE +AGP+WK FIR F   R    +     YD  +Y+LNFD+G +G
Subjt:  GIRSLQKLREWSEIVAGPRWKTFIRRFKRNRPATVKLGKLQYDPISYALNFDEGHNG

AT5G62865.1 unknown protein1.0e-2147.26Show/hide
Query:  CFCFPCFGPRRSASE-ELSWWEQVKTKAKSTKFDSDDH-----WWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKRNRPATVK----LGKLQYDPISYAL
        C CFP F   RS++    S W +++T   S    S DH     WW   IR+  K+REWSEIVAGPRWKTFIRRF R+ P   +      K QYDP+SY+L
Subjt:  CFCFPCFGPRRSASE-ELSWWEQVKTKAKSTKFDSDDH-----WWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKRNRPATVK----LGKLQYDPISYAL

Query:  NFDEGHNGDVDFEGDDYNG-GGFQNFSDRFAAVPTPVKSSLAAAVN
        NFD+      D E D+Y G GG ++FS RFA+V  PV S  A A++
Subjt:  NFDEGHNGDVDFEGDDYNG-GGFQNFSDRFAAVPTPVKSSLAAAVN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATTAGAATCAATTGGAGAAATCGACGGCGCTGATGCTCTTTTAGCCTCTAAACGTTATAGTTGTTTTTGTTTCCCT
TGCTTTGGACCTCGCCGGTCGGCTTCTGAGGAGCTCTCATGGTGGGAACAGGTGAAGACGAAGGCAAAATCCACGAAGTTCGACAGCGATGATCACTGGTGGACT
GGTGGAATCAGATCCCTCCAAAAGCTTCGTGAATGGTCGGAGATCGTCGCCGGTCCGAGATGGAAAACTTTCATTCGCCGGTTCAAGAGGAACCGTCCGGCCACT
GTGAAGCTAGGGAAATTGCAATATGATCCTATAAGTTACGCTTTGAATTTCGACGAGGGGCACAATGGTGATGTGGATTTCGAAGGGGATGATTACAATGGTGGT
GGGTTTCAGAATTTCTCCGACCGGTTTGCTGCCGTGCCGACTCCTGTGAAGTCATCGTTGGCAGCGGCGGTGAATGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATTAGAATCAATTGGAGAAATCGACGGCGCTGATGCTCTTTTAGCCTCTAAACGTTATAGTTGTTTTTGTTTCCCT
TGCTTTGGACCTCGCCGGTCGGCTTCTGAGGAGCTCTCATGGTGGGAACAGGTGAAGACGAAGGCAAAATCCACGAAGTTCGACAGCGATGATCACTGGTGGACT
GGTGGAATCAGATCCCTCCAAAAGCTTCGTGAATGGTCGGAGATCGTCGCCGGTCCGAGATGGAAAACTTTCATTCGCCGGTTCAAGAGGAACCGTCCGGCCACT
GTGAAGCTAGGGAAATTGCAATATGATCCTATAAGTTACGCTTTGAATTTCGACGAGGGGCACAATGGTGATGTGGATTTCGAAGGGGATGATTACAATGGTGGT
GGGTTTCAGAATTTCTCCGACCGGTTTGCTGCCGTGCCGACTCCTGTGAAGTCATCGTTGGCAGCGGCGGTGAATGGTTAG
Protein sequenceShow/hide protein sequence
MGDRDGAFKELESIGEIDGADALLASKRYSCFCFPCFGPRRSASEELSWWEQVKTKAKSTKFDSDDHWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFKRNRPAT
VKLGKLQYDPISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPTPVKSSLAAAVNG