; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008659 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008659
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationChr10:24963298..24963795
RNA-Seq ExpressionHG10008659
SyntenyHG10008659
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057881.1 putative Serine/arginine repetitive matrix protein 2 [Cucumis melo var. makuwa]3.6e-8288Show/hide
Query:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV   +S  CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA
              GGGG+G+RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDFNG+RNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA

KAG6589457.1 hypothetical protein SDJN03_14880, partial [Cucurbita argyrosperma subsp. sororia]9.6e-7582.63Show/hide
Query:  MAFTDVADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MA+ D  +  PCEA  SNSLLPT RCGCFRLSC  SRR   VGPSWWERIRASQVHSEGRWWARGVR +LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAFTDVADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  AGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGT--KDVSAVA
        A  R GKF YDPLSYAMNFDEGSRQIGELDDD DDFNG+RNFSARYASIPAPLKTGGT  KD+S VA
Subjt:  AGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGT--KDVSAVA

XP_008453108.1 PREDICTED: uncharacterized protein LOC103493921 [Cucumis melo]2.1e-8289.02Show/hide
Query:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV   +S  CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ----GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA
            GGGG+G+RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDFNG+RNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ----GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA

XP_011657987.1 uncharacterized protein LOC105435925 [Cucumis sativus]3.6e-8287.43Show/hide
Query:  MAFTDVADSLP-CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS---
        MAFTDV +  P C+ADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVH+EGRWW RGVR LLKLREWSEIVAGPRWKTFIRRFNRNRS   
Subjt:  MAFTDVADSLP-CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS---

Query:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA
              GGGG+G+RAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNG+RNFSARYASIPAPLKTGGTK+VSAVA
Subjt:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA

XP_038878924.1 uncharacterized protein LOC120071012 [Benincasa hispida]2.1e-8592.81Show/hide
Query:  MAFTDVADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAFTDV +  PCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWER+RASQVHSEGRWWARGVR LLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
Subjt:  MAFTDVADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  AGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGT--KDVSAVA
         G+RAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNG+RNFSARYASIPAPLKTGGT  KDVS+VA
Subjt:  AGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGT--KDVSAVA

TrEMBL top hitse value%identityAlignment
A0A0A0LUB2 Uncharacterized protein1.8e-8287.43Show/hide
Query:  MAFTDVADSLP-CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS---
        MAFTDV +  P C+ADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVH+EGRWW RGVR LLKLREWSEIVAGPRWKTFIRRFNRNRS   
Subjt:  MAFTDVADSLP-CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS---

Query:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA
              GGGG+G+RAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNG+RNFSARYASIPAPLKTGGTK+VSAVA
Subjt:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA

A0A1S3BVG0 uncharacterized protein LOC1034939211.0e-8289.02Show/hide
Query:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV   +S  CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ----GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA
            GGGG+G+RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDFNG+RNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ----GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA

A0A5A7UWM1 Putative Serine/arginine repetitive matrix protein 21.8e-8288Show/hide
Query:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV   +S  CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA
              GGGG+G+RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDFNG+RNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ------GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA

A0A5D3BK38 Putative Serine/arginine repetitive matrix protein 21.0e-8289.02Show/hide
Query:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV   +S  CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--ADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ----GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA
            GGGG+G+RAGKFQYDPLSYAMNFDEGSRQIGELDDD+DDFNG+RNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ----GGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA

A0A6J1E188 uncharacterized protein LOC1114298651.4e-7482.63Show/hide
Query:  MAFTDVADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MA+ D  +  PCE D SNSLLPT RCGCFRLSC   RR   VGPSWWERIRASQVHSEGRWWARGVR +LK+REWSEIVAGPRWKTFIRRFNRNR+GG G
Subjt:  MAFTDVADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  AGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGT--KDVSAVA
        AG R GKFQYDPLSYAMNFDEGSRQIGELDDD DDFNG+RNFSARYASIPAPLKTGGT  KD+S VA
Subjt:  AGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGT--KDVSAVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)7.4e-2539.75Show/hide
Query:  DKSNSLLPTARCGCFRLSCFGSRRVATVGPS-WWERI-RASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAG----------
        D  +  L   R  CF + C  S + +T G S WW+RI    ++  + RWW RG R   ++REWSE+VAGPRWKT+IRRF R+   GGG G          
Subjt:  DKSNSLLPTARCGCFRLSCFGSRRVATVGPS-WWERI-RASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAG----------

Query:  --------LRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKT
                   GKF+YD LSY++NFD+G+ Q G  DD+      +R++S R+A+   P+ T
Subjt:  --------LRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKT

AT3G48020.1 unknown protein7.4e-2553.64Show/hide
Query:  TVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFR
        TV  SWW+RI  +  H E RWW   VRA LK+REWSEIVAGPRWKTFIRRFNR+   G      + KF+YDP+SY ++F++  +     DDD     G R
Subjt:  TVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGKFQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFR

Query:  NFSARYASIP
        +FS RYAS+P
Subjt:  NFSARYASIP

AT5G14890.1 NHL domain-containing protein4.1e-2339.86Show/hide
Query:  DKSNSLLPTARCGCFRLSCFGSRRVA-TVGPSWWERIR-ASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGK-----
        D+ +  +   R  CF L C GS + +   G  WW+RIR   ++  + RWW  G    +K+REWSEIVAGP+WKTFIRRF RN    GG      +     
Subjt:  DKSNSLLPTARCGCFRLSCFGSRRVA-TVGPSWWERIR-ASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGK-----

Query:  FQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKT
        F+YD  SY++NFD+G +Q G  +D+      +R++S R+A+   P+ T
Subjt:  FQYDPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKT

AT5G25240.1 unknown protein5.9e-0642.03Show/hide
Query:  QVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGKFQYDPLSYAMNFDEG
        Q    G W   G   L  L+E SE +AGP+WK FIR F    S G     R   F YD  +Y++NFD+G
Subjt:  QVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGKFQYDPLSYAMNFDEG

AT5G62865.1 unknown protein6.3e-2450.74Show/hide
Query:  RCGCFRLSCFGSRRVATVGPSWWERIRA--SQVHS-----EGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGKFQYDPLSYAM
        RC CF  S   SR    VG S W RIR      HS     E RWW   +RA LK+REWSEIVAGPRWKTFIRRFNR+   G      + KFQYDPLSY++
Subjt:  RCGCFRLSCFGSRRVATVGPSWWERIRA--SQVHS-----EGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGKFQYDPLSYAM

Query:  NFDEGSRQIGELDDDIDDF---NGFRNFSARYASIP
        NFD+        DD+ D++    G R+FS R+AS+P
Subjt:  NFDEGSRQIGELDDDIDDF---NGFRNFSARYASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTTACTGATGTAGCTGACTCTCTTCCTTGCGAAGCCGATAAATCCAATTCTTTGTTGCCGACCGCCAGGTGCGGTTGTTTCCGGTTATCTTGCTTTGGATCTCG
CCGTGTGGCCACAGTCGGACCGTCGTGGTGGGAACGGATTAGGGCATCGCAGGTTCATAGCGAGGGGCGCTGGTGGGCGAGAGGCGTTAGGGCTCTTTTGAAGCTCCGAG
AATGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACTTTCATTCGCCGATTTAATCGGAATCGGAGTGGTGGTGGTGGAGCTGGTCTTAGGGCTGGGAAATTTCAATAC
GATCCTTTGAGTTACGCTATGAATTTCGATGAAGGTTCGAGGCAGATTGGAGAATTGGACGACGATATCGATGATTTCAATGGTTTTCGGAACTTCTCTGCTCGTTACGC
TTCGATTCCGGCGCCGTTGAAGACTGGAGGAACCAAGGATGTTTCTGCCGTCGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTTACTGATGTAGCTGACTCTCTTCCTTGCGAAGCCGATAAATCCAATTCTTTGTTGCCGACCGCCAGGTGCGGTTGTTTCCGGTTATCTTGCTTTGGATCTCG
CCGTGTGGCCACAGTCGGACCGTCGTGGTGGGAACGGATTAGGGCATCGCAGGTTCATAGCGAGGGGCGCTGGTGGGCGAGAGGCGTTAGGGCTCTTTTGAAGCTCCGAG
AATGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACTTTCATTCGCCGATTTAATCGGAATCGGAGTGGTGGTGGTGGAGCTGGTCTTAGGGCTGGGAAATTTCAATAC
GATCCTTTGAGTTACGCTATGAATTTCGATGAAGGTTCGAGGCAGATTGGAGAATTGGACGACGATATCGATGATTTCAATGGTTTTCGGAACTTCTCTGCTCGTTACGC
TTCGATTCCGGCGCCGTTGAAGACTGGAGGAACCAAGGATGTTTCTGCCGTCGCGTAA
Protein sequenceShow/hide protein sequence
MAFTDVADSLPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGAGLRAGKFQY
DPLSYAMNFDEGSRQIGELDDDIDDFNGFRNFSARYASIPAPLKTGGTKDVSAVA