; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014708 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014708
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionARM repeat superfamily protein
Genome locationChr02:18269177..18270721
RNA-Seq ExpressionHG10014708
SyntenyHG10014708
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030635.1 hypothetical protein SDJN02_04672 [Cucurbita argyrosperma subsp. argyrosperma]6.0e-7594.41Show/hide
Query:  IELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTR
        IELISEIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTR
Subjt:  IELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTR

Query:  KEALKALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        KEALKALHAIS SDEAVGALHKAGAILVIKSTPDSAED +VNE+KS+LMKRF DL YDVSS
Subjt:  KEALKALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

TYK25830.1 ARM repeat superfamily protein [Cucumis melo var. makuwa]8.4e-7790.34Show/hide
Query:  MSTMLIVIVFERLIEIELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQ
        MS MLIVIVF  + EIELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSV+LL+VKDPLFKRMGASRLARFSIDD+RRMKIVE GGAQ
Subjt:  MSTMLIVIVFERLIEIELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQ

Query:  ELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ELLNML  AKDDRTRKEALKAL+AIS SDEAVG LHKAGAILVIKSTPDSAEDMKVNE+KS+LMKRF DLRYDVSS
Subjt:  ELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

XP_004146451.1 uncharacterized protein LOC101212969 isoform X1 [Cucumis sativus]3.0e-7495.51Show/hide
Query:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK
        EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLL+VKDPLFKRMGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALK
Subjt:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK

Query:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ALHAIS SDEA GALHKAGAILVIKSTPDSAEDMKVNE+KS+LMKRF DLRYDVSS
Subjt:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

XP_011655026.1 uncharacterized protein LOC101212969 isoform X2 [Cucumis sativus]3.0e-7495.51Show/hide
Query:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK
        EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLL+VKDPLFKRMGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALK
Subjt:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK

Query:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ALHAIS SDEA GALHKAGAILVIKSTPDSAEDMKVNE+KS+LMKRF DLRYDVSS
Subjt:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

XP_038891777.1 uncharacterized protein LOC120081166 isoform X1 [Benincasa hispida]1.3e-7496.15Show/hide
Query:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK
        EIEKEAERKVGWLLKLIFAGTATF+GYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK
Subjt:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK

Query:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ALHAIS SDEAVGALHKAGAILVIKSTPDSAEDMKVNE+KS+LMKRF DL YDVSS
Subjt:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KM85 Uncharacterized protein1.4e-7495.51Show/hide
Query:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK
        EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLL+VKDPLFKRMGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALK
Subjt:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK

Query:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ALHAIS SDEA GALHKAGAILVIKSTPDSAEDMKVNE+KS+LMKRF DLRYDVSS
Subjt:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

A0A5D3DRA8 ARM repeat superfamily protein4.1e-7790.34Show/hide
Query:  MSTMLIVIVFERLIEIELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQ
        MS MLIVIVF  + EIELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSV+LL+VKDPLFKRMGASRLARFSIDD+RRMKIVE GGAQ
Subjt:  MSTMLIVIVFERLIEIELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQ

Query:  ELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ELLNML  AKDDRTRKEALKAL+AIS SDEAVG LHKAGAILVIKSTPDSAEDMKVNE+KS+LMKRF DLRYDVSS
Subjt:  ELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

A0A6J1D932 uncharacterized protein LOC1110183886.1e-7392.36Show/hide
Query:  SEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEAL
        SEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGD+L+QQSV+LLQVKDPLFKRMGASRLARFSIDDERRMKIVE+GGAQELLNMLGAAKDDRTRKEAL
Subjt:  SEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEAL

Query:  KALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        KALHAIS SDEAVGALHKAGAILVIKS P+S ED+KVNE+KS+LMKRF DLRYDVSS
Subjt:  KALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

A0A6J1FNN3 uncharacterized protein LOC111447100 isoform X14.6e-7394.23Show/hide
Query:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK
        EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALK
Subjt:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK

Query:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ALHAIS SDEAVGALHKAGAILVIKSTPDSAED +VNE+KS+LMKRF DL YDVSS
Subjt:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

A0A6J1FTA5 uncharacterized protein LOC111447100 isoform X24.6e-7394.23Show/hide
Query:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK
        EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALK
Subjt:  EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALK

Query:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        ALHAIS SDEAVGALHKAGAILVIKSTPDSAED +VNE+KS+LMKRF DL YDVSS
Subjt:  ALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56210.1 ARM repeat superfamily protein3.6e-5464.52Show/hide
Query:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA
        +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMGASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKA
Subjt:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA

Query:  LHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        L A+S+S EA   L   GA+ ++KSTP+S ED  ++ +KS+++++  +    VSS
Subjt:  LHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

AT3G56210.2 ARM repeat superfamily protein3.6e-5464.52Show/hide
Query:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA
        +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMGASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKA
Subjt:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA

Query:  LHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        L A+S+S EA   L   GA+ ++KSTP+S ED  ++ +KS+++++  +    VSS
Subjt:  LHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

AT3G56210.4 ARM repeat superfamily protein3.6e-5464.52Show/hide
Query:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA
        +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMGASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKA
Subjt:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA

Query:  LHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        L A+S+S EA   L   GA+ ++KSTP+S ED  ++ +KS+++++  +    VSS
Subjt:  LHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

AT3G56210.5 ARM repeat superfamily protein1.2e-5263.69Show/hide
Query:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA
        +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMGASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKA
Subjt:  IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKA

Query:  LHAISRS--DEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        L A+S+S   EA   L   GA+ ++KSTP+S ED  ++ +KS+++++  +    VSS
Subjt:  LHAISRS--DEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTACCATGCTTATTGTCATTGTTTTCGAACGCCTGATTGAGATTGAGTTGATTTCAGAGATCGAGAAGGAGGCTGAAAGAAAAGTAGGATGGTTATTAAAACTAAT
CTTTGCTGGGACTGCAACATTTCTGGGTTACCAGATTTTTCCATACATGGGGGATAACTTGTTGCAGCAATCTGTGTCGCTCTTGCAAGTCAAGGATCCATTGTTTAAGA
GGATGGGAGCGTCTAGATTGGCTCGCTTTTCAATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGCGCTCAAGAGCTCTTAAACATGCTCGGCGCTGCCAAA
GATGACCGGACACGTAAGGAAGCTTTGAAGGCTTTACATGCCATCTCACGTTCAGATGAAGCTGTTGGTGCCTTGCATAAAGCAGGGGCAATATTGGTTATTAAATCTAC
TCCAGATTCAGCTGAAGATATGAAAGTGAATGAGTTCAAGTCGGACCTAATGAAGAGATTTAGTGATCTTAGATATGATGTTTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTACCATGCTTATTGTCATTGTTTTCGAACGCCTGATTGAGATTGAGTTGATTTCAGAGATCGAGAAGGAGGCTGAAAGAAAAGTAGGATGGTTATTAAAACTAAT
CTTTGCTGGGACTGCAACATTTCTGGGTTACCAGATTTTTCCATACATGGGGGATAACTTGTTGCAGCAATCTGTGTCGCTCTTGCAAGTCAAGGATCCATTGTTTAAGA
GGATGGGAGCGTCTAGATTGGCTCGCTTTTCAATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGCGCTCAAGAGCTCTTAAACATGCTCGGCGCTGCCAAA
GATGACCGGACACGTAAGGAAGCTTTGAAGGCTTTACATGCCATCTCACGTTCAGATGAAGCTGTTGGTGCCTTGCATAAAGCAGGGGCAATATTGGTTATTAAATCTAC
TCCAGATTCAGCTGAAGATATGAAAGTGAATGAGTTCAAGTCGGACCTAATGAAGAGATTTAGTGATCTTAGATATGATGTTTCATCTTGA
Protein sequenceShow/hide protein sequence
MSTMLIVIVFERLIEIELISEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAK
DDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS