; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041010 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041010
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionARM repeat superfamily protein
Genome locationchr13:10774157..10776578
RNA-Seq ExpressionLag0041010
SyntenyLag0041010
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011655026.1 uncharacterized protein LOC101212969 isoform X2 [Cucumis sativus]6.4e-9193.09Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +IASKL THLCRREP RTLQFR FS YDEREIEK+AERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LL+VKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        +RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

XP_022941868.1 uncharacterized protein LOC111447100 isoform X2 [Cucurbita moschata]2.9e-9193.09Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +I SKL  HLCRREPARTLQFRPFS YDEREIEK+AERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        +RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRDL YDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

XP_022996604.1 uncharacterized protein LOC111491787 isoform X2 [Cucurbita maxima]1.4e-9092.55Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +I SKL  HLCRREPARTLQFRPFS YDEREIEK+AERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        +RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS+SDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRDL YDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

XP_023526983.1 uncharacterized protein LOC111790336 isoform X2 [Cucurbita pepo subsp. pepo]5.8e-9293.62Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +I SKL  HLCRREPARTLQFRPFS YDEREIEK+AERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        +RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAEDM+VNEYKSNLMKRFRDL YDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

XP_038891777.1 uncharacterized protein LOC120081166 isoform X1 [Benincasa hispida]2.9e-9193.62Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +IASKL THLCRREP RTLQFR FS YDEREIEK+AERKVGWLLKLIFAGTATF+GY IFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDL YDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KM85 Uncharacterized protein3.1e-9193.09Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +IASKL THLCRREP RTLQFR FS YDEREIEK+AERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSV+LL+VKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        +RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

A0A6J1FNN3 uncharacterized protein LOC111447100 isoform X14.5e-9092.11Show/hide
Query:  MSMIASKLAT--HLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        M +I SKL    HLCRREPARTLQFRPFS YDEREIEK+AERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSID
Subjt:  MSMIASKLAT--HLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        DE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRDL YDVSS
Subjt:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

A0A6J1FTA5 uncharacterized protein LOC111447100 isoform X21.4e-9193.09Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +I SKL  HLCRREPARTLQFRPFS YDEREIEK+AERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        +RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRDL YDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

A0A6J1K583 uncharacterized protein LOC111491787 isoform X12.2e-8991.58Show/hide
Query:  MSMIASKLAT--HLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        M +I SKL    HLCRREPARTLQFRPFS YDEREIEK+AERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSID
Subjt:  MSMIASKLAT--HLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        DE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS+SDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRDL YDVSS
Subjt:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

A0A6J1K968 uncharacterized protein LOC111491787 isoform X26.9e-9192.55Show/hide
Query:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE
        M +I SKL  HLCRREPARTLQFRPFS YDEREIEK+AERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSV+LLQVKDPLFKRMGASRLARFSIDDE
Subjt:  MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDE

Query:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        +RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS+SDEAV ALHKAGAILVIKSTPDSAED +VNEYKSNLMKRFRDL YDVSS
Subjt:  RRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56210.1 ARM repeat superfamily protein7.6e-5856.84Show/hide
Query:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        M ++ +++A H CR   A  R+  F   +  D+  +E++AERK+GW LK+ FAGTAT++GY  FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+ID
Subjt:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        DERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S ED  ++ YKSN++++  +    VSS
Subjt:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

AT3G56210.2 ARM repeat superfamily protein7.6e-5856.84Show/hide
Query:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        M ++ +++A H CR   A  R+  F   +  D+  +E++AERK+GW LK+ FAGTAT++GY  FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+ID
Subjt:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        DERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S ED  ++ YKSN++++  +    VSS
Subjt:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

AT3G56210.4 ARM repeat superfamily protein7.6e-5856.84Show/hide
Query:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        M ++ +++A H CR   A  R+  F   +  D+  +E++AERK+GW LK+ FAGTAT++GY  FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+ID
Subjt:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        DERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S ED  ++ YKSN++++  +    VSS
Subjt:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS

AT3G56210.5 ARM repeat superfamily protein2.4e-5656.25Show/hide
Query:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID
        M ++ +++A H CR   A  R+  F   +  D+  +E++AERK+GW LK+ FAGTAT++GY  FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+ID
Subjt:  MSMIASKLATHLCRREPA--RTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS
        DERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S S   EA   L   GA+ ++KSTP+S ED  ++ YKSN++++  +    VSS
Subjt:  DERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATGATTGCATCGAAGCTAGCCACTCATCTGTGCAGAAGGGAACCTGCGCGGACCCTGCAATTTCGCCCCTTTTCAGTTTACGATGAAAGAGAAATCGAGAAGGA
TGCTGAAAGAAAAGTAGGATGGTTATTAAAACTAATCTTTGCTGGAACTGCGACATTTCTGGGTTACCATATTTTTCCATACATGGGGGATAACTTGTTGCAGCAATCTG
TGGCGCTCTTGCAAGTCAAGGATCCACTGTTTAAGAGGATGGGAGCATCTAGATTGGCTCGCTTTTCGATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGG
GCTCAAGAGCTCTTAAACATGCTCGGGGCTGCCAAAGACGATCGTACACGTAAGGAAGCTTTGAAGGCTTTACACGCCATCTCACATTCAGATGAAGCTGTTGATGCCTT
GCATAAAGCAGGGGCAATCTTGGTTATTAAATCTACCCCGGATTCAGCTGAAGATATGAAAGTGAATGAGTACAAGTCAAACCTAATGAAGAGATTTAGAGATCTGAGAT
ATGATGTTTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATGATTGCATCGAAGCTAGCCACTCATCTGTGCAGAAGGGAACCTGCGCGGACCCTGCAATTTCGCCCCTTTTCAGTTTACGATGAAAGAGAAATCGAGAAGGA
TGCTGAAAGAAAAGTAGGATGGTTATTAAAACTAATCTTTGCTGGAACTGCGACATTTCTGGGTTACCATATTTTTCCATACATGGGGGATAACTTGTTGCAGCAATCTG
TGGCGCTCTTGCAAGTCAAGGATCCACTGTTTAAGAGGATGGGAGCATCTAGATTGGCTCGCTTTTCGATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGG
GCTCAAGAGCTCTTAAACATGCTCGGGGCTGCCAAAGACGATCGTACACGTAAGGAAGCTTTGAAGGCTTTACACGCCATCTCACATTCAGATGAAGCTGTTGATGCCTT
GCATAAAGCAGGGGCAATCTTGGTTATTAAATCTACCCCGGATTCAGCTGAAGATATGAAAGTGAATGAGTACAAGTCAAACCTAATGAAGAGATTTAGAGATCTGAGAT
ATGATGTTTCATCTTGA
Protein sequenceShow/hide protein sequence
MSMIASKLATHLCRREPARTLQFRPFSVYDEREIEKDAERKVGWLLKLIFAGTATFLGYHIFPYMGDNLLQQSVALLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGG
AQELLNMLGAAKDDRTRKEALKALHAISHSDEAVDALHKAGAILVIKSTPDSAEDMKVNEYKSNLMKRFRDLRYDVSS