; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028440 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028440
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGATA-N domain-containing protein
Genome locationscaffold7:11356984..11362797
RNA-Seq ExpressionSpg028440
SyntenySpg028440
Gene Ontology termsGO:0006813 - potassium ion transport (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456929.1 PREDICTED: uncharacterized protein LOC103496731 [Cucumis melo]3.9e-3980.99Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS GGFPSQLGQ WK+   RP  RSSNLRV    AE GEERE  G  ENKKSLFSSVTEALDFS VRSSRDAELLDDARQAT++GG+MTREQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

XP_022941869.1 uncharacterized protein LOC111447101 [Cucurbita moschata]1.4e-4180.17Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS  GF SQ GQ WK A E    RSSNLRVSAK+ E GEE+EDGG R+NK+SLFSSVTEALDFSQVRSSRDAELLDDARQAT++GG+M+REQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

XP_022996627.1 uncharacterized protein LOC111491808 [Cucurbita maxima]1.2e-4079.34Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS  GF SQ GQ WK A E    RSSNLRVSAKA E GEE+EDG  R+NK+SLFSSVTEALDFSQVRSSRDAELL+DARQAT+SGG+M++EQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

XP_023532465.1 uncharacterized protein LOC111794620 [Cucurbita pepo subsp. pepo]3.2e-4180.17Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS  GF SQ GQ WK A E    RSSNLRVSAKA E GEE+ED G R+NK+SLFSSVTEALDFSQVRSSRDAELLDDARQAT++GG+M+REQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

XP_038891315.1 uncharacterized protein LOC120080762 [Benincasa hispida]1.2e-4082.79Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEERED-GGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYG
        MVAISPS GGFPSQL Q WK    RP  RSSNLRV AKAAE GEERE  GG  ENKKSLFSSVTEALDFS VRSSRDAELLDDARQAT++GG+MTREQYG
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEERED-GGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYG

Query:  ALRRKIGGTYKDFFKSYIEDPG
        ALRRKIGGTYKDFFKSYIE  G
Subjt:  ALRRKIGGTYKDFFKSYIEDPG

TrEMBL top hitse value%identityAlignment
A0A0A0KM41 GATA-N domain-containing protein4.2e-3979.34Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAI PS GGFPSQLGQ WK+   RP  RSSNLRV    AE GEERE GG  ENKKSLFSSVTEALDFS VRS+RDAELLDDARQAT++GG+M+REQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

A0A1S3C3X8 uncharacterized protein LOC1034967311.9e-3980.99Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS GGFPSQLGQ WK+   RP  RSSNLRV    AE GEERE  G  ENKKSLFSSVTEALDFS VRSSRDAELLDDARQAT++GG+MTREQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

A0A5D3DFR3 GATA-N domain-containing protein1.9e-3980.99Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS GGFPSQLGQ WK+   RP  RSSNLRV    AE GEERE  G  ENKKSLFSSVTEALDFS VRSSRDAELLDDARQAT++GG+MTREQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

A0A6J1FPP4 uncharacterized protein LOC1114471017.0e-4280.17Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS  GF SQ GQ WK A E    RSSNLRVSAK+ E GEE+EDGG R+NK+SLFSSVTEALDFSQVRSSRDAELLDDARQAT++GG+M+REQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

A0A6J1K2I3 uncharacterized protein LOC1114918085.9e-4179.34Show/hide
Query:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA
        MVAISPS  GF SQ GQ WK A E    RSSNLRVSAKA E GEE+EDG  R+NK+SLFSSVTEALDFSQVRSSRDAELL+DARQAT+SGG+M++EQYGA
Subjt:  MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEDPG
        LRRKIGGTYKDFFKSYIE  G
Subjt:  LRRKIGGTYKDFFKSYIEDPG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56290.1 unknown protein5.5e-2366.67Show/hide
Query:  VSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGALRRKIGGTYKDFFKSYIEDPG
        VSAK     E+ ED  + E + SLF+ +T+ALDFSQVRS +DAELL +AR+AT+SG KMT+EQYGALRRKIGGTYKDFFKSY+E  G
Subjt:  VSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGALRRKIGGTYKDFFKSYIEDPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCGATTTCTCCATCACAGGGCGGATTCCCGTCGCAATTAGGGCAGAGATGGAAGGCGGCGGGGGAAAGGCCTAGTGGCCGGAGCTCGAATTTGAGAGTATCGGC
GAAAGCGGCGGAGAACGGCGAGGAGAGAGAGGACGGCGGCAGAAGGGAGAACAAGAAGTCGCTGTTCAGCAGCGTGACGGAGGCGTTGGATTTCTCTCAGGTCCGATCGA
GTCGCGACGCTGAGCTCCTCGATGATGCTCGTCAAGCCACCAGATCCGGCGGCAAAATGACCCGGGAACAGTATGGAGCTCTAAGAAGGAAGATAGGAGGGACCTACAAG
GACTTCTTCAAATCTTACATAGAAGATCCTGGTCTGGAACTTGGTCAAAATCCTGGTCGAAATCCTGGTCGAAATCCTGGTCGAGATCTTGGTCAAGATTTTTTCATGCA
CATCATGCCTCGTGTGTGGGTTTGTTGTGGTGGGATATGGAATGAGAGTGAAAAGGAGTACGAAGGTGGGAAGTTGAGAGGGTTCGATGTGGATGTTGGAATTACACATG
TCGACTTCGTAGGTCGGGTTTATAGAATAAGTCGTATAAATCCCACTGAGTTTGATATTGTGATAAGGTGTGTACTCCATCTAAAGTCCAAAGCTCCAGCATTTGTTATC
CAAGATGACGAAGACCTTCATACTTTCCTGACGTGGGAAGAGGTCTCTGTAAGACCTCTCTACGTATCGACTGTGCCAAAGTTTTCGAGTAATGAGAGACATAGGTTACT
TCCCATTCCATACACGGTATCAAATAACCCCAATCAATGTAATCCTTCCTCATCCTTCCCATATAACCAAGGACAAGATATCCCCTCCACAAACATCTCTAACCAAGGAC
AAAGTGCAGTAGCATCACTTACTCCGATGTCCAATAATGTATCTGCATATAACTTGGGAGATGATGTAGACCATGCTTGGGGGGAACTGAGAGATGAAGGGTTGGAAGTA
GATGAAGATGATGACTGGAGTGTGGATAGAGATGATGAGTCAAATGTAGATGTAGATTACGATGAGGATAGAGATGATGGACTTGATGAGACAGAGACAGATGGATATGG
TCATAGAGAGGCCCCGCCTGCTAATGCATCTGAAGCTCCACCTGTTAATGCATCTGAAGCTCCCCCTGTTAATGCACCTGAAGCTCCCCCTGTTAATGCACCTGAGGCTA
TGCATGCATCAGTTTCGGTCGCCCCACAAACTTCTGTGACAGCTCCATCAGGTAACTCAATTGTCATGTCTGGTCAGTCTTCTGGTTTTGATGATATACAAGTTGGGGAT
ATATTCATGTGCAAAAAAGACCTGACCATAAGATTGTCTGTTCTGGCGATGAAGAGAAACTTCGAGTTTAAGGTTAATAAGTCCAAAAAAGATATATACGTAGTTGTCTG
TCGCACTGATGAATGTAAATGGAGACTTCGAGCCATGAGATTGAAAGGTATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCGATTTCTCCATCACAGGGCGGATTCCCGTCGCAATTAGGGCAGAGATGGAAGGCGGCGGGGGAAAGGCCTAGTGGCCGGAGCTCGAATTTGAGAGTATCGGC
GAAAGCGGCGGAGAACGGCGAGGAGAGAGAGGACGGCGGCAGAAGGGAGAACAAGAAGTCGCTGTTCAGCAGCGTGACGGAGGCGTTGGATTTCTCTCAGGTCCGATCGA
GTCGCGACGCTGAGCTCCTCGATGATGCTCGTCAAGCCACCAGATCCGGCGGCAAAATGACCCGGGAACAGTATGGAGCTCTAAGAAGGAAGATAGGAGGGACCTACAAG
GACTTCTTCAAATCTTACATAGAAGATCCTGGTCTGGAACTTGGTCAAAATCCTGGTCGAAATCCTGGTCGAAATCCTGGTCGAGATCTTGGTCAAGATTTTTTCATGCA
CATCATGCCTCGTGTGTGGGTTTGTTGTGGTGGGATATGGAATGAGAGTGAAAAGGAGTACGAAGGTGGGAAGTTGAGAGGGTTCGATGTGGATGTTGGAATTACACATG
TCGACTTCGTAGGTCGGGTTTATAGAATAAGTCGTATAAATCCCACTGAGTTTGATATTGTGATAAGGTGTGTACTCCATCTAAAGTCCAAAGCTCCAGCATTTGTTATC
CAAGATGACGAAGACCTTCATACTTTCCTGACGTGGGAAGAGGTCTCTGTAAGACCTCTCTACGTATCGACTGTGCCAAAGTTTTCGAGTAATGAGAGACATAGGTTACT
TCCCATTCCATACACGGTATCAAATAACCCCAATCAATGTAATCCTTCCTCATCCTTCCCATATAACCAAGGACAAGATATCCCCTCCACAAACATCTCTAACCAAGGAC
AAAGTGCAGTAGCATCACTTACTCCGATGTCCAATAATGTATCTGCATATAACTTGGGAGATGATGTAGACCATGCTTGGGGGGAACTGAGAGATGAAGGGTTGGAAGTA
GATGAAGATGATGACTGGAGTGTGGATAGAGATGATGAGTCAAATGTAGATGTAGATTACGATGAGGATAGAGATGATGGACTTGATGAGACAGAGACAGATGGATATGG
TCATAGAGAGGCCCCGCCTGCTAATGCATCTGAAGCTCCACCTGTTAATGCATCTGAAGCTCCCCCTGTTAATGCACCTGAAGCTCCCCCTGTTAATGCACCTGAGGCTA
TGCATGCATCAGTTTCGGTCGCCCCACAAACTTCTGTGACAGCTCCATCAGGTAACTCAATTGTCATGTCTGGTCAGTCTTCTGGTTTTGATGATATACAAGTTGGGGAT
ATATTCATGTGCAAAAAAGACCTGACCATAAGATTGTCTGTTCTGGCGATGAAGAGAAACTTCGAGTTTAAGGTTAATAAGTCCAAAAAAGATATATACGTAGTTGTCTG
TCGCACTGATGAATGTAAATGGAGACTTCGAGCCATGAGATTGAAAGGTATCTGA
Protein sequenceShow/hide protein sequence
MVAISPSQGGFPSQLGQRWKAAGERPSGRSSNLRVSAKAAENGEEREDGGRRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATRSGGKMTREQYGALRRKIGGTYK
DFFKSYIEDPGLELGQNPGRNPGRNPGRDLGQDFFMHIMPRVWVCCGGIWNESEKEYEGGKLRGFDVDVGITHVDFVGRVYRISRINPTEFDIVIRCVLHLKSKAPAFVI
QDDEDLHTFLTWEEVSVRPLYVSTVPKFSSNERHRLLPIPYTVSNNPNQCNPSSSFPYNQGQDIPSTNISNQGQSAVASLTPMSNNVSAYNLGDDVDHAWGELRDEGLEV
DEDDDWSVDRDDESNVDVDYDEDRDDGLDETETDGYGHREAPPANASEAPPVNASEAPPVNAPEAPPVNAPEAMHASVSVAPQTSVTAPSGNSIVMSGQSSGFDDIQVGD
IFMCKKDLTIRLSVLAMKRNFEFKVNKSKKDIYVVVCRTDECKWRLRAMRLKGI