; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007950 (gene) of Snake gourd v1 genome

Gene IDTan0007950
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA-N domain-containing protein
Genome locationLG04:16055067..16056527
RNA-Seq ExpressionTan0007950
SyntenyTan0007950
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0071805 - potassium ion transmembrane transport (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0015079 - potassium ion transmembrane transporter activity (molecular function)
InterPro domainsIPR008013 - GATA-type transcription activator, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146444.1 uncharacterized protein LOC101211273 [Cucumis sativus]8.9e-7989.77Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAI PSHGGFPSQL QGWK+ R RP  RSSNLRV    AEKGEERE GGG ENKKSLFSSVTEALDFS VRS+RDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSS GNFFSKFFSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

XP_008456929.1 PREDICTED: uncharacterized protein LOC103496731 [Cucumis melo]5.2e-7990.91Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSHGGFPSQL QGWK+ R RP  RSSNLRV    AEKGEERE  GG ENKKSLFSSVTEALDFS VRSSRDAELLDDARQATK+GGRMTREQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSK FSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

XP_022941869.1 uncharacterized protein LOC111447101 [Cucurbita moschata]5.8e-7887.5Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSH GF SQ  QGWK ARE    RSSNLRV AK+ EKGEE+EDGG R+NK+SLFSSVTEALDFSQVRSSRDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSSS NFF+KFF K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

XP_023532465.1 uncharacterized protein LOC111794620 [Cucurbita pepo subsp. pepo]1.3e-7787.5Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSH GF SQ  QGWK ARE    RSSNLRV AKA EKGEE+ED G R+NK+SLFSSVTEALDFSQVRSSRDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSSS NFF+KFF K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

XP_038891315.1 uncharacterized protein LOC120080762 [Benincasa hispida]7.3e-8192.09Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEERED-GGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYG
        MVAISPSHGGFPSQL QGWK  R RP  RSSNLRV AKAAEKGEERE  GGG ENKKSLFSSVTEALDFS VRSSRDAELLDDARQATK+GGRMTREQYG
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEERED-GGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYG

Query:  ALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        ALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKS SSSGNFFSK FSK
Subjt:  ALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

TrEMBL top hitse value%identityAlignment
A0A0A0KM41 GATA-N domain-containing protein4.3e-7989.77Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAI PSHGGFPSQL QGWK+ R RP  RSSNLRV    AEKGEERE GGG ENKKSLFSSVTEALDFS VRS+RDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSS GNFFSKFFSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

A0A1S3C3X8 uncharacterized protein LOC1034967312.5e-7990.91Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSHGGFPSQL QGWK+ R RP  RSSNLRV    AEKGEERE  GG ENKKSLFSSVTEALDFS VRSSRDAELLDDARQATK+GGRMTREQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSK FSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

A0A5D3DFR3 GATA-N domain-containing protein2.5e-7990.91Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSHGGFPSQL QGWK+ R RP  RSSNLRV    AEKGEERE  GG ENKKSLFSSVTEALDFS VRSSRDAELLDDARQATK+GGRMTREQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSK FSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

A0A6J1D0J3 uncharacterized protein LOC1110159364.1e-7788Show/hide
Query:  VAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGAL
        VAISPSHGGFP    QGW A+R   +GR SN RV AKAA  GEEREDGGG+E KKSLFSSVTEALDFSQVRSSRDAELL+DARQAT+SGGRMTREQYGAL
Subjt:  VAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGAL

Query:  RRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        RRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLG YVHVACLE SNSSSGNFFSKFFSK
Subjt:  RRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

A0A6J1FPP4 uncharacterized protein LOC1114471012.8e-7887.5Show/hide
Query:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSH GF SQ  QGWK ARE    RSSNLRV AK+ EKGEE+EDGG R+NK+SLFSSVTEALDFSQVRSSRDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSSS NFF+KFF K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56290.1 unknown protein9.0e-5370.92Show/hide
Query:  WAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKV
        W   + K E+ ED    E + SLF+ +T+ALDFSQVRS +DAELL +AR+ATKSG +MT+EQYGALRRKIGGTYKDFFKSY+EVDGQYVEEGWVDKTCK+
Subjt:  WAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKV

Query:  CKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK
        CKKDT+GEARQVDKLGRY HV+CL+  N  SGNFF++ FS+
Subjt:  CKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCGATTTCTCCGTCGCACGGTGGATTCCCGTCGCAATTAGTGCAGGGATGGAAGGCGGCGAGGGAGAGGCCTATTGGCCGGAGCTCGAATTTGAGAGTATGGGC
AAAAGCGGCAGAGAAGGGCGAGGAGAGAGAAGATGGCGGCGGAAGGGAGAACAAGAAGTCGCTGTTCAGCAGCGTGACGGAGGCGTTGGATTTCTCTCAGGTCCGATCGA
GTCGCGACGCTGAGCTGCTCGACGACGCCCGTCAAGCCACTAAATCCGGCGGTAGAATGACCCGGGAACAGTATGGAGCTCTAAGAAGGAAAATAGGAGGAACCTACAAG
GACTTCTTCAAATCATACATAGAAGTGGATGGGCAATATGTTGAAGAAGGGTGGGTGGACAAGACATGCAAAGTGTGCAAGAAGGACACAAGGGGAGAGGCAAGACAAGT
TGACAAACTAGGAAGATATGTTCATGTTGCTTGTTTGGAGAAATCCAACTCTTCCTCAGGGAATTTCTTCTCCAAGTTCTTCTCCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCGATTTCTCCGTCGCACGGTGGATTCCCGTCGCAATTAGTGCAGGGATGGAAGGCGGCGAGGGAGAGGCCTATTGGCCGGAGCTCGAATTTGAGAGTATGGGC
AAAAGCGGCAGAGAAGGGCGAGGAGAGAGAAGATGGCGGCGGAAGGGAGAACAAGAAGTCGCTGTTCAGCAGCGTGACGGAGGCGTTGGATTTCTCTCAGGTCCGATCGA
GTCGCGACGCTGAGCTGCTCGACGACGCCCGTCAAGCCACTAAATCCGGCGGTAGAATGACCCGGGAACAGTATGGAGCTCTAAGAAGGAAAATAGGAGGAACCTACAAG
GACTTCTTCAAATCATACATAGAAGTGGATGGGCAATATGTTGAAGAAGGGTGGGTGGACAAGACATGCAAAGTGTGCAAGAAGGACACAAGGGGAGAGGCAAGACAAGT
TGACAAACTAGGAAGATATGTTCATGTTGCTTGTTTGGAGAAATCCAACTCTTCCTCAGGGAATTTCTTCTCCAAGTTCTTCTCCAAATGA
Protein sequenceShow/hide protein sequence
MVAISPSHGGFPSQLVQGWKAARERPIGRSSNLRVWAKAAEKGEEREDGGGRENKKSLFSSVTEALDFSQVRSSRDAELLDDARQATKSGGRMTREQYGALRRKIGGTYK
DFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKFFSK