; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012679 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012679
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGATA transcription factor
Genome locationtig00153489:100225..101216
RNA-Seq ExpressionSgr012679
SyntenySgr012679
Gene Ontology termsGO:0030154 - cell differentiation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA0412423.1 unnamed protein product [Arabidopsis thaliana]1.7e-0739.82Show/hide
Query:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK
        +S   +D G  L  +S FS  DD  SLPTSELS+PADDLA+LEWLSHFVE    DS       N+            G R+ P TA       +S   +K
Subjt:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK

Query:  QADENRRSSLVPW
           +  R+ L  W
Subjt:  QADENRRSSLVPW

CAD1844476.1 unnamed protein product [Ananas comosus var. bracteatus]1.3e-0745.83Show/hide
Query:  PDGESQNLRGPPAS-----AVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRP
        P   S +  G P S     AVQPLR  E+P+V   +   Q+SLQRLRRP+Q+RP   R+P RLQPH   R  LQ P + PR+A ++G R     RP
Subjt:  PDGESQNLRGPPAS-----AVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRP

KAF5960955.1 hypothetical protein HYC85_002164, partial [Camellia sinensis]3.2e-2237.77Show/hide
Query:  EEEDKASASISDSGQQLHH--------NSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVED-LSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTA
        EEE K  +   D  QQ  +         S FSA DD  S+P SELSVPADDL  LEWLSHFV+D  S  SL   L  N     +   R  P +   P   
Subjt:  EEEDKASASISDSGQQLHH--------NSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVED-LSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTA

Query:  AALFQDPRSGQGSKQADENRRSSLVPWVTVAHRVILQFDDVLLLLVAGESLAYFPEFRPLRTGPVAGN----SRLEESEEKVPDGESQNLRGPPASAVQP
           F  P     +K   +  R+    W ++  R + +         +  S +       + T PV        +L    +K  +      R P A+AVQP
Subjt:  AALFQDPRSGQGSKQADENRRSSLVPWVTVAHRVILQFDDVLLLLVAGESLAYFPEFRPLRTGPVAGN----SRLEESEEKVPDGESQNLRGPPASAVQP

Query:  LRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRPGSSGP
        L  PENP V +RS R QN+LQRLR P+QI  A+TR+P RLQP  L+R  LQ P E   +AAE+G  R  RRR  S+GP
Subjt:  LRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRPGSSGP

ONM02241.1 GATA zinc finger family protein, partial [Zea mays]1.0e-0741.67Show/hide
Query:  RPLRTGPVAGNSRLEESEEKVPDGESQNLR----GPPASAVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGP
        RP RT      +R EE E    D   +  R    GP   AV+PL   ++P+VE  +   Q+ +QRLRR VQ+R A  R+P R QPH  + +ALQ P EG 
Subjt:  RPLRTGPVAGNSRLEESEEKVPDGESQNLR----GPPASAVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGP

Query:  RDAAEKGSRRSIRRRPGSSG
         DAA    RR+   RP S G
Subjt:  RDAAEKGSRRSIRRRPGSSG

XP_038876782.1 GATA transcription factor 5-like [Benincasa hispida]3.5e-0866.67Show/hide
Query:  EEDKASASISD--SGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVED
        + DK S S+S+  S Q++HH+S  S   DLPSLP+SEL+VPADDLADLEWLSHFVED
Subjt:  EEDKASASISD--SGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVED

TrEMBL top hitse value%identityAlignment
A0A178UHW1 GATA transcription factor8.4e-0839.82Show/hide
Query:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK
        +S   +D G  L  +S FS  DD  SLPTSELS+PADDLA+LEWLSHFVE    DS       N+            G R+ P TA       +S   +K
Subjt:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK

Query:  QADENRRSSLVPW
           +  R+ L  W
Subjt:  QADENRRSSLVPW

A0A1D6KGN9 GATA zinc finger family protein4.9e-0841.67Show/hide
Query:  RPLRTGPVAGNSRLEESEEKVPDGESQNLR----GPPASAVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGP
        RP RT      +R EE E    D   +  R    GP   AV+PL   ++P+VE  +   Q+ +QRLRR VQ+R A  R+P R QPH  + +ALQ P EG 
Subjt:  RPLRTGPVAGNSRLEESEEKVPDGESQNLR----GPPASAVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGP

Query:  RDAAEKGSRRSIRRRPGSSG
         DAA    RR+   RP S G
Subjt:  RDAAEKGSRRSIRRRPGSSG

A0A5S9YHS5 GATA transcription factor8.4e-0839.82Show/hide
Query:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK
        +S   +D G  L  +S FS  DD  SLPTSELS+PADDLA+LEWLSHFVE    DS       N+            G R+ P TA       +S   +K
Subjt:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK

Query:  QADENRRSSLVPW
           +  R+ L  W
Subjt:  QADENRRSSLVPW

A0A6V7QNX9 Uncharacterized protein6.4e-0845.83Show/hide
Query:  PDGESQNLRGPPAS-----AVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRP
        P   S +  G P S     AVQPLR  E+P+V   +   Q+SLQRLRRP+Q+RP   R+P RLQPH   R  LQ P + PR+A ++G R     RP
Subjt:  PDGESQNLRGPPAS-----AVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRP

A0A7J7I7Z8 Uncharacterized protein (Fragment)1.6e-2237.77Show/hide
Query:  EEEDKASASISDSGQQLHH--------NSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVED-LSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTA
        EEE K  +   D  QQ  +         S FSA DD  S+P SELSVPADDL  LEWLSHFV+D  S  SL   L  N     +   R  P +   P   
Subjt:  EEEDKASASISDSGQQLHH--------NSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVED-LSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTA

Query:  AALFQDPRSGQGSKQADENRRSSLVPWVTVAHRVILQFDDVLLLLVAGESLAYFPEFRPLRTGPVAGN----SRLEESEEKVPDGESQNLRGPPASAVQP
           F  P     +K   +  R+    W ++  R + +         +  S +       + T PV        +L    +K  +      R P A+AVQP
Subjt:  AALFQDPRSGQGSKQADENRRSSLVPWVTVAHRVILQFDDVLLLLVAGESLAYFPEFRPLRTGPVAGN----SRLEESEEKVPDGESQNLRGPPASAVQP

Query:  LRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRPGSSGP
        L  PENP V +RS R QN+LQRLR P+QI  A+TR+P RLQP  L+R  LQ P E   +AAE+G  R  RRR  S+GP
Subjt:  LRSPENPSVENRSPRGQNSLQRLRRPVQIRPAITRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRPGSSGP

SwissProt top hitse value%identityAlignment
Q9FH57 GATA transcription factor 52.3e-1039.82Show/hide
Query:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK
        +S   +D G  L  +S FS  DD  SLPTSELS+PADDLA+LEWLSHFVE    DS       N+            G R+ P TA       +S   +K
Subjt:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK

Query:  QADENRRSSLVPW
           +  R+ L  W
Subjt:  QADENRRSSLVPW

Arabidopsis top hitse value%identityAlignment
AT3G51080.1 GATA transcription factor 68.6e-0557.14Show/hide
Query:  LHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLS
        LH ++ FS AD      TS LSVP DD+A+LEWLS+FV+D S
Subjt:  LHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLS

AT5G66320.1 GATA transcription factor 51.6e-1139.82Show/hide
Query:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK
        +S   +D G  L  +S FS  DD  SLPTSELS+PADDLA+LEWLSHFVE    DS       N+            G R+ P TA       +S   +K
Subjt:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK

Query:  QADENRRSSLVPW
           +  R+ L  W
Subjt:  QADENRRSSLVPW

AT5G66320.2 GATA transcription factor 51.6e-1139.82Show/hide
Query:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK
        +S   +D G  L  +S FS  DD  SLPTSELS+PADDLA+LEWLSHFVE    DS       N+            G R+ P TA       +S   +K
Subjt:  ASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSK

Query:  QADENRRSSLVPW
           +  R+ L  W
Subjt:  QADENRRSSLVPW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGAGGAAGAGGACAAGGCCTCCGCTTCCATTTCTGACTCCGGCCAACAGCTTCATCACAACTCTTGCTTTTCTGCCGCCGACGATCTTCCTTCTCTACCGACGAGCGAG
CTTAGCGTTCCGGCAGATGATTTGGCGGACCTAGAATGGCTGTCTCATTTTGTTGAGGATCTTTCACGGGATTCTCTGCTCCGTTCCCTCCGCCGGAATATCTGCGTTGA
AGTCCAAGGAGGCGGCCGAGGTGCACCAGGAAAACGACGGTTCCCTTTCACCGCCGCAGCCTTGTTTCAAGATCCCCGTTCCGGTCAAGGCTCGAAGCAAGCGGACGAGA
ACCGGCGGTCGAGTTTGGTGCCTTGGGTCACCGTCGCTCACCGAGTCATCCTCCAGTTCGACGACGTCCTCCTCCTCCTCGTCGCCGGTGAGTCCTTGGCTTATTTCCCA
GAATTCCGACCACTTCGAACAGGCCCTGTTGCCGGAAATTCACGCCTCGAAGAAAGCGAAGAAAAGGTTCCCGACGGAGAAAGCCAGAACCTGCGGGGCCCCCCCGCCTC
GGCGGTGCAGCCACTGCGGAGTCCAGAAAACCCCTCAGTGGAGAACCGGTCCCCTCGGGGCCAAAACTCTCTGCAACGCTTGCGGCGTCCGGTACAAATCCGGCCGGCTA
TTACCCGAATACCGACCCGCCTGCAGCCCCACTTTCTCCAGCGAATTGCACTCCAACCACCATCGGAAGGTCCTCGAGATGCGGCGGAAAAAGGAAGTCGTCGGAGCATC
AGACGCCGCCCCGGCTCCTCCGGCCCTGCCGAGTTTTTGACATCAGAAAACCAAAAAAAAAAAAGAAGAAAATTGTCGTAG
mRNA sequenceShow/hide mRNA sequence
TCGAGGAAGAGGACAAGGCCTCCGCTTCCATTTCTGACTCCGGCCAACAGCTTCATCACAACTCTTGCTTTTCTGCCGCCGACGATCTTCCTTCTCTACCGACGAGCGAG
CTTAGCGTTCCGGCAGATGATTTGGCGGACCTAGAATGGCTGTCTCATTTTGTTGAGGATCTTTCACGGGATTCTCTGCTCCGTTCCCTCCGCCGGAATATCTGCGTTGA
AGTCCAAGGAGGCGGCCGAGGTGCACCAGGAAAACGACGGTTCCCTTTCACCGCCGCAGCCTTGTTTCAAGATCCCCGTTCCGGTCAAGGCTCGAAGCAAGCGGACGAGA
ACCGGCGGTCGAGTTTGGTGCCTTGGGTCACCGTCGCTCACCGAGTCATCCTCCAGTTCGACGACGTCCTCCTCCTCCTCGTCGCCGGTGAGTCCTTGGCTTATTTCCCA
GAATTCCGACCACTTCGAACAGGCCCTGTTGCCGGAAATTCACGCCTCGAAGAAAGCGAAGAAAAGGTTCCCGACGGAGAAAGCCAGAACCTGCGGGGCCCCCCCGCCTC
GGCGGTGCAGCCACTGCGGAGTCCAGAAAACCCCTCAGTGGAGAACCGGTCCCCTCGGGGCCAAAACTCTCTGCAACGCTTGCGGCGTCCGGTACAAATCCGGCCGGCTA
TTACCCGAATACCGACCCGCCTGCAGCCCCACTTTCTCCAGCGAATTGCACTCCAACCACCATCGGAAGGTCCTCGAGATGCGGCGGAAAAAGGAAGTCGTCGGAGCATC
AGACGCCGCCCCGGCTCCTCCGGCCCTGCCGAGTTTTTGACATCAGAAAACCAAAAAAAAAAAAGAAGAAAATTGTCGTAG
Protein sequenceShow/hide protein sequence
EEEDKASASISDSGQQLHHNSCFSAADDLPSLPTSELSVPADDLADLEWLSHFVEDLSRDSLLRSLRRNICVEVQGGGRGAPGKRRFPFTAAALFQDPRSGQGSKQADEN
RRSSLVPWVTVAHRVILQFDDVLLLLVAGESLAYFPEFRPLRTGPVAGNSRLEESEEKVPDGESQNLRGPPASAVQPLRSPENPSVENRSPRGQNSLQRLRRPVQIRPAI
TRIPTRLQPHFLQRIALQPPSEGPRDAAEKGSRRSIRRRPGSSGPAEFLTSENQKKKRRKLS