; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025609 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025609
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGATA transcription factor
Genome locationtig00152936:1121405..1123021
RNA-Seq ExpressionSgr025609
SyntenySgr025609
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013448.1 GATA transcription factor 1 [Cucurbita argyrosperma subsp. argyrosperma]4.8e-5361.63Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKA--TAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLK
        MESSLAFMDDLLDFSSDIG EDEEDDAV PF+VKPKA  T AAA D S+ NA FHP+D SSCR+LP                                  
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKA--TAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLK

Query:  TRFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD------QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGK
                  E+DYAEEELEWLSNED FPAVETFVDILSD      QPP   SVSKQNSPVSVLE +S SSH  N    K S HGSILMSCC GLKVPGK
Subjt:  TRFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD------QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGK

Query:  ARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA
        ARSKRRR +H+S H LWFK QPSSRN KQ Q  PTTTAT T  AA
Subjt:  ARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA

XP_008460721.1 PREDICTED: GATA transcription factor 1 [Cucumis melo]1.4e-5255.96Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT
        MESSLAFMDDLLDFSSDIGEEDEEDDAV PF+VK K+++  A D S  N  A HPDD SSCR+LP                                   
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT

Query:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD-------QPPP--SVSKQNSPVSVLENSSSSSHSDNHN-ISKASIHG-SILMSCCGGLKVP
                 E+DYAEEELEWLSNEDAFPAVETFVDILSD       QPPP  SVSKQNSPVSVLE++S SSH +  N  +K S+HG SILMSCCGGLKVP
Subjt:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD-------QPPP--SVSKQNSPVSVLENSSSSSHSDNHN-ISKASIHG-SILMSCCGGLKVP

Query:  GKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAGGVEGWEESAYIVELRRPLNGGPGPSDPK
        GKARSKRRR +HIS H LWFK QPSS+N K  QV+P TT TA A AA  G  G           +      GP  PK
Subjt:  GKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAGGVEGWEESAYIVELRRPLNGGPGPSDPK

XP_023006311.1 GATA transcription factor 1-like isoform X1 [Cucurbita maxima]3.1e-5261.83Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR
        MESSLAFMDDLLDFSSDIG EDEEDDAV PF+VKPK  AAA  D S+ NA FHP+D SSCR+LP                                    
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR

Query:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD----QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKARSK
                E+DYAEEELEWLSNED FPAVETFVDILSD    QPP   SVSKQNSPVSVLE +S SSH  N    K S HGSILMSCC GLKVPGKARSK
Subjt:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD----QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKARSK

Query:  RRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA
        RRR +HIS H LWFK QPSSRN K  Q+LP T ATAT  AA
Subjt:  RRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA

XP_023549360.1 GATA transcription factor 1-like [Cucurbita pepo subsp. pepo]3.1e-5261.32Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAA---DPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELL
        MESSLAFMDDLLDFSSDIG EDEEDDAV PF++KPKA   AAA   D S+ NA FHP+D SSCR+LP                                 
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAA---DPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELL

Query:  KTRFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD---QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKAR
                   E+DYAEEELEWLSNED FPAVETFVDILSD   QPP   SVSKQNSPVSVLE +S SSH  N    K S HGSILMSCC GLKVPGKAR
Subjt:  KTRFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD---QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKAR

Query:  SKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA
        SKRRR +H+S H LWFK QPSSRN KQ Q  P TTATAT  AA
Subjt:  SKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA

XP_038875635.1 GATA transcription factor 1 [Benincasa hispida]1.4e-5259.2Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT
        MESSLAFMDDLLDFSSDIGEEDEEDD V PF+VKPK+++  AAD S+ N  A HPDD SSCR+LP                                   
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT

Query:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSDQ-------PPP--SVSKQNSPVSVLENSSSSSHSD-NHNISKASIH--GSILMSCCGGLKV
                 E+DY EEELEWLSNEDAFPAVETFVDILSD        PPP  SVSKQNSPVSVLE++S SSH + N+  +K S+H  GSILMSCCGGLKV
Subjt:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSDQ-------PPP--SVSKQNSPVSVLENSSSSSHSD-NHNISKASIH--GSILMSCCGGLKV

Query:  PGKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAG
        PGKARSKRRR +HIS H LWFK QPSS+N K Q V  T TA  T TAA G
Subjt:  PGKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAG

TrEMBL top hitse value%identityAlignment
A0A0A0LKH9 GATA-type domain-containing protein2.1e-4954.15Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT
        MESSLAFMDDLLDFSSDIGEEDEEDDAV PF+VKPK+++  A D S  N  A HPDD SSCR+LP                                   
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT

Query:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD-------QPP--PSVSKQNSPVSVLENSSSSSHSDNHN-ISKASIH-GSILMSCCGGLKVP
                  ++YAEEELEWLSNEDAFPAVETFVDILSD       QPP  PSVSKQNSPVSVLE++S SSH +  N  +K S+H  SILMSCCG LKVP
Subjt:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD-------QPP--PSVSKQNSPVSVLENSSSSSHSDNHN-ISKASIH-GSILMSCCGGLKVP

Query:  GKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAGGVEGWEESAYIVELRRPLNGGPGPSDPK
         KARSKRRR +HIS H L FK QPSS+N K  QV+P TTATA   AA  G  G           +      GP  PK
Subjt:  GKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAGGVEGWEESAYIVELRRPLNGGPGPSDPK

A0A1S3CCM6 GATA transcription factor6.8e-5355.96Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT
        MESSLAFMDDLLDFSSDIGEEDEEDDAV PF+VK K+++  A D S  N  A HPDD SSCR+LP                                   
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSN--AFHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKT

Query:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD-------QPPP--SVSKQNSPVSVLENSSSSSHSDNHN-ISKASIHG-SILMSCCGGLKVP
                 E+DYAEEELEWLSNEDAFPAVETFVDILSD       QPPP  SVSKQNSPVSVLE++S SSH +  N  +K S+HG SILMSCCGGLKVP
Subjt:  RFVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD-------QPPP--SVSKQNSPVSVLENSSSSSHSDNHN-ISKASIHG-SILMSCCGGLKVP

Query:  GKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAGGVEGWEESAYIVELRRPLNGGPGPSDPK
        GKARSKRRR +HIS H LWFK QPSS+N K  QV+P TT TA A AA  G  G           +      GP  PK
Subjt:  GKARSKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAAAGGVEGWEESAYIVELRRPLNGGPGPSDPK

A0A6J1H6G5 GATA transcription factor4.9e-5160.49Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR
        MESSLAFMDDLLDFSSDIG EDEEDDAV PF+VKPK    AA D S+ NA FHP+D SSCR+LP                                    
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR

Query:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD------QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKAR
                 +DYAEEELEWLSNED FPAVETFVDILSD      QPP   SVSKQNSPVSVLE +S SSH  N    K S HGSILMSCC GLKVPGKAR
Subjt:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD------QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKAR

Query:  SKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA
        SKRRR +H+S H LWFK QPSSRN KQ Q  P TTATAT  AA
Subjt:  SKRRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA

A0A6J1KVI4 GATA transcription factor 1-like isoform X25.8e-5261.41Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR
        MESSLAFMDDLLDFSSDIG EDEEDDAV PF+VKPK  AAA  D S+ NA FHP+D SSCR+LP                                    
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR

Query:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD----QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKARSK
                 +DYAEEELEWLSNED FPAVETFVDILSD    QPP   SVSKQNSPVSVLE +S SSH  N    K S HGSILMSCC GLKVPGKARSK
Subjt:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD----QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKARSK

Query:  RRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA
        RRR +HIS H LWFK QPSSRN K  Q+LP T ATAT  AA
Subjt:  RRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA

A0A6J1KXF9 GATA transcription factor1.5e-5261.83Show/hide
Query:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR
        MESSLAFMDDLLDFSSDIG EDEEDDAV PF+VKPK  AAA  D S+ NA FHP+D SSCR+LP                                    
Subjt:  MESSLAFMDDLLDFSSDIGEEDEEDDAV-PFNVKPKATAAAAADPSQSNA-FHPDD-SSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTR

Query:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD----QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKARSK
                E+DYAEEELEWLSNED FPAVETFVDILSD    QPP   SVSKQNSPVSVLE +S SSH  N    K S HGSILMSCC GLKVPGKARSK
Subjt:  FVLAFWANEQDYAEEELEWLSNEDAFPAVETFVDILSD----QPPP--SVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKARSK

Query:  RRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA
        RRR +HIS H LWFK QPSSRN K  Q+LP T ATAT  AA
Subjt:  RRR-KHISVHQLWFK-QPSSRNAKQQQVLPTTTATATATAA

SwissProt top hitse value%identityAlignment
Q8LAU9 GATA transcription factor 11.3e-1148.54Show/hide
Query:  EEELEWLSNEDAFPAVETFVDILSDQPPPSVS---------KQNSPVSVLENSSSSSHSDNHNISKASIHGS----------ILMSCCGGLKVPGKARSK
        EE+LEW+SN++AFP +ETFV +L  +  P  S         KQ SPVSVLE SS SS +   N S  S +GS           +MSCC G K P KARSK
Subjt:  EEELEWLSNEDAFPAVETFVDILSDQPPPSVS---------KQNSPVSVLENSSSSSHSDNHNISKASIHGS----------ILMSCCGGLKVPGKARSK

Query:  RRR
        RRR
Subjt:  RRR

Arabidopsis top hitse value%identityAlignment
AT3G24050.1 GATA transcription factor 18.9e-1348.54Show/hide
Query:  EEELEWLSNEDAFPAVETFVDILSDQPPPSVS---------KQNSPVSVLENSSSSSHSDNHNISKASIHGS----------ILMSCCGGLKVPGKARSK
        EE+LEW+SN++AFP +ETFV +L  +  P  S         KQ SPVSVLE SS SS +   N S  S +GS           +MSCC G K P KARSK
Subjt:  EEELEWLSNEDAFPAVETFVDILSDQPPPSVS---------KQNSPVSVLENSSSSSHSDNHNISKASIHGS----------ILMSCCGGLKVPGKARSK

Query:  RRR
        RRR
Subjt:  RRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTTCTTTGGCTTTCATGGATGACCTGCTGGATTTCTCCTCAGACATCGGCGAGGAAGACGAAGAAGACGACGCCGTACCCTTTAACGTTAAGCCTAAAGCTAC
TGCTGCGGCGGCGGCTGACCCCTCCCAGTCGAACGCCTTCCACCCGGACGATTCTTCTTGCCGTCTTTTGCCTACGCCGCCGGGATATCTCCGGTGGGTTTCAGTTTTGA
GACTTTTTTGGGGGGGGGGGGGAATGGACTCACTCGGGGCCGAGTTTGAGACCGAGTTGCTAAAAACTCGTTTCGTTTTGGCTTTTTGGGCGAATGAGCAGGATTATGCA
GAGGAAGAACTCGAGTGGCTGTCGAACGAAGATGCATTTCCGGCCGTCGAGACGTTCGTCGACATTCTCTCCGACCAGCCGCCGCCGAGCGTCTCCAAGCAGAACAGTCC
GGTGTCGGTTCTCGAGAACTCCTCAAGCAGCAGCCATAGCGACAACCACAATATCAGTAAAGCGAGCATCCATGGCAGTATTCTGATGAGCTGCTGCGGCGGCCTGAAAG
TGCCCGGAAAGGCCCGCAGCAAGCGCCGCCGCAAGCACATTTCCGTCCACCAGCTCTGGTTCAAGCAACCCAGTAGCAGGAATGCGAAACAGCAGCAAGTATTACCCACC
ACGACGGCTACAGCGACGGCGACGGCGGCGGCGGGGGGAGTGGAGGGATGGGAAGAAAGTGCCTACATTGTGGAGCTGAGAAGACCCCTCAATGGCGGGCCGGGCCCCTC
GGACCCAAAACGCTGTCCCGACGTTCTCGCCGGAGTTGCACTCGAATTCCCACCGGAAGGTGATGGAGATGAGGAGGCAGAAGCAGTTCGGTATGTTGGTGAACCCCATG
GATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTTCTTTGGCTTTCATGGATGACCTGCTGGATTTCTCCTCAGACATCGGCGAGGAAGACGAAGAAGACGACGCCGTACCCTTTAACGTTAAGCCTAAAGCTAC
TGCTGCGGCGGCGGCTGACCCCTCCCAGTCGAACGCCTTCCACCCGGACGATTCTTCTTGCCGTCTTTTGCCTACGCCGCCGGGATATCTCCGGTGGGTTTCAGTTTTGA
GACTTTTTTGGGGGGGGGGGGGAATGGACTCACTCGGGGCCGAGTTTGAGACCGAGTTGCTAAAAACTCGTTTCGTTTTGGCTTTTTGGGCGAATGAGCAGGATTATGCA
GAGGAAGAACTCGAGTGGCTGTCGAACGAAGATGCATTTCCGGCCGTCGAGACGTTCGTCGACATTCTCTCCGACCAGCCGCCGCCGAGCGTCTCCAAGCAGAACAGTCC
GGTGTCGGTTCTCGAGAACTCCTCAAGCAGCAGCCATAGCGACAACCACAATATCAGTAAAGCGAGCATCCATGGCAGTATTCTGATGAGCTGCTGCGGCGGCCTGAAAG
TGCCCGGAAAGGCCCGCAGCAAGCGCCGCCGCAAGCACATTTCCGTCCACCAGCTCTGGTTCAAGCAACCCAGTAGCAGGAATGCGAAACAGCAGCAAGTATTACCCACC
ACGACGGCTACAGCGACGGCGACGGCGGCGGCGGGGGGAGTGGAGGGATGGGAAGAAAGTGCCTACATTGTGGAGCTGAGAAGACCCCTCAATGGCGGGCCGGGCCCCTC
GGACCCAAAACGCTGTCCCGACGTTCTCGCCGGAGTTGCACTCGAATTCCCACCGGAAGGTGATGGAGATGAGGAGGCAGAAGCAGTTCGGTATGTTGGTGAACCCCATG
GATAA
Protein sequenceShow/hide protein sequence
MESSLAFMDDLLDFSSDIGEEDEEDDAVPFNVKPKATAAAAADPSQSNAFHPDDSSCRLLPTPPGYLRWVSVLRLFWGGGGMDSLGAEFETELLKTRFVLAFWANEQDYA
EEELEWLSNEDAFPAVETFVDILSDQPPPSVSKQNSPVSVLENSSSSSHSDNHNISKASIHGSILMSCCGGLKVPGKARSKRRRKHISVHQLWFKQPSSRNAKQQQVLPT
TTATATATAAAGGVEGWEESAYIVELRRPLNGGPGPSDPKRCPDVLAGVALEFPPEGDGDEEAEAVRYVGEPHG