; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022759 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022759
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGATA transcription factor 16-like isoform X4
Genome locationscaffold73:270930..272120
RNA-Seq ExpressionMS022759
SyntenyMS022759
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135612.1 GATA transcription factor 17-like isoform X1 [Momordica charantia]7.9e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

XP_022135613.1 GATA transcription factor 17-like isoform X2 [Momordica charantia]7.9e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

XP_022135614.1 GATA transcription factor 16-like isoform X3 [Momordica charantia]7.9e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

XP_022135615.1 GATA transcription factor 16-like isoform X4 [Momordica charantia]7.9e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

XP_038880207.1 GATA transcription factor 17-like [Benincasa hispida]7.8e-4469.28Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKRR+STIGTNRG DRKRE+ H++G + T  +SATTSS+ T     S +G  DG   +E+LGECGSL 
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQN----ISKQR--PPRKLGEEE-QAAVSLMALSCGSVFA
        MRLMMAL EEV+V QN    + KQR    RKLGEEE QAAVSLMALSCGSV +
Subjt:  MRLMMALGEEVVVQQN----ISKQR--PPRKLGEEE-QAAVSLMALSCGSVFA

TrEMBL top hitse value%identityAlignment
A0A6J1C1I3 GATA transcription factor 16-like isoform X43.8e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

A0A6J1C1Y1 GATA transcription factor 16-like isoform X33.8e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

A0A6J1C373 GATA transcription factor 17-like isoform X13.8e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

A0A6J1C5A4 GATA transcription factor 17-like isoform X23.8e-7399.32Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS+NGGADGEEEEEDLGECGSLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

A0A6J1JPG4 GATA transcription factor 16-like5.2e-3867.81Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        K CVDC TTKTPLWRGGPAGPKSLCNACGIRFRKRR+S   TNRG  RKRE+ HS   STT    +++SSS T A   + +GG  G + EEDLGEC SLR
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        MR+MM   EEVVV QN+S  +   KLGEEEQAAV LMALSCGSVFA
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

SwissProt top hitse value%identityAlignment
Q8LC59 GATA transcription factor 236.2e-1276.32Show/hide
Query:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI
        C +CKTTKTP+WRGGP GPKSLCNACGIR RK+R S +
Subjt:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI

Q8LG10 GATA transcription factor 155.8e-1844.22Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        K C  C T+KTPLWRGGPAGPKSLCNACGIR RK+R  T+ +NR  D+K+ K+H+                                      G+  SL+
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA
         RL M LG EV++Q++ ++ +   KLGEEEQAAV LMALS   SV+A
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA

Q9FJ10 GATA transcription factor 162.4e-1644.52Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        K C DC T+KTPLWRGGP GPKSLCNACGIR RK+R    GT    D K+ K  S GG                           GE  ++ L + G +R
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
         R              + KQR  +KLGEEEQAAV LMALS GSV+A
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

Q9LIB5 GATA transcription factor 171.3e-1740.65Show/hide
Query:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR--
        CVDC T +TPLWRGGPAGPKSLCNACGI+ RK+R + +G      +K  K++ +        +A         D K D         ++D   C + R  
Subjt:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR--

Query:  -------MRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCGSVFA
               +   + LG +V V +   + K+R  RKLGEEE+AAV LMALSC SV+A
Subjt:  -------MRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCGSVFA

Q9SZI6 Putative GATA transcription factor 223.1e-1169.05Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGT
        + C DC TTKTPLWR GP GPKSLCNACGIR RK R + + T
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGT

Arabidopsis top hitse value%identityAlignment
AT3G06740.1 GATA transcription factor 154.1e-1944.22Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        K C  C T+KTPLWRGGPAGPKSLCNACGIR RK+R  T+ +NR  D+K+ K+H+                                      G+  SL+
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA
         RL M LG EV++Q++ ++ +   KLGEEEQAAV LMALS   SV+A
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA

AT3G16870.1 GATA transcription factor 179.2e-1940.65Show/hide
Query:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR--
        CVDC T +TPLWRGGPAGPKSLCNACGI+ RK+R + +G      +K  K++ +        +A         D K D         ++D   C + R  
Subjt:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR--

Query:  -------MRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCGSVFA
               +   + LG +V V +   + K+R  RKLGEEE+AAV LMALSC SV+A
Subjt:  -------MRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCGSVFA

AT4G16141.1 GATA type zinc finger transcription factor family protein1.1e-1637.27Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNG------GADG------EE
        K CVDC T++TPLWRGGPAGPKSLCNACGI+ RK+R + +G  +   + + K++++ G  +  +            AK + G      G  G      + 
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNG------GADG------EE

Query:  EEEDLGECGS-----LRMRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCG
        + E+     +      R+   +  G +V   +   + K+R  RKLGEEE+AAV LMALSCG
Subjt:  EEEDLGECGS-----LRMRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCG

AT5G26930.1 GATA transcription factor 234.4e-1376.32Show/hide
Query:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI
        C +CKTTKTP+WRGGP GPKSLCNACGIR RK+R S +
Subjt:  CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI

AT5G49300.1 GATA transcription factor 161.7e-1744.52Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR
        K C DC T+KTPLWRGGP GPKSLCNACGIR RK+R    GT    D K+ K  S GG                           GE  ++ L + G +R
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLR

Query:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
         R              + KQR  +KLGEEEQAAV LMALS GSV+A
Subjt:  MRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAATACTGTGTTGATTGTAAGACTACCAAGACCCCTTTATGGCGTGGAGGCCCCGCTGGACCTAAGTCACTGTGTAACGCATGTGGGATCAGGTTTAGAAAGAGAAGAGT
CTCCACCATTGGAACCAACAGAGGGTGTGACAGAAAGAGAGAAAAGGCTCATAGCCATGGCGGCTCCACCACTGCCGCCATGTCAGCCACCACTTCCTCTAGTGCCACCG
CTGCCGATGCAAAATCCGACAATGGTGGTGCAGATGGGGAGGAGGAAGAAGAGGATTTAGGGGAATGTGGGTCATTGAGGATGAGGCTGATGATGGCGTTGGGGGAGGAG
GTGGTGGTGCAGCAGAATATTTCGAAACAGCGGCCCCCGAGGAAGCTCGGGGAGGAGGAGCAGGCGGCGGTGTCGTTAATGGCGCTGTCCTGTGGCTCTGTGTTTGCC
mRNA sequenceShow/hide mRNA sequence
AAATACTGTGTTGATTGTAAGACTACCAAGACCCCTTTATGGCGTGGAGGCCCCGCTGGACCTAAGTCACTGTGTAACGCATGTGGGATCAGGTTTAGAAAGAGAAGAGT
CTCCACCATTGGAACCAACAGAGGGTGTGACAGAAAGAGAGAAAAGGCTCATAGCCATGGCGGCTCCACCACTGCCGCCATGTCAGCCACCACTTCCTCTAGTGCCACCG
CTGCCGATGCAAAATCCGACAATGGTGGTGCAGATGGGGAGGAGGAAGAAGAGGATTTAGGGGAATGTGGGTCATTGAGGATGAGGCTGATGATGGCGTTGGGGGAGGAG
GTGGTGGTGCAGCAGAATATTTCGAAACAGCGGCCCCCGAGGAAGCTCGGGGAGGAGGAGCAGGCGGCGGTGTCGTTAATGGCGCTGTCCTGTGGCTCTGTGTTTGCC
Protein sequenceShow/hide protein sequence
KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSDNGGADGEEEEEDLGECGSLRMRLMMALGEE
VVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA