; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G007930 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G007930
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Descriptiontranscription factor TGA1-like isoform X1
Genome locationCma_Chr04:4044330..4057454
RNA-Seq ExpressionCmaCh04G007930
SyntenyCmaCh04G007930
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025422 - Transcription factor TGA like domain
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600623.1 Transcription factor TGA1, partial [Cucurbita argyrosperma subsp. sororia]9.7e-25682.7Show/hide
Query:  MRPIGFDPETGKSRSYAIFIYRTNEGARRALEEPHKVFEGNKLHCQRAAEGKNKNQNSTQAVQSLAQTQPPMMAAMATASNLPLFAQHPSLNPVCGGFGN
        M PIGFDPETGKSR YAIFIYRTNEGARRALEEPHKVFEGNKLHCQRAAEGKNKNQNSTQAVQSLAQTQPPMMAAMATASNLPLFAQHPSLN VCGGFGN
Subjt:  MRPIGFDPETGKSRSYAIFIYRTNEGARRALEEPHKVFEGNKLHCQRAAEGKNKNQNSTQAVQSLAQTQPPMMAAMATASNLPLFAQHPSLNPVCGGFGN

Query:  TALGVGMLNQGVVPMSQVGLVGSSVGAGIGLSGYSGGSYGLSQLSAGGSSMLGSYGSDSSSLKGLTHIYSSTMLGKAVSDRGPAASGGSLGGYTSYLWLF
         ALG GMLNQGVVPMSQVGLVGSSVGAGIGLSGYSGGSYGLSQLSAGGSSMLGSYGSDSSSLKGL HIYSSTMLGKAVSDRGPAASGGSLGGYTSYL   
Subjt:  TALGVGMLNQGVVPMSQVGLVGSSVGAGIGLSGYSGGSYGLSQLSAGGSSMLGSYGSDSSSLKGLTHIYSSTMLGKAVSDRGPAASGGSLGGYTSYLWLF

Query:  LLACNKGADIRLKTDFMAVEAVNILCSCDAGELVVEDVSFGLVASL-LLRLHPHSHFHHFCVYFHSPVLVSQVFIELLLTIDCCTEESKLVVMMDGAERD
                              N LC C    L++  + F L  S   LRL P S                                     ++ G    
Subjt:  LLACNKGADIRLKTDFMAVEAVNILCSCDAGELVVEDVSFGLVASL-LLRLHPHSHFHHFCVYFHSPVLVSQVFIELLLTIDCCTEESKLVVMMDGAERD

Query:  RMGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRL
        RMGLYEPVNKL MWGNTFRSNANL+VPSST +ME DTKLENQSDDASLG LGDPHIYDQEDTKR+DKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRL
Subjt:  RMGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRL

Query:  IRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIM
        IRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIM
Subjt:  IRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIM

Query:  SGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVN
        SGMWKTSAERLFLWIGGIRPSELLKVLIP+LETLTDQQISETSSL KSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVN
Subjt:  SGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVN

Query:  Q
        Q
Subjt:  Q

KAG7031257.1 Transcription factor TGA4 [Cucurbita argyrosperma subsp. argyrosperma]1.5e-17996.26Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKL MWGNTFRSNANL+VPSST +ME DTKLENQSDDASLG LGDPHIYDQEDTKR+DKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIP+LETLTDQQISETSSL KSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

Query:  VTFNLKFMNRVILTLSLVILTFHGPSPSPGDIHLLMSPKVGLRAALAS
        VTFNLKFMN VILT SLVILTFHGPSPSP DI+LLMSPKVGLRAALAS
Subjt:  VTFNLKFMNRVILTLSLVILTFHGPSPSPGDIHLLMSPKVGLRAALAS

XP_022941789.1 transcription factor TGA4 isoform X1 [Cucurbita moschata]2.0e-15597.33Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKL MWGNTFRSNANL+VPSST IME DTKLENQSDDASLGSLGDPHIYDQEDTKR+DKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVI AFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIP+LETLTDQQISETSSL KSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

XP_022981686.1 transcription factor TGA1-like isoform X1 [Cucurbita maxima]2.6e-160100Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

XP_023541501.1 transcription factor TGA1-like [Cucurbita pepo subsp. pepo]8.8e-15697.67Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKL MWGNTFRSNANL+VPSST IMEADTKLENQSDDAS GSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIP+LETLTDQQISETSSL KSCLQAEDALRQGMEKLQQNLFESIVASQ+DEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

TrEMBL top hitse value%identityAlignment
A0A0A0KUL2 Uncharacterized protein8.9e-13886.38Show/hide
Query:  RMGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRL
        RMGLYEP++ + MWG TFR+NANL+ PSS FI+EAD KLENQSDDASLGSLGDPH+YDQ+DTKRIDKIQRRLAQNREAARKSR+RKKAYIKQLETSR++L
Subjt:  RMGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRL

Query:  IRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIM
        I+LEQELEKARQQ L AGSRFD+ Q+GLSGTTNS I AFESEYEQWVEEQN+QICDLR  VHADITDIELRILVENAMRHYFKFF MKAKAAK DV YIM
Subjt:  IRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIM

Query:  SGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVN
        SGMWKTSAERLFLWIGG RPSELLKVLIP+LETLT+QQISET SL KSCLQAEDALRQGMEKLQQNLFES+VA QL EGSYPLQMTAA+ERLEAL+SFVN
Subjt:  SGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVN

Query:  Q
        Q
Subjt:  Q

A0A6J1FPG2 transcription factor TGA4 isoform X19.5e-15697.33Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKL MWGNTFRSNANL+VPSST IME DTKLENQSDDASLGSLGDPHIYDQEDTKR+DKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVI AFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIP+LETLTDQQISETSSL KSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

A0A6J1FUS2 transcription factor TGA4 isoform X24.4e-13789.33Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKL MWGNTFRSNANL+VPSST IME DTKLENQ                         IQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVI AFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIP+LETLTDQQISETSSL KSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

A0A6J1IX85 transcription factor TGA1-like isoform X11.3e-160100Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

A0A6J1J0B6 transcription factor TGA1-like isoform X21.3e-14191.67Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
        MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQ                         IQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLI

Query:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
        RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS
Subjt:  RLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMS

Query:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
        GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ
Subjt:  GMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQ

SwissProt top hitse value%identityAlignment
P14232 TGACG-sequence-specific DNA-binding protein TGA-1A4.8e-8051.21Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEAD-----------TKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYI
        MG+ +P+++L MW +    N++    S+T I+E D            +L+N+++D S G++G  + Y+ E +K ++K+ RRLAQNREAARKSR+RKKAY+
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEAD-----------TKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYI

Query:  KQLETSRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAK
        +QLE S+L+LI+LEQELE+AR+Q +  G   D  Q+  SGT +S  + F+ EY  WVEEQ +Q  DLR  +H+ I + ELRI+V+  + HYF  F MKA 
Subjt:  KQLETSRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAK

Query:  AAKTDVFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPL-QMTAAI
        AAK DV YIMSGMWKTSAER F+WIGG RPSELLKVL P LE LT+QQ+ E  +L +SC QAEDAL QGM KL Q L E++ A +L EG+Y L QM  AI
Subjt:  AAKTDVFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPL-QMTAAI

Query:  ERLEALISFVNQVTF----NLKFMNRVILT
        E+LE L+ FVNQ        L+ M+R++ T
Subjt:  ERLEALISFVNQVTF----NLKFMNRVILT

Q39162 Transcription factor TGA49.0e-9558.26Show/hide
Query:  RMGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRL
        R  +YEP+N++ MW  +F++N ++  P S  I+  + K ++ S+D S G+ G PH +DQE   ++  DKIQRRLAQNREAARKSR+RKKAY++QLETSRL
Subjt:  RMGLYEPVNKLEMWGNTFRSNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRL

Query:  RLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFY
        +LI LEQEL++ARQQ  + G+  D   +  S   +S I AFE EY  WVEEQN+QIC+LR V+H  ++DIELR LVENAM+HYF+ F MK+ AAK DVFY
Subjt:  RLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFY

Query:  IMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISF
        +MSGMWKTSAER FLWIGG RPSELLKVL+P  + LTDQQ+ +  +L +SC QAEDAL QGMEKLQ  L ES+ A +L EGSY  QMT A+ERLEAL+SF
Subjt:  IMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISF

Query:  VNQVTF----NLKFMNRVILT
        VNQ        L+ M+R++ T
Subjt:  VNQVTF----NLKFMNRVILT

Q39234 Transcription factor TGA37.2e-6846.41Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNAN------LNVPSSTFIMEADTKLE-------NQSDDASLGSLGDPHIYDQEDTKRI-DKIQRRLAQNREAARKSRIRKK
        MG+YEP  +L  W + F+S+ N       N  SS+  +E D + E       N +   +     +P   + +D  RI DK++RRLAQNREAARKSR+RKK
Subjt:  MGLYEPVNKLEMWGNTFRSNAN------LNVPSSTFIMEADTKLE-------NQSDDASLGSLGDPHIYDQEDTKRI-DKIQRRLAQNREAARKSRIRKK

Query:  AYIKQLETSRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHM
        A+++QLE SRL+L +LEQEL +ARQQ L   +  D   +G +G  NS I+AFE EY  W+EEQN+++ ++R  + A I DIEL++LV++ + HY   F M
Subjt:  AYIKQLETSRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHM

Query:  KAKAAKTDVFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIV--ASQLDEGSYPLQM
        KA AAK DVF++MSGMW+TS ER F WIGG RPSELL V++P +E LTDQQ+ E  +L +S  QAE+AL QG++KLQQ L ESI      ++  ++   M
Subjt:  KAKAAKTDVFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIV--ASQLDEGSYPLQM

Query:  TAAIERLEALISFVNQVTF----NLKFMNRVILT
         +A+E L+AL SFVNQ        L+ M++++ T
Subjt:  TAAIERLEALISFVNQVTF----NLKFMNRVILT

Q39237 Transcription factor TGA14.0e-9557.41Show/hide
Query:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET
        R+G+YEPV++  MWG +F+   SN  +N P+   I        N S+D S G+ G PH++DQE   ++  DKIQRRLAQNREAARKSR+RKKAY++QLET
Subjt:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET

Query:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD
        SRL+LI+LEQEL++ARQQ  + G+  D   +G S T N  I+AFE EY  WVEEQN+QIC+LR V+H  I DIELR LVENAM+HYF+ F MK+ AAK D
Subjt:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD

Query:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL
        VF++MSGMW+TSAER FLWIGG RPS+LLKVL+P  + LTDQQ+ +  +L +SC QAEDAL QGMEKLQ  L + + A QL EGSY  Q+ +A++RLEAL
Subjt:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL

Query:  ISFVNQVTF----NLKFMNRVILT
        +SFVNQ        L+ M R++ T
Subjt:  ISFVNQVTF----NLKFMNRVILT

Q93ZE2 Transcription factor TGA74.1e-7149.2Show/hide
Query:  MGLYEPVNKLEMWGNTFRSNANLNVP--SSTFIMEADTKLENQSDDASLG--------SLGDPHIYD-QEDTKRI-DKIQRRLAQNREAARKSRIRKKAY
        MG+YEP  ++  WGN F+S+ N + P  +++ I++ D ++++ +++  +             P   D Q+D  RI DK++RRLAQNREAARKSR+RKKAY
Subjt:  MGLYEPVNKLEMWGNTFRSNANLNVP--SSTFIMEADTKLENQSDDASLG--------SLGDPHIYD-QEDTKRI-DKIQRRLAQNREAARKSRIRKKAY

Query:  IKQLETSRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKA
        ++QLE SRL+L +LEQELEK +QQ            +G SG+ N+ I++FE EY  W++EQ++++ +LR  + + I+DIEL++LVE+ + HY   F MK+
Subjt:  IKQLETSRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKA

Query:  KAAKTDVFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGS-YPLQMTAA
         AAK DVFY++SGMW+TS ER F WIGG RPSELL V++P L+ LTDQQI E  +L +S  QAEDAL QG++KLQQ+L ESIV   + E + YP  M AA
Subjt:  KAAKTDVFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGS-YPLQMTAA

Query:  IERLEALISFVNQ
        IE L+AL  FVNQ
Subjt:  IERLEALISFVNQ

Arabidopsis top hitse value%identityAlignment
AT5G65210.1 bZIP transcription factor family protein2.9e-9657.41Show/hide
Query:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET
        R+G+YEPV++  MWG +F+   SN  +N P+   I        N S+D S G+ G PH++DQE   ++  DKIQRRLAQNREAARKSR+RKKAY++QLET
Subjt:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET

Query:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD
        SRL+LI+LEQEL++ARQQ  + G+  D   +G S T N  I+AFE EY  WVEEQN+QIC+LR V+H  I DIELR LVENAM+HYF+ F MK+ AAK D
Subjt:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD

Query:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL
        VF++MSGMW+TSAER FLWIGG RPS+LLKVL+P  + LTDQQ+ +  +L +SC QAEDAL QGMEKLQ  L + + A QL EGSY  Q+ +A++RLEAL
Subjt:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL

Query:  ISFVNQVTF----NLKFMNRVILT
        +SFVNQ        L+ M R++ T
Subjt:  ISFVNQVTF----NLKFMNRVILT

AT5G65210.2 bZIP transcription factor family protein2.9e-9657.41Show/hide
Query:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET
        R+G+YEPV++  MWG +F+   SN  +N P+   I        N S+D S G+ G PH++DQE   ++  DKIQRRLAQNREAARKSR+RKKAY++QLET
Subjt:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET

Query:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD
        SRL+LI+LEQEL++ARQQ  + G+  D   +G S T N  I+AFE EY  WVEEQN+QIC+LR V+H  I DIELR LVENAM+HYF+ F MK+ AAK D
Subjt:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD

Query:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL
        VF++MSGMW+TSAER FLWIGG RPS+LLKVL+P  + LTDQQ+ +  +L +SC QAEDAL QGMEKLQ  L + + A QL EGSY  Q+ +A++RLEAL
Subjt:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL

Query:  ISFVNQVTF----NLKFMNRVILT
        +SFVNQ        L+ M R++ T
Subjt:  ISFVNQVTF----NLKFMNRVILT

AT5G65210.3 bZIP transcription factor family protein2.9e-9657.41Show/hide
Query:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET
        R+G+YEPV++  MWG +F+   SN  +N P+   I        N S+D S G+ G PH++DQE   ++  DKIQRRLAQNREAARKSR+RKKAY++QLET
Subjt:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET

Query:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD
        SRL+LI+LEQEL++ARQQ  + G+  D   +G S T N  I+AFE EY  WVEEQN+QIC+LR V+H  I DIELR LVENAM+HYF+ F MK+ AAK D
Subjt:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD

Query:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL
        VF++MSGMW+TSAER FLWIGG RPS+LLKVL+P  + LTDQQ+ +  +L +SC QAEDAL QGMEKLQ  L + + A QL EGSY  Q+ +A++RLEAL
Subjt:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL

Query:  ISFVNQVTF----NLKFMNRVILT
        +SFVNQ        L+ M R++ T
Subjt:  ISFVNQVTF----NLKFMNRVILT

AT5G65210.4 bZIP transcription factor family protein2.9e-9657.41Show/hide
Query:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET
        R+G+YEPV++  MWG +F+   SN  +N P+   I        N S+D S G+ G PH++DQE   ++  DKIQRRLAQNREAARKSR+RKKAY++QLET
Subjt:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET

Query:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD
        SRL+LI+LEQEL++ARQQ  + G+  D   +G S T N  I+AFE EY  WVEEQN+QIC+LR V+H  I DIELR LVENAM+HYF+ F MK+ AAK D
Subjt:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD

Query:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL
        VF++MSGMW+TSAER FLWIGG RPS+LLKVL+P  + LTDQQ+ +  +L +SC QAEDAL QGMEKLQ  L + + A QL EGSY  Q+ +A++RLEAL
Subjt:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL

Query:  ISFVNQVTF----NLKFMNRVILT
        +SFVNQ        L+ M R++ T
Subjt:  ISFVNQVTF----NLKFMNRVILT

AT5G65210.5 bZIP transcription factor family protein2.9e-9657.41Show/hide
Query:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET
        R+G+YEPV++  MWG +F+   SN  +N P+   I        N S+D S G+ G PH++DQE   ++  DKIQRRLAQNREAARKSR+RKKAY++QLET
Subjt:  RMGLYEPVNKLEMWGNTFR---SNANLNVPSSTFIMEADTKLENQSDDASLGSLGDPHIYDQE--DTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLET

Query:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD
        SRL+LI+LEQEL++ARQQ  + G+  D   +G S T N  I+AFE EY  WVEEQN+QIC+LR V+H  I DIELR LVENAM+HYF+ F MK+ AAK D
Subjt:  SRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFESEYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTD

Query:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL
        VF++MSGMW+TSAER FLWIGG RPS+LLKVL+P  + LTDQQ+ +  +L +SC QAEDAL QGMEKLQ  L + + A QL EGSY  Q+ +A++RLEAL
Subjt:  VFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQAEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEAL

Query:  ISFVNQVTF----NLKFMNRVILT
        +SFVNQ        L+ M R++ T
Subjt:  ISFVNQVTF----NLKFMNRVILT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCCGATTGGGTTCGATCCCGAAACAGGGAAATCGAGGAGTTATGCTATTTTTATTTACAGAACAAATGAAGGTGCGAGGAGGGCTCTTGAGGAGCCACACAAAGT
ATTCGAAGGGAACAAATTGCATTGCCAGAGGGCGGCTGAGGGGAAGAACAAGAATCAGAATTCAACGCAGGCAGTGCAAAGCTTGGCGCAGACTCAGCCGCCGATGATGG
CGGCCATGGCTACAGCTTCAAATTTGCCATTGTTTGCTCAACATCCGAGCCTCAATCCTGTGTGCGGTGGATTTGGGAACACAGCTTTGGGAGTTGGAATGTTGAACCAG
GGAGTAGTTCCCATGAGTCAAGTGGGTCTGGTTGGTAGTTCAGTTGGGGCTGGGATAGGCTTGAGTGGTTACAGTGGGGGTTCATATGGCTTAAGCCAATTAAGTGCCGG
TGGTTCGTCAATGCTAGGCTCTTACGGGTCCGATTCTTCATCACTGAAGGGGTTGACGCACATTTACTCGAGCACAATGCTTGGCAAGGCCGTATCGGACCGGGGTCCGG
CGGCTTCTGGTGGGTCTCTCGGAGGATACACATCATATTTATGGCTTTTCTTGTTGGCATGCAACAAAGGAGCAGATATTCGCTTAAAAACTGATTTCATGGCAGTTGAG
GCTGTGAACATTTTATGTTCTTGTGATGCTGGTGAATTAGTAGTGGAGGATGTTTCATTTGGTTTAGTAGCTTCACTGCTTCTTCGCCTTCATCCTCATTCGCACTTTCA
TCATTTTTGCGTCTACTTCCATTCTCCTGTTCTTGTTTCCCAAGTTTTCATCGAATTGCTTCTTACAATTGACTGTTGTACAGAAGAAAGCAAGCTGGTGGTGATGATGG
ACGGAGCAGAACGAGATAGAATGGGATTGTACGAACCTGTCAACAAGCTTGAAATGTGGGGGAACACTTTTAGAAGCAATGCCAATTTAAATGTACCATCATCAACTTTC
ATTATGGAAGCTGATACCAAACTCGAGAATCAGTCCGATGATGCTTCACTTGGCTCGTTGGGAGATCCTCACATTTATGATCAAGAAGATACTAAACGTATTGATAAGAT
TCAAAGACGGCTCGCCCAAAATCGGGAAGCTGCCCGCAAAAGTCGCATACGAAAGAAGGCCTATATTAAGCAACTGGAAACGAGCCGTTTGAGACTTATTCGATTAGAGC
AAGAGCTTGAAAAAGCAAGGCAACAAGATCTGTTTGCTGGATCTCGATTTGATCATTATCAGATGGGCTTATCTGGAACCACAAATTCAGTCATCTCTGCATTTGAATCA
GAGTACGAGCAATGGGTAGAAGAGCAGAACAAGCAGATTTGTGATCTAAGGAATGTTGTGCATGCTGATATCACTGATATTGAGCTTCGAATACTTGTAGAAAACGCAAT
GAGACACTACTTCAAATTTTTTCACATGAAAGCCAAAGCTGCAAAAACCGATGTTTTTTACATTATGTCAGGCATGTGGAAAACTTCAGCCGAAAGACTTTTCTTATGGA
TAGGAGGAATTCGCCCTTCAGAACTTCTCAAGGTCTTGATACCAAAACTGGAGACACTGACCGATCAACAAATTTCGGAAACTAGTAGCCTTGGGAAATCATGTCTACAA
GCAGAAGATGCTTTAAGACAAGGTATGGAAAAACTACAACAAAACCTATTTGAGAGCATAGTGGCTAGTCAGCTCGATGAAGGAAGTTATCCCCTACAGATGACCGCTGC
AATCGAAAGATTAGAAGCACTCATTAGCTTTGTGAATCAGGTAACATTTAATCTCAAGTTCATGAACCGTGTTATTTTAACACTCTCTTTGGTCATCTTAACCTTCCATG
GCCCAAGTCCAAGCCCTGGTGATATCCACCTCTTAATGTCTCCTAAAGTTGGCCTTAGGGCAGCTTTGGCTTCCTAG
mRNA sequenceShow/hide mRNA sequence
CCTCGACGCAAAATTTTGATTCTTCTTCTCAAAGAGTTGGTTCTGTTCTGTTGAAGCGATTCAGTAGTTCTCCAATGGAGGGAACCAGCTAAACCACCAAAAGAAAGCTG
GTCAAGAAATCTGACAACAAGGTCGATAAGAAACTCAATAAGGAGCTTGATGATCCCGTTTTCTCTCAGCCACACGACGATGGCACCGACTCCGATTTGGACGATCTTCC
CAAACTCCTCGAGCCCTTCTCCAAGACCCAACTAATCGAGCTCATCTGCACCACTGCTCTCGAAGACGTTAATCTCGAAGCCCAAATTCGAGCTGCTGCCGACCGTGATG
TCACGCACCGCAAGATTTTCGTCCATGGCCTCTGTTGGGACACCACCAAAGAAACCCTTACTTCGGGTTTTGAATCATTTGGTGAAACTGAAGACTGCAATGTGGTTTTG
GATAAGAATACAGGGAAGGCTAAGGGCTACGGGTTTATTTTGTTCAAATCTCGTCAGGGTGCGATTAAAGCGTTGAAGGAACCTAGGAAGAAGATGAATAACAGGATGGC
TTCGTGTCAATTGGCTTCTGTGGGTTCAGTGCCGCCGCCTTAAAGTCAGGAGATTGGGCCGCGGAAGATTTATGTGGCGAATGTGCACCATAACGTGGATGCTGAGAGGC
TTAGGACGTTCTTTGCGAAGTTTGGGGAATTGGAGATGCGTCCGATTGGGTTCGATCCCGAAACAGGGAAATCGAGGAGTTATGCTATTTTTATTTACAGAACAAATGAA
GGTGCGAGGAGGGCTCTTGAGGAGCCACACAAAGTATTCGAAGGGAACAAATTGCATTGCCAGAGGGCGGCTGAGGGGAAGAACAAGAATCAGAATTCAACGCAGGCAGT
GCAAAGCTTGGCGCAGACTCAGCCGCCGATGATGGCGGCCATGGCTACAGCTTCAAATTTGCCATTGTTTGCTCAACATCCGAGCCTCAATCCTGTGTGCGGTGGATTTG
GGAACACAGCTTTGGGAGTTGGAATGTTGAACCAGGGAGTAGTTCCCATGAGTCAAGTGGGTCTGGTTGGTAGTTCAGTTGGGGCTGGGATAGGCTTGAGTGGTTACAGT
GGGGGTTCATATGGCTTAAGCCAATTAAGTGCCGGTGGTTCGTCAATGCTAGGCTCTTACGGGTCCGATTCTTCATCACTGAAGGGGTTGACGCACATTTACTCGAGCAC
AATGCTTGGCAAGGCCGTATCGGACCGGGGTCCGGCGGCTTCTGGTGGGTCTCTCGGAGGATACACATCATATTTATGGCTTTTCTTGTTGGCATGCAACAAAGGAGCAG
ATATTCGCTTAAAAACTGATTTCATGGCAGTTGAGGCTGTGAACATTTTATGTTCTTGTGATGCTGGTGAATTAGTAGTGGAGGATGTTTCATTTGGTTTAGTAGCTTCA
CTGCTTCTTCGCCTTCATCCTCATTCGCACTTTCATCATTTTTGCGTCTACTTCCATTCTCCTGTTCTTGTTTCCCAAGTTTTCATCGAATTGCTTCTTACAATTGACTG
TTGTACAGAAGAAAGCAAGCTGGTGGTGATGATGGACGGAGCAGAACGAGATAGAATGGGATTGTACGAACCTGTCAACAAGCTTGAAATGTGGGGGAACACTTTTAGAA
GCAATGCCAATTTAAATGTACCATCATCAACTTTCATTATGGAAGCTGATACCAAACTCGAGAATCAGTCCGATGATGCTTCACTTGGCTCGTTGGGAGATCCTCACATT
TATGATCAAGAAGATACTAAACGTATTGATAAGATTCAAAGACGGCTCGCCCAAAATCGGGAAGCTGCCCGCAAAAGTCGCATACGAAAGAAGGCCTATATTAAGCAACT
GGAAACGAGCCGTTTGAGACTTATTCGATTAGAGCAAGAGCTTGAAAAAGCAAGGCAACAAGATCTGTTTGCTGGATCTCGATTTGATCATTATCAGATGGGCTTATCTG
GAACCACAAATTCAGTCATCTCTGCATTTGAATCAGAGTACGAGCAATGGGTAGAAGAGCAGAACAAGCAGATTTGTGATCTAAGGAATGTTGTGCATGCTGATATCACT
GATATTGAGCTTCGAATACTTGTAGAAAACGCAATGAGACACTACTTCAAATTTTTTCACATGAAAGCCAAAGCTGCAAAAACCGATGTTTTTTACATTATGTCAGGCAT
GTGGAAAACTTCAGCCGAAAGACTTTTCTTATGGATAGGAGGAATTCGCCCTTCAGAACTTCTCAAGGTCTTGATACCAAAACTGGAGACACTGACCGATCAACAAATTT
CGGAAACTAGTAGCCTTGGGAAATCATGTCTACAAGCAGAAGATGCTTTAAGACAAGGTATGGAAAAACTACAACAAAACCTATTTGAGAGCATAGTGGCTAGTCAGCTC
GATGAAGGAAGTTATCCCCTACAGATGACCGCTGCAATCGAAAGATTAGAAGCACTCATTAGCTTTGTGAATCAGGTAACATTTAATCTCAAGTTCATGAACCGTGTTAT
TTTAACACTCTCTTTGGTCATCTTAACCTTCCATGGCCCAAGTCCAAGCCCTGGTGATATCCACCTCTTAATGTCTCCTAAAGTTGGCCTTAGGGCAGCTTTGGCTTCCT
AGAGATTAGTGATCATCTCGATTGTTCTAATAGAATCTGAATAGCTCAAACAATATTGAATGATGAAGATGAAGTCTTTAGTATTCGTACTCATGCCTTAAGCAGCTAAA
TAATTGCCTACCTTCATTCTTTCTCCATCCCCACTGACAACTCTTTAGCTAAATACATAATAATATTCCCACTTCTATCAGAACAGCATTCCTATCTCATTTAGAATCAA
ATTCAATTGACTTAGATTATAACTTGTACACTTATGATCTAGAGTTTTGTTCAGTCTTCTTCAGCATTTCATCAACATCGTCTACCTTTTTCTTCTCTCTTTTCCAATGT
CCAAATAGGCTGACCATCTACGGCAGGAAACATTACAACAGATGTACAAAATTCTAACAACTCGGCAATCTGCTCAAGGTCTCCTTACTTTAGGGGAGTTTTTCCAACGA
CTCCGAGCGTTGAGCTCACTTTGGGCTAACCGTCCTTGTGGGCCTGTATAGTTCATCATCAAAAAAATTTAAGCTCAAAGTTACCACACCCATTTGAATATCAAGAATCT
TGCTAGGGAAATTTTGTAAATATGATTAATTTTTTAACTATATAGTAATGCCATGTTGTCTTGAAAATAATCTATTTAGTGTCAATATAATCCATCTTCGTAGTTCTTCT
GGTG
Protein sequenceShow/hide protein sequence
MRPIGFDPETGKSRSYAIFIYRTNEGARRALEEPHKVFEGNKLHCQRAAEGKNKNQNSTQAVQSLAQTQPPMMAAMATASNLPLFAQHPSLNPVCGGFGNTALGVGMLNQ
GVVPMSQVGLVGSSVGAGIGLSGYSGGSYGLSQLSAGGSSMLGSYGSDSSSLKGLTHIYSSTMLGKAVSDRGPAASGGSLGGYTSYLWLFLLACNKGADIRLKTDFMAVE
AVNILCSCDAGELVVEDVSFGLVASLLLRLHPHSHFHHFCVYFHSPVLVSQVFIELLLTIDCCTEESKLVVMMDGAERDRMGLYEPVNKLEMWGNTFRSNANLNVPSSTF
IMEADTKLENQSDDASLGSLGDPHIYDQEDTKRIDKIQRRLAQNREAARKSRIRKKAYIKQLETSRLRLIRLEQELEKARQQDLFAGSRFDHYQMGLSGTTNSVISAFES
EYEQWVEEQNKQICDLRNVVHADITDIELRILVENAMRHYFKFFHMKAKAAKTDVFYIMSGMWKTSAERLFLWIGGIRPSELLKVLIPKLETLTDQQISETSSLGKSCLQ
AEDALRQGMEKLQQNLFESIVASQLDEGSYPLQMTAAIERLEALISFVNQVTFNLKFMNRVILTLSLVILTFHGPSPSPGDIHLLMSPKVGLRAALAS