; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019031 (gene) of Snake gourd v1 genome

Gene IDTan0019031
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionB3 domain-containing protein Os03g0120900-like
Genome locationLG06:44839824..44841738
RNA-Seq ExpressionTan0019031
SyntenyTan0019031
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR003340 - B3 DNA binding domain
IPR015300 - DNA-binding pseudobarrel domain superfamily
IPR044800 - B3 domain-containing transcription factor LEC2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592800.1 B3 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.2e-12774.64Show/hide
Query:  KSKEEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDVGKLNRLVIPKQHAEK
        + +EEEEEEEEEEEI +VACDFF NSH   +    +   +   NQ P MDLSLRIDS      NNNGF+VEREHMFDKVVTPSDVGKLNRLVIPKQHAEK
Subjt:  KSKEEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDVGKLNRLVIPKQHAEK

Query:  YFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLR
        YFPL+STTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD AKDRLFIDWR R D  SRP  HPLLLLPQSLR
Subjt:  YFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLR

Query:  WGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN--NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSG
        W   S R FTLPPPPP      PRRTH    ++LYPNYA E+ N G  +N  N++S+MYYF+P SISSS+SSSLYR+ NG+ IVVN EGSSMGI+KG SG
Subjt:  WGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN--NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSG

Query:  AATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD
          T  KRLRLFGVNMECAAA+GE GE D+  GGVSRRGKEPLSLNWD
Subjt:  AATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD

XP_022960203.1 B3 domain-containing protein Os03g0120900-like [Cucurbita moschata]7.9e-12772.73Show/hide
Query:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV
        M+F TKSK          EEEEEEEEEEEI +VACDFF NSH   +    +   +   NQ P MDLSLRIDS      NNNGF+VEREHMFDKVVTPSDV
Subjt:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV

Query:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS
        GKLNRLVIPKQHAEKYFPL+STTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD AKDRLFIDWR R D  S
Subjt:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS

Query:  RPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIV
        RP  HPLLLLPQSLRW   S R F TLPPPPP      PRRTH    ++LYPNYA ++ N G  +  +N++S+MYYF+P SISSS+SSSLYR+ NGD IV
Subjt:  RPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIV

Query:  VNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD
        VN EGSSMGI+KG SG  T  KRLRLFGVNMECAAA+GE GE D+ NGGVSRRGKEPL LNWD
Subjt:  VNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD

XP_023004852.1 B3 domain-containing protein Os03g0120900-like [Cucurbita maxima]2.4e-12872.88Show/hide
Query:  MEFFTKSK---------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVE---DQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV
        M+F TKSK         EEEEEEEEEEEI +VACDFF NSH ++N    V+   +   NQ P M+LSLRIDS      NNNGF+VEREHMFDKVVTPSDV
Subjt:  MEFFTKSK---------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVE---DQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV

Query:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS
        GKLNRLVIPKQHAEKYFPL+STTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD AKDRLFIDWR R D  S
Subjt:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS

Query:  RPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN--NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVV
        RP  HPLLLLPQSLRW   S R F LPPPPP      PRRTH    ++LYPNYA E+ N G  +N  N++S+MYYF+P SISSS+SSSLYR+ NGD IVV
Subjt:  RPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN--NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVV

Query:  NKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGE---GGEEDLSNGGVSRRGKEPLSLNWD
        N EG S+GI+KG SG  T AKRLRLFGVNMECAAA+GE   GGE D+ NGG SRRGKEPLSLNWD
Subjt:  NKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGE---GGEEDLSNGGVSRRGKEPLSLNWD

XP_023514974.1 B3 domain-containing protein Os03g0120900-like isoform X1 [Cucurbita pepo subsp. pepo]1.8e-12670.08Show/hide
Query:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHC--------------------NKNQQKWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNG
        M+F TKSK          EEEEEEEEEEEI +VACDFF NSH                     N +Q +   +   NQ P MDLSLRIDS      NNNG
Subjt:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHC--------------------NKNQQKWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNG

Query:  FSVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD
        F+VEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPL+STTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD
Subjt:  FSVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD

Query:  SAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSI
         AKDRLFIDWR R D  SRP  HPLLLLPQSLRW   S R F TLPPPPP      PRRTH    ++LYPNYA ++ N G  +  +N++S+MYYF+P SI
Subjt:  SAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSI

Query:  SSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD
        SSS+SSSLYR+ NGD IVVN EGSSMGI+KG SG  T  KRLRLFGVNMECAAA+GE GE D+ NGGVSRRGKEPLSLNWD
Subjt:  SSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD

XP_023514975.1 B3 domain-containing protein Os03g0120900-like isoform X2 [Cucurbita pepo subsp. pepo]2.1e-12773Show/hide
Query:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV
        M+F TKSK          EEEEEEEEEEEI +VACDFF NSH   +    +   +   NQ P MDLSLRIDS      NNNGF+VEREHMFDKVVTPSDV
Subjt:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV

Query:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS
        GKLNRLVIPKQHAEKYFPL+STTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD AKDRLFIDWR R D  S
Subjt:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS

Query:  RPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIV
        RP  HPLLLLPQSLRW   S R F TLPPPPP      PRRTH    ++LYPNYA ++ N G  +  +N++S+MYYF+P SISSS+SSSLYR+ NGD IV
Subjt:  RPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIV

Query:  VNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD
        VN EGSSMGI+KG SG  T  KRLRLFGVNMECAAA+GE GE D+ NGGVSRRGKEPLSLNWD
Subjt:  VNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD

TrEMBL top hitse value%identityAlignment
A0A0A0KCC3 TF-B3 domain-containing protein5.5e-11064.12Show/hide
Query:  MEFFTKS----------KEEEEEEEEEEEIRSVACDFFPNSHCNK--NQQKWVEDQGT---------NQQPLMDLSLRIDSNNNNNNNNNGFS--VEREH
        MEF TKS          +EEEE+E++E++   VAC+FFPNSH  +   QQ   +DQ +         +Q  LMDLSLR++S        NGF+  VEREH
Subjt:  MEFFTKS----------KEEEEEEEEEEEIRSVACDFFPNSHCNK--NQQKWVEDQGT---------NQQPLMDLSLRIDSNNNNNNNNNGFS--VEREH

Query:  MFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGV-GDSAKDRL
        MFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDS+TNDKGLILNFEDR+GKPWRFRYSYWNSSQSYVMTKGWSRFVK+KKLDAGD+VSF R +   S  DRL
Subjt:  MFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGV-GDSAKDRL

Query:  FIDW-RRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSL
        FIDW RRR DAP+  HHH     P  LRWG+ +     LPPPPP      PRRT  ++ ++LYPNY FEIPNFG  +N ++++MYYFRP     SSSSSL
Subjt:  FIDW-RRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSL

Query:  YRIGNGDEIVVNKEG-SSMG-IHKGDSGAATAAKRLRLFGVNMECAAAEGE--GGEEDLSNGGVSRRGKEPLSLNWDLL
        YR+GNGDEIVVN +G SSMG I+K  SG   AAKRLRLFGVNMECA+ +GE  GG ED+SNGGV RRGKEPLSLNWDLL
Subjt:  YRIGNGDEIVVNKEG-SSMG-IHKGDSGAATAAKRLRLFGVNMECAAAEGE--GGEEDLSNGGVSRRGKEPLSLNWDLL

A0A6J1ETF9 B3 domain-containing protein Os03g0120900-like2.5e-11867.75Show/hide
Query:  MEFFTKS------KEEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVEDQGTNQ----QPLMDLSLRIDSNNNNNNNNNGFS-VEREHMFDKVVTPSDVG
        MEF TKS      +EEEEEEEE++EIR VACDFFPNS      Q+WV+    NQ     PLMDLSLR+++        NGF  VEREHMFDKVVTPSDVG
Subjt:  MEFFTKS------KEEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVEDQGTNQ----QPLMDLSLRIDSNNNNNNNNNGFS-VEREHMFDKVVTPSDVG

Query:  KLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPSR
        KLNRLVIPKQHAEKYFPLDS TNDKGLILNFED NGKPWRFRYSYWNSSQSYVMTKGWSRFVK+KKLDAGDVVSF RG+GDSA+DRLFIDWRRRP+AP+ 
Subjt:  KLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPSR

Query:  PHHH-----PLLLLPQSLRWGSGSGRQFTLPPP--PPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGD
        P HH     PLL LPQSLRW +       LPPP  PP H             + LY NYAFEIPN G P NN+SS MYYFRPTS+SSSSS          
Subjt:  PHHH-----PLLLLPQSLRWGSGSGRQFTLPPP--PPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGD

Query:  EIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAE-GEGGEEDLSNGGVSRRGKEPLSLNWDLL
        E+VV+ EGSSMGI K   G   AAKRLRLFGVNMECAA + G GG EDLSNGGVSRRGK+P SLNWDL+
Subjt:  EIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAE-GEGGEEDLSNGGVSRRGKEPLSLNWDLL

A0A6J1H862 B3 domain-containing protein Os03g0120900-like3.8e-12772.73Show/hide
Query:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV
        M+F TKSK          EEEEEEEEEEEI +VACDFF NSH   +    +   +   NQ P MDLSLRIDS      NNNGF+VEREHMFDKVVTPSDV
Subjt:  MEFFTKSK----------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQ--KWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV

Query:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS
        GKLNRLVIPKQHAEKYFPL+STTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD AKDRLFIDWR R D  S
Subjt:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS

Query:  RPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIV
        RP  HPLLLLPQSLRW   S R F TLPPPPP      PRRTH    ++LYPNYA ++ N G  +  +N++S+MYYF+P SISSS+SSSLYR+ NGD IV
Subjt:  RPHHHPLLLLPQSLRWGSGSGRQF-TLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPT--NNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIV

Query:  VNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD
        VN EGSSMGI+KG SG  T  KRLRLFGVNMECAAA+GE GE D+ NGGVSRRGKEPL LNWD
Subjt:  VNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWD

A0A6J1JVF1 B3 domain-containing protein Os03g0120900-like8.0e-11767.49Show/hide
Query:  MEFFTKS------KEEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVEDQGTNQ----QPLMDLSLRIDSNNNNNNNNNGFS-VEREHMFDKVVTPSDVG
        MEF TKS      +EEEEEEEE++EIR VACDFFPNS      Q+WV+    NQ     PLMDLSLR+++        NGF  VEREHMFDKVVTPSDVG
Subjt:  MEFFTKS------KEEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVEDQGTNQ----QPLMDLSLRIDSNNNNNNNNNGFS-VEREHMFDKVVTPSDVG

Query:  KLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPSR
        KLNRLVIPKQHAEKYFPLDS T DKGLILNFED NGKPWRFRYSYWNSSQSYVMTKGWSRFVK+KKLDAGDVVSF RGVGDSA+DRLFIDWRRRP+A + 
Subjt:  KLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPSR

Query:  PHHH-----PLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGDEI
        P HH     PLL LPQSLRW +          PPP HP  + R         LY NYAFEIPN G P NN+SS MYYFRPTS+SSSS           EI
Subjt:  PHHH-----PLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGDEI

Query:  VVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWDLL
        VV+ EGSSMGI K  SG   AAKRLRLFGVNMECA    +GGEEDLSNG VSRRGK+P SLNWDL+
Subjt:  VVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDLSNGGVSRRGKEPLSLNWDLL

A0A6J1KRJ4 B3 domain-containing protein Os03g0120900-like1.2e-12872.88Show/hide
Query:  MEFFTKSK---------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVE---DQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV
        M+F TKSK         EEEEEEEEEEEI +VACDFF NSH ++N    V+   +   NQ P M+LSLRIDS      NNNGF+VEREHMFDKVVTPSDV
Subjt:  MEFFTKSK---------EEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVE---DQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDV

Query:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS
        GKLNRLVIPKQHAEKYFPL+STTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD AKDRLFIDWR R D  S
Subjt:  GKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPS

Query:  RPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN--NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVV
        RP  HPLLLLPQSLRW   S R F LPPPPP      PRRTH    ++LYPNYA E+ N G  +N  N++S+MYYF+P SISSS+SSSLYR+ NGD IVV
Subjt:  RPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN--NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVV

Query:  NKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGE---GGEEDLSNGGVSRRGKEPLSLNWD
        N EG S+GI+KG SG  T AKRLRLFGVNMECAAA+GE   GGE D+ NGG SRRGKEPLSLNWD
Subjt:  NKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGE---GGEEDLSNGGVSRRGKEPLSLNWD

SwissProt top hitse value%identityAlignment
O82799 B3 domain-containing transcription factor NGA11.2e-5850.36Show/hide
Query:  EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAK
        +REHMFDKVVTPSDVGKLNRLVIPKQHAE++FPLDS++N+KGL+LNFED  GK WRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGD+VSFQR VGDS +
Subjt:  EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAK

Query:  D-RLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIP-NFGTPTNNHSSNMYYFRPTSISSSS
        D RLFIDWRRRP  P  PH     + P+   +             P T+ + Y  +   ++ +    NY  +IP  FG          Y+ R     ++ 
Subjt:  D-RLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIP-NFGTPTNNHSSNMYYFRPTSISSSS

Query:  SSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG------EEDLSNGGVSRRG
        ++++      D +V+      M          TA KRLRLFGV+MEC    GE G      EE  S+GG   RG
Subjt:  SSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG------EEDLSNGGVSRRG

Q7F9W2 B3 domain-containing protein Os04g05814001.3e-5569.33Show/hide
Query:  VEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSA
        +E+EHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDS  N+KGL+L+FEDR GK WRFRYSYWNSSQSYVMTKGWSRFVK+K+LDAGD VSF RG  ++ 
Subjt:  VEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSA

Query:  KDRLFIDWRRRPDAPSRPHHHPLLLLPQSL---RWGSGSGRQFTLPPPPP
        +DRLFIDW+RR D    PH    L LP +     WG G+G     P  PP
Subjt:  KDRLFIDWRRRPDAPSRPHHHPLLLLPQSL---RWGSGSGRQFTLPPPPP

Q8LMR9 B3 domain-containing protein Os03g01209005.2e-5745.42Show/hide
Query:  SVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDS
        +VE+EHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLD+ +N+KGL+L+FEDR GKPWRFRYSYWNSSQSYVMTKGWSRFVK+K+LDAGD VSF RGVG++
Subjt:  SVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDS

Query:  AKDRLFIDWRRRPD---APSRPHHHPLLLLPQSLRW---------------GSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN
        A+ RLFIDWRRRPD   A   P H     LP S+ +                + +G +F LPP            + P Y ++    +A     +   T 
Subjt:  AKDRLFIDWRRRPD---APSRPHHHPLLLLPQSLRW---------------GSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTN

Query:  NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAE-------GEGGEEDL---SNGGVSRRGKEPL
          S  + ++RP         +         +V+      M     +  +A  +KR+RLFGVN++CA +E       G+     L    +   S  GK   
Subjt:  NHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAE-------GEGGEEDL---SNGGVSRRGKEPL

Query:  SLNWDL
        SLN DL
Subjt:  SLNWDL

Q9M268 B3 domain-containing transcription factor NGA24.4e-5652.9Show/hide
Query:  SVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLD-STTND--KGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGV
        S+EREHMFDKVVTPSDVGKLNRLVIPKQHAE+YFPLD STTND  KGL+LNFEDR+G  WRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGD+VSFQR  
Subjt:  SVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLD-STTND--KGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGV

Query:  GDSA-KDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSI
         DS  KD+L+IDWRRRP  P   HHH           G+   R +T   P P  PT Y   TH     NLY  + F   + G     +  +M    PT++
Subjt:  GDSA-KDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSI

Query:  SSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG---EEDLSNGGVSRRGK
          S    + R           + +SM        A+   KRLRLFGV+MEC    G      EE  S+GG   RG+
Subjt:  SSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG---EEDLSNGGVSRRGK

Q9MAN1 B3 domain-containing transcription factor NGA31.9e-5444.74Show/hide
Query:  EDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSV----EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNS
        ++Q  +Q+   ++     S  + NNNN    +    E+EHMFDKVVTPSDVGKLNRLVIPKQHAE+YFPLDS+ N  G +LNF+DRNGK WRFRYSYWNS
Subjt:  EDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSV----EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNS

Query:  SQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD-SAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHP-----
        SQSYVMTKGWSRFVK+KKLDAGD+VSFQRG+GD S + +L+IDWR RPD          + L Q+ ++G+  G  F  P       +QY  R HP     
Subjt:  SQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD-SAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHP-----

Query:  -----------NYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMG------IHKGDSGAATAAKRLRLFGV
                   N+  + Y     E   +G    N +   YY       + S      I   + +V++      G      +       +TA KRLRLFGV
Subjt:  -----------NYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMG------IHKGDSGAATAAKRLRLFGV

Query:  NMEC
        NMEC
Subjt:  NMEC

Arabidopsis top hitse value%identityAlignment
AT1G01030.1 AP2/B3-like transcriptional factor family protein1.3e-5544.74Show/hide
Query:  EDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSV----EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNS
        ++Q  +Q+   ++     S  + NNNN    +    E+EHMFDKVVTPSDVGKLNRLVIPKQHAE+YFPLDS+ N  G +LNF+DRNGK WRFRYSYWNS
Subjt:  EDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSV----EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNS

Query:  SQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD-SAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHP-----
        SQSYVMTKGWSRFVK+KKLDAGD+VSFQRG+GD S + +L+IDWR RPD          + L Q+ ++G+  G  F  P       +QY  R HP     
Subjt:  SQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGD-SAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHP-----

Query:  -----------NYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMG------IHKGDSGAATAAKRLRLFGV
                   N+  + Y     E   +G    N +   YY       + S      I   + +V++      G      +       +TA KRLRLFGV
Subjt:  -----------NYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMG------IHKGDSGAATAAKRLRLFGV

Query:  NMEC
        NMEC
Subjt:  NMEC

AT2G46870.1 AP2/B3-like transcriptional factor family protein8.8e-6050.36Show/hide
Query:  EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAK
        +REHMFDKVVTPSDVGKLNRLVIPKQHAE++FPLDS++N+KGL+LNFED  GK WRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGD+VSFQR VGDS +
Subjt:  EREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAK

Query:  D-RLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIP-NFGTPTNNHSSNMYYFRPTSISSSS
        D RLFIDWRRRP  P  PH     + P+   +             P T+ + Y  +   ++ +    NY  +IP  FG          Y+ R     ++ 
Subjt:  D-RLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIP-NFGTPTNNHSSNMYYFRPTSISSSS

Query:  SSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG------EEDLSNGGVSRRG
        ++++      D +V+      M          TA KRLRLFGV+MEC    GE G      EE  S+GG   RG
Subjt:  SSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG------EEDLSNGGVSRRG

AT3G61970.1 AP2/B3-like transcriptional factor family protein3.1e-5752.9Show/hide
Query:  SVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLD-STTND--KGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGV
        S+EREHMFDKVVTPSDVGKLNRLVIPKQHAE+YFPLD STTND  KGL+LNFEDR+G  WRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGD+VSFQR  
Subjt:  SVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLD-STTND--KGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGV

Query:  GDSA-KDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSI
         DS  KD+L+IDWRRRP  P   HHH           G+   R +T   P P  PT Y   TH     NLY  + F   + G     +  +M    PT++
Subjt:  GDSA-KDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPTHPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSI

Query:  SSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG---EEDLSNGGVSRRGK
          S    + R           + +SM        A+   KRLRLFGV+MEC    G      EE  S+GG   RG+
Subjt:  SSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGG---EEDLSNGGVSRRGK

AT5G06250.1 AP2/B3-like transcriptional factor family protein4.9e-4266.13Show/hide
Query:  REHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPL----------DSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSF
        +E +F+K +TPSDVGKLNRLVIPKQHAEKYFPL          D+++++KG++L+FED +GK WRFRYSYWNSSQSYV+TKGWSRFVKDK+LD GDVV F
Subjt:  REHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPL----------DSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSF

Query:  QRGVGDSAKDRLFIDWRRRPDAPS
        QR   DS   RLFI WRRR    S
Subjt:  QRGVGDSAKDRLFIDWRRRPDAPS

AT5G06250.2 AP2/B3-like transcriptional factor family protein4.9e-4266.13Show/hide
Query:  REHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPL----------DSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSF
        +E +F+K +TPSDVGKLNRLVIPKQHAEKYFPL          D+++++KG++L+FED +GK WRFRYSYWNSSQSYV+TKGWSRFVKDK+LD GDVV F
Subjt:  REHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPL----------DSTTNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSF

Query:  QRGVGDSAKDRLFIDWRRRPDAPS
        QR   DS   RLFI WRRR    S
Subjt:  QRGVGDSAKDRLFIDWRRRPDAPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTTTCACAAAATCGAAAGAAGAAGAAGAGGAGGAGGAAGAAGAAGAAGAAATTCGGAGTGTTGCATGCGATTTCTTCCCAAATTCACACTGTAATAAGAACCA
ACAGAAATGGGTTGAAGATCAGGGTACAAATCAGCAGCCTCTCATGGACTTATCCCTCCGAATCGACTCCAACAACAACAACAATAACAACAACAATGGGTTTTCTGTTG
AGAGAGAGCACATGTTCGACAAAGTGGTCACTCCAAGCGATGTGGGAAAATTAAACCGTCTAGTCATCCCAAAGCAACACGCCGAAAAATACTTCCCTTTGGACTCAACC
ACCAACGACAAAGGCCTCATCTTAAACTTCGAGGACCGCAACGGAAAGCCCTGGAGGTTTCGTTACTCTTACTGGAACAGCAGCCAGAGTTACGTGATGACTAAAGGCTG
GAGCCGTTTTGTCAAAGACAAGAAATTGGATGCAGGCGACGTCGTTTCCTTCCAGCGTGGCGTCGGCGATTCCGCTAAGGACCGCCTTTTCATCGACTGGCGCCGCCGTC
CCGATGCCCCATCTCGCCCTCACCATCATCCTCTCCTCCTCCTCCCACAGTCTCTCCGGTGGGGCTCTGGCAGTGGCCGTCAATTTACTCTTCCACCACCTCCTCCAACT
CATCCGACGCAGTACCCCAGACGAACCCACCCTAATTACTCCAATAATCTCTATCCCAATTACGCCTTTGAAATCCCCAATTTTGGGACGCCTACCAATAATCATTCTTC
CAATATGTATTATTTTCGACCTACCTCTATTTCCTCTTCCTCTTCCTCCTCTCTCTATCGGATCGGGAATGGGGATGAGATTGTTGTTAATAAGGAAGGCTCTTCCATGG
GGATTCACAAAGGCGACTCCGGCGCTGCCACCGCGGCCAAACGACTCCGACTGTTTGGAGTGAATATGGAGTGTGCTGCAGCCGAGGGAGAAGGAGGAGAAGAGGATCTC
TCAAACGGCGGCGTTTCCAGGAGAGGGAAAGAGCCCTTGTCGTTGAATTGGGATTTACTTTAA
mRNA sequenceShow/hide mRNA sequence
GAAACCACATCACACACATACTCTCTCTCTCTCTCTTTCTATCTGTCTGTTTGTCTCTCTCTCTCTCTCTCTGTGATTTGTGTTTGTGAGTGAAAGCTCAATTTCCTCCC
TCAAGTGTGTATGGAAATTTCAAGAATTTAAAACCACCCAAAGAGAGAAAAAAAAGAAAAAGAGGGTAAGAGAGAATCAATTTGAGCTTATCGAGTCCGAAATGGCTTAT
CCTGTTTAGGGTTTAGAAGGTTTCCACACACAGGTAAAAACCAGCAAGCTCTTCCATTATCCGGGCTGGCCTCCTCCTAAATCCAGAAACTGAATTTTGGAAGAAAATAA
AAGCGCAAGGCATGGAGTTTTTCACAAAATCGAAAGAAGAAGAAGAGGAGGAGGAAGAAGAAGAAGAAATTCGGAGTGTTGCATGCGATTTCTTCCCAAATTCACACTGT
AATAAGAACCAACAGAAATGGGTTGAAGATCAGGGTACAAATCAGCAGCCTCTCATGGACTTATCCCTCCGAATCGACTCCAACAACAACAACAATAACAACAACAATGG
GTTTTCTGTTGAGAGAGAGCACATGTTCGACAAAGTGGTCACTCCAAGCGATGTGGGAAAATTAAACCGTCTAGTCATCCCAAAGCAACACGCCGAAAAATACTTCCCTT
TGGACTCAACCACCAACGACAAAGGCCTCATCTTAAACTTCGAGGACCGCAACGGAAAGCCCTGGAGGTTTCGTTACTCTTACTGGAACAGCAGCCAGAGTTACGTGATG
ACTAAAGGCTGGAGCCGTTTTGTCAAAGACAAGAAATTGGATGCAGGCGACGTCGTTTCCTTCCAGCGTGGCGTCGGCGATTCCGCTAAGGACCGCCTTTTCATCGACTG
GCGCCGCCGTCCCGATGCCCCATCTCGCCCTCACCATCATCCTCTCCTCCTCCTCCCACAGTCTCTCCGGTGGGGCTCTGGCAGTGGCCGTCAATTTACTCTTCCACCAC
CTCCTCCAACTCATCCGACGCAGTACCCCAGACGAACCCACCCTAATTACTCCAATAATCTCTATCCCAATTACGCCTTTGAAATCCCCAATTTTGGGACGCCTACCAAT
AATCATTCTTCCAATATGTATTATTTTCGACCTACCTCTATTTCCTCTTCCTCTTCCTCCTCTCTCTATCGGATCGGGAATGGGGATGAGATTGTTGTTAATAAGGAAGG
CTCTTCCATGGGGATTCACAAAGGCGACTCCGGCGCTGCCACCGCGGCCAAACGACTCCGACTGTTTGGAGTGAATATGGAGTGTGCTGCAGCCGAGGGAGAAGGAGGAG
AAGAGGATCTCTCAAACGGCGGCGTTTCCAGGAGAGGGAAAGAGCCCTTGTCGTTGAATTGGGATTTACTTTAATTACTCTACTCGGCTTTGGATTTCGCTCGCTCATCG
CTTCTTTTTTCTGCAACACCAACACAATTCAAGAGAAAACGACTACAGAACAACCGAAAACACCCAAAAAAAAAAAAAAACACAAAAATAAAATCAAGGTAGTGGAAATT
TGATCCTATGAATTTTCTCTATTTTTTTTTTTAATTTTTTGTTTTTCTAAACACATTTTCTGTTATAATTAATTTTTACTCCAAAAAGTTGATCGAAATGCGATGGAAAA
GTAGAAGAAAAACAAGATATTGAGATCGCAAAAATTGTAGGCTTTGAAGTTGGTGAAACAAAGTATAGAGATTATTGTTGTTCCATATATATTTGAGATATGAAGTGCGT
AGCGGTAATGATGATGAATTGGATGAGTCGAATTTAGTTAAGGAGAATTAATTAGTTAGAATGAATGTGGTAGTGGTAGTGGTAGTGGAAGAGATTATTGTTGTATAGAA
AATGAAGAGAAGTTTTATAATAAGAATAATAAGAGAATGAAAATT
Protein sequenceShow/hide protein sequence
MEFFTKSKEEEEEEEEEEEIRSVACDFFPNSHCNKNQQKWVEDQGTNQQPLMDLSLRIDSNNNNNNNNNGFSVEREHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDST
TNDKGLILNFEDRNGKPWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDVVSFQRGVGDSAKDRLFIDWRRRPDAPSRPHHHPLLLLPQSLRWGSGSGRQFTLPPPPPT
HPTQYPRRTHPNYSNNLYPNYAFEIPNFGTPTNNHSSNMYYFRPTSISSSSSSSLYRIGNGDEIVVNKEGSSMGIHKGDSGAATAAKRLRLFGVNMECAAAEGEGGEEDL
SNGGVSRRGKEPLSLNWDLL