; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009482 (gene) of Snake gourd v1 genome

Gene IDTan0009482
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor MYB102-like
Genome locationLG05:84176790..84178152
RNA-Seq ExpressionTan0009482
SyntenyTan0009482
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140118.1 transcription factor MYB102-like [Momordica charantia]6.3e-15083.38Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGRTPCCDKNGLKKGPWTPEEDQKL+DYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAT+LPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSS-ERKSPSFSPQNSTPTVQFSHPQ--
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLS+LLGVQPLVNPELLKLAASLMSS +RKSPSFSPQNS+  +QFS PQ  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSS-ERKSPSFSPQNSTPTVQFSHPQ--

Query:  -------LQEIVQFPQEA--QLVEPIGSELNDQWQSGLIFPTNLSLDCNF--ELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNN-NFSLSSVLSS
               LQEIVQ+P+EA  QLVEPIGSELNDQWQSG I P+NLS +C+F   LP FD YCGL         D SYET TF SHN NNN NFSL+SVLSS
Subjt:  -------LQEIVQFPQEA--QLVEPIGSELNDQWQSGLIFPTNLSLDCNF--ELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNN-NFSLSSVLSS

Query:  PCSSSPTQMNSN-STNITSATEDERESYCSQILNFEIPDIFDE
        P SSSPTQMNSN ST  TSATEDERESYCSQ+L+FEI DIF+E
Subjt:  PCSSSPTQMNSN-STNITSATEDERESYCSQILNFEIPDIFDE

XP_022927584.1 transcription factor MYB102-like [Cucurbita moschata]1.5e-15184.96Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR PCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLP NAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKS SFSPQN   T+QFSHPQL  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--

Query:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSHN--GNN-NNFSLSSVLSSPCSS
            QEIVQFP   QLVEPI SELNDQWQ        +S DC N E+P GFD YCG DQQ  AA +D SYE PTFSSH   GNN NNF+LSSVLSSPCSS
Subjt:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSHN--GNN-NNFSLSSVLSSPCSS

Query:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA
        SPTQMNSNST  TS  EDERESYCSQILNF+IPD FDE+
Subjt:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA

XP_023000788.1 transcription factor MYB102-like [Cucurbita maxima]2.2e-15083.78Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR PCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLP NAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEE IIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKS SFSPQN   T+QFSHPQL  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--

Query:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSH---NGNNNNFSLSSVLSSPCSS
            QEIVQFP   QLVEPI SELNDQWQ        +S DC N ++P GFD YCG DQQ  AA +D SYE PTFSSH     N NNFSLSSV+SSPCSS
Subjt:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSH---NGNNNNFSLSSVLSSPCSS

Query:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA
        SPTQMNSNST  TS  EDERESYCSQILNF+IPD FDE+
Subjt:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA

XP_023519871.1 transcription factor MYB102-like [Cucurbita pepo subsp. pepo]3.3e-15184.66Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR PCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLP NAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKS SFSPQN   T+QFSHPQL  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--

Query:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSHN--GNN-NNFSLSSVLSSPCSS
            QEIVQFP   QLVEPI SELNDQWQ        +S +C N E+P GFD YCG DQQ  AA +D SYE PTFSSH   GNN NNF+LSSVLSSPCSS
Subjt:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSHN--GNN-NNFSLSSVLSSPCSS

Query:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA
        SPTQMNSNST  TS  EDERESYCSQILNF+IPD FDE+
Subjt:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA

XP_038894939.1 transcription factor MYB41-like [Benincasa hispida]1.1e-15787.16Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGRTPCCDKNGLKKGPWTPEED KLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNST-PTVQFSHPQL-
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERK+PSFSPQNST  T+QFSHPQL 
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNST-PTVQFSHPQL-

Query:  -----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGLD-QQQPAAGVDRSYETPTFSSHNGNNNNFSLSSVLSSPCSSSPT
             QEIVQFP  +Q+VEPI SELNDQWQ+G I P+NLS  CNF+L     Y GLD QQQ AA V+ SYET TFSSHNGN  NF L SVLSSPCSSSPT
Subjt:  -----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGLD-QQQPAAGVDRSYETPTFSSHNGNNNNFSLSSVLSSPCSSSPT

Query:  QMNSNSTNITSATEDERESYCSQILNFEIPDIFDE
        QMNSNST  TS TEDERESYCSQILNFEI DIFDE
Subjt:  QMNSNSTNITSATEDERESYCSQILNFEIPDIFDE

TrEMBL top hitse value%identityAlignment
A0A1S3B5A9 protein ODORANT1-like1.2e-14984.3Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNS----TPTVQFSHP
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNS    T T+QFS+P
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNS----TPTVQFSHP

Query:  QL------QEIVQFPQEAQLVEP-IGSELN-DQWQSGLIFPTNLSLDCNFELPGFDY-YCGLDQQQP--AAGVDRSYETPTFSSHNGNN--NNFSLSSVL
        QL      QEIVQFP  +Q+VEP I SELN DQW +G +         NF+L    + YCGLDQQQ   A GVD SYETPTFS HNGN+  NNFSL SVL
Subjt:  QL------QEIVQFPQEAQLVEP-IGSELN-DQWQSGLIFPTNLSLDCNFELPGFDY-YCGLDQQQP--AAGVDRSYETPTFSSHNGNN--NNFSLSSVL

Query:  SSPCSSSPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDE
        SSPCSSSPTQMNSNST  TS TEDERESYCSQILNFEI DIFDE
Subjt:  SSPCSSSPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDE

A0A5A7UQN0 Protein ODORANT1-like1.2e-14984.3Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNS----TPTVQFSHP
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNS    T T+QFS+P
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNS----TPTVQFSHP

Query:  QL------QEIVQFPQEAQLVEP-IGSELN-DQWQSGLIFPTNLSLDCNFELPGFDY-YCGLDQQQP--AAGVDRSYETPTFSSHNGNN--NNFSLSSVL
        QL      QEIVQFP  +Q+VEP I SELN DQW +G +         NF+L    + YCGLDQQQ   A GVD SYETPTFS HNGN+  NNFSL SVL
Subjt:  QL------QEIVQFPQEAQLVEP-IGSELN-DQWQSGLIFPTNLSLDCNFELPGFDY-YCGLDQQQP--AAGVDRSYETPTFSSHNGNN--NNFSLSSVL

Query:  SSPCSSSPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDE
        SSPCSSSPTQMNSNST  TS TEDERESYCSQILNFEI DIFDE
Subjt:  SSPCSSSPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDE

A0A6J1CHA1 transcription factor MYB102-like3.0e-15083.38Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGRTPCCDKNGLKKGPWTPEEDQKL+DYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAT+LPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSS-ERKSPSFSPQNSTPTVQFSHPQ--
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLS+LLGVQPLVNPELLKLAASLMSS +RKSPSFSPQNS+  +QFS PQ  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSS-ERKSPSFSPQNSTPTVQFSHPQ--

Query:  -------LQEIVQFPQEA--QLVEPIGSELNDQWQSGLIFPTNLSLDCNF--ELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNN-NFSLSSVLSS
               LQEIVQ+P+EA  QLVEPIGSELNDQWQSG I P+NLS +C+F   LP FD YCGL         D SYET TF SHN NNN NFSL+SVLSS
Subjt:  -------LQEIVQFPQEA--QLVEPIGSELNDQWQSGLIFPTNLSLDCNF--ELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNN-NFSLSSVLSS

Query:  PCSSSPTQMNSN-STNITSATEDERESYCSQILNFEIPDIFDE
        P SSSPTQMNSN ST  TSATEDERESYCSQ+L+FEI DIF+E
Subjt:  PCSSSPTQMNSN-STNITSATEDERESYCSQILNFEIPDIFDE

A0A6J1EIE8 transcription factor MYB102-like7.3e-15284.96Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR PCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLP NAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKS SFSPQN   T+QFSHPQL  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--

Query:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSHN--GNN-NNFSLSSVLSSPCSS
            QEIVQFP   QLVEPI SELNDQWQ        +S DC N E+P GFD YCG DQQ  AA +D SYE PTFSSH   GNN NNF+LSSVLSSPCSS
Subjt:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSHN--GNN-NNFSLSSVLSSPCSS

Query:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA
        SPTQMNSNST  TS  EDERESYCSQILNF+IPD FDE+
Subjt:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA

A0A6J1KNL9 transcription factor MYB102-like1.0e-15083.78Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR PCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLP NAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEE IIQLHSVLGNKWSAIATRLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--
        DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKS SFSPQN   T+QFSHPQL  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQL--

Query:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSH---NGNNNNFSLSSVLSSPCSS
            QEIVQFP   QLVEPI SELNDQWQ        +S DC N ++P GFD YCG DQQ  AA +D SYE PTFSSH     N NNFSLSSV+SSPCSS
Subjt:  ----QEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDC-NFELP-GFDYYCGLDQQQPAAGVDRSYETPTFSSH---NGNNNNFSLSSVLSSPCSS

Query:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA
        SPTQMNSNST  TS  EDERESYCSQILNF+IPD FDE+
Subjt:  SPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEA

SwissProt top hitse value%identityAlignment
Q9LDR8 Transcription factor MYB1027.4e-9355.83Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        M R+PCC+KNGLKKGPWT EEDQKL+DYIQK+G+GNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHS LGNKWSAIA RLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS--QMNLSSLL--------GVQPLVNPELLKLAASLMS--------------SE
        DNEIKN+WNTHIRK+LLRMGIDPVTHSPRLDLLD+SSIL S+LYNSS   MN+S L+           PLVNPE+LKLA SL S              ++
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS--QMNLSSLL--------GVQPLVNPELLKLAASLMS--------------SE

Query:  RKSPSFSPQ--NSTPTVQFSHPQLQEIVQ-----FPQEAQLVEPIGSELNDQWQSGLIFPTNLSL-DCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFS
         K   +S    N   T Q+    + + +Q     FP EA+    +    N   +  L+  +  S+ DC    P F+     D       +D SY   +F 
Subjt:  RKSPSFSPQ--NSTPTVQFSHPQLQEIVQ-----FPQEAQLVEPIGSELNDQWQSGLIFPTNLSL-DCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFS

Query:  SHNGNNNNFSLSSVLSSPCSS-SPTQMNSNSTNITS-ATEDERESYCSQILNFEIPDIFD
               NF+ +SVL++P SS SPT +NS+  N +S +TEDE ESYCS ++ F+IPD  D
Subjt:  SHNGNNNNFSLSSVLSSPCSS-SPTQMNSNSTNITS-ATEDERESYCSQILNFEIPDIFD

Q9LE63 Transcription factor MYB1061.0e-6275.18Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR+PCCDK GLKKGPWTPEEDQKL+ YI+++GHG+WR+LP+ AGLQRCGKSCRLRWTNYLRPDIKRG+F+ +EE+TIIQLH++LGN+WSAIAT LP RT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSS
        DNEIKNYWNTH++KRL++MGIDPVTH  + + L  S+
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSS

Q9LXF1 Transcription factor MYB161.0e-6278.29Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR+PCCDK GLKKGPWTPEEDQKL+ YI+++GHG+WR+LP+ AGL RCGKSCRLRWTNYLRPDIKRG+F+ +EE+TIIQLH++LGN+WSAIAT LP RT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPR
        DNEIKNYWNTH++KRL++MGIDPVTH P+
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPR

Q9M0J5 Transcription factor MYB417.9e-7963.52Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR+PCCDKNG+KKGPWT EEDQKLIDYI+ +G GNWRTLPKNAGL RCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSV+GNKWSAIA RLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSL-LGVQPLVNPELLKLAASLMSSERKSPSFSPQN-----STPTV--Q
        DNEIKN+WNTHIRKRL+R GIDPVTHSPRLDLLDLSS+L S L+N  Q N S++      L+NP++L+LA+ L+  +  +P + P N      TP    +
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSL-LGVQPLVNPELLKLAASLMSSERKSPSFSPQN-----STPTV--Q

Query:  FSHPQLQ-EIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLD
         S PQ +   V    E   +EP+ + L+D   + ++ P + S D
Subjt:  FSHPQLQ-EIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLD

Q9M0Y5 Transcription factor MYB741.0e-8655.75Show/hide
Query:  MGRTPCCD-KNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGR
        MGR+PCC+ KNGLKKGPWTPEEDQKLIDYI  +G+GNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHS++GNKWSAIA RLPGR
Subjt:  MGRTPCCD-KNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGR

Query:  TDNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS---------QMNLSSLL----GVQPLVNPELLKLAASLMSSERKSPSFSPQN
        TDNEIKNYWNTHIRKRLL+MGIDPVTH+PRLDLLD+SSIL S++YNSS          MN+S L+      QPLVNPE+LKLA SL S++   P+ + +N
Subjt:  TDNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS---------QMNLSSLL----GVQPLVNPELLKLAASLMSSERKSPSFSPQN

Query:  STPTVQFSHPQLQEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNNNFSL-------
        +T                           +E+N Q+Q+G   P N  L   F  P  D +       P   +  + +       + + +NF L       
Subjt:  STPTVQFSHPQLQEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNNNFSL-------

Query:  SSVLSSPCSSSPTQMNSNST----NITSATEDERESYCS
        +SVL++P SSSPT +NS+S+    + T +TEDE+ESY S
Subjt:  SSVLSSPCSSSPTQMNSNST----NITSATEDERESYCS

Arabidopsis top hitse value%identityAlignment
AT3G02940.1 myb domain protein 1074.2e-6762.18Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR+PCCD++GLKKGPWTPEEDQKLI++I+K+GHG+WR LPK AGL RCGKSCRLRWTNYLRPDIKRG F+ EEE+TII LHS+LGNKWS+IA  LPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSL-----LGVQPLVNPELLKLAASLMSSERKSPSFSPQNST
        DNEIKNYWNTHIRK+L++MGIDPVTH PR D L++ + L   L  ++  NL +L     L    +   +LL     ++S+   S SF   ++T
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSL-----LGVQPLVNPELLKLAASLMSSERKSPSFSPQNST

AT4G05100.1 myb domain protein 747.3e-8855.75Show/hide
Query:  MGRTPCCD-KNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGR
        MGR+PCC+ KNGLKKGPWTPEEDQKLIDYI  +G+GNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHS++GNKWSAIA RLPGR
Subjt:  MGRTPCCD-KNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGR

Query:  TDNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS---------QMNLSSLL----GVQPLVNPELLKLAASLMSSERKSPSFSPQN
        TDNEIKNYWNTHIRKRLL+MGIDPVTH+PRLDLLD+SSIL S++YNSS          MN+S L+      QPLVNPE+LKLA SL S++   P+ + +N
Subjt:  TDNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS---------QMNLSSLL----GVQPLVNPELLKLAASLMSSERKSPSFSPQN

Query:  STPTVQFSHPQLQEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNNNFSL-------
        +T                           +E+N Q+Q+G   P N  L   F  P  D +       P   +  + +       + + +NF L       
Subjt:  STPTVQFSHPQLQEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNNNFSL-------

Query:  SSVLSSPCSSSPTQMNSNST----NITSATEDERESYCS
        +SVL++P SSSPT +NS+S+    + T +TEDE+ESY S
Subjt:  SSVLSSPCSSSPTQMNSNST----NITSATEDERESYCS

AT4G21440.1 MYB-like 1025.2e-9455.83Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        M R+PCC+KNGLKKGPWT EEDQKL+DYIQK+G+GNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHS LGNKWSAIA RLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS--QMNLSSLL--------GVQPLVNPELLKLAASLMS--------------SE
        DNEIKN+WNTHIRK+LLRMGIDPVTHSPRLDLLD+SSIL S+LYNSS   MN+S L+           PLVNPE+LKLA SL S              ++
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSS--QMNLSSLL--------GVQPLVNPELLKLAASLMS--------------SE

Query:  RKSPSFSPQ--NSTPTVQFSHPQLQEIVQ-----FPQEAQLVEPIGSELNDQWQSGLIFPTNLSL-DCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFS
         K   +S    N   T Q+    + + +Q     FP EA+    +    N   +  L+  +  S+ DC    P F+     D       +D SY   +F 
Subjt:  RKSPSFSPQ--NSTPTVQFSHPQLQEIVQ-----FPQEAQLVEPIGSELNDQWQSGLIFPTNLSL-DCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFS

Query:  SHNGNNNNFSLSSVLSSPCSS-SPTQMNSNSTNITS-ATEDERESYCSQILNFEIPDIFD
               NF+ +SVL++P SS SPT +NS+  N +S +TEDE ESYCS ++ F+IPD  D
Subjt:  SHNGNNNNFSLSSVLSSPCSS-SPTQMNSNSTNITS-ATEDERESYCSQILNFEIPDIFD

AT4G28110.1 myb domain protein 415.6e-8063.52Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MGR+PCCDKNG+KKGPWT EEDQKLIDYI+ +G GNWRTLPKNAGL RCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSV+GNKWSAIA RLPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSL-LGVQPLVNPELLKLAASLMSSERKSPSFSPQN-----STPTV--Q
        DNEIKN+WNTHIRKRL+R GIDPVTHSPRLDLLDLSS+L S L+N  Q N S++      L+NP++L+LA+ L+  +  +P + P N      TP    +
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSL-LGVQPLVNPELLKLAASLMSSERKSPSFSPQN-----STPTV--Q

Query:  FSHPQLQ-EIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLD
         S PQ +   V    E   +EP+ + L+D   + ++ P + S D
Subjt:  FSHPQLQ-EIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLD

AT5G54230.1 myb domain protein 491.5e-6947.6Show/hide
Query:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT
        MG++   +++ +KKGPWTPEED+KL+ YIQ +G G WRTLPKNAGL+RCGKSCRLRWTNYLRPDIKRG FS +EEETIIQLH +LGNKWSAIA  LPGRT
Subjt:  MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRT

Query:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPL-VNPELLK-LAASL--MSSERKSPSFSPQNSTPTVQFSHP
        DNEIKNYWNTHI+K+LLRMGIDPVTH PR++LL LSS L S+L+ S    +++   +    +NP++L  L ASL  + +E   P+   QN   T Q +  
Subjt:  DNEIKNYWNTHIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPL-VNPELLK-LAASL--MSSERKSPSFSPQNSTPTVQFSHP

Query:  QLQEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGL----DQQQPAAGVDRSYETPTF-SSHNGNNNNFSL-----SSVLSSP
         L               + S    QWQ+   +           L  +  Y G     + + P AG   +Y +  F S H  +  NF       SS+L+  
Subjt:  QLQEIVQFPQEAQLVEPIGSELNDQWQSGLIFPTNLSLDCNFELPGFDYYCGL----DQQQPAAGVDRSYETPTF-SSHNGNNNNFSL-----SSVLSSP

Query:  CSSSPTQMNSNST-NITSATEDERESYCSQILNF
         SSS T +NS+ST  +   +ED+RES+ S +L F
Subjt:  CSSSPTQMNSNST-NITSATEDERESYCSQILNF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAACACCTTGCTGTGATAAAAATGGACTCAAGAAAGGCCCATGGACGCCTGAGGAAGATCAGAAATTGATTGATTATATTCAGAAAAATGGACATGGCAACTG
GAGGACTCTTCCCAAGAATGCAGGGCTCCAAAGATGTGGGAAGAGCTGTCGTTTGAGATGGACTAATTATTTGAGACCTGATATCAAACGAGGGAGATTTTCATTTGAGG
AGGAAGAAACCATCATTCAACTCCATAGCGTTTTGGGTAATAAGTGGTCTGCCATTGCGACTCGATTACCAGGAAGAACAGACAATGAAATAAAGAACTACTGGAACACG
CACATAAGAAAGAGGCTTCTAAGAATGGGAATTGATCCAGTGACCCACAGTCCAAGGCTTGATCTTCTGGACTTGTCCTCCATTTTACGCTCCACTCTCTACAACTCTTC
ACAGATGAACCTTTCAAGCTTGCTTGGGGTACAGCCATTAGTCAATCCCGAGCTTCTTAAATTGGCTGCCTCATTAATGTCTTCTGAACGCAAAAGTCCAAGCTTTTCGC
CTCAAAATTCTACACCAACCGTTCAATTTTCTCATCCTCAGCTGCAAGAAATTGTTCAATTTCCTCAAGAAGCCCAGCTGGTAGAGCCAATTGGAAGTGAATTAAATGAT
CAGTGGCAAAGTGGTCTGATTTTTCCAACAAATTTAAGCCTAGATTGTAATTTTGAGTTGCCGGGTTTTGACTATTATTGTGGCTTGGATCAGCAACAGCCCGCCGCTGG
GGTGGATCGTTCGTATGAAACGCCGACGTTTAGCTCCCACAACGGCAACAATAATAACTTTAGTTTGAGTTCAGTTTTATCTTCCCCGTGTTCGTCAAGCCCTACACAAA
TGAATTCAAACTCCACGAATATCACAAGTGCAACCGAGGATGAGAGAGAAAGTTACTGCAGCCAGATTTTGAATTTTGAAATCCCGGATATCTTTGATGAAGCTTGCTTT
CATGTAATCTAA
mRNA sequenceShow/hide mRNA sequence
AATTTCTTTCTCTTTCTCCTAAATCCAAATTCCCAAATTATTTTGTACTTACATAACCTGGGAGAGAGAGAGAGAGAAAAAAAAAAAACAGATTTTGATTTGAATATTCA
CCATGGGAAGAACACCTTGCTGTGATAAAAATGGACTCAAGAAAGGCCCATGGACGCCTGAGGAAGATCAGAAATTGATTGATTATATTCAGAAAAATGGACATGGCAAC
TGGAGGACTCTTCCCAAGAATGCAGGGCTCCAAAGATGTGGGAAGAGCTGTCGTTTGAGATGGACTAATTATTTGAGACCTGATATCAAACGAGGGAGATTTTCATTTGA
GGAGGAAGAAACCATCATTCAACTCCATAGCGTTTTGGGTAATAAGTGGTCTGCCATTGCGACTCGATTACCAGGAAGAACAGACAATGAAATAAAGAACTACTGGAACA
CGCACATAAGAAAGAGGCTTCTAAGAATGGGAATTGATCCAGTGACCCACAGTCCAAGGCTTGATCTTCTGGACTTGTCCTCCATTTTACGCTCCACTCTCTACAACTCT
TCACAGATGAACCTTTCAAGCTTGCTTGGGGTACAGCCATTAGTCAATCCCGAGCTTCTTAAATTGGCTGCCTCATTAATGTCTTCTGAACGCAAAAGTCCAAGCTTTTC
GCCTCAAAATTCTACACCAACCGTTCAATTTTCTCATCCTCAGCTGCAAGAAATTGTTCAATTTCCTCAAGAAGCCCAGCTGGTAGAGCCAATTGGAAGTGAATTAAATG
ATCAGTGGCAAAGTGGTCTGATTTTTCCAACAAATTTAAGCCTAGATTGTAATTTTGAGTTGCCGGGTTTTGACTATTATTGTGGCTTGGATCAGCAACAGCCCGCCGCT
GGGGTGGATCGTTCGTATGAAACGCCGACGTTTAGCTCCCACAACGGCAACAATAATAACTTTAGTTTGAGTTCAGTTTTATCTTCCCCGTGTTCGTCAAGCCCTACACA
AATGAATTCAAACTCCACGAATATCACAAGTGCAACCGAGGATGAGAGAGAAAGTTACTGCAGCCAGATTTTGAATTTTGAAATCCCGGATATCTTTGATGAAGCTTGCT
TTCATGTAATCTAATGTCTAATGTGTAATGTAAAAATCAAATAAAT
Protein sequenceShow/hide protein sequence
MGRTPCCDKNGLKKGPWTPEEDQKLIDYIQKNGHGNWRTLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIATRLPGRTDNEIKNYWNT
HIRKRLLRMGIDPVTHSPRLDLLDLSSILRSTLYNSSQMNLSSLLGVQPLVNPELLKLAASLMSSERKSPSFSPQNSTPTVQFSHPQLQEIVQFPQEAQLVEPIGSELND
QWQSGLIFPTNLSLDCNFELPGFDYYCGLDQQQPAAGVDRSYETPTFSSHNGNNNNFSLSSVLSSPCSSSPTQMNSNSTNITSATEDERESYCSQILNFEIPDIFDEACF
HVI