; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0162 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0162
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontrihelix transcription factor ASIL2
Genome locationMC09:1456054..1457181
RNA-Seq ExpressionMC09g0162
SyntenyMC09g0162
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4
IPR044823 - Trihelix transcription factor ASIL1/2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462265.1 PREDICTED: trihelix transcription factor ASIL2 [Cucumis melo]2.38e-20786.02Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA APPPPP SSQ  ITLALPNQQ+KGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+AT VNPPL QNQKVP+GIPVVNRS+ 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     QQQ KG+KAQKIQ HKRPRT DSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN+G   K  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNN  SNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

XP_022152404.1 trihelix transcription factor ASIL1-like [Momordica charantia]2.38e-256100Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
        MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK

Query:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
        EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
Subjt:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY

Query:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
        HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
Subjt:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF

Query:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
Subjt:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

XP_022964374.1 trihelix transcription factor ASIL2-like [Cucurbita moschata]1.68e-20785.79Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MK+DDEIQS     SGSP SP+SNGRITVTVAAA PPP P SQ TITLALPNQQSKG GGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKNPVSSAG+AT VNPPLQ NQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK
        Y  H     QQQQKG KAQK+Q HKRPRTDSDSS SERETSPTSSDS+   N+RRK+ RVQKEVVNPN+GQ+GK  KGRNGSREKGW NAV++LT+AILK
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK

Query:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN
        FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNNNN   SNS+SSN
Subjt:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN

XP_023514686.1 trihelix transcription factor ASIL2-like [Cucurbita pepo subsp. pepo]8.94e-20885.68Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MK+DDEIQS     SGSP SP+SNGRITVTVAAA PPP P SQ TITLALPNQQSKG GGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKNPVSSAG+AT VNPPLQ NQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK
        Y  H     QQQQKG KAQK+Q HKRPRTDSDSS SERETSPTSSDS+   N+RRK+ RVQKEVVNPN+GQ+GK  KGRNGSREKGW NAV++LT+AILK
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK

Query:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNNNN+N++ SN
Subjt:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

XP_038898910.1 trihelix transcription factor ASIL2 [Benincasa hispida]7.35e-21688.89Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAAA PPPPPSSQ  ITLALPNQQSKGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKN  SSAG+ATAVNPPLQQNQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAIL
        Y  H     QQQ KG KAQKIQ HKRPRT DSDSS S+RETSPTSSDS+   NF+RKNVRVQKE VNPN+GQVGK  KGRNGSREKGWGNAV+ELTQAIL
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAIL

Query:  KFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        KFGEAYEQAESSKL+QVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNN  SNSDSSN
Subjt:  KFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

TrEMBL top hitse value%identityAlignment
A0A0A0KCE8 Uncharacterized protein4.68e-20785.75Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA APPPPP SSQ  ITLALPNQQSKGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+ATAVNPPL QNQKVP+GIPV+NR + 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     Q Q KG+KAQKIQ HKRPRT DSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN+G   K  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNN  SNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A1S3CH31 trihelix transcription factor ASIL21.15e-20786.02Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA APPPPP SSQ  ITLALPNQQ+KGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+AT VNPPL QNQKVP+GIPVVNRS+ 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     QQQ KG+KAQKIQ HKRPRT DSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN+G   K  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNN  SNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A5A7V0L0 Trihelix transcription factor ASIL21.15e-20786.02Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA APPPPP SSQ  ITLALPNQQ+KGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+AT VNPPL QNQKVP+GIPVVNRS+ 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     QQQ KG+KAQKIQ HKRPRT DSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN+G   K  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPRT-DSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNN  SNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A6J1DFX4 trihelix transcription factor ASIL1-like1.15e-256100Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
        MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK

Query:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
        EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
Subjt:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY

Query:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
        HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
Subjt:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF

Query:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
Subjt:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A6J1HMY9 trihelix transcription factor ASIL2-like8.11e-20885.79Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MK+DDEIQS     SGSP SP+SNGRITVTVAAA PPP P SQ TITLALPNQQSKG GGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKNPVSSAG+AT VNPPLQ NQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK
        Y  H     QQQQKG KAQK+Q HKRPRTDSDSS SERETSPTSSDS+   N+RRK+ RVQKEVVNPN+GQ+GK  KGRNGSREKGW NAV++LT+AILK
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK

Query:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN
        FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA NHH CSNNNNNN   SNS+SSN
Subjt:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN

SwissProt top hitse value%identityAlignment
Q9LJG8 Trihelix transcription factor ASIL21.9e-7646.41Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG
        M++D++I+   S GS SPD   S   GRITVTVA+A PP     PP +S       + LAL   Q+ GGG        GGGGREDCWSE AT+VLIDAWG
Subjt:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG

Query:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------
        ERYLELSRGNLKQKHWKEVA+IVSSREDY KIP+TDIQCKNRIDTVKKKYK EK +I  GG  S+W FFD+LD+LIG  +K P +++G+           
Subjt:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------

Query:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK
                       A A  PP                             +  VP+GIP+ +RS          P       QQQQQ    ++ S++++
Subjt:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK

Query:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME
         +  KR  +DSDS  SE   S  S DS  P    ++    +K              K ++G    G GN   ELT+AI++FGEAYEQ E++KLQQVVEME
Subjt:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME

Query:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN
        K+RMKF K+LELQRMQFF+KTQLEISQLK  HGRR+    N HH S  NN N+  +++N
Subjt:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN

Q9SYG2 Trihelix transcription factor ASIL12.4e-5541.81Show/hide
Query:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS
        M++DDEIQS PSPG  S     P SP       VTVA    P P  SSQ     AL                N+  +GGG   GGGGGGR+DCWSE AT 
Subjt:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS

Query:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV
        VLI+AWG+R+ E  +G LKQ+HWKEVA+IV ++    K P+TDIQCKNRIDTVKKKYK EKAKI +G  PSKW FF +L+ LIG  +    SS       
Subjt:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV

Query:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV
            + ++K P+G  + N   + +      +Q  QQQQ+++GS + +    KR  ++++S S  E E SP  S   LP            + + P    +
Subjt:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV

Query:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH
         K +K     +  G G+ V ++ +AIL F EAYE+AE++KL+ + E+EK+RMKFAK++ELQRMQ F+KTQLEI+Q           +  RR+V   +  +
Subjt:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH

Query:  CSNNNNNNS
          NN N +S
Subjt:  CSNNNNNNS

Arabidopsis top hitse value%identityAlignment
AT1G54060.1 6B-interacting protein 1-like 11.7e-5641.81Show/hide
Query:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS
        M++DDEIQS PSPG  S     P SP       VTVA    P P  SSQ     AL                N+  +GGG   GGGGGGR+DCWSE AT 
Subjt:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS

Query:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV
        VLI+AWG+R+ E  +G LKQ+HWKEVA+IV ++    K P+TDIQCKNRIDTVKKKYK EKAKI +G  PSKW FF +L+ LIG  +    SS       
Subjt:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV

Query:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV
            + ++K P+G  + N   + +      +Q  QQQQ+++GS + +    KR  ++++S S  E E SP  S   LP            + + P    +
Subjt:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV

Query:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH
         K +K     +  G G+ V ++ +AIL F EAYE+AE++KL+ + E+EK+RMKFAK++ELQRMQ F+KTQLEI+Q           +  RR+V   +  +
Subjt:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH

Query:  CSNNNNNNS
          NN N +S
Subjt:  CSNNNNNNS

AT3G11100.1 sequence-specific DNA binding transcription factors1.5e-4139.38Show/hide
Query:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK
        GRED WSE AT+ LI+AWG+RY+ L+RGNL+Q  WKEVAD V+S     + P+TD+QCKNRIDT+KKKYKTEKAK +     S W FFDRLD LIGP  K
Subjt:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK

Query:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVV
            +   +  +NP                 ++NP              GSK+    L      D D    E +        F+        VR  ++V 
Subjt:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVV

Query:  NPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA
        + +  +                G+A  EL ++ILK GEA+E+ E  K Q ++E+EKQRM+ AK+LELQRM   M+ QLE+ + K G+R  A+
Subjt:  NPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA

AT3G14180.1 sequence-specific DNA binding transcription factors1.3e-7746.41Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG
        M++D++I+   S GS SPD   S   GRITVTVA+A PP     PP +S       + LAL   Q+ GGG        GGGGREDCWSE AT+VLIDAWG
Subjt:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG

Query:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------
        ERYLELSRGNLKQKHWKEVA+IVSSREDY KIP+TDIQCKNRIDTVKKKYK EK +I  GG  S+W FFD+LD+LIG  +K P +++G+           
Subjt:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------

Query:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK
                       A A  PP                             +  VP+GIP+ +RS          P       QQQQQ    ++ S++++
Subjt:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK

Query:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME
         +  KR  +DSDS  SE   S  S DS  P    ++    +K              K ++G    G GN   ELT+AI++FGEAYEQ E++KLQQVVEME
Subjt:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME

Query:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN
        K+RMKF K+LELQRMQFF+KTQLEISQLK  HGRR+    N HH S  NN N+  +++N
Subjt:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN

AT5G05550.1 sequence-specific DNA binding transcription factors1.7e-4037.54Show/hide
Query:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK
        GRED WSE AT+ L++AWG RY++L+ GNL+Q  WK+VAD V+SR       +TD+QCKNR+DT+KKKYKTEKAK+    +PS W F++RLD LIGP  K
Subjt:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK

Query:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV
           S+ G+  +                       P+ NH                        T S+S+ S  E      D      F  RK+ RV++  
Subjt:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV

Query:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA
        ++                     G+   EL  AILKFGE YE+ E  K Q ++E+EKQRM+  K++EL+RM   M+ QLEI + KH +R  A+
Subjt:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA

AT5G05550.2 sequence-specific DNA binding transcription factors4.4e-4137.76Show/hide
Query:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK
        GRED WSE AT+ L++AWG RY++L+ GNL+Q  WK+VAD V+SR       +TD+QCKNR+DT+KKKYKTEKAK+    +PS W F++RLD LIGP  K
Subjt:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK

Query:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV
           S+ G+  +                       P+ NH                        T S+S+ S  E      D      F  RK+ RV++  
Subjt:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV

Query:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAV
        ++                     G+   EL  AILKFGE YE+ E  K Q ++E+EKQRM+  K++EL+RM   M+ QLEI + KH +R  A+V
Subjt:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAAGACGATGAGATTCAGTCGTATCCGTCGCCGGGGAGTGGATCTCCTGATTCTCCGATGTCCAACGGCCGGATAACCGTGACGGTGGCCGCCGCAGCACCTCC
TCCTCCGCCTTCTTCTCAGGGCACAATAACCTTGGCTCTGCCGAATCAGCAGTCCAAGGGCGGCGGCGGCGGAGGTGGAGGCGGAAGAGAAGATTGCTGGAGCGAGGGAG
CAACGTCGGTGCTGATCGATGCTTGGGGCGAGAGGTATTTGGAACTGAGCAGAGGTAATTTGAAGCAGAAGCATTGGAAAGAGGTAGCTGATATTGTGAGTAGCAGAGAG
GACTATACGAAGATACCGAGGACTGATATTCAGTGCAAGAATCGGATTGATACGGTCAAGAAGAAGTATAAGACTGAGAAGGCTAAGATTATTGCTGGAGGAGCACCCAG
TAAGTGGCCATTTTTTGATAGATTGGATCAATTGATCGGTCCGGCCTCGAAGAATCCTGTGTCTAGCGCTGGTTTAGCCACCGCTGTAAATCCGCCTCTGCAGCAGAACC
AGAAAGTTCCACTCGGAATTCCTGTAGTGAATCGATCCGTGAATCCTTACCATAATCACCAGCAGCAGCAGCAGCAGCAGCAACAGAAGGGGTCTAAAGCTCAGAAGATA
CAGCTCCACAAGCGGCCCCGAACAGATTCCGACTCCTCAGCATCGGAGAGAGAAACATCCCCAACTTCGAGTGATAGTTTCCTACCAGAAAATTTTCGGAGGAAAAATGT
TAGGGTTCAAAAGGAGGTGGTGAATCCGAATGTAGGTCAGGTGGGGAAAGTGGTGAAGGGAAGGAATGGATCAAGAGAAAAAGGGTGGGGGAACGCCGTGAATGAATTGA
CACAAGCAATACTGAAGTTTGGGGAGGCGTATGAACAAGCCGAGAGCTCGAAGCTGCAGCAAGTGGTGGAAATGGAGAAGCAGAGGATGAAATTTGCCAAGGATCTTGAA
TTGCAGAGAATGCAATTCTTCATGAAGACGCAGTTGGAAATCTCGCAGCTGAAGCATGGGAGAAGAGTCGTGGCTGCTGTTAATCACCATCATTGCAGCAACAACAACAA
CAACAACAGTAACAGCGATAGCAGCAAC
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAAGACGATGAGATTCAGTCGTATCCGTCGCCGGGGAGTGGATCTCCTGATTCTCCGATGTCCAACGGCCGGATAACCGTGACGGTGGCCGCCGCAGCACCTCC
TCCTCCGCCTTCTTCTCAGGGCACAATAACCTTGGCTCTGCCGAATCAGCAGTCCAAGGGCGGCGGCGGCGGAGGTGGAGGCGGAAGAGAAGATTGCTGGAGCGAGGGAG
CAACGTCGGTGCTGATCGATGCTTGGGGCGAGAGGTATTTGGAACTGAGCAGAGGTAATTTGAAGCAGAAGCATTGGAAAGAGGTAGCTGATATTGTGAGTAGCAGAGAG
GACTATACGAAGATACCGAGGACTGATATTCAGTGCAAGAATCGGATTGATACGGTCAAGAAGAAGTATAAGACTGAGAAGGCTAAGATTATTGCTGGAGGAGCACCCAG
TAAGTGGCCATTTTTTGATAGATTGGATCAATTGATCGGTCCGGCCTCGAAGAATCCTGTGTCTAGCGCTGGTTTAGCCACCGCTGTAAATCCGCCTCTGCAGCAGAACC
AGAAAGTTCCACTCGGAATTCCTGTAGTGAATCGATCCGTGAATCCTTACCATAATCACCAGCAGCAGCAGCAGCAGCAGCAACAGAAGGGGTCTAAAGCTCAGAAGATA
CAGCTCCACAAGCGGCCCCGAACAGATTCCGACTCCTCAGCATCGGAGAGAGAAACATCCCCAACTTCGAGTGATAGTTTCCTACCAGAAAATTTTCGGAGGAAAAATGT
TAGGGTTCAAAAGGAGGTGGTGAATCCGAATGTAGGTCAGGTGGGGAAAGTGGTGAAGGGAAGGAATGGATCAAGAGAAAAAGGGTGGGGGAACGCCGTGAATGAATTGA
CACAAGCAATACTGAAGTTTGGGGAGGCGTATGAACAAGCCGAGAGCTCGAAGCTGCAGCAAGTGGTGGAAATGGAGAAGCAGAGGATGAAATTTGCCAAGGATCTTGAA
TTGCAGAGAATGCAATTCTTCATGAAGACGCAGTTGGAAATCTCGCAGCTGAAGCATGGGAGAAGAGTCGTGGCTGCTGTTAATCACCATCATTGCAGCAACAACAACAA
CAACAACAGTAACAGCGATAGCAGCAAC
Protein sequenceShow/hide protein sequence
MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSRE
DYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKI
QLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLE
LQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN