; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g01720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g01720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiontrihelix transcription factor ASIL2
Genome locationchr9:1443795..1444925
RNA-Seq ExpressionMoc09g01720
SyntenyMoc09g01720
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4
IPR044823 - Trihelix transcription factor ASIL1/2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462265.1 PREDICTED: trihelix transcription factor ASIL2 [Cucumis melo]1.6e-16586.02Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA A PPPPPSSQ  ITLALPNQQ+KGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+AT VNPPL QNQKVP+GIPVVNRS+ 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     QQQ KG+KAQKIQ HKRPR TDSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN   +GK  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCS  NNNNSNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

XP_022152404.1 trihelix transcription factor ASIL1-like [Momordica charantia]7.2e-203100Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
        MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK

Query:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
        EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
Subjt:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY

Query:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
        HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
Subjt:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF

Query:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
Subjt:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

XP_022964374.1 trihelix transcription factor ASIL2-like [Cucurbita moschata]1.6e-16585.79Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKG-GGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MK+DDEIQS     SGSP SP+SNGRITVTVAAA PPP P SQ TITLALPNQQSKG GGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKG-GGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKNPVSSAG+AT VNPPL QNQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK
        Y  H     QQQQKG KAQK+Q HKRPRTDSDSS SERETSPTSSDS+   N+RRK+ RVQKEVVNPN+GQ+GK  KGRNGSREKGW NAV++LT+AILK
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK

Query:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN
        FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCSNNNNNN   SNS+SSN
Subjt:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN

XP_023514686.1 trihelix transcription factor ASIL2-like [Cucurbita pepo subsp. pepo]9.2e-16685.68Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKG-GGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MK+DDEIQS     SGSP SP+SNGRITVTVAAA PPP P SQ TITLALPNQQSKG GGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKG-GGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKNPVSSAG+AT VNPPL QNQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK
        Y  H     QQQQKG KAQK+Q HKRPRTDSDSS SERETSPTSSDS+   N+RRK+ RVQKEVVNPN+GQ+GK  KGRNGSREKGW NAV++LT+AILK
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK

Query:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCSNNNNNN+N++ SN
Subjt:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

XP_038898910.1 trihelix transcription factor ASIL2 [Benincasa hispida]3.9e-17288.89Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTV AAAPPPPPSSQ  ITLALPNQQSKGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKN  SSAG+ATAVNPPLQQNQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAIL
        Y  H     QQQ KG KAQKIQ HKRPR TDSDSS S+RETSPTSSDS+   NF+RKNVRVQKE VNPN+GQVGK  KGRNGSREKGWGNAV+ELTQAIL
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAIL

Query:  KFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        KFGEAYEQAESSKL+QVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCS  NNNNSNSDSSN
Subjt:  KFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

TrEMBL top hitse value%identityAlignment
A0A0A0KCE8 Uncharacterized protein2.2e-16585.75Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA A PPPPPSSQ  ITLALPNQQSKGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+ATAVNPPL QNQKVP+GIPV+NR + 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     Q Q KG+KAQKIQ HKRPR TDSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN   +GK  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCS  NNNNSNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A1S3CH31 trihelix transcription factor ASIL27.6e-16686.02Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA A PPPPPSSQ  ITLALPNQQ+KGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+AT VNPPL QNQKVP+GIPVVNRS+ 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     QQQ KG+KAQKIQ HKRPR TDSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN   +GK  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCS  NNNNSNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A5A7V0L0 Trihelix transcription factor ASIL27.6e-16686.02Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
        MKEDDEIQSYPSPGSGSP SP+SNGRITVTVAA A PPPPPSSQ  ITLALPNQQ+KGGGGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAA-PPPPPSSQGTITLALPNQQSKGGGGG-GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKH

Query:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN
        WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGP SKN  S+AG+AT VNPPL QNQKVP+GIPVVNRS+ 
Subjt:  WKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVN

Query:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI
        P+  H     QQQ KG+KAQKIQ HKRPR TDSDSS S+RETSPTSSDS+L   FRRKNVRVQKE VNPN   +GK  KG+NGSREKGWGNAV+EL QAI
Subjt:  PYHNHQQQQQQQQQKGSKAQKIQLHKRPR-TDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAI

Query:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCS  NNNNSNSDSSN
Subjt:  LKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A6J1DFX4 trihelix transcription factor ASIL1-like3.5e-203100Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
        MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWK

Query:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
        EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY
Subjt:  EVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPY

Query:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
        HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF
Subjt:  HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKF

Query:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
        GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN
Subjt:  GEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN

A0A6J1HMY9 trihelix transcription factor ASIL2-like7.6e-16685.79Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKG-GGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
        MK+DDEIQS     SGSP SP+SNGRITVTVAAA PPP P SQ TITLALPNQQSKG GGG GGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW
Subjt:  MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKG-GGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHW

Query:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP
        KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKI AGG PSKWPF+DRLDQLIGPASKNPVSSAG+AT VNPPL QNQKVPLGIPVVNRS+ P
Subjt:  KEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNP

Query:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK
        Y  H     QQQQKG KAQK+Q HKRPRTDSDSS SERETSPTSSDS+   N+RRK+ RVQKEVVNPN+GQ+GK  KGRNGSREKGW NAV++LT+AILK
Subjt:  YHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILK

Query:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN
        FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA N HHCSNNNNNN   SNS+SSN
Subjt:  FGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNN---SNSDSSN

SwissProt top hitse value%identityAlignment
Q9LJG8 Trihelix transcription factor ASIL21.9e-7646.41Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG
        M++D++I+   S GS SPD   S   GRITVTVA+A PP     PP +S       + LAL   Q+ GGG        GGGGREDCWSE AT+VLIDAWG
Subjt:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG

Query:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------
        ERYLELSRGNLKQKHWKEVA+IVSSREDY KIP+TDIQCKNRIDTVKKKYK EK +I  GG  S+W FFD+LD+LIG  +K P +++G+           
Subjt:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------

Query:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK
                       A A  PP                             +  VP+GIP+ +RS          P       QQQQQ    ++ S++++
Subjt:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK

Query:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME
         +  KR  +DSDS  SE   S  S DS  P    ++    +K              K ++G    G GN   ELT+AI++FGEAYEQ E++KLQQVVEME
Subjt:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME

Query:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN
        K+RMKF K+LELQRMQFF+KTQLEISQLK  HGRR+    N HH S  NN N+  +++N
Subjt:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN

Q9SYG2 Trihelix transcription factor ASIL12.4e-5541.81Show/hide
Query:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS
        M++DDEIQS PSPG  S     P SP       VTVA    P P  SSQ     AL                N+  +GGG   GGGGGGR+DCWSE AT 
Subjt:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS

Query:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV
        VLI+AWG+R+ E  +G LKQ+HWKEVA+IV ++    K P+TDIQCKNRIDTVKKKYK EKAKI +G  PSKW FF +L+ LIG  +    SS       
Subjt:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV

Query:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV
            + ++K P+G  + N   + +      +Q  QQQQ+++GS + +    KR  ++++S S  E E SP  S   LP            + + P    +
Subjt:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV

Query:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH
         K +K     +  G G+ V ++ +AIL F EAYE+AE++KL+ + E+EK+RMKFAK++ELQRMQ F+KTQLEI+Q           +  RR+V   +  +
Subjt:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH

Query:  CSNNNNNNS
          NN N +S
Subjt:  CSNNNNNNS

Arabidopsis top hitse value%identityAlignment
AT1G54060.1 6B-interacting protein 1-like 11.7e-5641.81Show/hide
Query:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS
        M++DDEIQS PSPG  S     P SP       VTVA    P P  SSQ     AL                N+  +GGG   GGGGGGR+DCWSE AT 
Subjt:  MKEDDEIQSYPSPGSGS-----PDSPMSNGRITVTVAAAAPPPPP-SSQGTITLALP---------------NQQSKGGG---GGGGGGREDCWSEGATS

Query:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV
        VLI+AWG+R+ E  +G LKQ+HWKEVA+IV ++    K P+TDIQCKNRIDTVKKKYK EKAKI +G  PSKW FF +L+ LIG  +    SS       
Subjt:  VLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAV

Query:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV
            + ++K P+G  + N   + +      +Q  QQQQ+++GS + +    KR  ++++S S  E E SP  S   LP            + + P    +
Subjt:  NPPLQQNQKVPLGIPVVNRSVNPY----HNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDS-SASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQV

Query:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH
         K +K     +  G G+ V ++ +AIL F EAYE+AE++KL+ + E+EK+RMKFAK++ELQRMQ F+KTQLEI+Q           +  RR+V   +  +
Subjt:  GKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQ----------LKHGRRVVAAVNHHH

Query:  CSNNNNNNS
          NN N +S
Subjt:  CSNNNNNNS

AT3G11100.1 sequence-specific DNA binding transcription factors1.5e-4139.38Show/hide
Query:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK
        GRED WSE AT+ LI+AWG+RY+ L+RGNL+Q  WKEVAD V+S     + P+TD+QCKNRIDT+KKKYKTEKAK +     S W FFDRLD LIGP  K
Subjt:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK

Query:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVV
            +   +  +NP                 ++NP              GSK+    L      D D    E +        F+        VR  ++V 
Subjt:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVV

Query:  NPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA
        + +  +                G+A  EL ++ILK GEA+E+ E  K Q ++E+EKQRM+ AK+LELQRM   M+ QLE+ + K G+R  A+
Subjt:  NPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA

AT3G14180.1 sequence-specific DNA binding transcription factors1.3e-7746.41Show/hide
Query:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG
        M++D++I+   S GS SPD   S   GRITVTVA+A PP     PP +S       + LAL   Q+ GGG        GGGGREDCWSE AT+VLIDAWG
Subjt:  MKEDDEIQSYPSPGSGSPDSPMS--NGRITVTVAAAAPP-----PPPSSQ----GTITLALPNQQSKGGGGG------GGGGREDCWSEGATSVLIDAWG

Query:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------
        ERYLELSRGNLKQKHWKEVA+IVSSREDY KIP+TDIQCKNRIDTVKKKYK EK +I  GG  S+W FFD+LD+LIG  +K P +++G+           
Subjt:  ERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGL-----------

Query:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK
                       A A  PP                             +  VP+GIP+ +RS          P       QQQQQ    ++ S++++
Subjt:  ---------------ATAVNPPLQQ--------------------------NQKVPLGIPVVNRSVN--------PYHNHQQQQQQQQ----QKGSKAQK

Query:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME
         +  KR  +DSDS  SE   S  S DS  P    ++    +K              K ++G    G GN   ELT+AI++FGEAYEQ E++KLQQVVEME
Subjt:  IQLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEME

Query:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN
        K+RMKF K+LELQRMQFF+KTQLEISQLK  HGRR+    N HH S  NN N+  +++N
Subjt:  KQRMKFAKDLELQRMQFFMKTQLEISQLK--HGRRVVAAVNHHHCSNNNNNNSNSDSSN

AT5G05550.1 sequence-specific DNA binding transcription factors1.7e-4037.54Show/hide
Query:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK
        GRED WSE AT+ L++AWG RY++L+ GNL+Q  WK+VAD V+SR       +TD+QCKNR+DT+KKKYKTEKAK+    +PS W F++RLD LIGP  K
Subjt:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK

Query:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV
           S+ G+  +                       P+ NH                        T S+S+ S  E      D      F  RK+ RV++  
Subjt:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV

Query:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA
        ++                     G+   EL  AILKFGE YE+ E  K Q ++E+EKQRM+  K++EL+RM   M+ QLEI + KH +R  A+
Subjt:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAA

AT5G05550.2 sequence-specific DNA binding transcription factors4.4e-4137.76Show/hide
Query:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK
        GRED WSE AT+ L++AWG RY++L+ GNL+Q  WK+VAD V+SR       +TD+QCKNR+DT+KKKYKTEKAK+    +PS W F++RLD LIGP  K
Subjt:  GREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSREDYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASK

Query:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV
           S+ G+  +                       P+ NH                        T S+S+ S  E      D      F  RK+ RV++  
Subjt:  NPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKIQLHKRPRTDSDSSASERETSPTSSDSFLPENF-RRKNVRVQKEV

Query:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAV
        ++                     G+   EL  AILKFGE YE+ E  K Q ++E+EKQRM+  K++EL+RM   M+ QLEI + KH +R  A+V
Subjt:  VNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLELQRMQFFMKTQLEISQLKHGRRVVAAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAAGACGATGAGATTCAGTCGTATCCGTCGCCGGGGAGTGGATCTCCTGATTCTCCGATGTCCAACGGCCGGATAACCGTGACGGTGGCCGCCGCAGCACCTCC
TCCTCCGCCTTCTTCTCAGGGCACAATAACCTTGGCTCTGCCGAATCAGCAGTCCAAGGGCGGCGGCGGCGGAGGTGGAGGCGGAAGAGAAGATTGCTGGAGCGAGGGAG
CAACGTCGGTGCTGATCGATGCTTGGGGCGAGAGGTATTTGGAACTGAGCAGAGGTAATTTGAAGCAGAAGCATTGGAAAGAGGTAGCTGATATTGTGAGTAGCAGAGAG
GACTATACGAAGATACCGAGGACTGATATTCAGTGCAAGAATCGGATTGATACGGTCAAGAAGAAGTATAAGACTGAGAAGGCTAAGATTATTGCTGGAGGAGCACCCAG
TAAGTGGCCATTTTTTGATAGATTGGATCAATTGATCGGTCCGGCCTCGAAGAATCCTGTGTCTAGCGCTGGTTTAGCCACCGCTGTAAATCCGCCTCTGCAGCAGAACC
AGAAAGTTCCACTCGGAATTCCTGTAGTGAATCGATCCGTGAATCCTTACCATAATCACCAGCAGCAGCAGCAGCAGCAGCAACAGAAGGGGTCTAAAGCTCAGAAGATA
CAGCTCCACAAGCGGCCCCGAACAGATTCCGACTCCTCAGCATCGGAGAGAGAAACATCCCCAACTTCGAGTGATAGTTTCCTACCAGAAAATTTTCGGAGGAAAAATGT
TAGGGTTCAAAAGGAGGTGGTGAATCCGAATGTAGGTCAGGTGGGGAAAGTGGTGAAGGGAAGGAATGGATCAAGAGAAAAAGGGTGGGGGAACGCCGTGAATGAATTGA
CACAAGCAATACTGAAGTTTGGGGAGGCGTATGAACAAGCCGAGAGCTCGAAGCTGCAGCAAGTGGTGGAAATGGAGAAGCAGAGGATGAAATTTGCCAAGGATCTTGAA
TTGCAGAGAATGCAATTCTTCATGAAGACGCAGTTGGAAATCTCGCAGCTGAAGCATGGGAGAAGAGTCGTGGCTGCTGTTAATCACCATCATTGCAGCAACAACAACAA
CAACAACAGTAACAGCGATAGCAGCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAAGACGATGAGATTCAGTCGTATCCGTCGCCGGGGAGTGGATCTCCTGATTCTCCGATGTCCAACGGCCGGATAACCGTGACGGTGGCCGCCGCAGCACCTCC
TCCTCCGCCTTCTTCTCAGGGCACAATAACCTTGGCTCTGCCGAATCAGCAGTCCAAGGGCGGCGGCGGCGGAGGTGGAGGCGGAAGAGAAGATTGCTGGAGCGAGGGAG
CAACGTCGGTGCTGATCGATGCTTGGGGCGAGAGGTATTTGGAACTGAGCAGAGGTAATTTGAAGCAGAAGCATTGGAAAGAGGTAGCTGATATTGTGAGTAGCAGAGAG
GACTATACGAAGATACCGAGGACTGATATTCAGTGCAAGAATCGGATTGATACGGTCAAGAAGAAGTATAAGACTGAGAAGGCTAAGATTATTGCTGGAGGAGCACCCAG
TAAGTGGCCATTTTTTGATAGATTGGATCAATTGATCGGTCCGGCCTCGAAGAATCCTGTGTCTAGCGCTGGTTTAGCCACCGCTGTAAATCCGCCTCTGCAGCAGAACC
AGAAAGTTCCACTCGGAATTCCTGTAGTGAATCGATCCGTGAATCCTTACCATAATCACCAGCAGCAGCAGCAGCAGCAGCAACAGAAGGGGTCTAAAGCTCAGAAGATA
CAGCTCCACAAGCGGCCCCGAACAGATTCCGACTCCTCAGCATCGGAGAGAGAAACATCCCCAACTTCGAGTGATAGTTTCCTACCAGAAAATTTTCGGAGGAAAAATGT
TAGGGTTCAAAAGGAGGTGGTGAATCCGAATGTAGGTCAGGTGGGGAAAGTGGTGAAGGGAAGGAATGGATCAAGAGAAAAAGGGTGGGGGAACGCCGTGAATGAATTGA
CACAAGCAATACTGAAGTTTGGGGAGGCGTATGAACAAGCCGAGAGCTCGAAGCTGCAGCAAGTGGTGGAAATGGAGAAGCAGAGGATGAAATTTGCCAAGGATCTTGAA
TTGCAGAGAATGCAATTCTTCATGAAGACGCAGTTGGAAATCTCGCAGCTGAAGCATGGGAGAAGAGTCGTGGCTGCTGTTAATCACCATCATTGCAGCAACAACAACAA
CAACAACAGTAACAGCGATAGCAGCAACTAG
Protein sequenceShow/hide protein sequence
MKEDDEIQSYPSPGSGSPDSPMSNGRITVTVAAAAPPPPPSSQGTITLALPNQQSKGGGGGGGGGREDCWSEGATSVLIDAWGERYLELSRGNLKQKHWKEVADIVSSRE
DYTKIPRTDIQCKNRIDTVKKKYKTEKAKIIAGGAPSKWPFFDRLDQLIGPASKNPVSSAGLATAVNPPLQQNQKVPLGIPVVNRSVNPYHNHQQQQQQQQQKGSKAQKI
QLHKRPRTDSDSSASERETSPTSSDSFLPENFRRKNVRVQKEVVNPNVGQVGKVVKGRNGSREKGWGNAVNELTQAILKFGEAYEQAESSKLQQVVEMEKQRMKFAKDLE
LQRMQFFMKTQLEISQLKHGRRVVAAVNHHHCSNNNNNNSNSDSSN